Yonggan Fu

Yonggan Fu is an upcoming research scientist at NVIDIA Research and is currently a PhD student at the Georgia Institute of Technology, advised by Dr. Yingyan (Celine) Lin. His research focuses on developing efficient and robust AI algorithms for language models and vision models.
Avatar photo

Posts by Yonggan Fu

Generative AI

Hymba Hybrid-Head Architecture Boosts Small Language Model Performance

Transformers, with their attention-based architecture, have become the dominant choice for language models (LMs) due to their strong performance,... 12 MIN READ