Gerald Shen

Gerald Shen is a deep learning algorithms engineer at NVIDIA, specializing in model alignment. He leads the development of the NeMo-Aligner toolkit, a scalable toolkit to align large language models. This toolkit has been used to align models at NVIDIA with algorithms such as reinforcement learning from human feedback (RLHF).

Posts by Gerald Shen