Reach out directly about this role
We are GigaChat Alignment. We make the model useful and reliable: SFT/DPO, distillation into small models, LoRA service, metrics and validation pipelines. We quickly test hypotheses, accelerate training, and roll out improvements to production – first for internal clients, then for all of Russia.
Areas:
improving SFT / DPO: testing new training approaches, accelerating pipelines, generating new data, distilling knowledge from large LLMs to small ones.
Developing GigaChat's quality metrics, for example, by assessing its ability to solve international-level Olympiad problems. Developing internal LLM-AS-A-JUDGE.
Developing the LoRA training service for GigaChat and GigaEmbedder. Increasing the stability and reproducibility of runs, creating validation and data generation pipelines using LLMs.
For these roles, we are looking for a talented NLP Engineer to work together to improve and develop GigaChat. For all these experiments, we have a cluster with a large number of A/H100s.
1-3 years
Experience
Full-time
Employment
Onsite
Work Format
Middle
Grade
Data Science & ML
Specialization
IT & Tech
Industry
Corporation
Company Type
By city
Data Science & ML
Specialization
IT & Tech
Industry
Corporation
Company Type