Description

We are GigaChat Alignment. We make the model useful and reliable: SFT/DPO, distillation into small models, LoRA service, metrics and validation pipelines. We quickly test hypotheses, accelerate training, and roll out improvements to production – first for internal clients, then for all of Russia.

Areas:

improving SFT / DPO: testing new training approaches, accelerating pipelines, generating new data, distilling knowledge from large LLMs to small ones.

Developing GigaChat's quality metrics, for example, by assessing its ability to solve international-level Olympiad problems. Developing internal LLM-AS-A-JUDGE.

Developing the LoRA training service for GigaChat and GigaEmbedder. Increasing the stability and reproducibility of runs, creating validation and data generation pipelines using LLMs.

For these roles, we are looking for a talented NLP Engineer to work together to improve and develop GigaChat. For all these experiments, we have a cluster with a large number of A/H100s.

Responsibilities

improve the quality of GigaChat's performance in Russian and English
help solve business problems using our technology, first for internal clients at Sber, and then for external ones
come up with and implement new applications for LLMs
help deploy to production everything we train
constantly stay up-to-date with the latest research papers.

Requirements

confident knowledge of Python, PyTorch
knowledge of basic algorithms and mathematics
knowledge in DL, experience training simple and large models
experience training models for production
understanding of the current state of evolution of large LLMs
having publications is a plus.

Conditions

remote work within Russia.
possibility of employment through an accredited IT company
annual performance-based bonus
regular salary reviews
corporate gym and relaxation areas
more than 400 SberUniversity programs for growth
adaptation program and manager support at the start
largest DS&AI community – more than 600 DS professionals from the bank, regular knowledge exchange, experience sharing, and best practices, interactive lectures and master classes from leading universities and experts of tech companies, digest of the latest developments in the field of DS&AI and reports from major world conferences, regular internal meetups
voluntary health insurance (DMS), preferential family insurance, corporate pension program
employee mortgage under a discount program
SberPrime+ and discounts with partners
referral bonus for team recommendations.

Contacts

Description

Responsibilities

Requirements

Conditions

Similar vacancies

ML Engineer LLM GigaChat

Senior NLP Researcher (RnD GigaChat)

Senior LLM Researcher (Center for Applied Artificial Intelligence)

ML Engineer (GigaChat Data)

Senior NLP Engineer (GigaChat)

NLP/LLM Researcher

NLP Engineer (GigaChat Pretrain)

Team Lead Data Science NLP

ML Engineer (GigaChat Data)

Senior-Midle NLP Engineer

DS/LLM Engineer (Center for Practical AI)

Team Lead ML TTS GigaChat Data

NLP Engineer at GigaChat Alignment

Key Skills

Details

Average salary for this role