Description

Hello! This is GigaChat Reasoning — the team that gives the model the superpower to reason. We create environments, train via online RL, accelerate learning, and bring solutions to production.

Directions

Improving GigaChat Reasoning: full training cycle from cold start to deploying the model to production. Adding new domains, creating datasets, and answer evaluation functions.

Developing agent skills and tool calling using Online RL: creating training environments for LLMs, training and testing models.

Improving the Deep Research product

For these roles, we are looking for a talented NLP Engineer with knowledge and experience in Reinforcement Learning. For all these experiments, we have a cluster with a large number of A/H 100s.

Responsibilities

Improve the quality of GigaChat Reasoning in Russian and English languages
Accelerate the training pipeline: profiling bottlenecks, efficient sampling.
Test new loss functions and training approaches
Help deploy to production everything we train.
Constantly stay up-to-date with the latest papers.

Requirements

Experience in online RL and good theoretical knowledge
Proficient in Python, PyTorch.
Knowledge of basic algorithms and mathematics.
Knowledge in DL, experience training simple and large models.
Experience training models for production.
Understanding of the current state of evolution of large LLMs.
Having publications is a plus.

Conditions

Remote within Russia.

Possibility of employment in an IT-accredited company
Annual performance bonus up to 6 months' salary.
Regular salary reviews.
Corporate gym and relaxation areas.
More than 400 programs of SberUniversity for growth.
Onboarding program and manager's support at the start.
Largest DS&AI community – over 600 DS from the bank, regular exchange of knowledge, experience and best practices, interactive lectures and master classes from leading universities and experts from technology companies, digest of the latest developments in DS&AI and reports from the world's largest conferences, regular internal meetups.
Extended health insurance, preferential family insurance, corporate pension program.
Employee mortgage under a discount program.
SberPrime+ and discounts with partners.
Referral bonus for recommendations to the team

Contacts

Description

Responsibilities

Requirements

Conditions

Similar vacancies

Senior NLP Researcher (RnD GigaChat)

NLP/LLM Researcher

Team Lead Data Science NLP

Senior NLP Engineer (GigaChat)

NLP Engineer at GigaChat Alignment

Senior Data Scientist

Senior NLP Data Scientist (AI Agents)

Senior NLP Data Scientist (Knowledge Management team)

Senior LLM Researcher (Center for Applied Artificial Intelligence)

Data Scientist (R&D NLP)

Senior Data Scientist NLP | RND TeamLead in LegaTech

Data Scientist / ML Researcher (Multi-agent AI Assistant)

Senior NLP Engineer with knowledge and experience in Reinforcement Learning

Key Skills

Details

Average salary for this role