Analyst Developer at Neuro

We are developing a new Neuro technology for Alice. It solves search tasks in a minute that typically take users several hours. Our goal is to confidently defeat competitors in the Russian market, which includes very strong international players.

The technology is based on an LLM neural network. It communicates with the user in dialogue mode, analyzes data from the internet, and writes accessible, informative, clear, and reliable answers. The first product release was launched in 2024 under the name Neuro and brought significant profit to Yandex Search. This technology has now become part of Alice and continues to evolve actively.

Our analytics team is key to the development of the new Alice. Every six months, together with ML and product teams, we prepare the next technology release. Each release is based on offline quality labeling of answers, which we develop. Using labeling, we assess how well Alice's answers meet our product goals and user expectations: * We compare answer quality Side-by-Side * We measure the amount of false information and hallucinations in answers through fact-checking * We use crowd labeling (human task completion), train LLM-as-a-judge, and assemble LLM-based assistants to help performers in Yandex Crowd.

Labeling is used as the main tool for measuring quality — just like a target for RLHF when training neural networks. Our projects are among the most complex and large-scale offline quality labeling projects at Yandex. We have unique expertise in this area.

Our team is involved in the entire technology creation cycle, from formulating general product requirements to debugging and accepting finished models. To successfully develop our projects, we need to solve various types of tasks: from technical and analytical to product-related.

What tasks await you

Formalizing product requirements You will need to delve into ambiguous product requirements, help the product team turn them into logical principles and rules that become clear instructions for performers in Yandex Crowd and neural networks. Figure out how to properly reason about answer quality, how to find all factual errors in it and assess their significance for the product.

Finding and fixing LLM bugs through training You will quantify and improve the quality of training for RLHF, together with ML developers constantly analyze problems in current LLM versions and fix them using our labeling, develop and maintain convenient tools for collecting training datasets and measuring metrics in experiments.

Deep analysis of labeling quality You will need to collect and update reference labeling sets (gold sets) that perfectly match product goals and become benchmarks for project development, work with Yandex's AI trainers team, analyze complex task examples with them, formulate and refine product principles based on data.

Developing large projects in Yandex Crowd You will recruit, train, and test performers for labeling, develop processes for ongoing quality control, banning, and rehabilitation, create convenient dashboards that allow quick answers to questions about the performance, cost, and quality of our labeling.

More about analytics at Yandex — in the channel Yandex for Analytics

We expect you to

Understand why it's important to always look at the data
Write confidently in Python and know algorithms
Know mathematical statistics and have used it for hypothesis testing

Will be a plus

Are interested in the product, strive to understand what it should be and why
Can communicate, clearly express thoughts, understand and consider colleagues' opinions, persuade
Have worked with tabular data using SQL
Have experience with crowd projects, for example, have done labeling in Toloka
Know the basics of machine learning

What tasks await you

More about analytics at Yandex — in the channel Yandex for Analytics

Will be a plus

Are interested in the product, strive to understand what it should be and why

Can communicate, clearly express thoughts, understand and consider colleagues' opinions, persuade

Have worked with tabular data using SQL

Have experience with crowd projects, for example, have done labeling in Toloka

Know the basics of machine learning

Analyst Developer at Neuro

Key Skills

Contacts

Details

What tasks await you

We expect you to

Will be a plus

Similar vacancies

Analyst-Developer for the Neuro Quality Assessment Team

Alice AI LLM Analyst-Developer

ML Browser Team Analyst-Developer

Analyst-Developer in the Generative Answers Quality Team

Analyst-Developer for Maps

Verification Analyst-Developer for Search with Alisa

Аналитик-разработчик в команду безопасности Алисы

Analyst-Developer AliceAI (Education)

Analyst-Developer in R&D

Analyst-Developer for the Yandex Crowd Product Analytics Team

ML Analyst for Neurojurist Service

Analyst-Developer for the Elektro Team

Analyst Developer at Neuro

Key Skills

Contacts

Details

What tasks await you

We expect you to

Will be a plus

Similar vacancies

Analyst-Developer for the Neuro Quality Assessment Team

Alice AI LLM Analyst-Developer

ML Browser Team Analyst-Developer

Analyst-Developer in the Generative Answers Quality Team

Analyst-Developer for Maps

Verification Analyst-Developer for Search with Alisa

Аналитик-разработчик в команду безопасности Алисы

Analyst-Developer AliceAI (Education)

Analyst-Developer in R&D

Analyst-Developer for the Yandex Crowd Product Analytics Team

ML Analyst for Neurojurist Service

Analyst-Developer for the Elektro Team