Our team is working on high-quality speech synthesis for all Yandex products. This includes, for example, video translation and voice-over in the Browser, Bookmate audiobooks, Alice, and geo-products. We are looking for a colleague who wants to improve the intonation of synthesized speech together with us.

What tasks await you

Training TTS models for Alice and Bookmate The higher the quality of speech synthesis, the more comfortable it is for the user. If the synthesis is monotonous and emotionless — the user will not want to listen to an audiobook or talk to a voice assistant. Therefore, we are improving intonation and implementing emotions into the synthesis. You will conduct a lot of research work and train SOTA models.

Synthesis prompting Many datasets are now appearing that contain not only audio and text but also a prompt describing the pronunciation style. For example, 'fast reading in a high-pitched female voice with expressive pauses.' Your task is to prompt the synthesis. To do this, you will need to implement many modern approaches and generate new ideas.

Dataset generation Many prompt datasets are generated synthetically. It is necessary to develop pipelines consisting of many neural networks (and if they are insufficient — train them from scratch or fine-tune them) that will help collect such datasets.

We expect that you

Have excellent knowledge of ML and DL
Are well acquainted with Python

It will be a plus

Have accelerated neural networks and deployed them to production
Have worked with ML in the field of voice technologies: in ASR, voice biometrics, text-to-speech, etc.
Have worked with NLP
Have trained diffusion models

Contacts

What tasks await you

We expect that you

It will be a plus

Similar vacancies

ML Developer for the Speech Synthesis Team

ML Developer for Voice Input Applications Team

ML Engineer for Speech Synthesis Pretraining Team

ML Research Engineer for Video Translation in the Browser

ML Researcher Developer for the Alignment team of the Speech Synthesis Service

ML Developer for Voice Quality Improvement Team

Senior ML Engineer (Text-to-Speech)

ML Developer for the Voice Quality Enhancement Team at Alice

ML Developer for the Recommendation Systems Team

ML Developer for the Machine Learning Quality Group of the e-com Content System

Middle ML Researcher (Audio)

ML Developer for the International Search Ranking Team

ML Developer for the Intonation Group

Key Skills

Details

Average salary for this role