Reach out directly about this role
ML Research Engineer for Video Translation in the Browser
We are seeking a strong ML Research Engineer ready to build from scratch and innovate. You will work on a critically important direction of the speech synthesis service — the video translation technology in Yandex Browser. Our goal is to bring the voice of translated video to a level where the original delivery, naturalness and emotionality, intonation patterns, and even singing style are preserved.
Your developments will provide millions of users with the most natural and emotionally accurate translation of video content, breaking down language barriers and making content even more accessible and vibrant. You will work on the most challenging areas of ML, solving tasks related to transferring and preserving all the nuances of speech.
Development and Implementation of ML Models For accurate analysis of the source audio track and transferring intonation, emotional coloring, and speech pace into the synthesized translation.
Work on Reproducing Non-Verbal Communication Elements Laughter, sighs, pauses, interjections, and other natural speech features are critically important for maximum translation naturalness.
Research and Creation of Algorithms For successful copying and synthesis of singing in the target language.
Analysis and Improvement of Algorithms To minimize pronunciation errors in speech synthesis.
More about ML at Yandex — in the channel Yandex for ML
3 years
Experience
Full-time
Employment
Senior
Grade
Data Science & ML
Specialization
IT & Tech
Industry
Corporation
Company Type
IT & Tech
Industry
Corporation
Company Type