Description
Our team is responsible for the quality of speech synthesis models in GigaChat. We are looking for people who will work on model quality, multimodal GigaChat, and other exciting projects.
Responsibilities
- improve existing speech synthesis models and create new solutions
- ensure the superiority of our models on key performance metrics
- support the deployment of developed models into the production environment
- participate in research projects, including training acceleration, voice cloning, low-resource learning, and the application of reinforcement learning methods
- read scientific papers and discuss the acquired knowledge and results within the team
- present found solutions at internal seminars, make publications on Habr and in our Telegram channel.
Requirements
- commercial experience and knowledge of Python, knowledge of fundamental data structures and algorithms; knowledge and understanding of mathematics and its applications in the field
- Data Science experience in developing and training deep neural networks, with special attention to audio processing
- successful experience in integrating models into production systems
- broad knowledge in the fields of NLP, linguistics, the Russian language, biology, and physics, which are directly related to speech synthesis
- programming skills in C++
- having publications and experience participating in scientific conferences is an advantage.
Conditions
- hybrid or remote work format (Russia)
- annual salary review and annual bonus
- corporate gym and recreation areas
- more than 400 educational programs from SberUniversity for professional and career development
- extended voluntary health insurance, preferential family insurance, and a corporate pension program
- flexible mortgage discount equal to 1/3 of the Central Bank's key rate
- free SberPrime+ subscription, discounts on products from partner companies.