Reach out directly about this role
Developer for the Core Team of the Managed Databases Service in Yandex Cloud
We are the team building the foundation for working with data in Yandex Cloud. The solutions we develop are used in truly large-scale systems with high loads and critical availability requirements.
A key focus is building and developing the control plane — the infrastructure that automates the entire database lifecycle: creation, configuration, scaling, and maintenance. Our tasks lie at the intersection of infrastructure and product: from developing services that coordinate operations on database hosts to auto-scaling DB clusters.
We are looking for talented and passionate developers ready to dive into creating and developing high-performance infrastructure around popular open-source DBMSs.
Developing services for DB cluster operations and their coordination mechanisms Cluster availability is affected by factors such as hardware failure, user operations, maintenance, and others. Your task is to design and develop smart systems that, by interacting and coordinating actions with each other, will orchestrate cluster operations (updates, migrations, backups, failover procedures, and others) to be seamless for users, guaranteeing compliance with availability SLAs.
Developing an IaaS service for managing a fleet of thousands of physical servers Our resource allocator must efficiently use computing power, minimizing fragmentation. At the same time, associated operations (creating/changing isolated environments, data migration between host machines, etc.) must be performed quickly and reliably.
Deployment service development Delivering changes (binary artifacts, configurations, OS updates) to hundreds of thousands of hosts (and their number is constantly growing) requires non-trivial solutions in the field of scalability. You will improve and rethink the architecture of our deployment service so it can efficiently and predictably handle multiple increases in the number of managed systems.
Finding and eliminating architectural bottlenecks in terms of reliability, scaling, and maintainability We understand that in a service responsible for data integrity, the focus must be on reliability. Therefore, you will participate in architectural sessions, improve observability, conduct deep performance analysis of key components, and implement best practices to increase the fault tolerance of the entire system. In turn, we are wisely breaking down monolithic components that hinder service development and support into isolated microservices.
More about backend at Yandex — in the channel Yandex for Backend
3-5 years
Experience
Full-time
Employment
Hybrid, Remote, Onsite
Work Format
Middle
Grade
Backend
Specialization
IT & Tech
Industry
Corporation
Company Type
By job title
Backend
Specialization
IT & Tech
Industry
Corporation
Company Type