Cloud Engineer (OpenStack)
We are looking for a Cloud Engineer (OpenStack) who will help develop and scale the Server Moll Cloud public cloud — an enterprise-level platform based on OpenStack.
You will participate in the infrastructure modernization and automation project: updating OpenStack to the latest version, transitioning to a KVM + Ceph stack, implementing pay-as-you-go metrics, developing PaaS-level services (Kubernetes, DBaaS), and increasing cloud fault tolerance.
This is an opportunity to apply deep technical expertise, solve non-trivial problems, and influence the architectural decisions of a rapidly growing cloud platform.
Format: remote (office/hybrid optional).
Office: Saint Petersburg.
Team: join as the second key engineer with prospects for team growth.
Responsibilities
- OpenStack Administration and Development: maintaining the current platform (currently — VMware Integrated OpenStack on Ussuri/Train 2019–2020 releases), planning and implementing updates to the latest OpenStack versions. Implementing consumption metrics for the Pay-as-you-go model and new release functionality.
- Project Isolation and Security: modernizing the RBAC system to ensure strict isolation between client projects (multi-tenant). Configuring and managing access, roles, and quotas in OpenStack.
- Integrations and Tools: participating in the refinement of UI, CLI, and API for our specific deployment; collaborating with development on necessary extensions.
- Monitoring and Reliability: configuring monitoring and alerting, increasing fault tolerance, identifying bottlenecks, and optimizing performance.
- Support and Incidents: 3rd-line support for the cloud platform — diagnosing complex incidents, participating in post-mortems, and preventing recurrence.
- Documentation and Automation: maintaining up-to-date technical documentation; automating deployment and operations using Terraform and Ansible.
- Further Development: participating in the launch and development of Kubernetes and Database Management Services (DBaaS), as well as the transition to a KVM + Ceph stack.
Requirements
- Linux and Infrastructure: approximately 3+ years of experience with Linux and cloud systems, deep understanding of Linux operations at the system process, network, and storage levels.
- OpenStack: practical operational experience, understanding of architecture and key components (Nova, Neutron, Cinder, Glance, Keystone, etc.); experience with updates/patches is a plus.
- Containers and Orchestration: experience with Docker and Kubernetes.
- Virtualization and Networking: knowledge of virtualization technologies (KVM/QEMU, VMware vSphere) and network technologies (TCP/IP, VLAN, VPN, etc.); understanding of network services and protocols.
- Automation and CI/CD: Terraform, Ansible, GitLab CI/CD.
- Monitoring and Related Services: familiarity with Prometheus, Grafana, VictoriaMetrics, and understanding of MySQL/PostgreSQL operations for interaction with adjacent services.
- Programming Skills: confident proficiency in Bash and Python; ability to write scripts and utilities for automation, read and understand source code, develop auxiliary tools for integration and administration.
- Troubleshooting: systematic approach to finding root causes of problems in distributed systems.
- Communication: teamwork, interaction with development and other departments, documentation, and knowledge sharing.
- English Language: at the level of reading technical documentation.
Conditions
- Format: remote work (office in Saint Petersburg or flexible office visit schedule at the employee's discretion). Flexible schedule focused on results.
- Projects and Development: opportunity to participate in the creation and development of a large cloud platform, implement cutting-edge technologies (OpenStack, Kubernetes, DBaaS), and see the results of your work. Participation in large-scale infrastructure modernization projects (e.g., OpenStack updates and launching new services).
- Team: small, highly professional team, direct interaction with the lead cloud architect.