DevOps Engineer (Senior)
We are looking for a Senior DevOps Engineer to develop a universal Kubernetes platform and related infrastructure services. The role focuses on building a reliable, scalable, and automated platform, including CI/CD, observability, security, and IAM elements.
Responsibilities:
- Design, develop, and maintain CI/CD pipelines for platform services.
- Administer and develop Kubernetes clusters in production.
- Implement and maintain SRE practices (incidents, post-mortems, SLI/SLO).
- Develop infrastructure as code (Terraform, Ansible).
- Configure monitoring, logging, and alerting (Prometheus, Grafana, ELK/Loki).
- Support containerized applications and Docker environments.
- Work with Nginx as a reverse proxy and ingress layer.
- Automate infrastructure tasks (Bash/Python).
- Participate in incident analysis and system performance optimization.
- Maintain technical documentation (runbooks, architecture, processes).
What we expect from our future colleague:
- 3-5 years of experience as a DevOps/SRE engineer.
- Experience with Kubernetes in production (deployment, maintenance, troubleshooting).
- Strong Linux skills (systemd, cgroups, networking, problem diagnostics).
- Experience with Docker (building, optimization, containerizing services).
- Experience building CI/CD (Jenkins and/or GitLab CI/CD).
- Experience with Helm and the Kubernetes ecosystem.
- Experience with Prometheus, Grafana, and logging systems.
- Understanding of networks: TCP/IP, DNS, TLS, routing fundamentals, and diagnostics (tcpdump, curl).
- Experience with Nginx (reverse proxy, TLS termination, basic security).
- Automation skills with Bash and/or Python.
- Experience with Git and standard branching workflows.
- Basic understanding of IAM, OAuth2 / OIDC.
- Basic understanding of TLS/PKI (certificates, HTTPS, CA).
- Experience with Terraform and/or Ansible (basic or intermediate level).
- Will be an advantage:
- Deep experience operating Keycloak (HA, clustering, Infinispan, session management).
- Expert understanding of IAM platforms and OAuth2/OIDC architectures.
- Experience with Istio Service Mesh.
- mTLS.
- traffic splitting / canary deployments.
- AuthorizationPolicy / RequestAuthentication.
- service mesh troubleshooting
- Experience building SLI/SLO and mature SRE practices.
- Experience with advanced Kubernetes (networking, security, CRD/operators).
- Experience with Service Mesh observability (Kiali, Jaeger, distributed tracing).
- Experience with Vault, HSM, or Sealed Secrets.
- Experience implementing GitOps (ArgoCD / FluxCD).
- Experience with OpenShift or enterprise Kubernetes distributions.
- Experience with TeamCity, Hadoop, JBoss/WildFly.
Conditions
- Engagement period: 3–6 months (with possibility of extension)
- Employment type: Full-time
- Work format: Remote (only from Russia)
- Salary: 800-1050 RUB/hour
- Willingness to complete a small test task (up to 1 hour).
Key Skills
Kubernetes / Docker / CI/CD / Terraform / Linux / Prometheus