Senior Dev Ops Engineer (AWS + PostgreSQL)
CluneTech is a global technology group that provides solutions to simplify international business, including digital sales, global payroll, global VAT, tax compliance, and cross-border payments. The group is headquartered in Kilkenny, Ireland, and publicly lists more than 1,200 employees, 35 offices, and operations across 20 countries.
About the role:
We are seeking a Senior AWS DevOps Engineer to support, secure, and evolve mission-critical cloud infrastructure for regulated financial technology workloads.
The successful candidate will play a key role in AWS infrastructure engineering, database administration, Linux systems operations, CI/CD automation, cost optimisation, observability, and application modernisation. They will also help introduce approved AI-assisted engineering practices to improve documentation, operational efficiency, code quality, and incident response.
This role requires strong ownership, excellent technical judgement, and the ability to produce clear, audit-ready documentation that supports compliance, knowledge transfer, and operational continuity.
Responsibilities:
1. AWS Infrastructure Engineering
- Design, deploy, and manage AWS infrastructure supporting production financial workloads.
- Work with services including EC2, ECS/Fargate, RDS, Aurora, S3, VPC, Lambda, CloudWatch, Secrets Manager, IAM, DynamoDB, SQS, SNS, EventBridge, API Gateway, AWS IAM Identity Center, and QuickSight.
- Implement and maintain Infrastructure as Code using Terraform, CloudFormation, or AWS CDK.
- Manage infrastructure across multiple AWS accounts and environments.
- Build and optimise CI/CD pipelines using tools such as GitLab CI, GitHub Actions, AWS CodePipeline, or equivalent.
- Design and maintain secure networking, including VPCs, subnets, routing, security groups, NACLs, Site-to-Site VPN, Transit Gateway, and related controls.
- Implement monitoring, alerting, logging, and observability using CloudWatch, dashboards, metrics, alarms, and log aggregation.
- Execute cost optimisation initiatives across compute, storage, databases, data transfer, and managed services.
- Build and maintain AWS QuickSight dashboards for billing analytics, cost allocation, usage trends, and financial reporting.
- Support recurring FinOps activities including monthly spend reviews, rightsizing, Savings Plans / Reserved Instance analysis, and waste elimination.
2. Database Administration
- Administer PostgreSQL environments, including RDS for PostgreSQL, Aurora PostgreSQL, and self-managed PostgreSQL where applicable.
- Support PostgreSQL version lifecycle management, including supported production versions and upgrade planning.
- Manage replication, backup and recovery, point-in-time recovery, vacuuming, indexing, query tuning, and performance troubleshooting.
- Manage Amazon Aurora PostgreSQL clusters, including scaling, failover, parameter groups, monitoring, and Performance Insights.
- Administer SQL Server on Amazon RDS, including backup strategies, index maintenance, Query Store analysis, and parameter tuning.
- Plan and execute database migrations, including SQL Server to PostgreSQL migrations using AWS DMS and native database tooling.
- Implement database security controls, including encryption at rest and in transit, IAM authentication where appropriate, audit logging, access control, and secrets management.
3. Linux Systems Administration
- Manage Amazon Linux 2023 and RHEL-based systems.
- Perform patching, hardening, performance tuning, log management, and operational troubleshooting.
- Administer Apache HTTP Server, including virtual hosts, SSL/TLS, module configuration, and runtime troubleshooting.
- Manage systemd services, EBS/EFS storage, filesystem permissions, and automated deployment workflows.
- Write and maintain Bash scripts for automation, monitoring, deployment, and operational tasks.
- Implement host-level security controls including SSH hardening, firewall rules, least-privilege access, log forwarding, and vulnerability remediation.
4. AI-Assisted Engineering and Automation
- Use approved AI-assisted engineering tools to improve operational workflows, documentation, code review, and troubleshooting.
- Work with Amazon Bedrock and supported large language models, including Anthropic Claude models where approved for company use.
- Use Kiro, AWS's agentic coding service, where appropriate to support spec-driven development, documentation, testing, and implementation planning.
- Evaluate AI-assisted tooling for infrastructure operations, anomaly detection, documentation generation, and incident response support.
- Build automation that improves alert enrichment, operational insight, and repeatable engineering workflows.
- Ensure AI usage follows company security, data protection, privacy, and compliance policies.
5. Application Modernisation and Rearchitecture
- Contribute to the rearchitecture of monolithic transactional applications into modern, resilient AWS-native patterns.
- Support containerisation strategies using Docker, ECS, and Fargate.
- Design and implement event-driven patterns using SQS, SNS, and EventBridge.
- Support blue/green and canary deployment strategies for safer releases and reduced downtime.
- Improve application reliability, scalability, observability, and operational maintainability.
6. Documentation, Security, and Compliance
- Produce clear technical documentation for completed work, including architecture decisions, runbooks, migration plans, configuration records, and operational procedures.
- Maintain standard operating procedures and incident response runbooks.
- Support ISO 27001 audit activity by providing evidence of infrastructure controls, change history, access controls, monitoring, and operational procedures.
- Contribute to change management processes, including risk assessment, implementation planning, rollback planning, and CAB submissions.
- Apply least-privilege, secure-by-design, and auditability principles across all infrastructure and operational work.
Working conditions:
- Participation in an on-call rotation for production systems, expected to be approximately 1 week in 4.
- Occasional out-of-hours work for planned maintenance, release activity, and incident response.
- Minimal travel, primarily between CluneTech office locations.
Requirements:
- 5+ years in DevOps, SRE, Cloud Engineering, or Infrastructure Engineering, including at least 3 years of hands-on AWS experience.
- Deep hands-on experience with EC2, RDS, Aurora, ECS/Fargate, S3, VPC, IAM, Lambda, CloudWatch, Secrets Manager, and QuickSight.
- Strong Terraform, CloudFormation, or AWS CDK experience.
- Experience building and maintaining deployment pipelines using GitLab CI, GitHub Actions, AWS CodePipeline, or equivalent.
- Strong PostgreSQL administration experience, including replication, performance tuning, backup and recovery, upgrades, and pg_dump / pg_restore.
- Working knowledge of SQL Server on RDS, including query optimisation, index management, backups, and maintenance tasks.
- Advanced Linux administration experience, preferably with Amazon Linux and RHEL-based distributions.
- Strong Bash scripting and automation skills.
- Practical experience with VPCs, routing, subnets, security groups, NACLs, VPNs, and Transit Gateway.
- Strong understanding of IAM, encryption, secrets management, patching, access control, and least-privilege design.
- Practical interest or experience in AI-assisted engineering, Amazon Bedrock, Kiro, LLMs, prompt engineering, or AI-supported development workflows.
- Ability to produce clear, complete, audit-ready technical documentation.
- Excellent written and spoken English, with the ability to explain complex technical topics to varied audiences.
Desirable skills:
- Experience in regulated financial services, payments, banking, or similar environments.
- ISO 27001, SOC 2, PCI DSS, or other audit / compliance exposure.
- AWS Solutions Architect Professional, AWS DevOps Engineer Professional, AWS Database Specialty, or equivalent.
- Cross-engine migration experience, especially SQL Server to PostgreSQL using AWS DMS or native tooling.
- Direct Connect, Transit Gateway, Site-to-Site VPN, IPAM, and multi-account networking.
- AWS cost optimisation, CUR analysis, QuickSight billing dashboards, Savings Plans, Reserved Instances, and tagging strategies.
- Docker, ECS task definitions, Fargate, service discovery, and service mesh concepts.
- Amazon Bedrock, Kiro, SageMaker, Claude models, AI-assisted development, or AI-enabled operational automation.
- Experience with log aggregation, distributed tracing, SLOs, dashboards, and incident response workflows.
Benefits:
What’s in it for you?
- Flexible Working Policy with a combination of remote and office working*
- Multisport Cards
- 22 Annual Leave Days for all employees plus additional days with length of service
- Lifework days: ½ day each quarter to get your life work done
- Flex hours where you can work your 40 hours across 4.5 days
- Additional Health Insurance including dental and vision care
- Team Buildings & Parties
- Life Insurance
- Refer a Friend Bonus
- Discounts across various stores, theatres, restaurants