We are hiring a Senior DevOps Engineer to help scale and evolve a large, production-heavy platform running on AWS and GCP.
You will work on a high-scale Kubernetes environment (EKS) supporting 1000+ microservices, driving improvements across developer experience, CI/CD, observability, and cost efficiency.
This role is part of building a new DevOps team structure, splitting ownership domains and modernizing tooling and workflows.
What You'll Do
- Design and operate large-scale Kubernetes platforms on AWS (EKS), including scaling, networking, security, and reliability.
- Improve developer experience across environments: faster builds, better feedback loops, and streamlined workflows.
- Lead CI/CD modernization: migrate and standardize pipelines (e.g., CircleCI → GitHub Actions, Jenkins → ArgoCD); build scalable, reusable pipeline patterns.
- Own and evolve GitOps workflows using ArgoCD.
- Build and maintain infrastructure with Terraform (modular, production-grade patterns).
- Drive observability improvements using tools like Groundcover, Prometheus, and Grafana.
- Optimize cloud usage and costs across AWS and GCP (including BigQuery).
- Work closely with engineering teams to support high-scale distributed systems (MongoDB, Elasticsearch, RDS).
- Contribute to domain ownership as part of a newly structured DevOps organization.
- Participate in production troubleshooting, incident response, and system reliability improvements.
Requirements
- 6+ years in DevOps / SRE / Platform Engineering.
- Strong Kubernetes (EKS) experience in large-scale production environments.
- Proven experience building and operating CI/CD systems at scale.
- Hands-on experience with GitHub Actions, Jenkins, and GitOps (ArgoCD).
- Strong Terraform expertise (modules, environments, team workflows).
- Solid AWS experience (networking, IAM, compute, managed services); familiarity with GCP (BigQuery).
- Experience with observability stacks (Prometheus, Grafana; modern tools like Groundcover are a plus).
- Strong Linux and networking fundamentals.
- Experience working in microservices-heavy environments (hundreds+ services).
Nice to Have
- Experience restructuring DevOps teams or working in domain-based ownership models.
- Cost optimization / FinOps mindset.
- Experience with high-scale data systems (Elasticsearch, MongoDB).
- Background supporting AI/ML or data-heavy workloads.
- Familiarity with modern observability platforms beyond traditional ELK stacks.