← All Careers
Posted Jun 2, 2026

Lead Engineer, DevOps – SRE

Job Description: • Own and evolve Launch Potato's cloud infrastructure, CI/CD platform, and compliance posture. • Build the SRE function from the ground up so product teams can ship faster without compromising reliability, security, or cost control. • Stand up the SRE practice from scratch: on-call rotation, PagerDuty configuration, SLA/SLO definitions for core infrastructure services, runbook library, and observability dashboards that tie site performance to business metrics. • Complete the AWS multi-account migration: move production workloads to an isolated account with zero unplanned downtime. • Deliver SOC 2 Type I audit-ready infrastructure evidence package: own the technical controls implementation end-to-end. • Version and publish the Terraform module library: (30+ modules) to a private registry to eliminate ad hoc git consumption by product teams. • Implement automated deployment rollback for ECS and Lambda: gate production on integration test passage. • Stand up monthly cost reporting to leadership: budget anomaly detection, savings plan recommendations, spend by service/team/environment. Requirements: • 5+ years of production AWS infrastructure experience with deep Terraform expertise. • Hands-on experience building the SRE function from scratch and had complete ownership. • Experience with a multi-site company where PaaS or microservices are required. • CI/CD pipeline ownership in one or more previous roles. • PagerDuty experience and standing up an on-call rotation. Benefits: • profit-sharing bonus • competitive benefits