infrastructure
Posted Aug 12, 2025Senior Software Engineer, Infrastructure
at Retell AI
On-site
You are nearing today's limit. Upgrade for unlimited access.
Responsibilities
- RESPONSIBILITIES - Own CI/CD end-to-end: design, implement, and operate pipelines with blue/green, canary, and phased rollouts; define graceful draining for HA systems.
- - Implement robust observability (metrics, logs, traces), SLOs/error budgets, and automated rollback/one-click restore.
- - Lead incident response & postmortems; drive resilience, cost, and performance improvements.
- - Build production-grade CI/CD with GitHub Actions / GitLab CI / Jenkins (or similar), including complex rollout strategies.
- System Design: Architect a production-ready system. 3.
Requirements
- ABOUT RETELL AI Retell AI is using first principles to reimagine the call center with cutting-edge voice AI.
- Thousands of companies now utilize Retell’s AI voice agents to handle sales, support, and logistics calls that once required large teams of human agents.
- Instead of basic automation that needs constant human tuning, we’re creating intelligent AI “workers” that can act as frontline agents, QA analysts, and managers — continuously executing, monitoring, and improving customer interactions.
- We’re growing quickly and looking for ambitious builders who want to tackle hard technical problems, move fast, and have real impact at one of the fastest-growing voice AI startups.
- Let’s build the future together. - We’re a top 50 AI app in a16z list: https://tinyurl.com/5853dt2x - #4 on Brex's Fast-Growing Software Vendors of 2025: https://www.brex.com/journal/brex-benchmark-december-2025 - We're also one of the top ranking startups on: https://leanaileaderboard.com/ - Enterprise tech 30: https://www.wing.vc/et30/overview
- - Architect, maintain, and harden Kubernetes-based runtime (Docker, Kubernetes, Helm), including multi-cluster and multi-tenant concerns.
- - Manage cloud deployments across AWS/Azure/GCP and coordinate with on-prem infrastructure teams; standardize with IaC (e.g., Terraform).
- - Partner with compliance to integrate SOC 2 / ISO 27001 / HIPAA controls into pipelines (artifact signing, SBOMs, change management, access/keys).
- experience with a major cloud (AWS, Azure, or GCP) and container orchestration (Kubernetes, Helm).
- - Are comfortable with networking fundamentals, security hardening, and performance tuning.