infrastructure
Posted 5 hours agoSenior Platform Engineer
at Attio
PolandRemote
Responsibilities
- Drive a culture of blameless post-mortems, ensuring root causes are identified, and long-term preventative measures are implemented as code (e.g., via runbooks, automation, or system design changes). - Tooling & Automation: Own the stack of supporting tools necessary for operational excellence and developer enablement, including: - Continuous Integration and Continuous Delivery (CI/CD) Pipelines: Implement, maintain, and evolve the fully automated CI and CD pipelines.
Requirements
- Attio is the CRM built for the AI era.
- Designed for the most ambitious go-to-market teams, it gives companies the power to understand every customer, automate at scale, and build their go-to-market motion exactly as they need.
- WHAT YOU'LL BRING - Applied DevOps and SRE Principles: - Must have : Demonstrable, hands-on
- experience applying core DevOps and Site Reliability Engineering (SRE) principles to manage, monitor, and scale production systems.
- - Must have: A deep understanding of the SRE mindset, including SLO/SLA creation and monitoring, error budget management, toil reduction, and post-incident review (blameless postmortems).
- - Desirable: Proven ability to drive cultural and process change that fosters a collaborative approach between development and operations teams.
- - Cloud Infrastructure and Containerisation Expertise: - Must have: Expertise in one or more major public cloud providers (AWS, GCP, or Azure), encompassing network configuration, security best practices (IAM, security groups, etc.), compute services (EC2, GKE, ECS, etc.), and managed services (databases, queues, serverless functions).
- - Must have: In-depth knowledge of container technologies, specifically Docker, and extensive
- experience orchestrating them at scale using Kubernetes (K8s).
- This includes designing, deploying, and managing Kubernetes clusters, understanding networking (CNI), storage (CSI), and security configurations within the Kubernetes ecosystem. - Automation and Programming Skills: - Must have: Proficiency in one or more modern software languages (e.g., Typescript, Go, Python, Rust) and associated frameworks used for building high-performance, resilient production systems. - Must have: Proven
- experience developing robust, maintainable, and well-tested automation scripts, services and pipelines to manage infrastructure, deployments, and operational tasks. - Operational Tooling and Observability Management: - Must have: