infrastructure
Posted 3 weeks agoSite Reliability Engineer
at Alpaca
Remote
Responsibilities
- Operate production day-to-day - oncall, incident response, postmortems, and the follow-ups that actually close the loop.
- Own reliability practice - define and refine SLIs/SLOs and error budgets, and help product teams live within them.
- Ship infrastructure through code in a GitOps workflow - cloud resources and Kubernetes workloads alike.
- Mentor engineers on reliability and database fundamentals through code review, design review, and pairing. Who You Are (must-haves)
Requirements
- We're deeply committed to open-source contributions and fostering a vibrant community, continuously enhancing our award-winning, developer-friendly API and the robust infrastructure behind it.
- As a Site Reliability Engineer at Alpaca, you'll help keep our brokerage platform reliable, observable, and operable as we grow - working across our cloud infrastructure, Kubernetes platform, observability stack, messaging layer, and data layer.
- We're especially interested in candidates with strong PostgreSQL fundamentals who'd like to grow into deeper ownership of our database reliability posture: PostgreSQL sits on the trading-critical path, and we want this person to spend a meaningful share of their time leveling it up while still being a well-rounded SRE the rest of the week.
- Look after PostgreSQL : performance tuning, schema and migration review, online migrations on large tables, HA/DR, and CDC pipelines.
- experience operating production services on Kubernetes , and shipping infrastructure as code in a GitOps workflow.
- Solid working knowledge of PostgreSQL in production — query plans, pg_stat_*, indexing and schema trade-offs, and what a safe online migration looks like on a non-trivial table.
- Cloud networking fundamentals (VPCs, routing, L4/L7 load balancing, DNS, TLS) and comfort debugging cross-service connectivity.
- Comfortable with a modern observability stack and proficient with Linux at the operator level.
- At least working proficiency in Go or Python , plus strong written and verbal communication.
- Genuine interest in databases and in growing your PostgreSQL/DBA expertise. Who You Might Be (Nice-to-Haves): Deeper PostgreSQL
- Experience with typed SQL access layers in Go (e.g. pgx, gorm, sqlc). Production