infrastructure
Posted Jan 30Senior Site Reliability Engineer
at Stellar3
San Francisco, United StatesRemote
Responsibilities
- - Monitor, triage and respond to alerts in our high availability environments.
Requirements
- Interested in working on cutting-edge blockchain technology and creating equitable access to the global financial system? Since 2014, the mission-driven team at the Stellar Development Foundation (SDF) has helped fuel the tremendous growth of the Stellar blockchain network, an open-source platform that operates at high-scale today.
- you will: - Maintain, improve, scale and secure our AWS/GCP infrastructure and Linux systems.
- - Build, maintain, monitor and improve our Kubernetes clusters.
- - Work with development teams on migrating applications to Kubernetes.
- - Be responsible for maintenance and improvements to multiple internal services, for example Kubernetes, Prometheus, ELK.
- experience of working in cloud-based systems operations, as a SRE or DevOps engineer. - First-hand
- experience with configuration management and infrastructure as code (Ansible, Puppet, Terraform).
- - A strong understanding of computer networking, TCP/UDP, load balancing, distributed computing, web services, and the fundamental protocols used by the internet (HTTP, HTTPS, DNS, etc.).
- - Proficiency in at least one programming language.
- Approachable, empathetic, and proactive in promoting collaboration and innovation. - Excels in working independently, demonstrating the ability to accomplish tasks without constant monitoring. - Production
- experience building and maintaining Kubernetes clusters.
Experience
- You have: - 5+ years of
Benefits
- Bonus Points if: - Ability to understand Go, Rust, C++ and TypeScript source code -
- Experience experimenting with AI-driven approaches to operations We offer competitive pay with a base salary range for this position of 165,000 - $235,000 depending on job-related knowledge, skills, experience, and location.