infrastructure
2 hours ago*
Staff Site Reliability Engineer, Data Platform
at Visa
📍 Us Austin, United States·🏢 Remote
You are nearing today's limit. Upgrade for unlimited access.
Requirements
- Job Description The Staff Site Reliability Engineer (Azure) is responsible for designing, building, and evolving cloud-native, containerized infrastructure on Microsoft Azure that powers our data products and services.
- As a Staff Engineer, you will bring deep expertise in Azure cloud architecture, Azure infrastructure implementation, systems design, networking, databases, and modern data technologies.
- experience with complex technology adoption, infrastructure automation, and high-scale distributed systems, with a strong emphasis on building and operating secure, resilient, and scalable solutions in Microsoft Azure environments.
- experience architecting, implementing, and optimizing Azure-based platforms and services, including cloud networking, compute, storage, identity and access management, observability, and container orchestration.
- The ideal candidate will be capable of leading the design and delivery of enterprise-grade cloud solutions using Azure-native and hybrid-cloud patterns, and of driving best practices for reliability, security, and operational excellence across the data platform.
- Remote positions may be required to be present at a Visa office with scheduled notice. Visa requires at least 3 days in office, expectations of these days will be confirmed by your Hiring Manager.
- Qualifications Basic
- experience with a Bachelors Degree or at least 2 years of work
- experience with an Advanced degree (e.g. Masters, MBA, JD, MD) or 0 years of work experience with a PhD Preferred
- experience with a Bachelors Degree or 4 or more years of relevant
- experience with an Advanced Degree (e.g. Masters, MBA, JD, MD) or up to 3 years of relevant experience with a PhD
- Bachelor’s degree in Computer Science, Engineering, or related field (Desirable but not mandatory). Technical Expertise: Advanced
- experience designing and operating large‑scale, cloud‑native infrastructure (AWS preferred).
- Strong hands-on proficiency with Infrastructure as Code (Terraform), including building reusable modules and platform-level components.
- Deep understanding of Kubernetes and container orchestration for data platforms and distributed systems.
- Knowledge of CI/CD systems, pipeline design, automation, and secure deployment practices.
- Understanding of database technologies including SQL, NoSQL, and data storage patterns.
- Experience with observability stacks (Prometheus, Grafana, OpenTelemetry, ELK/EFK, Datadog, or similar).