jobloom

JobLoom finds jobs directly from company career sites before many job boards, then routes you into detailed role pages like this one.

infrastructure

Posted 3 hours ago

Senior Site Reliability Engineer

Responsibilities

  • - Build and improve the platform foundations that help engineering teams ship safely and quickly, including CI/CD, deployment workflows, and infrastructure automation.
  • - Design and operate resilient, observable systems that support our real-time ingestion, detection, and response workloads.
  • - Lead improvements in scalability, performance, and operational maturity across services and environments.
  • - Drive modernization of our infrastructure stack, including Kubernetes-based workloads, infrastructure as code, and standardized operational patterns. - Improve developer

Requirements

  • We sit at the intersection of cybersecurity and blockchains—an adversarial space where attacks are constant and response time matters.
  • We’re building a real-time on-chain detection and response platform that combines ML-driven threat intelligence, custom detections, and an LLM-powered rules engine to protect on-chain assets.
  • experience by creating internal tooling, paved roads, and clear operational standards so engineers can own services from development through production. - Partner closely with backend and platform engineers to reduce toil, prevent recurring classes of failure, and raise the reliability bar across the organization. - Take part in incident management, root cause analysis, and follow-through on systemic fixes—not just short-term mitigation.
  • experience in SRE, infrastructure engineering, platform engineering, or a closely related role with strong hands-on production ownership. - Strong
  • experience with Kubernetes, AWS, and infrastructure as code tools such as Terraform or Pulumi. - Strong understanding of observability, monitoring, debugging, and performance tuning for distributed systems. -
  • - Solid coding ability in Python, Go, or Rust, with the ability to build internal tools and automate operational workflows.
  • NICE TO HAVE - Web3 or blockchain experience, or strong interest in the space. -
  • Experience with real-time or streaming systems. - Background in security, fraud, risk, or other adversarial domains. -
  • TECHNOLOGIES WE USE - Languages: Python, plus Rust and Go in the platform. - Infrastructure: Kubernetes, AWS, Grafana, Pulumi. - Streaming and messaging: NATS. - Data stores: PostgreSQL, Redis, S3.
  • Real impact. - Small team, high ownership, close to customers. - Best of both worlds: startup pace with the backing and data advantage of the world’s leading blockchain intelligence company.
  • AI at Chainalysis AI is not a feature at Chainalysis - it is a new way of working.
  • As the world's most trusted blockchain analytics platform, Chainalysis sits at a rare intersection of proprietary data, regulatory relationships and crypto expertise that makes it uniquely placed to shape and lead the next era of AI-driven intelligence - and we expect everyone here, regardless of role, to be an active part of it.
  • AI fluency is tied directly to how we measure performance and how we plan to win.
  • We are not using AI to do less.
  • About Chainalysis Chainalysis is the blockchain data platform, making it easy to connect the movement of digital assets to real-world services.
  • Powered by deep blockchain data and AI, organizations can investigate illicit activity, manage risk exposure, and develop innovative market solutions built on the industry's most trusted blockchain intelligence.
  • Our mission is to build trust in blockchains, blending safety and security with an unwavering commitment to growth and innovation. You belong here.

Experience

  • WHAT WE’RE LOOKING FOR - 5+ years of

Contact

  • You can learn more here https://go.chainalysis.com/rs/503-FAP-074/images/Interview%20Accommodations%20Request.pdf.

Additional details

  • The Hexagate team builds cybersecurity products that protect on-chain and web3 assets.
  • What you’ll do - Own reliability as a product capability: define and evolve SLOs, alerting standards, incident response practices, and production readiness expectations across the platform.
  • experience building and operating cloud-native systems in production, ideally in high-growth SaaS, fintech, cybersecurity, or similarly demanding environments. - Deep practical
  • Experience building or maintaining CI/CD systems, deployment tooling, and operational automation.
  • - Sound judgment around reliability trade-offs, failure modes, and safe delivery practices.
  • - A collaborative mindset: you enjoy enabling product teams, mentoring engineers on operational best practices, and building a culture of shared ownership rather than acting as a ticket-based support function.
  • - High ownership and a bias toward durable solutions that make systems more stable, secure, and scalable over time.
  • Experience building internal developer platforms or defining reliability standards used across multiple teams.
  • One that turns instructions into work done, and helps us move faster than the threats we're built to counter, and we expect our employees to take ownership of the output and ensure quality.
  • We provide the tools, workflows, and space to experiment - but the expectation is that you develop these capabilities yourself, bring ideas, and collaborate across teams to reinvent the way work gets done.

Find more real-time jobs on JobLoom.