engineering
Posted Jul 10, 2025Software Engineer, Data Platform
at Notion
San Francisco, United StatesRemote
Responsibilities
- - Own critical data pipelines and services Design, implement, and harden batch and streaming pipelines (Spark, Kafka, etc.) that move and transform data reliably across regions.
- - Drive reliability and observability Raise the operational bar for our data stack: improve on-call experience, debugging, and alerting for data jobs and services.
Requirements
- In today's world of endless apps and tabs, Notion provides one place for teams to get everything done, seamlessly connecting docs, notes, projects, calendar, and email—with AI built in to find answers and automate work.
- WHAT YOU'LL DO: - Design and evolve the data lakehouse Build and operate core lakehouse components (e.g., Iceberg/Hudi/Delta tables, catalogs, schema management) that serve as the source of truth for analytics, AI, and search.
- Experience: 2+ years building and operating data platforms or large-scale infrastructure for SaaS or similar environments. - Programming: Strong skills in at least one of Python, Scala, or Typescript; comfortable working with SQL for analytics and data modeling. - Distributed data systems: Hands-on
- experience with Spark or similar distributed processing systems, including debugging and performance tuning. - Streaming & ingestion:
- Experience with Kafka or equivalent streaming systems; familiarity with CDC/ingestion patterns (e.g., Debezium, Fivetran, custom connectors). - Lakehouse / storage:
- Experience with data lakes and table formats (Iceberg, Hudi, or Delta) and/or data catalogs and schema evolution. - Operations: Comfortable owning services and pipelines in production, including on-call, incident response, and reliability improvements. NICE TO HAVES: -
- Experience working in an applied data platform setting, such as Trust and Safety, and/or directly with enterprise customers or on features like data residency, analytics product, EKM, or compliance-driven auditing. - Security & governance: Practical understanding of access control, encryption at rest/in transit, and auditing as they apply to data platforms.
- Prior work on Databricks, Unity Catalog, Lake Formation, or similar catalog/governance systems. -
- Experience designing or improving observability for data platforms (e.g., Honeycomb, OpenTelemetry, metrics/trace-heavy debugging).