Optimize the platform for distributed training, model evaluation, and batch/real-time inference.
Ensure consistent, low-latency feature delivery, preventing data leakage between training pipelines and real-time production inference.
Design, build, and secure high-throughput APIs, data connectors, and reverse-ETL patterns to sync data back into business systems (e.g., CRMs, ERPs, marketing automation).
Compute Optimization for Data Science: Configure and optimize compute engines tailored for heavy mathematical and data science workloads (e.g., Ray, Spark/EMR GPU instances).
Requirements
We’re obsessed with making hard work pay off, for our people, our customers, and the world around us. As a Fortune 500 company and a leading AI platform for managing people, money, and agents, we’re shaping the future of work so teams can reach their potential and focus on what matters most.
Operating across two specialized verticals—Data Engineering (ingestion, enrichment, and semantic layers) and Data Platform (foundational infrastructure, security architecture, and AI enablement)—we are designing a scalable, cloud-native foundation from the ground up.
About the Role We are seeking a highly specialized Senior Data Engineer - Cybersecurity to serve as the Subject Matter Expert (SME) for AI/ML and Platform Integration .
Your primary mission is to optimize our data platform to serve as a high-performance engine for Data Science, Machine Learning (ML), and Generative AI (GenAI) workloads.
Responsibilities AI/ML Data Infrastructure & Tooling: Design, provision, and maintain the platform infrastructure required for end-to-end machine learning lifecycles.
Vector Infrastructure for GenAI: Architect and maintain vector databases and indexing pipelines required to support Large Language Models (LLMs), Retrieval-Augmented Generation (RAG) patterns, and semantic search.
MLOps Collaboration & Automation: Partner closely with Data Scientists and MLOps teams to establish CI/CD automation for ML (MLOps).
About You Basic Qualification
experience with AWS SageMaker , MLflow, or equivalent cloud-native ML platforms. Feature Stores & Vector DBs: Proven
experience implementing feature store frameworks (e.g., Feast, SageMaker Feature Store) and vector databases (e.g., Pinecone, Milvus, Qdrant, or Pgvector). Distributed Compute & ML Libraries: Strong
experience using Apache Spark / AWS EMR , Ray, or Dask to process massive datasets for feature extraction and model preparation.
Integration Patterns: Expert knowledge of building rest APIs, Webhooks, and utilizing streaming tools (e.g., AWS Kinesis, Kafka) for real-time integration.
Languages & CI/CD: Advanced proficiency in Python (including ML ecosystems like Pandas, NumPy, Scikit-Learn) and SQL . Extensive
experience with GitHub Actions, GitLab CI, or Jenkins for data/ML pipelines. Other
Experience deploying and fine-tuning open-source LLMs or orchestrating AI agents using frameworks like LangChain or LlamaIndex.
Experience with reverse-ETL tools (e.g., Census, Hightouch) or enterprise integration platforms.
Experience
Experience: 5+ years of data engineering experience, with at least 2+ years dedicated to supporting machine learning platforms, MLOps, or complex platform integrations. ML Data Stack: Deep hands-on
Benefits
In return, we’ll give you the trust to take risks, the tools to grow, the skills to develop and the support of a company invested in you for the long haul.
Workday Pay Transparency Statement The annualized base salary ranges for the primary location and any additional locations are listed below.
Workday pay ranges vary based on work location.
As a part of the total compensation package, this role may be eligible for the Workday Bonus Plan or a role-specific commission/bonus, as well as annual refresh stock grants.
Each candidate’s compensation offer will be based on multiple factors including, but not limited to, geography, experience, skills, job duties, and business need, among other things.
Primary Location: USA.VA.Reston Primary Location Base Pay Range: $159,600 USD - $239,400 USD Additional US Location(s) Base Pay Range: $144,400 USD - $258,000 USD Our Approach to Flexible Work With Flex Work, we’re combining the best of both worlds: in-person time and remote.
Additional details
Not just in the products we build, but in how we show up for each other.
Our culture is rooted in integrity, empathy, and shared enthusiasm.
We’re in this together, tackling big challenges with bold ideas and genuine care.
We look for curious minds and courageous collaborators who bring sun-drenched optimism and drive.
Whether you're building smarter solutions, supporting customers, or creating a space where everyone belongs, you’ll do meaningful work with Workmates who’ve got your back.
So, if you want to inspire a brighter work day for everyone, including yourself, you’ve found a match in Workday, and we hope to be a match for you too.
About the Team We are a newly formed, forward-looking Cybersecurity Data Engineering & Enablement Team driving the future of our enterprise defense strategy.
Our mission is to build a next-generation, centralized data lakehouse that unifies all security telemetry into a single, high-performance ecosystem.
By combining cutting-edge data architecture with advanced analytics, we empower our threat hunters, data scientists, and incident responders with the real-time, trusted intelligence needed to protect the enterprise at scale.
This critical role sits at the intersection of core data platform infrastructure, advanced analytics, and external system integrations.