engineering
Posted Apr 14Software Engineer - RL Environments
at AfterQuery
San Francisco, United StatesOn-site
$200,000
Responsibilities
- What You'll Do - Design data slides and explore data shapes that expose meaningful model failure modes across domains like finance, code, and enterprise workflows - Build and refine evaluation rubrics and reward signals for RLHF and RLVR training pipelines - Model annotator behavior and run experiments to improve different model capabilities - Develop quantitative frameworks for measuring dataset quality, diversity, and downstream impact on model alignment and capability - Create and manage both real world
Requirements
- We serve every frontier AI lab with the mission of delivering the best data to power the best models.
- This is a rare opportunity to join a company at a defining moment in AI.
- We're based in San Francisco and backed by leading investors including Altos Ventures, BoxGroup, and Y Combinator and angels from Google DeepMind, OpenAI, Anthropic, Meta Superintelligence Labs, and Microsoft AI and are based in San Francisco.
- You'll work hands-on with research teams at top AI labs, experimenting with data collection strategies, diagnosing model failure modes, and developing the metrics that determine whether a model is actually improving.
Benefits
- Since raising our $30M Series A at a $300M valuation, AfterQuery has grown well over a $100M revenue run rate.
- Compensation Structure: $200k base + profit share (around 150% of base) + competitive equity
Contact
- ABOUT AFTERQUERY AfterQuery https://www.afterquery.com/ is an applied research lab curating data solutions for foundation model development.
Additional details
- In doing so, we can make expertise that once took a lifetime to build available to anyone who needs it.