data
Posted Oct 7, 2025Lead AI Engineer
at Sphere
On-site
$20K
Requirements
- Our AI (TRAM) ingests global trade law, interprets it, resolves conflicts across jurisdictions, and produces compliance determinations more reliable than human experts.
- THE ROLE You'll lead development of TRAM, our proprietary AI reasoning model that reads and interprets global trade law.
- WHAT YOU'LL DO Within weeks: - Lead development of new features aimed at increasing TRAM’s test-time accuracy - Work on the underlying data and retrieval pipelines that help power our AI workflows - Work directly with our internal tax experts to understand how TRAM can better reason like them Within months: - Own TRAM’s eval framework and workflows - Work directly with leading frontier labs to reinforce fine tune models on our proprietary data
- experience building AI enabled products, particularly RAG systems -
- Experience fine tuning base models, ideally via RFT - Willingness to dive into tax technical problems - if you aren’t willing to dive deep on how the model should reason through the tax research process you won’t be effective - A strong understanding of how LLMs and reasoning models function NICE TO HAVES -
- Experience working with LLMs on legal applications -
- Experience with RAG data pipelines and collecting/curating data for the pipeline WHO YOU ARE You'll thrive here if: - You're a Dog.
Benefits
- We handle the entire lifecycle — calculation, registration, filing, remittance — at millisecond latency with zero downtime. - Backed by a16z and YC. $21M Series A, 30%+ month-over-month growth, customers include ElevenLabs, Replit, Deel, Runway, and Lovable. - Small team, global surface area.
- Expanding into input tax, withholding, e-invoicing, tariffs — each multiplies the complexity.
Additional details
- ABOUT SPHERE Every breakthrough in trade infrastructure has followed the same pattern: reduce a transaction cost, expand the market.
- Compliance is one of the last and largest — and the hardest, because trade rules aren't data to be looked up.
- They're a complex adaptive system with 190+ sovereign jurisdictions, in different languages, changing constantly, reacting to each other.
- Everyone owns a domain that would be a full team at a larger company.
- San Francisco, five days in office. - The problem keeps compounding.
- Tens of millions of transactions today, billions ahead.
- This isn't a lookup problem, it's a reasoning problem — and it only became solvable with LLMs.
- You'll build the data pipelines that ingest legal sources, the model stack that produces structured evidence, the evaluation frameworks that measure accuracy, and the fine-tuning loops that improve performance.
- The unusual constraint: you need speed, scale, correctness, and robustness simultaneously — at millisecond latency, zero downtime, heading toward billions of transactions where a single error costs a customer $20K.
- You've been underestimated, gone through struggle, and never stopped running.