data
Posted Jan 27Research Engineer, Machine Learning
at Mistral AI
On-site
Responsibilities
- Accelerate researchers by taking on the heavy parts of large-scale ML pipelines and building robust tools.
- Conduct experiments on the latest deep-learning techniques (sparsified 70 B + runs, distributed training on thousands of GPUs).
- Design, implement and benchmark ML algorithms; write clear, efficient code in Python.
- Deliver prototypes that become production-grade components for Le Chat and our enterprise API. About you
Requirements
- About Mistral At Mistral AI, we believe in the power of AI to simplify tasks, save time, and enhance learning and creativity.
- We democratize AI through high-performance, optimized, open-source and cutting-edge models, products and solutions.
- Our comprehensive AI platform is designed to meet enterprise as well as personal needs.
- Our offerings include Le Chat, La Plateforme, Mistral Code and Mistral Compute - a suite that brings frontier intelligence to end-users.
- We are a dynamic, collaborative team passionate about AI and its potential to transform society.
- Join us to be part of a pioneering company shaping the future of AI.
- See more about our culture on https://mistral.ai/careers.
- As a Research Engineer – ML track, you’ll build and optimise the large-scale learning systems that power our open-weight models.
- Master’s or PhD in Computer Science (or equivalent proven track record).
- Hands-on with PyTorch, JAX or TensorFlow; comfortable with distributed training (DeepSpeed / FSDP / SLURM / K8s).
Experience
- 4 + years working on large-scale ML codebases.
Benefits
- Experience in deep learning, NLP or LLMs; bonus for CUDA or data-pipeline chops.