data
Posted Aug 29, 2024Research Engineer, Machine Learning - Paris/London/Zurich/Warsaw
at Mistral AI
Paris, FranceOn-site
Responsibilities
- Accelerate researchers by taking on the heavy parts of large-scale ML pipelines and building robust tools.
- Conduct experiments on the latest deep-learning techniques (sparsified 70 B + runs, distributed training on thousands of GPUs).
- Design, implement and benchmark ML algorithms; write clear, efficient code in Python.
- Deliver prototypes that become production-grade components for Le Chat and our enterprise API. About you
Requirements
- Master’s or PhD in Computer Science (or equivalent proven track record).
- Hands-on with PyTorch, JAX or TensorFlow; comfortable with distributed training (DeepSpeed / FSDP / SLURM / K8s).
Experience
- 4 + years working on large-scale ML codebases.
Benefits
- Experience in deep learning, NLP or LLMs; bonus for CUDA or data-pipeline chops.
Additional details
- Interface cutting-edge research with production: integrate checkpoints, streamline evaluation, and expose APIs.
- Strong software-design instincts: testing, code review, CI/CD.