Paris, FranceOnsite$180,000 - $300,0001 months ago
full-timeseniormistralmixtralcustom
About the Role
Mistral AI is looking for a Foundation Model Engineer to work on training our next generation of open and efficient language models. You will develop training pipelines, implement novel architectures, and optimize our models for both capability and efficiency.
As one of Europe's leading AI labs, Mistral AI offers the opportunity to work on frontier models that are used by developers and enterprises worldwide. You will help define the direction of open-weight AI models.
The ideal candidate has experience training large models and is passionate about making AI more accessible and efficient.
Requirements
- 5+ years of ML engineering experience
- Experience training large language models from scratch
- Expert PyTorch and CUDA skills
- Experience with distributed training frameworks
- Understanding of model architecture design
- PhD in ML/CS preferred
Required Skills
PythonPyTorchCUDADistributed TrainingTransformers
About Mistral AI
European AI lab building open and efficient language models.