San Francisco, CAHybrid$200,000 - $350,0002 weeks ago
full-timeseniorclaudecustom
About the Role
We are looking for a Senior Machine Learning Engineer to help build and improve the next generation of AI systems at Anthropic. You will work on training large language models, developing new alignment techniques, and improving the reliability and safety of our AI systems.
You will collaborate closely with research scientists and other engineers to translate research insights into production-ready systems. This role requires deep expertise in modern ML infrastructure, distributed training, and a strong understanding of transformer architectures.
As part of the team, you will help design and implement training pipelines that operate at scale across thousands of GPUs, optimize model performance, and develop tools that accelerate research iteration cycles.
Requirements
- 5+ years of experience in machine learning engineering
- Strong proficiency in Python and PyTorch
- Experience with distributed training (DeepSpeed, FSDP, or Megatron)
- Familiarity with CUDA and GPU optimization
- Experience training large transformer models
- Strong understanding of RLHF and alignment techniques
- MS or PhD in Computer Science, Machine Learning, or related field preferred
Required Skills
PythonPyTorchDistributed TrainingCUDARLHF
About Anthropic
AI safety company building reliable, interpretable, and steerable AI systems. Makers of Claude.