San Francisco, CARemote$280,000 - $400,0001 months ago
full-timeseniorclaudecustom
About the Role
Help develop safer AI systems through Anthropic's Constitutional AI approach. Work on methods to train AI systems that are helpful, harmless, and honest.
Your work will include:
- Researching new techniques for AI alignment
- Developing evaluation methods for AI safety
- Contributing to the development of Claude's personality and values
- Publishing research that advances the field of AI safety
Requirements
- PhD in ML, AI, or related field
- Deep understanding of language model training
- Experience with RLHF or related techniques
- Strong research track record
Required Skills
PythonPyTorchRLHFFine-tuning
About Anthropic
AI safety company building reliable, interpretable, and steerable AI systems. Makers of Claude.