G

Reinforcement Learning Research Engineer

Google DeepMind

London, UKHybrid$200,000 - $350,0003 weeks ago

full-timeseniorgeminicustom

About the Role

Google DeepMind is hiring a Senior RL Research Engineer to work on reinforcement learning for language model alignment and reasoning. You will develop new RL algorithms that improve the capabilities and safety of our foundation models. This role involves designing reward models, implementing RL training loops at scale, and developing new techniques for improving model reasoning through self-play and multi-agent interactions. The ideal candidate has deep expertise in RL theory and practice, combined with strong engineering skills for implementing algorithms at scale.

Requirements

- 5+ years in RL research or engineering - Strong background in RL theory and algorithms - Experience with RLHF and language model alignment - Expert JAX or PyTorch skills - Published research in RL or related areas - PhD in ML/RL preferred

Required Skills

PythonJAXPyTorchRLHF

About Google DeepMind

Building AI systems that can solve complex problems and advance scientific discovery.

Visit Company Website

Ready to Apply?

Join Google DeepMind and work on cutting-edge AI technology