Build production AI systems at Meta AI. Work on distributed systems infrastructure that powers our products and serves millions of users.
What you'll do:
- Design and implement scalable distributed systems systems
- Optimize performance and reliability
- Collaborate with research teams to deploy new capabilities
- Build tools and frameworks for internal teams
- Participate in on-call rotations and incident response
Requirements
- 10+ years of experience in software engineering or ML
- Strong programming skills in Python, PyTorch, Distributed Training
- Experience with machine learning frameworks and tools
- Track record of delivering complex projects
- Experience mentoring and leading teams
- Strong communication and collaboration skills
- Publications in top venues preferred
- Industry recognition and thought leadership
- Experience setting technical strategy
Required Skills
PythonPyTorchDistributed Training
About Meta AI
Building the future of human connection through AI. Makers of Llama and open source AI tools.