LH
LLMHire
Browse JobsAgentsNewSalary InsightsCompaniesBlogPricing

Never Miss an AI Job

Get weekly AI job alerts delivered to your inbox.

Join the AI hiring radar. Unsubscribe anytime.

LH
LLMHire

The #1 job board for AI & LLM engineers. Find your next role in the AI revolution.

Jobs

  • Browse Jobs
  • Companies
  • Job Alerts
  • Post a Job
  • Pricing

Resources

  • Blog
  • CyberOS.devScan code for vulnerabilities
  • EndOfCoding.comStay ahead with AI news
  • Vibe Coding AcademyLearn skills employers want
  • Vibe Coding Ebook22 chapters, 200+ prompts
  • Video Tutorials@endofcoding on YouTube

Company

  • About
  • Contact
  • Privacy
  • Terms

Contact

  • hello@llmhire.com
  • Get in Touch

© 2026 LLMHire. All rights reserved.

VeriduxLabsBuilt by VeriduxLabs
Back to all jobs
I

Member of Technical Staff – Model Training

Inflection AI
Palo Alto, CAOnsite4 days ago
full-timeseniorcustom

About the Role

<div class="content-intro"><p><strong>At Inflection AI, our public benefit mission is to harness the power of AI to improve human well-being and productivity.</strong></p> <p>The next era of AI will be defined by agents we trust to act on our behalf.&nbsp;</p> <p>We’re pioneering this future with human-centered AI models that unite emotional intelligence (EQ) and raw intelligence (IQ)—transforming interactions from transactional to relational, to create enduring value for individuals and enterprises alike.</p> <p>Our work comes to life in two ways today:</p> <p><a href="https://pi.ai" target="_blank">Pi, your personal AI</a>, designed to be a kind and supportive companion that elevates everyday life with practical assistance and perspectives.</p> <p><a href="https://developers.inflection.ai" target="_blank">Platform</a> — large-language models (LLMs) and APIs that enable builders, agents, and enterprises to bring Pi-class emotional intelligence into experiences where empathy and human understanding matter most.</p> <p>We are building toward a future of AI agents that earn trust, deepen understanding, and create aligned, long-term value for all.</p></div><h2>About the Role</h2> <p>As a Model Training engineer, you will design, build, and scale the post-training pipelines that turn a general LLM into a brand-fluent, production-ready assistant. Your innovations in fine-tuning and preference optimization (RLHF, DPO, GRPO, RLAIF) will directly improve reliability, alignment, and cost.</p> <p><strong>This is a good role for you if you:</strong></p> <ul> <li>Have hands-on experience training and fine-tuning large transformer models on multi-GPU / multi-node clusters.</li> <li>Are fluent in PyTorch and its ecosystem tools (Torchtune, FSDP, DeepSpeed) and enjoy digging into distributed-training internals, mixed precision, and memory-efficiency tricks.</li> <li>Have shipped or published work in RLHF, DPO, GRPO, or RLAIF and understand their practical trade-offs.</li> <li>Care deeply about training tools, pipelines, and reproducibility—you automate the boring parts so you can iterate on the fun parts.</li> <li>Balance research curiosity with product pragmatism—you know when to run an ablation and when to ship.</li> <li>Communicate crisply with both technical and non-technical teammates.</li> <li>Have a bachelor’s degree or equivalent in a related field to the offered position requirements.</li> </ul> <p><strong>Responsibilities include:</strong></p> <ul> <li>Contribute to end-to-end post-training workflows—dataset curation, hyper-parameter search, evaluation, and rollout—using PyTorch, Torchtune, FSDP/DeepSpeed, and our internal orchestration stack.</li> <li>Prototype and compare alignment techniques (e.g., curriculum RL, multi-objective reward modeling, tool-use fine-tuning) and push the best ideas into production.</li> <li>Automate training at scale: build robust pipeline components, tools, scripts, and dashboards so experiments are reproducible and easy to trace.</li> <li>Define the metrics that matter; run A/B tests and iterate quickly to meet aggressive quality targets.</li> <li>Collaborate with inference, safety, and product teams to land improvements in customer-facing systems.</li> </ul> <h2><strong>Employee Pay Disclosures</strong></h2> <p>At Inflection AI, we aim to attract and retain the best employees and compensate them in a way that appropriately and fairly values their individual contributions to the company. For this role, Inflection AI estimates a starting annual base salary to fall within the range of <strong>$</strong><strong>175,000</strong><strong> to $</strong><strong>350,000</strong>, depending on a candidate’s qualifications and level of experience. This role also includes a meaningful equity component, allowing employees to share in the long-term success of the company.<br><br></p> <h3><strong>Benefits</strong></h3> <p>Inflection AI values and supports our team’s mental and physical health. We are focused on building a positive, safe, inclusive and inspiring place to work. Our benefits include:&nbsp;</p> <ul> <li>Diverse medical, dental and vision options&nbsp;</li> <li>401k matching program&nbsp;</li> <li>Unlimited paid time off&nbsp;</li> <li>Parental leave and flexibility for all parents and caregivers</li> <li>Support of country-specific visa needs for international employees living in the Bay Area</li> </ul>

Required Skills

PyTorchNode.jsRustRAGRLHFFine-tuningAgent Orchestration

About Inflection AI

Creating personal AI for everyone. Makers of Pi.

Visit Company Website

Ready to Apply?

Join Inflection AI and work on cutting-edge AI technology