LH
LLMHire
Browse JobsAgentsNewSalary InsightsCompaniesBlogPricing

Never Miss an AI Job

Get weekly AI job alerts delivered to your inbox.

Join the AI hiring radar. Unsubscribe anytime.

LH
LLMHire

The #1 job board for AI & LLM engineers. Find your next role in the AI revolution.

Jobs

  • Browse Jobs
  • Companies
  • Job Alerts
  • Post a Job
  • Pricing

Resources

  • Blog
  • CyberOS.devScan code for vulnerabilities
  • EndOfCoding.comStay ahead with AI news
  • Vibe Coding AcademyLearn skills employers want
  • Vibe Coding Ebook22 chapters, 200+ prompts
  • Video Tutorials@endofcoding on YouTube

Company

  • About
  • Contact
  • Privacy
  • Terms

Contact

  • hello@llmhire.com
  • Get in Touch

© 2026 LLMHire. All rights reserved.

VeriduxLabsBuilt by VeriduxLabs
Back to all jobs
x

Member of Technical Staff - Applied Inference

xAI
Palo Alto, CAOnsite4 days ago
full-timeseniorcustom

About the Role

<div class="content-intro"><h3><strong><span style="font-family: arial, helvetica, sans-serif;">About xAI</span></strong></h3> <p><span style="font-family: arial, helvetica, sans-serif;">xAI’s mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. </span><span style="font-family: arial, helvetica, sans-serif;">Our team is small, highly motivated, and focused on engineering excellence. This organization is for individuals who appreciate challenging themselves and thrive on curiosity. </span><span style="font-family: arial, helvetica, sans-serif;">We operate with a flat organizational structure. All employees are expected to be hands-on and to contribute directly to the company’s mission. Leadership is given to those who show initiative and consistently deliver excellence. Work ethic and strong prioritization skills are important. </span><span style="font-family: arial, helvetica, sans-serif;">All employees are expected to have strong communication skills. They should be able to concisely and accurately share knowledge with their teammates.</span></p></div><h3>Responsibilities</h3> <ul> <li>Architect and implement scalable distributed infrastructure for model serving, such as load balancing, auto scaling, batch scheduling, and global KVcache systems.</li> <li>Ensure the reliability of inference services, targeting 100% uptime, a 0% error rate, and good tail performance, through proactive monitoring, fault-tolerant designs, and rigorous testing.</li> <li>Create custom tools to trace, replay, and fix issues or crashes across the entire stack, from cluster orchestration to GPU kernels.</li> <li>Benchmark and fine-tune inference engines to deliver optimal performance under diverse, production workloads.</li> <li>Develop robust CI/CD infrastructure to enable seamless endpoint deployment, image publishing, feature rollouts, and inference engine updates.</li> </ul> <h3>Required Qualifications</h3> <ul> <li>Worked on large-scale, high-concurrent production serving.</li> <li>Worked on GPU inference engines.</li> <li>Worked on testing, benchmarking, and the reliability of inference services.</li> <li>Worked on designing and implementing CI/CD infrastructure.</li> </ul> <h3><strong>Annual Salary Range</strong></h3> <p>$180,000 - $440,000 USD</p> <h3><strong>Benefits</strong></h3> <p>Base salary is just one part of our total rewards package at xAI, which also includes equity, comprehensive medical, vision, and dental coverage, access to a 401(k) retirement plan, short &amp; long-term disability insurance, life insurance, and various other discounts and perks.</p><div class="content-conclusion"><p><em>xAI is an equal opportunity employer. For details on data processing, view our&nbsp;</em><em><a href="https://x.ai/legal/recruitment-privacy-notice" target="_blank">Recruitment Privacy Notice</a>.</em></p></div>

Required Skills

ScalaRAGFine-tuning

About xAI

Elon Musk's AI company building Grok and advancing AI understanding.

Visit Company Website

Ready to Apply?

Join xAI and work on cutting-edge AI technology