LH
LLMHire
Browse JobsAgentsNewSalary InsightsCompaniesBlogPricing

Never Miss an AI Job

Get weekly AI job alerts delivered to your inbox.

Join the AI hiring radar. Unsubscribe anytime.

LH
LLMHire

The #1 job board for AI & LLM engineers. Find your next role in the AI revolution.

Jobs

  • Browse Jobs
  • Companies
  • Job Alerts
  • Post a Job
  • Pricing

Resources

  • Blog
  • CyberOS.devScan code for vulnerabilities
  • EndOfCoding.comStay ahead with AI news
  • Vibe Coding AcademyLearn skills employers want
  • Vibe Coding Ebook22 chapters, 200+ prompts
  • Video Tutorials@endofcoding on YouTube

Company

  • About
  • Contact
  • Privacy
  • Terms

Contact

  • hello@llmhire.com
  • Get in Touch

© 2026 LLMHire. All rights reserved.

VeriduxLabsBuilt by VeriduxLabs
Back to all jobs
C

Applied AI/ML Scientist

Cerebras
UAEOnsite4 days ago
full-timemidgpt-5customopen-source

About the Role

<div class="content-intro"><p><span data-contrast="none">Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs.&nbsp;</span><span data-ccp-props="{"134233117":false,"134233118":false,"201341983":0,"335559685":0,"335559737":240,"335559738":240,"335559739":240,"335559740":279}">&nbsp;</span></p> <p>Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups.&nbsp;<a href="https://openai.com/index/cerebras-partnership/">OpenAI recently announced a multi-year partnership with Cerebras</a>, to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference.&nbsp;</p> <p>Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation.</p></div><h4>About The Role</h4> <p><span data-contrast="auto">As an Applied AI Scientist in the&nbsp;FieldML&nbsp;team, you will&nbsp;be responsible for&nbsp;developing and customizing large language models and more broadly large-scale deep learning models to solve specific customer problems. You&nbsp;won't&nbsp;just advise; you will build.&nbsp;You will bridge the gap between&nbsp;state-of-the-art&nbsp;research and real-world&nbsp;applications&nbsp;by helping customers harness the power of the Cerebras Wafer-Scale Engine (WSE) for their AI initiatives.&nbsp;</span><span data-ccp-props="{}">&nbsp;</span></p> <p><span data-contrast="auto">We are looking for&nbsp;experienced&nbsp;AI Scientists who are passionate about the "applied" side of machine learning&nbsp;-&nbsp;those who enjoy not just reading papers, but implementing, training, and scaling models to solve complex business and scientific problems. You will work on a diverse range of projects, from training bespoke models from scratch to fine-tuning and&nbsp;optimizing&nbsp;the latest Large Language Models (LLMs) for specific industry verticals, to designing and building components for custom agentic systems.</span><span data-ccp-props="{}">&nbsp;</span></p> <p><span data-contrast="auto">The ideal candidate has experience in large model training and/or post-training, a deep understanding of training dynamics and model convergence, and&nbsp;expertise&nbsp;in data curation, combined with&nbsp;strong communication&nbsp;skills.&nbsp;&nbsp;</span><span data-ccp-props="{}">&nbsp;</span></p> <h4><span data-ccp-props="{}">Key Responsibilities&nbsp;</span></h4> <ul> <li><strong><span data-contrast="auto">Customer Use Case Discovery &amp; Project Scoping</span></strong><span data-ccp-props="{}">&nbsp;</span> <ul> <li><span data-contrast="auto">Collaborate with customer stakeholders to&nbsp;identify&nbsp;the best approaches&nbsp;to their&nbsp;business problem with AI.</span><span data-ccp-props="{}">&nbsp;</span></li> <li><span data-contrast="auto">Contribute to the technical scoping of engagements, including feasibility analysis, data quality/availability/readiness assessments, and the selection of&nbsp;optimal&nbsp;model architectures.</span><span data-ccp-props="{}">&nbsp;</span></li> <li><span data-contrast="auto">Define project milestones, success metrics, and rigorous evaluation benchmarks to ensure the solution delivers measurable value to the customer’s business.</span><span data-ccp-props="{}">&nbsp;</span></li> </ul> </li> <li><strong><span data-contrast="auto">Custom SOTA Models and AI Systems&nbsp;Development</span></strong><span data-ccp-props="{}">&nbsp;</span> <ul> <li><span data-contrast="auto">Architect and execute&nbsp;end-to-end training recipes for custom models,&nbsp;tailoring&nbsp;model architecture and training recipes&nbsp;to meet customer-specific performance and accuracy requirements.</span><span data-ccp-props="{}">&nbsp;</span></li> <li><span data-contrast="auto">Design and implement sophisticated adaptation strategies, including continuous pre-training on private datasets, supervised fine-tuning (SFT), and post-training alignment via RLHF or DPO.</span><span data-ccp-props="{}">&nbsp;</span></li> <li><span data-contrast="auto">Take full ownership of the training pipeline, from high-performance data preprocessing and tokenization to hyperparameter tuning and loss-curve analysis.</span><span data-ccp-props="{}">&nbsp;</span></li> <li><span data-contrast="auto">Navigate the nuances of model convergence on specialized hardware, performing deep-dive analysis into loss dynamics and gradient stability.</span><span data-ccp-props="{}">&nbsp;</span></li> <li><span data-contrast="auto">Scale training workloads across Cerebras clusters, ensuring efficient&nbsp;utilization&nbsp;of the hardware for multi-billion parameter models.</span><span data-ccp-props="{}">&nbsp;</span></li> <li><span data-contrast="auto">Build and&nbsp;optimize&nbsp;the core components of agentic systems, focusing on tool-use capabilities, long-context reasoning, and multi-step planning.</span><span data-ccp-props="{}">&nbsp;</span></li> </ul> </li> <li><strong><span data-contrast="auto">Technical Customer Leadership</span></strong><span data-ccp-props="{}">&nbsp;</span> <ul> <li><span data-contrast="auto">Serve as an AI/ML subject matter expert during technical&nbsp;deep-dives, translating customer requirements into precise training recipes.</span><span data-ccp-props="{}">&nbsp;</span></li> <li><span data-contrast="auto">Build and&nbsp;maintain&nbsp;strong customer relationships to become their go-to&nbsp;AI/ML&nbsp;expert.</span><span data-ccp-props="{}">&nbsp;</span></li> </ul> </li> <li><strong><span data-contrast="auto">Internal Research and Engineering Collaboration</span></strong><span data-ccp-props="{}">&nbsp;</span> <ul> <li><span data-contrast="auto">Act as the "voice of the customer" for internal R&amp;D and engineering teams to drive improvements in our software stack and hardware&nbsp;utilization.</span><span data-ccp-props="{}">&nbsp;</span></li> <li><span data-contrast="auto">Partner with internal ML teams and product teams on prioritization of novel model architectures with Cerebras software stack, development of training recipes and internal case studies.</span><span data-ccp-props="{}">&nbsp;</span></li> <li><span data-contrast="auto">Distill customer-facing&nbsp;successful projects&nbsp;into internal playbooks, helping scale the&nbsp;FieldML&nbsp;team’s ability to deliver specialized models.</span></li> </ul> </li> </ul> <h4><span data-contrast="none">Skills And Qualifications&nbsp;</span> <span data-ccp-props="{"134233117":false,"134233118":false,"201341983":0,"335559685":0,"335559737":240,"335559738":240,"335559739":240,"335559740":279}">&nbsp;</span></h4> <ul> <li><span data-contrast="auto">Education:</span><span data-contrast="auto">&nbsp;Master’s or PhD in Computer Science, Machine Learning, or&nbsp;related&nbsp;fields.</span><span data-ccp-props="{}">&nbsp;</span></li> <li><span data-contrast="auto">Broad Deep Learning Expertise:</span><span data-contrast="auto">&nbsp;Expert-level understanding of modern model architectures, including dense transformers,&nbsp;MoEs, multimodal and sequence models, scaling laws and training dynamics.&nbsp;</span><span data-ccp-props="{}">&nbsp;</span></li> <li><span data-contrast="auto">Hands-on&nbsp;Trainig&nbsp;Experience:</span><span data-contrast="auto">&nbsp;Proven&nbsp;track record&nbsp;of training and/or fine-tuning large models (1B+ parameters) and direct experience with the challenges of large-scale model training.&nbsp;</span><span data-ccp-props="{}">&nbsp;</span></li> <li><span data-contrast="auto"><span data-ccp-charstyle="citation-6" data-ccp-charstyle-defn="{"ObjectId":"0a950e52-e491-55c3-8b16-6fa80c2fcfc6|1","ClassId":1073872969,"Properties":[201342446,"1",201342447,"5",201342448,"3",201342449,"1",469777841,"Aptos",469777842,"Arial",469777843,"MS 明朝",469777844,"Aptos",201341986,"1",469769226,"Aptos,Arial,MS 明朝",268442635,"24",469775450,"citation-6",201340122,"1",134233614,"true",469778129,"citation-6",335572020,"1",469778324,"Default Paragraph Font"]}">Engineering Proficiency:</span></span><span data-contrast="auto"><span data-ccp-charstyle="citation-6">&nbsp;Mastery of Python and&nbsp;</span><span data-ccp-charstyle="citation-6">PyTorch</span><span data-ccp-charstyle="citation-6">, e</span>xperience with distributed training frameworks and large-scale distributed data processing pipelines and tools.</span><span data-ccp-props="{}">&nbsp;</span></li> <li><span data-contrast="auto">Strong&nbsp;Interpersonal and&nbsp;Communication&nbsp;Skills</span><span data-contrast="auto">: Effective in collaborative and fast-paced team settings, able to work autonomously and within a team in a dynamic environment, managing multiple projects and pivoting as customer needs evolve. Able to present complex technical results to diverse&nbsp;audience&nbsp;- from C-level executives to research scientists, and to work collaboratively to solve customers’ unique challenges.&nbsp;</span><span data-ccp-props="{}">&nbsp;</span></li> </ul><div class="content-conclusion"><h4><strong>Why Join Cerebras</strong></h4> <p>People who are serious about software make their own hardware. At Cerebras we have built a breakthrough architecture that is unlocking new opportunities for the AI industry. With dozens of model releases and rapid growth, we’ve reached an inflection&nbsp; point in our business. Members of our team tell us there are five main reasons they joined Cerebras:</p> <ol> <li>Build a breakthrough AI platform beyond the constraints of the GPU.</li> <li>Publish and open source their cutting-edge AI research.</li> <li>Work on one of the fastest AI supercomputers in the world.</li> <li>Enjoy job stability with startup vitality.</li> <li>Our simple, non-corporate work culture that respects individual beliefs.</li> </ol> <p>Read our blog:&nbsp;<a href="https://www.cerebras.net/blog/5-reasons-to-join-cerebras" target="_blank" data-auth="NotApplicable" data-linkindex="0">Five Reasons to Join Cerebras in 2026.</a></p> <h4>Apply today and become part of the forefront of groundbreaking advancements in AI!</h4> <hr> <p><em>Cerebras Systems is committed to creating an equal and diverse environment and is proud to be an equal opportunity employer.&nbsp;</em><em>We celebrate different backgrounds, perspectives, and skills. We believe inclusive teams build better products and companies. </em><em>We try every day to build a work environment that empowers people to do their best work through continuous learning, growth and support of those around them.</em></p> <hr> <p><em>This website or its third-party tools process personal data. For more details, click <a href="https://www.cerebras.net/privacy/" target="_blank">here</a> to review our CCPA disclosure notice.</em></p></div>

Required Skills

PythonPyTorchAWSRAGTransformersRLHFFine-tuningDistributed Training

About Cerebras

Building the largest AI chips in the world for training massive models.

Visit Company Website

Ready to Apply?

Join Cerebras and work on cutting-edge AI technology