LH
LLMHire
Browse JobsMarket TrendsNewSalariesTrendsCompaniesPricingBlog

Never Miss an AI Job

Get weekly AI job alerts delivered to your inbox.

Join the AI hiring radar. Unsubscribe anytime.

LH
LLMHire

The AI Labor Market Intelligence Platform. Real-time job data, salary benchmarks, and hiring trends from 160+ companies.

Jobs

  • Browse Jobs
  • Companies
  • Job Alerts
  • Post a Job
  • Pricing

Resources

  • Blog
  • CyberOS.devScan code for vulnerabilities
  • EndOfCoding.comStay ahead with AI news
  • Vibe Coding AcademyLearn skills employers want
  • Vibe Coding Ebook22 chapters, 200+ prompts
  • Video Tutorials@endofcoding on YouTube

Company

  • About
  • Contact
  • Privacy
  • Terms

Contact

  • hello@llmhire.com
  • Get in Touch

© 2026 LLMHire. All rights reserved.

VeriduxLabsBuilt by VeriduxLabs
Back to all jobs
F

Member of Technical Staff

Fireworks AI
New York, NYOnsite3 days agovia Greenhouse
full-timeseniorcustomopen-source

About the Role

<div class="content-intro"><h2><strong>About Us:</strong></h2> <p data-start="107" data-end="729">At Fireworks, we’re building the future of generative AI infrastructure. Our platform delivers the highest-quality models with the fastest and most scalable inference in the industry. We’ve been independently benchmarked as the leader in LLM inference speed and are driving cutting-edge innovation through projects like our own function calling and multimodal models. Fireworks is a Series C company valued at $4 billion and backed by top investors including Benchmark, Sequoia, Lightspeed, Index, and Evantic. We’re an ambitious, collaborative team of builders, founded by veterans of Meta PyTorch and Google Vertex AI.</p></div><p style="text-align: left;"><strong>Job Duties: </strong>Design, develop, and maintain large-scale backend and cloud-native infrastructure to<br>support distributed machine learning training, inference, and data processing pipelines for generative AI platform.<br>Architect and build scalable, resilient backend infrastructure to support distributed training, inference, and data<br>processing pipelines. Lead technical design discussions, mentor engineers, and establish best practices for<br>large-scale machine learning systems. Design and implement core backend services with a focus on efficiency<br>and low latency. Drive infrastructure optimization initiatives for compute cost, storage lifecycle management, and<br>network performance. Collaborate with machine learning, DevOps, and product teams to translate research and<br>product requirements into robust infrastructure solutions. Evaluate and integrate cloud-native and open-source<br>technologies such as Kubernetes, Ray, Kubeflow, and MLFlow to enhance platform reliability. Own end-to-end<br>systems from design to deployment, emphasizing reliability, fault tolerance, and operational excellence.</p> <p style="text-align: left;"><strong>Minimum Education &amp; Experience Required</strong>: Bachelor’s degree or equivalent in Computer Science or related<br>field plus four (4) years of experience in software engineering or related role</p> <p style="text-align: left;"><strong>Minimum Skills Required:</strong> 4 years of experience designing, building, and optimizing large-scale backend<br>infrastructure and distributed data systems (e.g., PostgreSQL, MySQL, DynamoDB, Apache Spark, Apache<br>Flink, Apache Kafka) in cloud environments (AWS, GCP, Azure, or equivalent), including cloud-native platforms,<br>core infrastructure components, and optimization techniques (caching, indexing, sharding, replication,<br>transactions, ACID). 4 years of experience with major server-side programming languages and frameworks<br>(e.g., Python, C++, Go, TypeScript). 4 years of experience writing technical design documentation, leading cross-<br>functional projects, and collaborating with cross-functional teams to achieve business impact. 3 years of<br>experience developing and maintaining data processing and API systems, including client-server communication<br>frameworks (e.g., gRPC, Thrift). 3 years of experience conducting A/B testing and scientific experimentation<br>(e.g., Statsig, Meta Deltoid, Optimizely) to measure software impact. 3 years of experience conducting coding<br>interviews and providing systematic feedback for engineering candidates. 2 years of experience with cloud-native<br>tools and infrastructure, such as Docker and Kubernetes. 2 years of experience defining and implementing data-<br>driven metrics to support company or team goals.</p> <p style="text-align: left;"><strong>How to Apply</strong>: Submit resume and apply online at http://www.fireworks.ai/careers and search for job by title.</p> <p style="text-align: left;">&nbsp;</p><div class="content-pay-transparency"><div class="pay-input"><div class="description"><p>Total compensation for this role also includes meaningful equity in a fast-growing startup, along with a competitive salary and comprehensive benefits package. Base salary is determined by a range of factors including individual qualifications, experience, skills, interview performance, market data, and work location. The listed salary range is intended as a guideline and may be adjusted.</p></div><div class="title">Base Pay Range (Plus Equity)</div><div class="pay-range"><span>$175,000</span><span class="divider">&mdash;</span><span>$220,000 USD</span></div></div></div><div class="content-conclusion"><h2><strong>Why Fireworks AI?</strong></h2> <ul> <li>Solve Hard Problems: Tackle challenges at the forefront of AI infrastructure, from low-latency inference to scalable model serving.</li> <li>Build What’s Next: Work with bleeding-edge technology that impacts how businesses and developers harness AI globally.</li> <li>Ownership &amp; Impact: Join a fast-growing, passionate team where your work directly shapes the future of AI—no bureaucracy, just results.</li> <li>Learn from the Best: Collaborate with world-class engineers and AI researchers who thrive on curiosity and innovation.</li> </ul> <p><em>Fireworks AI is an equal-opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all innovators.</em></p></div>

Required Skills

PythonTypeScriptPyTorchKubernetesDockerAWSGCPAzure

About Fireworks AI

Fast and affordable AI inference platform for production workloads.

Visit Company Website

Ready to Apply?

Join Fireworks AI and work on cutting-edge AI technology