<div class="content-intro"><h2><strong>About Us:</strong></h2> <p data-start="107" data-end="729">At Fireworks, we’re building the future of generative AI infrastructure. Our platform delivers the highest-quality models with the fastest and most scalable inference in the industry. We’ve been independently benchmarked as the leader in LLM inference speed and are driving cutting-edge innovation through projects like our own function calling and multimodal models. Fireworks is a Series C company valued at $4 billion and backed by top investors including Benchmark, Sequoia, Lightspeed, Index, and Evantic. We’re an ambitious, collaborative team of builders, founded by veterans of Meta PyTorch and Google Vertex AI.</p></div><p style="text-align: left;"><strong>Job Duties: </strong>Design, develop, and maintain large-scale backend and cloud-native infrastructure to<br>support distributed machine learning training, inference, and data processing pipelines for generative AI platform.<br>Architect and build scalable, resilient backend infrastructure to support distributed training, inference, and data<br>processing pipelines. Lead technical design discussions, mentor engineers, and establish best practices for<br>large-scale machine learning systems. Design and implement core backend services with a focus on efficiency<br>and low latency. Drive infrastructure optimization initiatives for compute cost, storage lifecycle management, and<br>network performance. Collaborate with machine learning, DevOps, and product teams to translate research and<br>product requirements into robust infrastructure solutions. Evaluate and integrate cloud-native and open-source<br>technologies such as Kubernetes, Ray, Kubeflow, and MLFlow to enhance platform reliability. Own end-to-end<br>systems from design to deployment, emphasizing reliability, fault tolerance, and operational excellence.</p> <p style="text-align: left;"><strong>Minimum Education & Experience Required</strong>: Bachelor’s degree or equivalent in Computer Science or related<br>field plus four (4) years of experience in software engineering or related role</p> <p style="text-align: left;"><strong>Minimum Skills Required:</strong> 4 years of experience designing, building, and optimizing large-scale backend<br>infrastructure and distributed data systems (e.g., PostgreSQL, MySQL, DynamoDB, Apache Spark, Apache<br>Flink, Apache Kafka) in cloud environments (AWS, GCP, Azure, or equivalent), including cloud-native platforms,<br>core infrastructure components, and optimization techniques (caching, indexing, sharding, replication,<br>transactions, ACID). 4 years of experience with major server-side programming languages and frameworks<br>(e.g., Python, C++, Go, TypeScript). 4 years of experience writing technical design documentation, leading cross-<br>functional projects, and collaborating with cross-functional teams to achieve business impact. 3 years of<br>experience developing and maintaining data processing and API systems, including client-server communication<br>frameworks (e.g., gRPC, Thrift). 3 years of experience conducting A/B testing and scientific experimentation<br>(e.g., Statsig, Meta Deltoid, Optimizely) to measure software impact. 3 years of experience conducting coding<br>interviews and providing systematic feedback for engineering candidates. 2 years of experience with cloud-native<br>tools and infrastructure, such as Docker and Kubernetes. 2 years of experience defining and implementing data-<br>driven metrics to support company or team goals.</p> <p style="text-align: left;"><strong>How to Apply</strong>: Submit resume and apply online at http://www.fireworks.ai/careers and search for job by title.</p> <p style="text-align: left;"> </p><div class="content-pay-transparency"><div class="pay-input"><div class="description"><p>Total compensation for this role also includes meaningful equity in a fast-growing startup, along with a competitive salary and comprehensive benefits package. Base salary is determined by a range of factors including individual qualifications, experience, skills, interview performance, market data, and work location. The listed salary range is intended as a guideline and may be adjusted.</p></div><div class="title">Base Pay Range (Plus Equity)</div><div class="pay-range"><span>$175,000</span><span class="divider">—</span><span>$220,000 USD</span></div></div></div><div class="content-conclusion"><h2><strong>Why Fireworks AI?</strong></h2> <ul> <li>Solve Hard Problems: Tackle challenges at the forefront of AI infrastructure, from low-latency inference to scalable model serving.</li> <li>Build What’s Next: Work with bleeding-edge technology that impacts how businesses and developers harness AI globally.</li> <li>Ownership & Impact: Join a fast-growing, passionate team where your work directly shapes the future of AI—no bureaucracy, just results.</li> <li>Learn from the Best: Collaborate with world-class engineers and AI researchers who thrive on curiosity and innovation.</li> </ul> <p><em>Fireworks AI is an equal-opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all innovators.</em></p></div>