LH
LLMHire
Browse JobsMarket TrendsNewSalariesTrendsCompaniesPricingBlog

Never Miss an AI Job

Get weekly AI job alerts delivered to your inbox.

Join the AI hiring radar. Unsubscribe anytime.

LH
LLMHire

The AI Labor Market Intelligence Platform. Real-time job data, salary benchmarks, and hiring trends from 160+ companies.

Jobs

  • Browse Jobs
  • Companies
  • Job Alerts
  • Post a Job
  • Pricing

Resources

  • Blog
  • CyberOS.devScan code for vulnerabilities
  • EndOfCoding.comStay ahead with AI news
  • Vibe Coding AcademyLearn skills employers want
  • Vibe Coding Ebook22 chapters, 200+ prompts
  • Video Tutorials@endofcoding on YouTube

Company

  • About
  • Contact
  • Privacy
  • Terms

Contact

  • hello@llmhire.com
  • Get in Touch

© 2026 LLMHire. All rights reserved.

VeriduxLabsBuilt by VeriduxLabs
Back to all jobs
S

Senior Site Reliability Engineer

Stability AI
United StatesOnsite6 days agovia Greenhouse
full-timesenior

About the Role

<p><strong>< Remote - United States ></strong></p> <p><strong>Job</strong> <strong>Description</strong>:<br>Stability AI’s Engineering Operations team is looking for a Senior Site Reliability Engineer (SRE) to join our growing team and play a pivotal role in improving and shaping our cloud infrastructure. The person will closely work with engineering, IT, security, and product teams to drive innovation and reliability in an evolving environment. Candidates should have the initiative to build and improve a maturing cloud landscape.</p> <h4><strong>Responsibilities:</strong></h4> <ul> <li>Developing and enforcing SRE best practices and standards across the organization.</li> <li>Architecting and managing scalable systems in AWS and other cloud environments, focusing on high availability and resilience.</li> <li>Implementing and maintaining infrastructure as code using Terraform.</li> <li>Setting up and refining monitoring, logging, and alerting systems.</li> <li>Driving incident management and root cause analysis to improve system reliability.</li> <li>Championing SRE principles and mentoring junior team members.</li> </ul> <h4><strong>Qualifications:</strong></h4> <ul> <li>Collaborating with development teams to enhance CI/CD pipelines.</li> <li>Experience scaling resource intensive systems, be it storage, networking, or compute.</li> <li>Knowledge and experience with Kubernetes or other container scaling solutions</li> <li>Background in software development or automation scripting.</li> <li>Knowledge and experience with Grafana, ELK stack, or similar tools.</li> <li>Cloud security experience.</li> </ul> <p><strong>Equal Employment Opportunity:</strong></p> <p>We are an equal opportunity employer and do not discriminate on the basis of race, religion, national origin, gender, sexual orientation, age, veteran status, disability or other legally protected statuses.</p> <p>&nbsp;</p>

Required Skills

KubernetesAWSScalaRAG

About Stability AI

Open source generative AI. Makers of Stable Diffusion.

Visit Company Website

Ready to Apply?

Join Stability AI and work on cutting-edge AI technology