Gem.com

Senior Site Reliability Engineer (Copy)

Gem.com$170K — $215K *
Information Technology
Less than 5 years of experience
Job Overview by Ladders

Qualifications

  • 3-6+ years in Site Reliability Engineering (SRE), DevOps, or production infrastructure roles.
  • Background in software development with a passion for coding.
  • Proficiency in Bash, Python, TypeScript, and Postgres SQL.
  • Experience with AWS services including EC2, Lambda, RDS, IAM, and VPCs.
  • Strong understanding of GitHub workflows, particularly GitHub Actions and release automation.
  • Familiarity with CI/CD principles and complete deployment lifecycle management.
  • Ability to audit and adapt AI-generated code for production standards.

Responsibilities

  • Lead the release and deployment process, ensuring safe and observable workflows.
  • Manage GitHub branching strategies and enforce collaboration standards.
  • Build and maintain AWS-based infrastructure, setting up observability tools and alarms.
  • Automate processes and write scripts to enhance deployment confidence and reduce friction.
  • Contribute code for internal tools and system reliability improvements as needed.
  • Support global teams, handling off-hours tasks to maintain deployment velocity.
  • Work in high-autonomy and fast-paced environments.

Benefits

  • Competitive salary with meaningful equity options.
  • 401(k) retirement plan.
  • Comprehensive health, dental, vision, and life insurance coverage.
  • Flexible time off policy including holidays.
  • Supportive high-autonomy work environment.
  • Opportunities for team gatherings and offsite events.
Full Job Description
Who We're Looking For

We're looking for a hands-on, high-agency Site Reliability Engineer to help shape and scale the reliability layer of our stack. You'll own the release pipeline end-to-end - managing daily releases, weekly deploys, and hotfixes - while also automating infrastructure, monitoring systems, and GitHub workflows. This is a software engineering role, deeply embedded in DevOps culture, with significant autonomy and direct impact on the pace and safety of our shipping process.

You'll work closely with engineers, product leads, and company leadership to ensure uptime, speed, and confidence in every deploy.

What You'll Do

  • Own Deployments: Lead our release and deployment process - from daily rollouts to weekly deploys and hotfix coordination. Build safe, repeatable, and observable workflows.
  • GitHub Operations: Manage GitHub branching strategies, pull request flows, merge policies, and GitHub Actions. Set and enforce collaboration standards for the engineering team.
  • Infrastructure & Monitoring: Build and maintain resilient AWS-based infrastructure. Set up and manage observability tools (logs, metrics, traces), configure alarms, and be the first responder for incidents. Triage, escalate, or resolve based on impact.
  • Automation & Internal Tooling: Write scripts, services, and automations that reduce friction and improve deployment confidence. Using AI tools to generate code is encouraged and expected - you'll be comfortable guiding, adapting, and integrating AI-assisted outputs into production workflows.
  • Software Development: You'll contribute code when needed - whether that's building internal tools, improving system reliability, or unblocking a deploy. This is not a sprint-based role, but strong software fundamentals are key to success.
  • Support Global Teams: Work off-hours as needed to unblock offshore teams and maintain deployment velocity across time zones.


You're a Great Fit If You

  • Have 3-6+ years in SRE, DevOps, or infrastructure roles with production ownership.
  • Started your career in software development - and still enjoy writing code.
  • Are fluent in or at least familiar with Bash, Python, TypeScript, and Postgres SQL.
  • Are a confident AWS operator and know your way around EC2, Lambda, RDS, IAM, and VPCs.
  • Have strong experience with GitHub workflows, including GitHub Actions and release automation.
  • Are comfortable using AI tools (Claude, ChatGPT, etc.) to generate code - and have the skill to audit and adapt that code to meet production standards.
  • Are familiar with CI/CD principles and enjoy owning the full deployment lifecycle.
  • Are comfortable being on-call and understand how to design systems for both speed and safety.
  • Can operate with a high level of autonomy in fast-moving, ambiguous environments.


Compensation & Benefits

  • Competitive Salary + Meaningful Equity
  • 401(k)
  • Health, Dental, Vision, and Life Insurance
  • Flexible Time Off + Holidays
  • High-autonomy work environment
  • Team gatherings and offsites


Salary

As an early-stage startup, we offer a competitive compensation package that includes base salary, meaningful equity, and benefits. Equity grants are designed to ensure employees share in the long-term success and upside of the company.

Base Salary: $170,000 - $215,000 annually

Actual compensation will be determined based on a candidate's qualifications, skills, experience, and geographic location.

About Gem.com

Industry
Founded
2013

Similar Jobs

More Jobs at Gem.com

More Information Technology Jobs

Find similar Senior Site Reliability Engineer (Copy) jobs: