Senior Site Reliability Engineer

Navan • $120K — $160K *

Austin, TX 78745In-Person

Enterprise Technology

5 - 7 years of experience

1 month ago

By clicking Apply, I agree with Ladders' Terms of Use and Privacy Policy

Job Overview by Ladders

Qualifications

5+ years of experience as a Senior SRE or DevOps Lead
2+ years in a production, 24x7 environment
Strong problem-solving and technology learning mindset
Excellent communication skills for stakeholder collaboration
Experience mentoring junior and mid-level engineers
Demonstrated ownership in shipping production-quality code
Operational experience with Java, with Python, Node.js, and Go as a plus
Hands-on experience with cloud-based distributed systems, preferably AWS

Responsibilities

Design and develop tooling, automation, and infrastructure services for Navan
Build high-growth, reliable service infrastructure using infrastructure as code
Identify and resolve reliability anti-patterns to enhance system health
Automate workflow processes and empower users through tool development
Leverage AI tools to enhance autonomy and observability in operations
Define and enforce system reliability standards and practices across teams
Drive AI-assisted developer tools to boost productivity and code quality

Benefits

Work in a fast-paced, startup-like environment
Opportunity to drive impactful changes in a rapidly evolving product
Collaborative atmosphere with cross-functional team engagement
Focus on innovation in problem-solving and system reliability
Flexible work location options in Dallas, TX or Austin, TX

Full Job Description

We are constantly striving to make the most reliable and scalable systems possible to ensure that our services are available to our travelers when they need it most. With our exponential growth, we have many exciting challenges ahead and we9re looking for a passionate Site Reliability Engineer to join our team in Dallas, TX or Austin, TX. As an SRE you will design and develop tooling, automation and infrastructure services that power the Navan services, used by thousands of travelers on a daily basis. You will work closely with development teams, release and productivity teams and security teams to identify customer needs and build innovative solutions to solve them.

You will work across a vast array of systems and technologies, aiming to build an autonomous, monitored, fault-tolerant infrastructure that is optimized for both simplicity and uptime. You will collaborate with the backend and frontend engineering teams to ensure that product solutions are scalable, efficient, and reliable. You will design infrastructure to support our massive growth and work with the team to maintain the highest level of service.

What You9ll Do:

Building a fast moving, high growth service. Navan is revolutionizing travel and expense services for the enterprise, and the product is evolving quickly. You are comfortable in a startup environment, enjoy seeing the product take shape, and have strong ownership of the success of your services.
Designing, implementing and operating cloud infrastructure. You9re a fit for us if you think in terms of infrastructure as code, deployment pipelines, and building the guardrails to make going fast also going safely.
Identifying reliability anti-patterns and solving them systemically. You dive deep into the data to evaluate the health of your systems, and you use it to improve visibility and reliability across the fleet of services.
Finding and automating the toil out of our processes. You9d prefer to automate it entirely, or build a tool to empower your users rather than be the gatekeeper to the tool.
Leveraging AI tools and platforms in your daily work to achieve autonomous operations, reduce toil, and improve system observability.
Defining and driving the adoption of system reliability standards, including formalizing SLO/SLI frameworks, observability standards, and blameless post-mortem practices across multiple engineering teams.
Driving the adoption of AI-assisted developer tools and platforms to increase engineering productivity, enforce code quality standards, and enable real-time architectural validation.

What We9re Looking For:

5+ years of progressive experience as a Senior SRE or DevOps Lead (or equivalent role)
2+ years of experience in working on a production, 24x7 product environment
Passionate about solving problems and learning new tools and technologies
Excellent communication skills working with stakeholders and domain experts across the company to design solutions to user problems
Thrive in a fast-paced environment
Demonstrated experience mentoring and leading junior and mid-level engineers, and acting as a technical owner for cross-functional infrastructure projects.
Operate with a strong sense of ownership demonstrated through shipping production-quality code and infrastructure equipped with testing, monitoring and documentation
Hands-on operational experience with Java based applications and services including JVM profiling and performance tuning (python, Node.js and Go are a plus)
Hands-on experience building and operating distributed systems in a public cloud environment (preferably AWS), using CI/CD to deploy, manage and operate production systems, focusing on tooling and automation using tools such as maven and Jenkins.
Hands-on experience with microservice architecture and related reliability and resiliency patterns such as throttling, queueing, and retries
Hands-on experience with writing Infrastructure as Code in Terraform or Cloudformation or similar tools
A passion for automating away everything, using scripting languages such as python, bash groovy (we prefer lazy engineers)
Built, using, and automating monitoring systems such as NewRelic, DataDog, SignalFX, Kibana,
Hands-on experience deploying, operating, and monitoring production-grade AI/ML microservices (e.g., RAG pipelines, agentic systems) on cloud platforms like AWS Fargate/ECS.
Experience leveraging AI/LLM platforms (e.g., Gemini, Braintrust) and managing their secrets and infrastructure using Infrastructure as Code (Terraform) and AWS SSM.
Demonstrated ability to integrate AI-specific telemetry and advanced observability practices to enable predictive insights and systemic root-cause analysis.

About Navan

Navan is a mining company that focuses on the exploration and development of mineral properties. The company was founded in 2019 and is headquartered in Vancouver, Canada. Navan's primary focus is on the exploration and development of gold and silver properties in North America. The company's management team has extensive experience in the mining industry, and is committed to responsible and sustainable mining practices. Navan is a publicly traded company, and its shares are listed on the Canadian Securities Exchange.

Learn more about Navan

Size

10 employees

Industry

Manufacturing & Automotive

Founded

2015

* Ladders Estimates

Similar Jobs

Principal Product Support Engineer, Level 4 (Clearance Required - Secret), Oklahoma City, Oklahoma, Dallas or Houston, TX, Montgomery, AL
$152K — $349K *
Hewlett Packard Enterprise Development LP
Montgomery, TX 77356 (Montgomery County)
Reposted Today
Lead Systems Integrator
$120K — $150K *
Tyto Athene
Remote
Today
Systems Engineer (Splunk)
$100K — $130K *
Charles Schwab
Southlake, TX 76092 (Tarrant County)
Reposted Today
Staff Engineer - Capacity Planning and Management
$110K — $230K *
Geico
Dallas, TX 75217 (Dallas County)
Reposted Today
Senior Emulation Methodology Engineer
$120K — $160K *
Advanced Micro Devices, Inc
Austin, TX 78745 (Travis County)
Reposted Today
Mainframe VTAM Engineer - Remote - Multiple locations
$90K — $120K *
Truist Financial
Charlotte, NC 28269 (Mecklenburg County)
Reposted Today

Get Ready For Your
Next Interview

More Jobs at Navan

Launch Manager, Navan Premier
$97K — $170K *
New York, NY 10025 (New York County)
2 days ago
Business Services
In-Person
Launch Manager, Navan Premier
$90K — $120K *
Austin, TX 78745 (Travis County)
2 days ago
Business Services
In-Person
Integrations Manager
$90K — $120K *
Austin, TX 78745 (Travis County)
3 days ago
Technical Services
In-Person
Integrations Manager
$82K — $162K *
New York, NY 10025 (New York County)
3 days ago
Enterprise Technology
In-Person
Senior Software Engineer, AI
$113K — $252K *
New York, NY 10025 (New York County)
3 days ago
Enterprise Technology
In-Person

More Enterprise Technology Jobs

Sourcing Manager, Technology
$90K — $120K *
American International Group
Atlanta, GA 30349 (Fulton County)
Today
SAP PLM Product Owner
$105K — $130K *
Kimberly-Clark Corporation
Dallas, TX 75217 (Dallas County)
Reposted Today
.Net Lead (Hybrid - Flexible Options)
$165K — $175K *
Broadridge
New York, NY 10025 (New York County)
Today
Software asset Management (SAM) Analyst
$120K — $135K *
Broadridge
Kansas City, MO 64118 (Clay County)
Today
Senior Project Manager
$135K — $165K *
SSOE Group
Phoenix, AZ 85032 (Maricopa County)
Today

Find similar Senior Site Reliability Engineer jobs:

Nationwide Austin, TX

Senior Site Reliability Engineer

Job Overview by Ladders

Full Job Description

Get Ready For Your Next Interview

Find similar Senior Site Reliability Engineer jobs:

Get Ready For Your
Next Interview