Site Reliability Engineer

NOV, Inc.$100K — $130K *
Information Technology
5 - 7 years of experience
Job Overview by Ladders

Qualifications

  • 5+ years in SRE, DevOps, or Infrastructure Engineering roles.
  • Expertise in Kubernetes and container orchestration at scale.
  • Strong experience with AKKA.NET or actor-based frameworks.
  • Proficient in scripting and automation with Bash, PowerShell, or Python.
  • Familiar with observability tools like Prometheus and Grafana.
  • Hands-on with cloud platforms such as AWS, Azure, or GCP.
  • Strong knowledge of PostgreSQL performance tuning and optimization.

Responsibilities

  • Maintain and monitor production systems for uptime and performance.
  • Lead incident response efforts with communication and documentation.
  • Design health checks, alerting systems, and automated workflows.
  • Conduct root cause analysis and implement permanent fixes.
  • Setup observability stacks for logging, metrics, and tracing.
  • Analyze telemetry data for trends and improvement opportunities.
  • Optimize distributed systems and PostgreSQL performance.

Benefits

  • Opportunity to work with cutting-edge technologies and frameworks.
  • Collaborative and high-performance culture that values innovation.
  • Focus on operational excellence with a builder's mindset.
  • Engagement in performance tuning and architecture evolution.
Full Job Description
Job Description

As a Site Reliability Engineer, you will be responsible for: Operational Excellence & Incident Management

- Maintain and monitor production systems for availability, latency, and performance.

- Lead incident response efforts, including communication, resolution, and postmortem documentation.

- Design and implement health checks, alerting systems, and automated remediation workflows.

- Drive root cause analysis and implement permanent resolutions for recurring issues.

Observability & Insights

- Set up and maintain full observability stacks (logging, metrics, tracing) using tools like Prometheus, Grafana, Datadog, OpenTelemetry, or ELK.

- Analyze telemetry and logs to identify trends, anomalies, and opportunities for improvement.

- Conduct post-incident reviews and use insights to inform future engineering investments.

Performance & Systems Optimization

- Tune and optimize distributed systems, including AKKA.NET actors, for performance and resource efficiency.

- Work with developers to evolve architecture and improve system throughput, latency, and stability.

- Optimize PostgreSQL performance, queries, and maintenance strategies.

CI/CD & Automation

- Design and maintain modern CI/CD pipelines using GitHub Actions, Azure Pipelines, or GitLab CI.

- Automate deployment, testing, and rollback processes to reduce friction and increase deployment frequency.

- Standardize infrastructure as code practices across environments.

We'd love to talk to you if you have:

- 5+ years of experience in SRE, DevOps, or Infrastructure Engineering roles.

- Expertise in Kubernetes and container orchestration at scale.

- Strong experience with AKKA.NET or similar actor-based frameworks.

- Proficiency with scripting and automation (Bash, PowerShell, Python).

- Experience with observability tools (Phobos,Datadog, Prometheus, Grafana, OpenTelemetry, ELK).

- Hands-on experience with cloud platforms (AWS, Azure, or GCP).

- Strong PostgreSQL knowledge-performance tuning, query optimization, maintenance.

- Proven ability to lead incident management and drive postmortem processes.

- A builder's mindset with high standards for operational excellence and technical ownership.

Preferred Tools & Ecosystem Experience

- CI/CD: GitHub Actions, Azure Pipelines, GitLab CI

- Infrastructure: Kubernetes, Docker, Terraform

- Monitoring: Phobos (AKKA.NET), Datadog, Prometheus

- Source Control: GitHub, GitLab, Azure DevOps

- Programming: C#, Python, Bash, PowerShell

About NOV, Inc.

NOV, Inc. Careers

Joining NOV, Inc. presents an unparalleled opportunity to advance a career in the energy sector with a company at the forefront of driving innovation and growth. NOV, Inc. is actively seeking professionals who are ready to engage with a global team that values leadership, diversity, and professional development.

Explore Job Opportunities

NOV, Inc. offers a variety of job opportunities that cater to a range of skills and experiences. Whether it's in engineering, finance, or project management, NOV, Inc. positions itself as a leader in career advancement in the energy industry. Explore open positions that align with professional skills and career interests.

Internship Programs

NOV, Inc. believes in nurturing talent from the ground up. Internship programs at NOV, Inc. provide invaluable industry exposure and hands-on experience, making them a cornerstone of professional development for students and recent graduates eager to make their mark.

Commitment to Employee Growth and Benefits

At NOV, Inc., employee growth is a priority, supported by comprehensive benefits and diversity training programs designed to foster an inclusive workplace. NOV, Inc. invests in its team, ensuring access to the tools and training necessary to excel both professionally and personally.

Cultivating a Culture of Innovation

The culture at NOV, Inc. is built on a foundation of innovation and collaborative problem-solving. Employees are encouraged to bring fresh ideas and perspectives to the table, driving the company’s leadership in the energy sector.

Professional Development and Networking

NOV, Inc. is dedicated to the continuous professional development of its team members through leadership training, networking opportunities, and robust career paths. Employees at NOV, Inc. enjoy a dynamic environment where they can build strong professional networks and enhance their career trajectory.

Hiring Process

The hiring process at NOV, Inc. is designed to be transparent and engaging, starting from the initial job posting to the final interview. Candidates are encouraged to showcase their skills and experiences through a detailed resume and during the interview process, ensuring a fit that is beneficial both for the individual and for NOV, Inc.

Join the NOV, Inc. Team

NOV, Inc. is looking for passionate, curious, and innovative team players. Search for open positions that match skills and interests on the NOV, Inc. careers page. Discover how a position at NOV, Inc. can propel a career to new heights.

Stay Connected with NOV, Inc. Careers

Keep up to date with career tips, industry insights, and the latest job openings at NOV, Inc. Personalize subscriptions for job alerts and insider tips tailored to specific career preferences. Explore the rewarding opportunities that await at NOV, Inc.

SEARCH NOV, INC. JOBS

READ CAREERS BLOG

JOB ALERT EMAILS

Embark on a journey with NOV, Inc., where career aspirations turn into achievements, and professional growth is intertwined with innovation and leadership.
Learn more about NOV, Inc.

Similar Jobs

More Jobs at NOV, Inc.

More Information Technology Jobs

Find similar Site Reliability Engineer jobs: