VoltaGrid

DevOps & Site Reliability Engineer

VoltaGrid$90K — $130K *
Information Technology
Less than 5 years of experience
Job Overview by Ladders

Qualifications

  • 4+ years of experience in DevOps, SRE, or infrastructure engineering roles.
  • Strong experience with at least one major cloud provider (AWS, GCP, or Azure, with a preference for AWS).
  • Deep hands-on experience with Kubernetes and Docker in production environments.
  • Proficiency with infrastructure as code tools, particularly Terraform.
  • Experience building and maintaining CI/CD pipelines (GitHub Actions, GitLab CI, Jenkins, or similar).
  • Strong scripting skills in languages such as Bash, Python, or Go.
  • Strong Linux systems administration skills (Ubuntu, RHEL/CentOS, or similar).

Responsibilities

  • Design, build, and maintain cloud infrastructure.
  • Manage and optimize Kubernetes clusters and containerized workloads in production.
  • Develop and maintain infrastructure as code using Terraform (or equivalent tooling).
  • Build and improve CI/CD pipelines to enable fast, safe, and reliable deployments.
  • Implement and maintain monitoring, alerting, and observability systems (Prometheus, Grafana, Datadog, or similar).
  • Define and track SLIs/SLOs, and participate in incident response and root cause analysis.
  • Identify and eliminate toil through automation and self-service tooling.

Benefits

  • Collaborative environment with opportunities to work closely with engineering teams.
  • Encouragement of operational excellence and innovative practices.
  • On-call rotations that foster a culture of shared responsibility and readiness.
  • Opportunity to participate in initiatives for developing internal platforms.
  • Exposure to various cloud technologies and infrastructure management methodologies.
Full Job Description
Position Title: DEVOPS & SRE ENGINEER
Location: HOUSTON, TX
FLSA Class: EXEMPT
Responsible to: Directo of Software Engineering

Position Summary: DevOps / Site Reliability Engineer to implement and evolve the infrastructure, deployment pipelines, and reliability posture of our systems. You'll work closely with engineering teams to build scalable, observable, and resilient infrastructure while driving a culture of operational excellence.

Essential Duties and Responsibilities:
  • Design, build, and maintain cloud infrastructure
  • Manage and optimize Kubernetes clusters and containerized workloads in production
  • Develop and maintain infrastructureascode using Terraform (or equivalent tooling)
  • Build and improve CI/CD pipelines to enable fast, safe, and reliable deployments
  • Implement and maintain monitoring, alerting, and observability systems (Prometheus, Grafana, Datadog, or similar)
  • Define and track SLIs/SLOs, participate in incident response, root cause analysis, and blameless postmortems
  • Identify and eliminate toil through automation and selfservice tooling
  • Configure and maintain onprem baremetal servers and Linuxbased infrastructure
  • Configure, maintain, and optimize virtualized assets
  • Collaborate with development teams on system design, capacity planning, and performance optimization
  • Participate in oncall rotations and ensure production readiness of new services

Other Requirements:
  • 4+ years of experience in DevOps, SRE, or infrastructure engineering roles
  • Strong experience with at least one major cloud provider (AWS, GCP, or Azure AWS preferred)
  • Deep hands-on experience with Kubernetes and Docker in production environments
  • Proficiency with infrastructureascode tools, particularly Terraform
  • Experience building and maintaining CI/CD pipelines (GitHub Actions, GitLab CI, Jenkins, or similar)
  • Solid understanding of monitoring and observability (metrics, logs, traces)
  • Strong scripting skills (Bash, Python, or Go)
  • Experience with incident management, SLObased reliability practices, and capacity planning
  • Strong Linux systems administration skills (Ubuntu, RHEL/CentOS, or similar)
  • Experience with virtualization platforms including VM provisioning, storage, networking, and cluster management
  • Solid understanding of networking, DNS, load balancing, and security fundamentals

Nice to Have:
  • Contributions to internal developer platforms or platform engineering initiatives
  • Proxmox VE experience
  • Certifications in cloud platforms (AWS SA, CKA, etc.)

The above statements are intended to describe the general nature and level of work being performed by employees assigned to this classification. All personnel may be required to perform duties outside of their normal responsibilities from time to time, as needed.

About VoltaGrid

VoltaGrid is a renewable energy company that specializes in developing and operating distributed energy resources. The company's mission is to accelerate the transition to a clean energy future by providing reliable, affordable, and sustainable energy solutions to businesses and communities. VoltaGrid's innovative technology platform enables customers to optimize their energy usage, reduce their carbon footprint, and save money on their energy bills. The company is committed to delivering exceptional customer service and building long-term relationships with its clients.
Learn more about VoltaGrid
Size
50 employees
Industry
Net Income
$1 million
Founded
2010
5 Year Trend
+20%
Revenue
$5 million
NASDAQ

Similar Jobs

More Jobs at VoltaGrid

More Information Technology Jobs

Find similar DevOps & Site Reliability Engineer jobs: