IT - Infrastructure Systems - DevOps Engineer II (Remote in CA only)

Golden 1

$123K — $135K *
US-AnywhereRemote in Sacramento, CA
Information Technology
Less than 5 years of experience
Job Overview by Ladders

Qualifications

  • Bachelor's degree in computer science, engineering, or related field.
  • 4+ years as a DevOps Engineer in medium to large-scale environments.
  • Proficient in Windows Server, Linux, Microsoft Azure, and VMWare.
  • Skilled in Git/GitHub, Terraform, Python, PowerShell, and container orchestration (Kubernetes/Docker).
  • Experienced with CI/CD tools (Jenkins, GitLab CI, Azure DevOps) and observability platforms (Prometheus, Grafana).
  • Knowledge in log management (ELK Stack) and database technologies (PostgreSQL, MySQL).
  • Strong background in automating infrastructure and application deployments.

Responsibilities

  • Lead infrastructure-as-code development using Terraform and scripting languages.
  • Manage Linux/Kubernetes cluster environments.
  • Deploy solutions following Change Management Processes.
  • Support development teams on API integration strategies.
  • Ensure systems are secure against cybersecurity threats.
  • Propose automation solutions to improve workloads and create efficiencies.
  • Design and optimize CI/CD pipelines for reliable software releases.

Benefits

  • Flexible working hours with a possibility for remote work.
  • Collaborative environment with cross-team interactions.
  • Focus on continuous learning with exposure to new technologies.
  • Opportunity to work on scalable projects in cloud environments.
  • Health and wellness benefits supporting work-life balance.
Full Job Description
TITLE: DevOps Engineer II
STATUS: Exempt
REPORTS TO: Mgr - DevOps - SRE
DEPARTMENT: IT - Infrastructure Systems
JOB CODE: 12041

PAY RANGE: $123,600.00 - $135,000.00 Annually

POSITION AND PURPOSE

The DevOps Engineer 2 is responsible for leading the automation processes for deploying Infrastructure as Code in both Microsoft Azure and On-Premises environments. The engineer will deploy product updates, identify production issues, and implement integrations that meet our customers' needs. The ideal candidate will have a solid background in DevOps and Site Reliability Engineering, with significant experience in Terraform, Python, and PowerShell. The engineer will lead the infrastructure-as-code process, manage Linux/Kubernetes cluster environments, and support development teams on API integration strategies. The engineer will design, implement, and optimize CI/CD pipelines for faster and more reliable software releases. Additionally, the engineer will monitor systems, create alerts, and ensure application uptime and performance. Responsibilities also include provisioning and setting up metrics, creating alerts and managing alert suppression, and proposing automation solutions to reduce workload. This role is responsible for implementing and operating cloud platform services and standards defined by Cloud Engineering, with a focus on reliability, security, and scalability.

THE WORK

GOLDEN 1 RESPONSIBILITES INCLUDE:
  • Independently lead infrastructure-as-code development using Terraform and scripting languages such as Python and PowerShell to support scalable and reliable deployments.
  • Manage Linux/Kubernetes cluster environments.
  • Deploy solutions in accordance with Change Management Processes.
  • Support development teams on API integration strategy and standards development.
  • Ensure systems are secure against cybersecurity threats.
  • Identify technical problems and develop software updates and fixes.
  • Strong Splunk skills for administration, query optimization, alerting, and dashboard development.
  • Build tools to reduce errors and improve customer experience.
  • Propose ideas and solutions within the Infrastructure Department to reduce workload through automation.
  • Design, implement, and optimize CI/CD pipelines for faster and more reliable software releases.
  • Independently conduct root cause analysis and implement corrective actions.
  • Design and write tests to investigate infrastructure failure and scaling.
  • Create and maintain response playbooks across incident management and monitoring tools.
  • Develop automation to ensure repeatability, eliminate toil, and reduce time to action and repair services.
  • Analyze key operational metrics to identify opportunities to improve availability.
  • Implement effective monitoring, alerting, and reduction of alert fatigue.
  • Manage container orchestration environments and optimize deployment workflows to enhance scalability, reliability, and operational efficiency.
  • Design, build, and manage containerized environments using Docker.
  • Create and maintain SLIs, SLOs, and error budgets.
  • Design and optimize monitoring dashboards and alerting systems to proactively detect and address application performance and uptime issues.
  • Implement code branching strategies using GitHub functions.
  • Advanced Terraform syntax and GitLab CI/CD configuration, pipelines, jobs.
  • Provisioning and setting up metrics in Prometheus, Thanos, and Grafana, creating and managing alerts.
  • Implement cloud engineering standards, reusable modules, and platform patterns in Microsoft Azure
  • Operate shared cloud platform services according to Cloud Engineering defined architectures
  • Ensure infrastructure changes comply with reliability, security, and cost controls established by Cloud Engineering
  • Maintain operational documentation and runbooks for cloud platform services


QUALIFICATIONS

EDUCATION: Bachelor of science degree (or equivalent) in computer science, engineering, or relevant field.

EXPERIENCE:
  • Over 4 years as a DevOps Engineer in medium to large-scale environments.
  • Proficient in Windows Server, Linux, and hybrid cloud deployments using Microsoft Azure and VMWare.
  • Skilled in Git/GitHub workflows, Terraform, Python, PowerShell, and container orchestration (Tanzu, Docker, Kubernetes, OpenShift).
  • Experienced with CI/CD tools (Jenkins, GitLab CI, Azure DevOps) and observability platforms (Datadog, Prometheus, Grafana, ThousandEyes).
  • Knowledgeable in log management (ELK Stack) and database technologies (PostgreSQL, MySQL, NoSQL).
  • Strong background in automating infrastructure provisioning and application deployment using Terraform, Ansible, and Kubernetes.
  • Proficient in creating and maintaining monitoring dashboards, SLIs, SLOs, and error budgets to ensure application uptime and performance.
  • Experienced in ensuring infrastructure security, driving automation initiatives, and collaborating across teams to improve reliability and scalability.
  • Experienced in building observability pipelines and performing advanced queries in log management tools like Splunk for troubleshooting.
  • Experience implementing and operating Azure-based shared services defined by platform or cloud engineering teams

KNOWLEDGE/SKILLS:
  • Microsoft Azure DevOps Engineer Expert Certification (Required)
  • Kubernetes Administration Certification (Required)
  • Linux Certification (Desired)

CORE COMPETENCIES:
  • Takes Initiative - Owns tasks and responsibilities
  • Delivers Results with Agility - Meets deadlines and adapts
  • Collaborates Across Teams - Works well within the team solves problems proactively
  • Handles day-to-day challenges Builds Trust and Credibility - Demonstrates reliability

ORGANIZATIONAL CONTACTS & RELATIONSHIPS

INTERNAL: Regular interaction with Infrastructure Engineers, Computer Operations, IT Programing, Information Security, Network and Storage teams, and IT Service Management staff to support enterprise systems, respond to incidents, and perform scheduled maintenance activities.

EXTERNAL: Interaction with approved technology vendors, hardware and software support providers, and service partners, typically in coordination with IT Systems Manager, for troubleshooting, maintenance, and support escalation.

WORKING CONDITIONS

Work time includes weekend and after-hours time, based on organizational needs. This position works in-office where working conditions, lighting, temperature, audio, and workspace are all sufficient.

PHYSICAL REQUIREMENTS

Work requires the ability to constantly operate a computer and the ability to read, type, and communicate. Work may require the ability to move work-related supplies weighing up to 10-15 pounds.



#LI-Remote

Pursuant to the San Francisco Fair Chance Ordinance, we will consider for employment qualified applicants with arrest and conviction records.

Similar Jobs

More Jobs at Golden 1

More Information Technology Jobs

Find similar IT - Infrastructure Systems - DevOps Engineer II (Remote in CA only) jobs: