DevOps Engineer - AWS

TensorWave

$100K — $130K *
Information Technology
5 - 7 years of experience
Job Overview by Ladders

Qualifications

  • 5+ years in cloud infrastructure, DevOps, SRE, or platform operations
  • Extensive AWS expertise: VPCs, EC2, S3, IAM, CloudWatch, Route 53
  • Proficient in Infrastructure-as-Code tooling, especially Terraform
  • Solid Linux knowledge: networking, process management, troubleshooting
  • Experience with CI/CD processes and monitoring tools
  • Excellent documentation and communication skills for cross-team collaborations

Responsibilities

  • Manage the complete lifecycle of AWS infrastructure across various environments
  • Develop and maintain Infrastructure-as-Code with Terraform or similar tools
  • Design and implement cloud architectures for high availability and scalability
  • Establish and oversee CI/CD workflows for cloud services
  • Enhance system observability through metrics and dashboards
  • Diagnose and resolve AWS network and deployment issues
  • Engage in incident response and conduct post-incident analyses

Benefits

  • Stock options
  • 100% paid medical, dental, and vision insurance for employees
  • Company contributions to Health Savings Accounts
  • Fully paid short-term and long-term disability insurance
  • Life insurance and optional voluntary supplemental insurance
  • Access to various supplementary health benefits
  • Flexible Spending Accounts
  • 401(k) plan
  • Employee Assistance Programs
  • Flexible PTO and paid holidays
  • Parental leave
  • Additional in-office perks
Full Job Description
About the Role

We are hiring an AWS Cloud Engineer to design, provision, optimize, and support the AWS infrastructure powering our AMD GPU AI/HPC platform. This is a hands-on execution role - you'll work closely with Rust backend engineers, TypeScript developers, SREs, and platform teams to keep cloud infrastructure reliable, cost-efficient, and scalable. The goal is simple: reduce cloud bottlenecks and give our engineering teams a solid foundation to build on.

What You'll Do
  • Own the full lifecycle of AWS infrastructure across dev, staging, production, and customer-facing environments - provisioning, scaling, monitoring, security, cost optimization, and decommissioning
  • Build and maintain Infrastructure-as-Code (Terraform, Pulumi, AWS CDK, CloudFormation)
  • Implement cloud patterns for high availability, auto-scaling, secure service communication, and customer environment provisioning
  • Build and maintain CI/CD workflows for cloud infrastructure and hosted services
  • Improve observability through metrics, logging, alerting, dashboards, and runbooks
  • Troubleshoot AWS networking, compute, storage, IAM, and deployment issues
  • Participate in incident response, post-incident reviews, and root cause analysis
  • Document architecture, operational processes, and best practices


Who You Are

Required Qualifications
  • 5+ years in cloud infrastructure, DevOps, SRE, or platform operations
  • Hands-on AWS experience: VPCs, EC2, S3, IAM, CloudWatch, Route 53, load balancers, security groups, private networking
  • Proficiency with IaC tooling (Terraform strongly preferred)
  • Strong Linux fundamentals - networking, process management, storage, troubleshooting
  • Experience with CI/CD, Git-based workflows, and monitoring/alerting platforms
  • Clear communicator who can document infrastructure and collaborate across engineering teams

Preferred Qualifications
  • Experience with AI/ML, GPU, or HPC workloads
  • Kubernetes on AWS (EKS or self-managed)
  • Observability platforms: Prometheus, Grafana, Loki, OpenTelemetry, Datadog
  • AWS cost optimization: right-sizing, savings plans, lifecycle policies, tagging
  • Startup or high-growth infrastructure environment background


What We Offer
  • Stock Options
  • 100% paid Medical, Dental, and Vision insurance for Employees
  • Company Health Savings Account Contributions
  • 100% paid Short Term and Long Term Disability Insurance for Employees
  • Life and Voluntary Supplemental Insurance Options
  • Other Insurance Options, such as Pet & Legal Insurance
  • Various Supplementary Health Benefits, such as discounted Virtual Healthcare Appointments and Serious Illness Support
  • Flexible Spending Account
  • 401(k)
  • Employee Assistance Program
  • Flexible PTO
  • Paid Holidays
  • Parental Leave
  • Other In-Office Perks


Similar Jobs

More Jobs at TensorWave

More Information Technology Jobs

Find similar DevOps Engineer - AWS jobs: