Senior Infrastructure Engineer (SF)

Sysdig   •  

San Francisco, CA

Industry: Information Technology

  •  

Not Specified years

Posted 53 days ago

We're looking for a Senior/Infrastructure Engineer to help us lead the container revolution. You'll build solutions to enhance the availability, performance, and stability of the Sysdig SaaS and On-Prem offering. Being part of the engineering team, you will support Sysdig through building automated self-healing systems.

Role Responsibilities:

  • Enhancing our application running in Kubernetes with self-healing and stability improvements
  • Building and managing various components of internal and production environments with a focus on configuration management, continuous integration, and platform automation
  • Building and managing software delivery, systems integration, and developer support tools
  • Enhancing developer CI/CD pipeline using Jenkins and Github
  • Automating our infrastructure and EC2 deployments as well as our build automation systems
  • Conducting performance tuning, load testing, and optimization of information/data processing of the production environment

Key technologies:

Go, Python, Cassandra, Kafka, Kubernetes

Required Qualifications:

  • Solid full-cycle development experience in a high-level language, preferably Golang/Python/Java
  • Worked with containers such as Docker, Rkt (Rocket), containerd
  • Aptitude for troubleshooting complex problems in high-throughput web applications and network services
  • Solid understanding of Linux systems and networking

Desired Qualifications:

  • Deployed Kubernetes or OpenStack clusters
  • Managed any of these clusters – Cassandra, HBase, HDFS, Elasticsearch, Kafka, Redis
  • Proficiency with configuration management tools like Terraform (or at least Puppet, Chef, or SaltStack)
  • Experience in diagnosing and troubleshooting customer facing production service outages
  • Experience in monitoring cloud services using tools like Sysdig, Datadog, Prometheus, Grafana, Graphite, Nagios, or Zabbix
  • Experience in managing AWS resources including EC2, RDS, Auto Scaling groups, ALB/NLB, IAM