Senior Site Reliability Engineer

Royall & Company   •  

Birmingham, AL

Industry: Professional, Scientific & Technical Services


5 - 7 years

Posted 102 days ago

This job is no longer available.

Senior Site Reliability Engineer

The current opportunity is open to Washington, D.C., Birmingham, AL, or remote work.

Primary Responsibilities:

  • Configuration and deployment automation
  • Monitoring and disaster recovery
  • High availability architecture practices
  • Working with infrastructure and product teams to execute against the standards of practice, working on initial implementations, and coaching teams on successful DevOps
  • Write software to improve operations and services availability, efficiency and scalability
  • Forecast capacity needs, draft resource utilization reports, and optimize infrastructure spend
  • Resolve service outages and build automated responses for recurrence prevention
  • Influence development and administration decisions to enhance application integration and delivery
  • Address security and compliance concerns, in accordance with company policies
  • Identify and roll out new infrastructure technologies and methodologies
  • Advance infrastructure as code practices to enable a fully automated and resilient environment
  • Identify capacity metrics at various points in the application and track load against these metrics to determine when to scale.

Basic Qualifications:

  • Education: Bachelor's Degree in Computer Science, related field OR equivalent work experience
  • 5+ years of experience in production software development and/or operations
  • Passion for learning new technologies and finding the best fits for your colleagues
  • Hands on experience with AWS
  • Familiarity with one or more application frameworks and associated deployment practices: Python/Django, Ruby/Rails, Java/Spring
  • Experience in Linux configuration and administration
  • Strong scripting capabilities: bash, ruby or python preferred
  • Experience with distributed Source Code Management (SCM) tools, e.g. Git
  • Some experience with operational CM/provisioning tools, e.g. Ansible, Chef, or Puppet

Ideal Qualifications:

  • Hands on coding experience in Python, Rails or Java development and related operations
  • Leadership experience in defining a public cloud strategy: initial greenfield implementation or migration of existing applications.
  • Integration experience with third party/open source system monitoring tools. e.g. Nagios, New Relic, Splunk
  • Experience with Continuous Integration (CI) tools. e.g. Bamboo, CircleCI, Jenkins, Travis
  • Some experience with task management tools, JIRA preferred
  • Knowledge of deployment strategies (like blue/green and canary deployments
  • An understanding of Agile methodology either by formal education or on the job experience
  • 489782