Site Reliability Engineer

Carbon Black   •  


Industry: Information Technology


Less than 5 years

Posted 187 days ago

This job is no longer available.

Why Carbon Black?
At Carbon Black, you’ll have the chance to make an impact in the ever-evolving cybersecurity space. Our advanced technology tackles even the toughest challenges and stays ahead of the latest threats. If you want to join an agile company that’s building bleeding edge technology in the cloud, Carbon Black is the place for you. Driven by passionate people who are dedicated to making the world safer, it’s no wonder we’ve been named a “Top Place to Work” by the Boston Globe for four consecutive years. Join us!

Why You Matter
We’re looking for a Cloud Operations Engineer who will perform operations and development support for our cloud product line. This individual will work with development and have responsibility for the health of our services.
If you are:
• Ready for your next challenge
• Some experience as an operations engineer
• Able to maintain the delicate balance between quality, speed, user experience and customer expectations in a 24x7 operations environment
• Apt to take the stairs…two at a time
Then you’re exactly the person we need. Join us in the battle to secure the world’s intellectual property.

What You’ll Do
• Share responsibility for health, scalability and availability of our cloud services
• Maintain deployed infrastructure on the AWS platform
• Participate in on-call rotation for production issues
• Work with the team to ensure cloud architecture meets scalability, availability and cost requirements
• Follow good operational practices such as use of playbooks, always having a backout plan, comfort in escalating when necessary

What You’ll Bring
• B.S. in Computer Science or related fields
• 1-3 years of experience managing cloud infrastructure, preferably with AWS running hundreds of EC2 instances
• Minimum of 2 years of experience with technical operations and software development support that worked on enterprise scale, mission critical, highly available Linux systems
• Experience with configuration management tools and cloud management automation, e.g. Cloud Formation, OpsWorks, Puppet, Chef
• Working knowledge of monitoring such as AWS CloudWatch, Splunk, and various alerting systems
• Introductory understanding of web services, databases and how they interact
• Understanding of backup/restore best practices in the cloud
• Ability to read and troubleshoot using a modern scripting language
• Excellent troubleshooting skills
• Thrive in a fast-paced, results oriented environment, with a KanBan workflow
• Ability to work independently and take specific instruction, switching as required
• Knowledge of AWS APIs a plus
• Security Experience a plus