General Motors

Lead Site Reliability Engineer

General Motors$120K — $150K *
Enterprise Technology
5 - 7 years of experience
Job Overview by Ladders

Qualifications

  • 5-7 years supporting Linux production environments
  • 5-7 years administering Apache Spark
  • 3-5 years scripting in bash, perl, ruby, or python
  • 3-5 years experience with Docker Datacenter
  • 2-4 years administering Machine Learning platforms
  • 1+ year experience with Mesos/Kubernetes/OpenShift orchestration
  • Bachelor's degree or equivalent experience required

Responsibilities

  • Manage and deploy Kubernetes and Spark cluster environments on various infrastructures
  • Refine site reliability engineering processes and procedures
  • Set up and maintain scalable Kubernetes environments for high availability
  • Collaborate with cross-functional teams to design environments for data management
  • Evaluate and experiment with technologies for data processing and scalability
  • Document, monitor, and troubleshoot data processing and automation issues
  • Contribute to architectural planning of distributed systems to improve scalability and reliability

Benefits

  • 401K matching
  • Bonding leave for new parents (12 weeks, 100% paid)
  • Training opportunities
  • GM employee auto discount
  • Community service pay
  • Nine company holidays
  • Generous benefits package available on day one
  • Flexible hybrid work environment, 2-days a week in office
Full Job Description
Job Description

Responsibilities

About the Role:

We are expanding our efforts into complementary data technologies for decision support in areas of ingesting and processing large data sets. Our interests are in enabling data science and search based applications on large and low latent data sets in both a batch and streaming context for processing. To that end, this role will incorporates aspects of software engineering and operations, combining SRE and DevOps skills to come up with efficient ways of managing and operating applications. The role will require a high level of responsibility and accountability to deliver technical solutions. The data sets we deal with support both off-line and in-line machine learning training and model execution. Other data sets support search engine based analytics. Exploration and deployment of technologies activities include identifying opportunities that impact business strategy, selecting data solutions software, and defining hardware requirements based on business requirements. Responsibility also includes documentation of procedures for deployment, monitoring, managing and switching the environments in production and disaster recovery sites. This role participates along with team counterparts to architect an end-to-end framework developed on a group of core data technologies
  • Manage/Administer/Deploy Kubernetes and Spark cluster environments, on bare-metal and container infrastructure, including service allocation and configuration for the cluster, capacity planning, performance tuning, and ongoing monitoring
  • Define and refine processes and procedures for the site reliability engineering practice
  • Setup, manage and maintain Kubernetes based scalable environments for high-availability and work with vendors for smooth and continuous operations
  • Work closely with data scientists, data architects, data engineers, ETL developers, cybersecurity, network, Linux, other IT counterparts, and business partners to design and setup the environments to manage the ingested and processed datasets from the external sources, internal systems, and the data warehouse to extract features of interest
  • Evaluate, research, experiment with data processing, management and scalability technologies in a lab to keep pace with industry innovation while assessing business impact and viability for use cases associated with efforts in hand
  • Design, setup, test, deploy, monitor, document, and troubleshoot data processing and associated automation issues from the operations perspective
  • Work with IT Operations and Information Security Operations with monitoring and troubleshooting of incidents to maintain service levels
  • Work with Information Security Vulnerability Management and vendors to remediate known impacting vulnerabilities
  • Contribute to the evolving distributed systems architecture to meet changing requirements for scaling, reliability, performance, manageability, and cost
  • Report utilization and performance metrics to user communities
  • Contribute to planning and implementation of new/upgraded hardware and software releases
  • Responsible for monitoring the Linux, Kubernetes, Object Storage(MinIO), Feature Store, and Spark
  • Research and recommend innovative, and where possible, automated approaches for administration tasks
  • Identify approaches to efficiencies in resource utilization, provide economies of scale, and simplify support issues
  • Responsible for administration of Machine Learning platforms & Operations (MLOps) Such as Kubeflow/Jupyterhub/Python
  • This role will support GMF international operations and will closely align with our GMF IT NorthStar architecture and operating Principles


Qualifications

What Makes You an Ideal Candidate?
  • Excellent knowledge of Kubernetes Administration, Deployments & Upgrades
  • Excellent Knowledge on Apache Spark administration on various platforms
  • Strong working knowledge of Object Store(MinIO) and Spark cluster security, networking connectivity and IO throughput along with other factors that affect distributed system performance
  • Strong working knowledge of disaster recovery, incident management, and security best practices
  • Working knowledge of containers (e.g., docker) and major orchestrators (e.g., Mesos, Kubernetes, Docker Datacenter)
  • Working knowledge of software defined networking
  • Working knowledge of hardening Data at Rest with key based encryption technologies
  • Working knowledge of setting up and customize interactive data analytics tools (e.g., Apache Zeppelin, Jupyter notebooks)
  • Excellent knowledge on building the docker images to provide Containers-as-a-service
  • Working knowledge on Azure Administration, Azure DevOps & Azure Kubernetes Service (AKS)
  • Working knowledge of Pipeline Automation: Azure DevOps (YAML, ARM), Terraform, Jenkins, Chef/Puppet, Ansible
  • Working knowledge of CICD methodologies like Artifactory/Git/Gitops/Jenkins
  • Working knowledge of Code Scanning tools: SonarQube, Checkmarx/Blackduck/Twistlock
  • Working knowledge of Object Storage like S3/MinIO, Bucket policies and administration
  • Working knowledge of Kubernetes Storage protocols
  • Experienced with networking infrastructure including VLAN and firewalls
  • Working knowledge of hardening Kubernetes clusters with network policies like Calico/Tigera, service meshes like Istio, Internal & external load balancers
  • Proven track record with Red Hat Enterprise Linux & Kubernetes administration
  • Proficiency in a high-level language like Python, Go, Ruby and/or Java
  • Solid experience in High Availability and distributed systems, Linux , Data and SAN Storage Networks, NAS and Networking, leveraging tools to instrument and automate proactively and eventually predictive availability solutions
  • Proven track record leading complex enterprise production support efforts adhering to a mix of DevOps & SRE frameworks
  • Experience transitioning platforms to the cloud, with knowledge of cloud frameworks & design patterns, micro-service architectures
  • Extensive Knowledge of networking, including DNS, DHCP, firewalls, load balancers and IP routing
  • Experience in Monitoring tools - Splunk, Zenoss, Elastic, Appdynamics, Dynatrace, Grafana, Promotheus, Kiali etc.
  • Ability to grasp difficult concepts, large architectures, and sophisticated designs quickly and troubleshoot with debugging skills across a variety of integrated platforms
  • Proven capability to provide operational visibility on environment health to Senior Leadership, Technology and Business partners
  • Receptive, approachable teammate, with the ability to positively interact with business partners, technology teams, offshore, and professional services
  • Strong customer advocate with excellent written and verbal communication skills


Education and Experience:
  • 5-7 years of hands-on experience with supporting Linux production environments required
  • 5-7 years of hands-on administration experience on Spark required
  • 3-5 years hands-on experience with scripting with bash, perl, ruby, or python required
  • 3-5 years experience with Docker Datacenter required
  • 2-4 years of hands-on administration experience on Machine learning platforms required
  • Minimum of 1 year of experience in Mesos, Kubernetes, OpenShift and/or Deis or other such container/platform-as-a-service orchestrator required
  • Minimum of 1 year of hands-on experience on CICD tools & Technologies required
  • Minimum of 1 year of lead experience of site reliability engineering team required
  • Hands-on experience in cloud technologies with Microsoft Azure required
  • High School Diploma or equivalent required
  • Bachelor's Degree in related field or equivalent experience required
  • Master's Degree Preferred


What We Offer: Generous benefits package available on day one to include: 401K matching, bonding leave for new parents (12 weeks, 100% paid), training, GM employee auto discount, community service pay and nine company holidays.

Compensation: Competitive salary and bonus eligibility

Work Life Balance: Flexible hybrid work environment, 2-days a week in office

#GMFJobs #LI-Hybrid #LI-KC1

#GMFjobs

About General Motors

General Motors Company engages in the manufacture and sale of cars and trucks in the United States, China, Brazil, Germany, the United Kingdom, Canada, and Italy. It offers sedans, crossovers, sport utility vehicles, pick-up trucks, coupes, sports/convertibles and hybrid vehicles, hatchbacks/wagons, and vans, as well as mini cars in India. The company also provides parts and accessories, such as iPod and MP3 compatibility, mobility accessories, performance parts, AC parts and services, and merchandise. In addition, it offers vehicle safety, security, and information services. The company provides used vehicles. It offers its products through dealers and distributors. General Motors Company was formerly known as NGMCO, Inc. and changed its name to General Motors Company in July 2009. The company was incorporated in 2009 and is based in Detroit, Michigan. It operates manufacturing facilities in India, the United States, and Canada. General Motors Company operates as a subsidiary of the United States Department of The Treasury. General Motors led global vehicle sales for 77 consecutive years from 1931 through 2007, longer than any other automaker, and is currently among the world's largest automakers by vehicle unit sales. General Motors acts in most countries outside the USA via wholly-owned subsidiaries but operates in China through 10 joint ventures. GM's OnStar subsidiary provides vehicle safety, security, and information services. In 2009, General Motors shed several brands, closing Saturn, Pontiac, and Hummer, and emerged from a government-backed Chapter 11 reorganization. In 2010, GM made an initial public offering IPOs to date and returned to profitability later that year.

General Motors Careers

Join the dynamic team at General Motors, a global leader in automotive innovation and technology. At General Motors, we offer unparalleled job opportunities that propel your career forward while contributing to a legacy of engineering excellence.

Work You’ll Do

Embark on a career with General Motors to drive the future of mobility. Our team is dedicated to redefining the automotive landscape through innovation and leadership in electric vehicles and sustainable solutions. By joining us, you will be part of a culture that values diversity, teamwork, and continuous professional growth.

Transform Your Career

General Motors is not just a company; it's a community where you can grow your skills alongside the best in the industry. Our leadership is committed to providing every employee—from interns to senior professionals—with opportunities for career advancement, leadership development, and diversity training.

Innovate and Lead

At General Motors, innovation is at the core of everything we do. From research and development to manufacturing, our teams work collaboratively to lead the industry with cutting-edge technologies and sustainable practices. We encourage our employees to think big and push the boundaries of what’s possible.

Join Our Global Team

As part of our global workforce, you will collaborate with talented individuals who are passionate about shaping the future of transportation. General Motors offers a variety of career paths in engineering, design, IT, marketing, and more. With over 155,000 employees worldwide, our network provides expansive opportunities for networking and professional development.

Internship Programs and Employment Benefits

Start your career journey with a General Motors internship, where you can apply your academic knowledge to real-world projects. Our internships provide a robust foundation in the automotive industry, with mentorship from experienced leaders. Full-time employees enjoy a wealth of benefits, including comprehensive health care, retirement plans, and performance bonuses, ensuring that your hard work is rewarded.

Explore Job Opportunities

Whether you’re a seasoned professional or a recent graduate, General Motors offers positions that leverage your unique skills. Our hiring process is designed to identify and nurture talent, focusing on aligning your capabilities with the right opportunities for growth within the company.

Stay Connected

Join Our Team Search open positions that match your skills and interests. At General Motors, we look for innovative, driven, and solution-oriented team players. Explore the possibilities that await you in a career at General Motors.

Keep Up to Date

Stay ahead with career tips, insider perspectives, and industry-leading insights you can put to use today—all from the people who drive success at General Motors.

Job Alert Emails

Customize your subscription to receive job alerts, latest news, and insider tips tailored to your preferences. Discover the exciting and rewarding career opportunities available at General Motors. Embark on a journey of growth, innovation, and leadership at General Motors. Shape your future in an environment that fosters diversity, learning, and the pursuit of excellence. Join us and redefine the roads of tomorrow.
Learn more about General Motors
Size
157,000 employees
Market Cap
$46.9 billion
Industry
Net Income
$6.4 billion
Founded
1908
5 Year Trend
-3.2%
Revenue
$122.4 billion
NASDAQ

Similar Jobs

More Jobs at General Motors

More Enterprise Technology Jobs

Find similar Lead Site Reliability Engineer jobs: