Walmart

Senior Site Reliability Engineer

Walmart$112K — $180K *
Information Technology
Less than 5 years of experience
Job Overview by Ladders

Qualifications

  • Master's or Bachelor's degree in relevant field with related experience
  • Experience in site reliability engineering or infrastructure management
  • Proficiency with Kubernetes, including helm charts
  • Familiarity with AWS services for server and enterprise management
  • Strong skills in CI/CD pipeline construction including GitHub Actions and CodePipeline
  • Experience in database management (both RDBMS and non-RDBMS)
  • Ability to write unit and integration tests, proficient in scripting with BASH, Python, and Typescript.

Responsibilities

  • Assist in creating modular and functional designs adhering to requirements
  • Evaluate trade-offs in multi-component product designs
  • Convert high-level designs into detailed functional specifications
  • Design and create minimum viable products (MVPs) to clarify requirements
  • Automate infrastructure coding tasks
  • Maintain documentation for program development and revisions
  • Monitor site reliability and recommend improvements based on data analysis.

Benefits

  • Comprehensive health coverage including medical, vision, and dental
  • 401(k) plans with company contributions
  • Company-paid life insurance and performance incentives
  • Paid time off for various needs including family care and bereavement
  • Short- and long-term disability options
  • Education assistance with full coverage for college degrees
  • Company discounts and support for military service.
Full Job Description
What you'll do...

Position: Senior Site Reliability Engineer

Job Location: 14901 Quorum Drive, Dallas, TX 75254

Duties: Assist in creating simple, modular, extensible, and functional design for the product/solution in adherence to the requirements. Evaluate trade-offs while designing across multiple components in a product based on business requirements. Convert HLD to create detailed design using mock screens, pseudo codes, and detailed functional logic of the modules for specific modules and components of a product/system. Understand nuances of designing for disaster recovery. Design and create MVP to clarify requirements and design and uncover risks. Refine the MVP design for early defects and revised customer requirements. Undertake infrastructure coding automation. Adhere to all relevant coding guidelines while writing/configuring code. Create/configure minimalistic (less complex, highly robust, and high quality) code for a component/module under guidance. Maintain records by documenting program development and revisions. Stay updated on the prevalent coding languages and frameworks in the industry outside the immediate scope of delivery. Identify repetitive and routine tasks in (Continuous Integration/Continuous Delivery) CI/CD, testing, or any other process that can be automated. Implement telemetry features as required under guidance. Apply security policy requirements to component/module during code development/configuration. Detect and document defects, bugs, and errors for assigned component/module and conduct analysis to determine the sources under guidance. Troubleshoot performance and availability bottlenecks for assigned application under guidance. Work with business partners to identify and document critical applications. Interpret and follow procedures in contingency plans. Explain the contingency and disaster recovery plans for assigned environment. Execute established procedures necessary to continue operations in an emergency. Participate in the design of a minimum operating environment for a computer-based facility. Utilize established criteria (for example, probability of failure, frequency of failure) to measure site reliability. Monitor site reliability conditions and new reliability requirements. Assist in the design and development of a reliability program plan for a specific site environment. Apply appropriate tools, services, or applications for reliability prediction and other site improvements. Research and assess various reliability models for different site environments. Suggest metrics to monitor software or system performance. Monitor current performance data to ensure compliance with defined SLOs for multiple applications/systems. Determine thresholds for monitoring metrics and triggers alerts based on thresholds. Help with specific procedures to proactively check the health of applications and infrastructure, including a variety of operating systems, hardware, and software. Make recommendations regarding situational awareness and alerting. Make recommendations regarding instrumentation gaps and alerting logic, including a variety of operating systems, hardware, and software.

Minimum education and experience required: Master's degree or the equivalent in Computer Science, Computer Engineering, Computer Information Systems, Software Engineering, Electrical Engineering, or related area and 1 year of experience in site reliability engineering, site and system administration, infrastructure management, or related area; OR Bachelor's degree or the equivalent in Computer Science, Computer Engineering, Computer Information Systems, Software Engineering, Electrical Engineering, or related area and 3 years of experience in site reliability engineering, site and system administration, infrastructure management, or related area.

Skills required: Experience with the management and orchestration of Kubernetes cluster with helm charts. Experience with networking solutions including VPN systems, firewall technologies, and storage systems. Experience building scalable monitoring and observability systems using CloudWatch, PRTG, Grafana, and PagerDuty. Experience with server management in AWS with orchestration tools, including Ansible, Puppet, and Terraform. Experience managing DNS and SSL certificates in AWS. Experience managing Enterprise Workloads in an AWS Infrastructure. Experience building CI/CD pipelines using GitHub Action, CodeBuild, CodePipeline, and CircleCI. Experience managing RDBMS including PostgreSQL and MSSQL Server and non-RDBMS including Redshift and MongoDB. Experience writing unit and integration tests. Experience with tool development, including scripting with BASH and high level languages: Python and Typescript. Employer will accept any amount of experience with the required skills.

Salary Range: $112,923/year to $180,000/year. Additional compensation includes annual or quarterly performance incentives.

Benefits: At Walmart, we offer competitive pay as well as performance-based incentive awards and other great benefits for a happier mind, body, and wallet. Health benefits include medical, vision and dental coverage. Financial benefits include 401(k), stock purchase and company-paid life insurance. Paid time off benefits include PTO (including sick leave), parental leave, family care leave, bereavement, jury duty and voting. Other benefits include short-term and long-term disability, education assistance with 100% company paid college degrees, company discounts, military service pay, adoption expense reimbursement, and more.

Eligibility requirements apply to some benefits and may depend on your job classification and length of employment. Benefits are subject to change and may be subject to a specific plan or program terms. For information about benefits and eligibility, see One.Walmart.com.

About Walmart

WalmartLabs is accelerating their development to redefine the shopping experience to meet the changing needs of our customers wherever they are —in a store, on our website, or on their mobile device.

Walmart Careers

Joining Walmart means becoming part of a world-renowned team that leads with innovation and is committed to creating impact. As the largest retailer globally, Walmart offers unparalleled job opportunities and career growth in an environment that values diversity and leadership. Work You’ll Do At Walmart, you will be part of a dynamic team that drives our mission to help people save money so they can live better. Engage in work that matters with a company that offers both stability and the flexibility to explore different career paths. Transform the retail landscape with your skills and help shape the future of millions of customers worldwide. Walmart is at the forefront of combining retail with advanced technology, making it an exciting place for professional growth and innovation. Lead with Us Step into a role that harnesses your potential and places you at the intersection of retail and technology. Walmart is not just a company; it's a community where you can develop your leadership skills and contribute to a culture that nurtures professional growth. Work with a diverse team of experts who bring a wealth of knowledge and experience to the table. Our commitment to diversity training ensures that all team members are valued and can thrive. Walmart Careers and Employment Opportunities We are continuously expanding our team to include enthusiastic professionals eager to drive change and make a significant impact. Explore a range of positions from entry-level to executive, each offering competitive benefits and the opportunity to advance. Innovate with Us Join Walmart and be part of a team that is dedicated to innovation and excellence. With over 2.3 million associates worldwide, you are joining the largest private employment group, ready to innovate, lead, and impact the global market. Internship Programs Kickstart your career with a Walmart internship. Gain invaluable experience, build your resume, and develop networking connections that will empower your career journey. Our internships provide a platform to apply your academic knowledge in real-world scenarios, preparing you for future employment. Be Part of a Great Team At Walmart, our team is our strength. We invest in our employees through robust training programs, leadership development, and opportunities for career advancement. Enjoy the benefits of working in a supportive and inclusive environment where every member’s contribution is valued. Future-Proof Your Career Your journey at Walmart can be as vast as your ambitions. With endless opportunities to grow, learn, and lead, you can take your professional experience to new heights. Benefit from our comprehensive training programs and develop the skills needed for tomorrow’s challenges. Stay Connected Join Our Team Search open positions that match your skills and interests. We are looking for passionate, curious, creative, and solution-driven team players. Explore the diverse job opportunities at Walmart and find where you can make a difference. Keep Up to Date Stay ahead with career tips, insider perspectives, and industry-leading insights you can put to use today—all from the people who work here. READ CAREERS BLOG Job Alert Emails Personalize your subscription to receive job alerts, latest news, and insider tips tailored to your preferences. Discover the exciting and rewarding opportunities that await at Walmart. SEARCH WALMART JOBS Join Walmart today and be part of a story of growth, innovation, and leadership.
Learn more about Walmart
Size
2,300 employees
Market Cap
$387 billion
Industry
Net Income
$13.5 billion
Founded
1962
5 Year Trend
+3.3%
Revenue
$559.1 billion
NASDAQ

Similar Jobs

More Jobs at Walmart

More Information Technology Jobs

Find similar Senior Site Reliability Engineer jobs: