Engineering Manager, Reliability Platform

DoorDash

$193K — $285K *
Information Technology
5 - 7 years of experience
Job Overview by Ladders

Qualifications

  • 5+ years leading teams of high caliber engineers
  • Proven experience in infrastructure, platform, or backend engineering
  • Platform mindset with a focus on products/platforms/customers
  • Strong influencer through conversations and presentations
  • Commitment to establishing new best practices
  • Broad understanding of AWS primitives and containerization
  • Experience with SLOs and incident response
  • Embraces AI tools for productivity

Responsibilities

  • Design and operate services and infrastructure on the Reliability Platform team
  • Collaborate with Infrastructure and Product teams on mission-critical systems
  • Recruit and retain top engineering talent
  • Manage team performance with regular coaching and feedback
  • Establish project rituals that maximize productivity
  • Align planning and culture with European counterparts
  • Oversee budget for Cloud Provider Infra and third-party vendor spend

Benefits

  • 401(k) plan with employer matching
  • 16 weeks of paid parental leave
  • Wellness benefits
  • Paid time off and paid sick leave
  • Medical, dental, and vision benefits
  • Disability and basic life insurance
  • Family-forming assistance
  • Mental health program
Full Job Description
About the Role

As a Software Engineer on the Reliability Platform team, you'll help design, build, and operate services and infrastructure that deliver on the team's broad mandate described above. This team has a unique opportunity for breadth, often in collaboration with expert peers across the Infrastructure and Product teams. Depending on need and interest, you may be working on mission-critical back-end services or pipelines, complex orchestration workflows, self-service UI, or AI Agent continuous improvements.

We have fully embraced the use of AI tools in everything we do, and believe in the incredible potential this provides while remaining pragmatic enough to ensure the critical infrastructure we maintain cannot be compromised. Our goal is to deliver innovative next generation capabilities, as well as make data in our custody available to others pursuing the same.

A few examples of efforts the team has owned in recent years:
  • Delivering framework to capture/alert/report on SLO quality across tens of thousands of endpoints ensuring all teams are accountable for the quality of their delivered services
  • Replacement of our escalation management tools including alignment with our internal Asset/Team Catalog to allow automated alert routing and cross-brand alignment
  • Delivery of MCP back-end for Reliability Platform data/tools, as well as enabling the same for peer teams across the Core Infrastructure organization
  • Design and delivered orchestration tools to enable self-service provisioning of critical infrastructure (Kafka topics, Databases, CPU/GPU Pools, Service Scaffolding, etc)
  • PoC for internal SRE AI Agentic tooling leveraging internal MCPs and domain specific profiles to facilitate troubleshooting and Q&A capabilities replacing FAQs/Runbooks
  • Delivered per-pod realtime configuration key-value tooling enabling runtime feature flag management from a central source of truth across the fleet (100K+ pods)

As the leader of this team, you will take an active role across the organization to:
  • Recruit, hiring, retain world class engineering talent into the team and continuously level up the team's capabilities and outcomes
  • Manage team performance, including ongoing and annual assessment in addition to regular 1:1 alignment, coaching, and feedback
  • Establish rituals and expectations for project execution that maximize productivity while minimizing overhead/meetings/administrative work
  • Align with European counterparts to forge a shared global culture, and alignment of planning aligned with a shared mission
  • Own global processes/policies for incident response, communications, and reporting. We enable our colleagues, but do not own the response itself.
  • Manage the team's budget for Cloud Provider Infra and 3rd party vendor spend within the team's mandate, including forcasting

We are proud of our engineering culture, and many of our greatest successes are born from an individual with an idea spending some time hacking out a rudimentary demonstrable prototype. The mandate of this team is ripe for individuals with this creative pioneering mindset, and the ability to execute.
You're excited about this opportunity because you will...
  • Delivery Innovative Capabilities: You don't want to 'turn the crank' somewhere, but you want to contribute to some frontier thinking and help us push the industry forward
  • Build Great Infrastructure: You know great infrastructure often goes unnoticed by design. You are content knowing your efforts allow you to claim a portion of everyone's success.
  • Balance Practical and Possible: Sometimes our pragmatic perspective is needed to maintain a high quality service; your experience will support finding the right risk balance
  • Be Custom Obsessed: We want to learn from our customers to ensure we are solving the right challenges, and also share our perspective to influence in areas of expertise
  • Automate Everything: Well... not everything... but if your first instinct is to ask how this toil could be automated or better yet avoided then you're on the right team
  • Shape the Future of Operations: Experiment with agentic, AI-assisted workflows that can propose, validate, and safely execute production changes - moving DoorDash toward proactive, self-healing systems in step with industry first movers.
We're excited about you because you have...
  • Leading Teams: You have 5+ years leading teams of high calibre Engineers, and providing structure and rituals that enable the team to thrive
  • Proven Experience: You have 5+ years of experience in an infrastructure, platform, or backend engineering role, showing you can deliver and maintain complex systems through a team or as an individual contributor.
  • Platform Mindset: You think in terms of products/platforms/customers while designing systems that other engineers depend on every day.
  • Influence: You are comfortable influencing others via conversations, presentations, demonstrations, and policies.
  • Consistency: Your influence and leadership is seeking an outcome that will become the new best practice, to be applied consistently across the organization.
  • Cloud/Infra Fundamentals: You're comfortable broadly across the infra discussing topics related to AWS primitives, security best practices, containerization, and Infrastructure as Code.
  • SRE Experience: You understand concepts like SLOs, error budgets, and incident response though this is a platform development team, not an SRE/oncall team.
  • AI Alignement: You embrace the use of AI tools to be a more productive Engineering Manager, and instill the same mindset in your team.
  • Curiosity About the Future: You're excited about automation and agentic, AI-assisted operations and want to help shape how engineers interact with production systems.


Compensation

The successful candidate's starting pay will fall within the pay range listed below and is determined based on job-related factors including, but not limited to, skills, experience, qualifications, work location, and market conditions. Base salary is localized according to an employee's work location. Ranges are market-dependent and may be modified in the future.

In addition to base salary, the compensation for this role includes opportunities for equity grants. Talk to your recruiter for more information.

DoorDash cares about you and your overall well-being. That's why we offer a comprehensive benefits package to all regular employees, which includes a 401(k) plan with employer matching, 16 weeks of paid parental leave, wellness benefits, commuter benefits match, paid time off and paid sick leave in compliance with applicable laws (e.g. Colorado Healthy Families and Workplaces Act). DoorDash also offers medical, dental, and vision benefits, 11 paid holidays, disability and basic life insurance, family-forming assistance, and a mental health program, among others.

To learn more about our benefits, visit our careers page here.

See below for paid time off details:
  • For salaried roles: flexible paid time off/vacation, plus 80 hours of paid sick time per year.
  • For hourly roles: vacation accrued at about 1 hour for every 25.97 hours worked (e.g. about 6.7 hours/month if working 40 hours/week; about 3.4 hours/month if working 20 hours/week), and paid sick time accrued at 1 hour for every 30 hours worked (e.g. about 5.8 hours/month if working 40 hours/week; about 2.9 hours/month if working 20 hours/week).


The national base pay range for this position within the United States, including Illinois and Colorado.

$193,800-$285,000 USD

Similar Jobs

More Jobs at DoorDash

More Information Technology Jobs

Find similar Engineering Manager, Reliability Platform jobs: