Wealthsimple

Staff Software Developer, Production Engineering

Wealthsimple • $120K — $150K *
US-AnywhereRemote in Canada
Information Technology
8 - 10 years of experience
Job Overview by Ladders

Qualifications

  • 8+ years of experience in software engineering, particularly in platform, infrastructure, or SRE roles
  • Proven success in enhancing reliability at scale through reducing incidents and establishing operational standards
  • Strong background in backend systems and distributed architectures with the ability to diagnose complex failures
  • Experience in load testing and capacity planning with capability to implement engineering improvements based on findings
  • Demonstrated ability to influence cross-functional engineering teams and promote best practices without direct authority
  • Familiarity with Kubernetes, Helm, and modern deployment tools
  • Excellent written and verbal communication skills for presenting ideas to both technical teams and leadership

Responsibilities

  • Design and implement platform improvements to minimize incidents through guardrails and engineering standards
  • Develop tools to expedite incident mitigation, including AI-assisted incident response products
  • Lead investigations on load test outcomes and translate insights into reliability enhancements
  • Collaborate as a technical influencer across engineering teams in architecture reviews and coaching service owners
  • Identify and implement platform-level solutions for repeating failure patterns
  • Contribute to syncing reliability efforts with product engineering to align on incident themes and critical risks

Benefits

  • Comprehensive health benefits and life insurance
  • Long-term group savings plan with employer matching through Wealthsimple for Business
  • Generous leave policy: 20 vacation days, 4 wellness days, unlimited sick and mental health days annually
  • Opportunity to work remotely outside Canada for up to 90 days per year
  • Active employee resource groups supporting diversity and inclusion in the workplace
  • Hybrid work environment fostering collaboration with talented and driven colleagues across North America
Full Job Description
About the team

We build the products and infrastructure that millions of Canadians trust with their financial lives. Data & Engineering at Wealthsimple spans everything from the client-facing apps to the systems running underneath them - and we hold ourselves to a high bar on both. We move fast, but we build thoughtfully: quality, security, and scalability aren't trade-offs here, they're the standard.

The Production Engineering team sits within Platform Experience, on the boundary between Platform and Product. Our mandate is to raise reliability across Wealthsimple's most critical flows - reducing incidents, helping service teams ship safely, and turning individual fixes into platform-wide improvements. We measure ourselves against two targets: 99.9% uptime on critical flows, and fewer than 1% of weekly active users experiencing errors in the app. If you want to work on hard problems with people who care deeply about craft, you'll fit right in.

About the role

This is a new role - one that doesn't yet exist at Wealthsimple - and it's a meaningful one. As a Staff Software Developer on Production Engineering, you'll bring senior technical leadership to the work of making Wealthsimple more reliable at scale. You'll work across platform and product teams, identify the highest-leverage reliability problems, and build solutions that don't just fix the immediate issue but raise the floor for everyone. This isn't a role where you sit in one corner of the codebase. It's a role where you shape how engineering gets done across the company.

What you'll do
  • Improve the platform to prevent incidents - designing and driving adoption of guardrails, sensible defaults, and engineering standards that reduce the likelihood of failures across services
  • Build tooling that reduces time to mitigation when incidents occur, including contributing to our in-house product on AI-assisted incident response
  • Own the investigation and follow-through on load test findings - translating results into concrete reliability improvements across critical flows
  • Work across platform and product engineering teams as a technical influencer - participating in architecture and readiness reviews, coaching service owners, and driving adoption of scalable reliability practices
  • Identify recurring failure patterns and design platform-level fixes that prevent them from showing up again in a different service
  • Contribute to the team's reliability syncs with product engineering, helping align on incident themes, critical-flow risks, and the next highest-leverage initiatives


Skills you bring
  • 8+ years of software engineering experience, with significant time in platform, infrastructure, or SRE work
  • Demonstrated track record of improving reliability at scale - reducing incidents, building guardrails, or driving operational standards across multiple teams
  • Strong proficiency in backend systems and distributed architecture; you can diagnose complex failure modes across a service mesh
  • Experience with load testing and capacity planning, and the ability to translate findings into concrete engineering improvements
  • Proven ability to work across engineering teams as a technical influencer - driving adoption of standards and practices without direct authority
  • Familiarity with Kubernetes, Helm, Argo and modern deployment tooling
  • Strong written and verbal communication - comfortable presenting findings and recommendations to both engineering teams and senior leadership


Who you are
  • You think in systems - you're not looking for the fix, you're looking for what caused the problem and how to make sure it doesn't happen elsewhere
  • You're comfortable working without direct authority; you build credibility through the quality of your thinking and the clarity of your recommendations
  • You hold a high bar for operational excellence without making it someone else's problem to catch up to - you bring people along
  • You're energised by ambiguity, not slowed down by it; you know how to prioritise when everything feels urgent
  • You're curious about where AI-assisted tooling is headed in reliability engineering, and you want to help shape how we use it - not just observe it from a distance


🌸 Top-tier health benefits and life insurance

Long-term group savings with employer match, through Wealthsimple for Business

20 vacation days, 4 wellness days, and unlimited sick and mental health days per year

90 days away: work outside Canada for up to 90 days per year

Employee resource groups, including Rainbow (2SLGBTQ), Women of WS, and Black at WS

We are a hybrid team with over 1,500 employees across North America. The people are one of the best parts of working here: you'll collaborate with incredibly talented, curious, and driven teammates who are deeply committed to doing great work.

About Wealthsimple

Wealthsimple is a financial services company that provides online investment management and trading services. The company's platform allows users to invest in a variety of financial products, including stocks, bonds, and exchange-traded funds (ETFs), and offers a range of tools and resources to help users manage their investments. Wealthsimple also offers a high-interest savings account and a tax preparation service. The company was founded in 2014 and is headquartered in Toronto, Canada.
Learn more about Wealthsimple
Size
500 employees
Industry
Founded
2014

Similar Jobs

More Jobs at Wealthsimple

More Information Technology Jobs

Find similar Staff Software Developer, Production Engineering jobs: