To get the best candidate experience, please consider applying for a maximum of 3 roles within 12 months to ensure you are not duplicating efforts.
Job Category
Software Engineering
Job Details
Job Details:
Salesforce hosts web services and applications written by thousands of internal developers and tens of thousands of customers to provide the largest business automation cloud in the world. The underlying infrastructure that enables this innovation and value is evolving to fully adopt lights-out operations, single-click deploy to tens of thousands of nodes, and services that self-heal and self-optimize.
Join a team building the Service Mesh and Ingress Gateway load balancing and proxy platform.
The Microservices Platform Service Mesh team is building a highly scalable and distributed load balancing and gateway service to front all customer traffic coming into Salesforce. We provide simple declarative interfaces for L4/L7 load balancing, TLS termination, end-to-end encryption, along with support for richer traffic policies such as blue/green deployments, access control, etc. The team owns the application networking layer and enables secure, resilient, observable communication.
We are looking for people who can drive the design and implementation of the next generation Ingress Gateway data plane and control plane. We intend to transform our current software stack to adopt more cloud native primitives to build a more reliable, scalable, and feature-rich service mesh. Our software stack is based on leading edge open source software like Envoy for data plane and Istio for control plane. The opportunities to enhance the capabilities of the OSS software and contribute back to the community are immense. The team has already made active upstream contributions to the Envoy project. We intend to transform the way our north-south traffic is secured, load balanced and proxied before entering our core service mesh. We are looking to add experienced distributed systems engineers who are passionate, hungry for new challenges, can step up and own large areas of that vision.
Some attributes of successful candidates:
* Passion for service ownership, building reliable, and self-healing services.
* Proven ability to work in complex team environments and deliver under pressure with dependency constraints.
* Expertise in building large scale distributed systems, especially in public cloud environments (i.e. AWS, GCP).
* Skilled in balancing live-site management, feature delivery, and retirement of technical debt.
* Familiarity with crash-only and recovery-oriented software design.
* Proficient in designing, developing, debugging, and operating resilient distributed systems that run across thousands of compute nodes in multiple data centers.
* Capable of driving and delivering thin slices of end-to-end functionality on a regular cadence with data-driven feedback loops.
Requirements:
* 10+ years of experience with strong infrastructure background.
* Define and drive the technical strategy for infrastructure platforms, including compute, networking, storage, and observability systems that serve as the
foundation for all engineering teams 6 owning architecture decisions, setting multi-year roadmaps, and ensuring systems scale reliably to meet business growth.
* Raise the engineering bar across the organization by establishing paved-path patterns for reliability, performance, and operational excellence 6 mentoring senior engineers, leading design reviews, and driving cross-team alignment on infrastructure standards that reduce toil and accelerate delivery.
* Experience with Istio/Envoy is a huge plus.
* Proficiency with Golang, Java and/or C++ in a Linux/UNIX data center environment.
* Experience in operating large scale cluster management systems (e.g. Kubernetes) of a mission critical service.
* Strong knowledge of network technologies, such as TCP/IP, DNS, TLS termination, HTTP proxies, network load balancing, etc.
* Experience with cloud infrastructure automation tools, frameworks, workflows, and validation platforms.
* Working knowledge of CI/CD, configuration management and Infrastructure as Code principles (e.g. Spinnaker, Terraform).
* Experience with Agile development methodology (e.g. Scrum) and Test-Driven Development, with attention to code quality, and delivering secure code.
* Participation in the teams on-call rotation to address complex problems in real-time and keep services operational and highly available.
* Experience in using telemetry and metrics to drive operational excellence.
At Salesforce, we believe in equitable compensation practices that reflect the dynamic nature of labor markets across various regions.
The typical base salary range for this position is $197,300 - $313,700 annually. In select cities within the San Francisco and New York City metropolitan area, the base salary range for this role is $237,700 - $344,700 annually.
The range represents base salary only, and does not include company bonus, incentive for sales roles, equity or benefits, as applicable.