Software Engineer, Machine Learning Platform

VXI Global Solutions • $187K — $259K *

San Francisco, CA 94112In-Person

Information Technology

5 - 7 years of experience

2 weeks ago

By clicking Apply, I agree with Ladders' Terms of Use and Privacy Policy

Job Overview by Ladders

Qualifications

5+ years of experience in ML infrastructure or production ML systems
Knowledge of ML model development lifecycle
Experience with distributed systems and cloud computing
Strong foundation in computer science principles
Hands-on experience with CI/CD pipelines and DevOps practices
Proficient in programming languages such as Python, Go, or Java
Familiarity with infrastructure-as-code tools like Terraform

Responsibilities

Design and operate scalable ML infrastructure on AWS
Develop distributed training and batch processing systems using Ray
Build and maintain infrastructure-as-code using Terraform
Support and evolve the feature store and feature pipelines
Develop data ingestion and streaming systems like Kafka or Spark
Improve CI/CD workflows for ML models
Enhance observability and reliability across ML workloads

Benefits

In-office work policy with flexibility for remote work
Subsidized commuter benefits and in-office perks
401k match along with comprehensive medical, dental, and vision coverage
Generous vacation policy and company-wide paid days off
Annual wellness stipend for wellness-related expenses
Up to 24 weeks of paid parental leave
Access to fertility benefits and family planning tools

Full Job Description

About the role

Chime's Machine Learning Platform (MLP) team builds and operates the infrastructure, tooling, and developer experience that powers machine learning across the company. We enable data scientists and ML engineers to develop, train, deploy, and monitor models reliably and efficiently.

As a Machine Learning Platform Engineer, you will design and build scalable systems that support model training, feature computation, real-time inference, and experimentation. You'll work at the intersection of distributed systems, cloud infrastructure, and applied machine learning.

This role focuses on building robust foundations that allow ML teams to move quickly while maintaining reliability, governance, and cost efficiency.

The base salary offered for this role and level of experience will begin at $187,000.00 and goes up to $259,000.00. Full-time employees are also eligible for a bonus, competitive equity package, and benefits. The actual base salary offered may be higher, depending on your location, skills, qualifications, and experience.
In this role, you can expect to

Design, build, and operate scalable ML infrastructure on AWS
Develop distributed training and batch processing systems using Ray
Build and maintain infrastructure-as-code using Terraform
Support and evolve the feature store and feature pipelines
Develop data ingestion and streaming systems (e.g., Kinesis, Kafka, Flink, Spark, or similar technologies)
Improve CI/CD workflows for ML models and platform components
Enhance observability, reliability, and cost visibility across ML workloads
Partner closely with Data Science and ML Engineering teams to improve developer experience
Contribute to platform architecture decisions and technical roadmaps
Participate in on-call rotations to support production systems

To thrive in this role, you have

5+ years of experience in ML infrastructure, platform engineering, or production ML systems
Knowledge of the machine learning model development lifecycle, including data preprocessing, model training, evaluation, and deployment
Experience with distributed systems, cloud computing, or large-scale data processing
Strong foundation in computer science and software engineering principles
Deeply interested in the impact and evolution of advanced AI technologies
Hands-on experience with CI/CD pipelines, DevOps practices, and infrastructure as code
Experience with containerization technologies such as Docker and Kubernetes, and orchestration systems
Knowledge of cloud platforms such as AWS and distributed computing frameworks such as Spark and Ray
Experience with GPU programming(CUDA) and GPU costs/optimization
Strong programming skills in Python, Go, Scala, Java or similar languages
Familiarity with infrastructure-as-code (e.g., Terraform, CloudFormation)
Solid understanding of software engineering fundamentals (testing, version control, code review, observability)

Nice-to-have

Experience with distributed compute frameworks such as Ray
Experience building or operating a feature store
Experience with real-time ML systems or model serving
Familiarity with streaming technologies (Kafka, Kinesis, Flink, Spark Streaming, etc.)
Experience supporting ML lifecycle workflows (training, evaluation, deployment, monitoring)
Knowledge of ML experimentation platforms and model governance practices

#LI-GC1 #LI-SF

What we offer for our full-time, regular employees

Our in-office work policy is designed to keep you connected - with four days a week in the office and Fridays from home for those near one of our offices, plus team and company-wide events depending on location. Whether you're coming in regularly or are part of our fully remote program, you'll stay engaged with your work and teammates.
In-office perks including backup child, elder, and/or pet care, plus a subsidized commuter benefit to support your regular commute
Competitive salary based on experience
401k match plus great medical, dental, vision, life, and disability benefits
Generous vacation policy and company-wide Chime Days, bonus company-wide paid days off
1% of your time off to support local community organizations of your choice
Annual wellness stipend to use towards eligible wellness related expenses
Up to 24 weeks of paid parental leave for birthing parents and 12 weeks of paid parental leave for non-birthing parents
Access to Maven, a family planning tool, with $15k lifetime reimbursement for egg freezing, fertility treatments, adoption, and more.
In-person and virtual events to connect with your fellow Chimers-think cooking classes, guided meditations, music festivals, mixology classes, paint nights, etc., and delicious snack boxes, too!
♥ A challenging and fulfilling opportunity to join one of the most experienced teams in FinTech and help millions unlock financial progress

We know that great work can't be done without a diverse team and inclusive environment. That's why we specifically look for individuals of varying strengths, skills, backgrounds, and ideas to join our team. We believe this gives us a competitive advantage to better serve our members and helps us all grow as Chimers and individuals.

About VXI Global Solutions

VXI Global Solutions is a business process outsourcing company that provides customer care, technical support, and back-office services to its clients. The company was founded in 1998 and has since grown to have over 30,000 employees across 42 locations worldwide. VXI Global Solutions prides itself on its ability to provide high-quality customer service and support to its clients, which range from small startups to Fortune 500 companies. The company's services are designed to help its clients improve customer satisfaction, reduce costs, and increase revenue.

Learn more about VXI Global Solutions

Size

30,000 employees

Industry

Business Services

Founded

1998

* Ladders Estimates

Similar Jobs