Senior Software Engineer, AI Infra

RoboForce

$130K — $180K *
Consumer Technology
5 - 7 years of experience
Job Overview by Ladders

Qualifications

  • 5+ years of software engineering experience in AI or robotics.
  • Bachelor's or Master's degree in Computer Science or related field.
  • Proficient in C++, Python, and ML frameworks like PyTorch or JAX.
  • Deep experience with cloud providers (GCP, AWS, Azure).
  • Strong understanding of SQL and NoSQL databases.

Responsibilities

  • Build and maintain distributed training pipelines using PyTorch, JAX, or similar.
  • Develop tools for data collection and deployment on RoboForce robots.
  • Design cloud-native GPU infrastructure in multi-cloud setups.
  • Create high-throughput data workflows for dataset management.
  • Optimize training performance, focusing on CPU-GPU interactions and I/O pipelines.
  • Integrate training components into on-robot inference systems.

Benefits

  • Competitive stock options and equity programs.
  • Comprehensive health, dental, and vision insurance.
  • 401(k) plan with company contribution.
  • Visa sponsorship and green card assistance for eligible candidates.
  • Free lunches and dinners, a fully stocked kitchen, plus team-building activities.
Full Job Description
Why RoboForce

RoboForce is an AI robotics company building Physical AI and Robo-Labor system for dull, dirty, and dangerous work. Our flagship robot, TITAN, is a super humanoid robot designed for industrial environments. We are based in Milpitas, CA and require 5 days/week in-office collaboration.

We are looking for a Senior Software Engineer to build scalable AI infrastructure. As a Senior Software Engineer, you will architect and implement the core training infrastructure that enables large-scale model training, validation workflows, and production deployment for physical robots. You will work across cloud systems, GPU clusters, data pipelines, and robotics runtime environments to create a high-performance platform for Physical AI.

Responsibilities:
  • Build and maintain distributed training pipelines leveraging PyTorch, JAX, or equivalent frameworks across multi-GPU and multi-node clusters.
  • Build tools for data collection, training, and deployment of neural networks on RoboForce robots.
  • Architect robust cloud-native and on-prem GPU infrastructure across multi-cloud environments.
  • Build high-throughput data workflows to support large-scale dataset ingestion, versioning, and distributed storage.
  • Optimize end-to-end training performance: CPU-GPU transfers, NVMe caching, I/O pipelines, containerized runtime environments, and CUDA-level optimizations.
  • Integrate training artifacts into on-robot inference stacks.

Requirements
  • Bachelor's or Master's degree in Computer Science or related field with 5+ years of experience.
  • Strong proficiency with C++, Python, and ML frameworks (e.g., PyTorch, JAX).
  • Deep experience with at least one major cloud provider (GCP, AWS, Azure) and familiarity with Kubernetes, containerization, and GPU machine provisioning.
  • Strong understanding of SQL and NoSQL data stores (Postgres, MySQL, BigQuery, ElasticSearch, Redis).
  • Requires 5 days/week in-office collaboration with the teams.

Bonus Qualifications
  • Expertise in profiling and optimizing CPU-GPU interactions.
  • Experience scaling neural network training jobs and GPU programming with CUDA.
  • Proven ability to develop annotation and dataset management tools.


Benefits
  • Competitive stock options/equity programs.
  • Health, dental, and vision insurance, 401(k) plan.
  • Visa sponsorship and green card support for qualified candidates.
  • Lunches and dinners, a fully stocked kitchen, and regular team-building events.

Similar Jobs

More Jobs at RoboForce

More Consumer Technology Jobs

Find similar Senior Software Engineer, AI Infra jobs: