Senior Software Engineer, ML DataSan Francisco, CA • Hybrid • Reports to Head of Vision & AI
About the RoleVoxel's perception system is the technical core of everything we ship. Our models detect human activity, equipment interactions, environmental hazards, and operational state in real time across thousands of cameras in manufacturing, logistics, retail, and pharmaceutical environments. Safety was our wedge; it proved our platform works. Now customers are pulling us into operations: equipment utilization, workflow compliance, process efficiency. Every new use case runs through the perception team.
We're hiring a strong software engineer to own the data infrastructure required to train and evaluate our ML models. You'll work with petabytes of data streaming from thousands of live customer cameras, and build pipelines to ingest, search, label, version and store the highest quality of data required to train state-of-the-art computer vision models. Your work directly shapes how our applied ML team measures and improves model quality. You'll set technical direction, write code, make architecture calls, and partner closely with applied CV, ML infra and Platform engineers.
What You'll Do- Drive the roadmap for the data infrastructure that powers Voxel's vision and ML capabilities.
- Build petabyte-scale pipelines for ingestion, search, labeling, versioning and storage of camera data.
- Develop methods for data mining and automated data collection.
- Partner with applied ML engineers on dataset quality for training and evaluation.
- Collaborate with the Data Ops team (HITL) to design processes to measure and improve data quality.
- Understand the data needs of Vision and AI engineers and design scalable infra solutions that support model improvement and vision capabilities.
- Collaborate with the Platform team to store and retrieve data efficiently on the cloud.
What We're Looking For- 4+ years of experience building and shipping large scale software solutions.
- Working knowledge of ML training and evaluation. Understand what makes a good dataset, how to measure model quality, and how data quality affects model performance.
- Strong Python. Comfortable across the stack: data mining, data labeling, storage and retrieval.
- Track record of owning something end to end: building data products valuable to internal customers.
- Bias toward shipping. You'd rather ship something good this week than something perfect next quarter.
- Strong communication skills.
Nice to Have- Experience with implementing data compliance & data governance solutions.
- Deep understanding of data for computer vision - object detection, tracking, video understanding.
- Familiarity with human in the loop data annotation, auto labeling.
Compensation & Benefits- Equity through Voxel's Equity Incentive Plan
- Total compensation includes base salary, annual bonus, and equity
- Comprehensive health, dental, and vision insurance
- Competitive paid parental leave
- Unlimited PTO and flexible work arrangements
- Daily meals in-office, team events, annual company onsite