We are looking for a Data Platform Engineer to build the foundational data systems that power our robotics and machine learning development. In this role, you will design and implement the infrastructure for collecting, storing, processing, and transforming the vast amounts of data generated by our robots-from sensor telemetry and video streams to operational logs and performance metrics.
You'll work closely with ML teams to ensure data is accessible, well-structured, and ready for training. Your work will enable research teams to iterate faster and operations teams to monitor fleet performance as we scale.
Key job responsibilities
- Design and build scalable data pipelines for ingesting and processing robotics data (sensor streams, video, telemetry, logs)
- Develop and maintain data storage solutions optimized for diverse data types and access patterns
- Create tools and APIs for researchers and engineers to efficiently query and analyze large datasets
- Build real-time data processing systems for monitoring robot fleet performance
- Build and maintain data transformation pipelines that prepare robotics data for ML training
- Collaborate with ML and robotics teams to ensure data platforms meet their evolving needs
BASIC QUALIFICATIONS
- Bachelor's degree or above in computer science, computer engineering, or related field, or experience in data science, machine learning or data mining
- 3+ years of data engineering experience
- Experience in scripting for automation (e.g. Python) and advanced SQL skills.
- Experience in Kafka, or experience in Hive/Spark/Hbase/Yarn and experience in software development
- Experience with cloud computing technologies
- Knowledge of distributed systems as it pertains to data storage and computing
- Proficiency with data storage technologies (e.g., PostgreSQL, object storage)
PREFERRED QUALIFICATIONS
- Experience working with robotics or IoT data (time-series, video, point clouds)
- Knowledge of streaming architectures and real-time analytics
- Familiarity with ML techniques and how data preparation impacts model training
- Experience with data cataloging, metadata management, and data discovery tools
The base salary range for this position is listed below. Your Amazon package will include sign-on payments and restricted stock units (RSUs). Final compensation will be determined based on factors including experience, qualifications, and location. Amazon also offers comprehensive benefits including health insurance (medical, dental, vision, prescription, Basic Life & AD&D insurance and option for Supplemental life plans, EAP, Mental Health Support, Medical Advice Line, Flexible Spending Accounts, Adoption and Surrogacy Reimbursement coverage), 401(k) matching, paid time off, and parental leave. Learn more about our benefits at https://amazon.jobs/en/benefits.
USA, NY, New York - 145,300.00 - 196,600.00 USD annually