Senior Staff Software Engineer ML Platform

Stack AV

$120K — $160K *
Transportation
5 - 7 years of experience
Job Overview by Ladders

Qualifications

  • 6+ years of experience in multimodal data indexing and inference pipelines.
  • Demonstrated experience building semantic search services for video/images and vector databases.
  • Proven ability to develop large-scale ML pipelines and optimize models.
  • Experience in collaborating effectively across teams, particularly within fast-paced environments.
  • Deep understanding of design trade-offs with the ability to communicate effectively.
  • Prior involvement in autonomous vehicles and ML applications is a strong plus.
  • Experience with data processing pipelines and model optimization.

Responsibilities

  • Build advanced multimodal data mining and search solutions for AV development.
  • Design and implement a data understanding platform for real-time and batch processing.
  • Deliver comprehensive data mining solutions across both onboard and offboard systems.
  • Create real-time semantic search services for various data types and optimize database access.
  • Identify and enhance issues within existing ML infrastructure to boost performance.
  • Construct efficient batch and stream processing pipelines.
  • Lead technical discussions across organizations and ensure timely solution delivery.

Benefits

  • Innovative work environment focused on cutting-edge AI technology.
  • Opportunity to collaborate with experts in autonomous systems.
  • Access to a culture that prioritizes diversity and inclusion.
  • Potential for professional growth within a rapidly evolving tech field.
  • Exposure to state-of-the-art tools and methodologies.
Full Job Description
About Stack:

Stack is developing revolutionary AI and advanced autonomous systems designed to enhance safety, reliability, and efficiency of modern operations. Stack's autonomous technology incorporates cutting-edge advancements in artificial intelligence, robotics, machine learning, and cloud technologies, empowering us to create innovative solutions that address the needs and challenges of the dynamic trucking transportation industry. With decades of experience creating and deploying real world systems for demanding environments, the Stack team is dedicated to developing an autonomous solution ecosystem tailored to the trucking industry's unique demands.

About the Role:

In the ML Data Understanding team, our mission is to provide trusted and useful data to efficiently power all of Stack's ML applications end-to-end from mining to training to safety evaluation. We work hand in hand with AV autonomy teams to provide cutting edge solutions to all their data needs, working across data engineering, mining, modeling and infrastructure. In particular, we provide services to find (data mining), curate (datasets), annotate (data labeling), search and serve (high throughput data access) data for all ML needs.
  • Data Mining: We are building a framework and infrastructure to find interesting events quickly and flexibly. As part of this mission, you would be setting the direction for and helping us build an inference service using LLMs, open-world models and vector databases.
  • Semantic Search for Data Mining: We are building the infrastructure of a highly scalable semantic search service for multimodal data to find interesting events quickly and flexibly. As part of this mission, you would be setting the direction for and helping us build an inference service using the latest AI models & approaches.
  • Dataset management for training: We are building state of the art infrastructure to support machine learning training and inference workloads using OSS components such as Ray, Spark, Lance and Iceberg.

Responsibilities:
  • Build state-of-art multimodal data mining and semantic search solutions to power AV product development.
  • Develop data understanding platform infrastructure for real-time querying/vector databases and batch/stream processing using technologies like Ray, Spark, Lance, or similar.
  • Deliver end-to-end data mining solutions that span onboard (C++) and offboard (ML & Data Infra) infrastructure to accelerate AV product development.
  • Develop e2e solution for real-time semantic search services (text/images/videos) and vector DBs.
  • Discover and identify key issues in existing ML infra and proactively improve system performance.
  • Build low latency/high throughput batch or stream processing pipelines.
  • Drive technical discussions across multiple orgs and deliver solutions on a timely basis.
  • Architect and tune ETL pipelines to maximize GPU/CPU/Ram utilization.
  • Write readable and high-performance Python/C++ code.

Qualifications:
  • Experience with both ML platforms and building ML-based applications (modeling experience is a bonus).
  • Proven track record of building scalable, reliable infrastructure in a fast-paced environment.
  • Ability to collaborate effectively across teams.
  • Experience building or using ML infrastructure for a large number of customer teams.
  • Deep understanding of design trade-offs with the ability to articulate those trade-offs and achieve alignment with others.
  • Experience in building ML models or infrastructure in domains such as autonomous vehicles, perception, and decision-making (desirable but not required).
  • Experience with model training, model optimization, or large data processing pipelines.
  • Prior experience in autonomous vehicles (AV) is a plus.
  • 6+ years of experience with:
    • Multimodal data indexing and inference pipelines.
    • Building semantic search service, embedding generation for video/images and vector DB.
    • Large scale ML pipelines (Airflow/Flyte) and model optimization.


We are proud to be an equal opportunity workplace. We believe that diverse teams produce the best ideas and outcomes. We are committed to building a culture of inclusion, entrepreneurship, and innovation across gender, race, age, sexual orientation, religion, disability, and identity.

Check out our Privacy Policy.

Please Note: Pursuant to its business activities and use of technology, Stack AV complies with all applicable U.S. national security laws, regulations, and administrative requirements, which can restrict Stack AV's ability to employ certain persons in certain positions pursuant to a range of national security-related requirements. As such, this position may be contingent upon Stack AV verifying a candidate's residence, U.S. person status, and/or citizenship status. This position may also involve working with software and technologies subject to U.S. export control regulations. Under these regulations, it may be necessary for Stack AV to obtain a U.S. government export license prior to releasing its technologies to certain persons. If Stack AV determines that a candidate's residence, U.S. person status, and/or citizenship status will require a license, prohibit the candidate from working in this position, or otherwise be subject to national security-related restrictions, Stack AV expressly reserves the right to either consider the candidate for a different position that is not subject to such restrictions, on whatever terms and conditions Stack AV shall establish in its sole discretion, or, in the alternative, decline to move forward with the candidate's application.

Similar Jobs

More Jobs at Stack AV

More Transportation Jobs

Find similar Senior Staff Software Engineer ML Platform jobs: