Data Engineer

Robot.com

$120K — $150K *
Information Technology
Less than 5 years of experience
Job Overview by Ladders

Qualifications

  • Strong proficiency in Python for data engineering and scripting.
  • Extensive experience with SQL and relational databases (PostgreSQL preferred).
  • Proven expertise with Google Cloud Platform (GCP) services, especially BigQuery.
  • Experience designing, building, and maintaining robust ETL/ELT data pipelines.
  • Familiarity with data orchestration tools (e.g., Apache Airflow).
  • Experience with real-time data processing technologies (e.g., Kafka, MQTT).

Responsibilities

  • Design, develop, and maintain scalable ETL/ELT pipelines for data processing.
  • Manage and optimize data warehousing solutions in Google BigQuery.
  • Implement data quality assertions to ensure data integrity.
  • Collaborate with teams to understand data needs and deliver tailored solutions.
  • Contribute to the design evolution of data architecture for scalability and performance.
  • Develop monitoring solutions for data pipeline health and reporting accuracy.

Benefits

  • Dynamic and innovative work environment at a leading robotics company.
  • Opportunities for professional growth and mentorship.
  • Collaborative culture that encourages continuous learning.
  • Hybrid work model providing flexibility for remote and onsite work.
Full Job Description
The Opportunity:

We are seeking a highly skilled and motivated Data Engineer to join our dynamic Data Team. This critical role will be instrumental in designing, developing, and maintaining our robust data architecture and pipelines, ensuring data quality, and providing essential data support for our AI and Robotics initiatives. The ideal candidate will be an end-to-end data professional, capable of leading complex projects, collaborating with cross-functional teams, and upholding the highest standards of data governance and security.

Key Responsibilities:
  • Data Pipeline Development & Management:
    • Design, develop, and maintain scalable, reliable, and efficient ETL/ELT pipelines for batch and real-time data processing (e.g., MQTT/Kafka data ingestion).
    • Manage and optimize our data warehousing solutions, primarily Google BigQuery, ensuring efficient data storage, querying, and cost-effectiveness.
    • Implement and maintain data quality assertions across all data pipelines to ensure data integrity from source to consumption.
    • Develop and integrate new data sources into our existing data ecosystem.
    • Troubleshoot and resolve data pipeline issues, ensuring minimal disruption to data availability.
  • Data Support:
    • Collaborate closely with company teams to understand their data needs and develop tailored data solutions.
    • Design and implement data workflows to support machine learning workflows
    • Contribute to the development of data-driven insights that improve robot autonomy and performance.
  • Data Architecture & Infrastructure:
    • Contribute to the design and evolution of our overall data architecture, ensuring scalability, performance, and maintainability.
    • Implement and adhere to best practices for data modeling, schema design, and data governance.
    • Work with cloud infrastructure (GCP preferred) to deploy and manage data services.
    • Knowledge of spatial data wrangling and best practices for warehousing and consumption.
  • Monitoring, Reporting & Dashboards:
    • Develop and maintain monitoring solutions for data pipeline health and performance.
    • Ensure data consistency and accuracy in reporting tools.
  • Team Collaboration & Leadership:
    • Collaborate effectively with cross-functional teams to gather requirements and deliver data solutions.
    • Mentor junior team members and contribute to a culture of continuous learning and knowledge sharing within the data team.
    • Take ownership of projects from conception to deployment, ensuring timely and high-quality deliverables.

Technical Skills & Qualifications:
  • Required:
    • Strong proficiency in Python for data engineering and scripting.
    • Extensive experience with SQL and relational databases (PostgreSQL preferred).
    • Proven expertise with Google Cloud Platform (GCP) services, especially BigQuery, Cloud Storage,, Cloud Functions.
    • Experience designing, building, and maintaining robust ETL/ELT data pipelines.
    • Familiarity with data orchestration tools (e.g., Apache Airflow,).
    • Experience with real-time data processing technologies (e.g., Kafka, MQTT).
    • Understanding of data modeling techniques (e.g., dimensional modeling, Kimball).
    • Familiarity with version control systems (Git/GitHub).


  • Plus:
    • Exposure to AI/ML data pipelines and MLOps principles.
    • Knowledge of AI Agents for internal product development.
    • Familiarity with containerization technologies (Docker, Kubernetes).

Exposure to companies KPIs and OKRs among other performance metrics.

Department Robot.com Role Data & Analytics Locations San Francisco Remote status Hybrid

Similar Jobs

More Jobs at Robot.com

  • Data Engineer
    $120K — $150K *
    San Francisco, CA 94112 (San Francisco County)
    Information Technology
    Hybrid
  • Performance Marketing Analyst
    $80K — $120K *
    San Francisco, CA 94112 (San Francisco County)
    Business Services
    Hybrid
  • Senior Accountant
    $90K — $120K *
    San Francisco, CA 94112 (San Francisco County)
    Legal & Accounting
    Hybrid
  • Human Resources Analyst
    $80K — $120K *
    San Francisco, CA 94112 (San Francisco County)
    Business Services
    Hybrid
  • Industrial/Product Designer
    $90K — $120K *
    San Francisco, CA 94112 (San Francisco County)
    Manufacturing & Automotive
    Hybrid

More Information Technology Jobs

Find similar Data Engineer jobs: