Staff Engineer, Data Platform

Lila Sciences

$192K — $272K *
Enterprise Technology
8 - 10 years of experience
Job Overview by Ladders

Qualifications

  • Bachelor's or Master's degree in Computer Science, Engineering, or related field
  • 8+ years as a software or data engineer focused on data infrastructure
  • Experience designing and shipping data platform components from the ground up
  • Proficient in Python and SQL with production-quality code writing skills
  • Familiarity with relational and NoSQL databases, schema design, and operational concerns
  • Cross-functional collaboration experience with scientists, ML researchers, and engineers
  • Hands-on experience with cloud infrastructure and containerized deployment (AWS, Kubernetes)

Responsibilities

  • Design and evolve core data infrastructure for scientific and ML workflows
  • Build reliable data pipelines from diverse sources
  • Operate and extend workflow orchestration systems for scientific pipelines
  • Define and maintain data models and schema strategies
  • Partner with ML researchers and lab scientists to translate requirements into capabilities
  • Establish coding, review, and design standards for the team
  • Mentor engineers and lead design reviews to raise technical standards

Benefits

  • Comprehensive medical, dental, and vision coverage
  • Employer-paid life and disability insurance
  • Flexible time off with generous holidays
  • Paid parental leave and educational assistance program
  • Commuter benefits including bike share memberships
  • Company subsidized lunch program
Full Job Description
We are looking for a Staff Engineer to set the technical direction for our core data infrastructure: ingestion frameworks, storage architecture, orchestration patterns, and the interfaces that let scientists and ML researchers work with data reliably at scale. You will work closely with software engineers, machine learning researchers, and lab scientists to understand requirements and translate them into durable platform capabilities. This is a role for engineers who care deeply about how data systems are designed. You will establish the architectural patterns and engineering standards the broader team builds on, mentor engineers across the data platform group, and make technical decisions that compound over time. What You'll Be Building • Data Platform Architecture: Design and evolve the core data infrastructure that ingests, stores, and serves data across scientific and ML workflows. Make principled build-vs-buy decisions and establish architectural patterns adopted by the broader engineering organization. • Ingestion and Integration: Build reliable pipelines that bring in data from diverse sources: laboratory instruments, public scientific datasets, and external research literature. Own the interfaces between upstream producers and downstream consumers. • Orchestration and Reliability: Operate and extend workflow orchestration systems that run complex, multi-step scientific pipelines. Ensure observability, fault tolerance, and reproducibility across the data stack. • Data Modeling and Schema Strategy: Define and maintain data models, schema evolution practices, and data contracts that ensure consistency, discoverability, and long-term durability of scientific and platform data assets. • Cross-Functional Technical Leadership: Partner with ML researchers, lab scientists, and product engineers to translate scientific and research requirements into platform capabilities. Drive alignment on data standards and integration patterns across teams. • Engineering Standards and Mentorship: Establish coding, review, and design standards for the data platform team. Mentor engineers, lead design reviews, and raise the technical bar across the group. What You'll Need to Succeed • Bachelor's or Master's degree in Computer Science, Engineering, or a related field, and 8+ years as a software or data engineer with a focus on building and operating data infrastructure. • Designed and shipped data platform components from the ground up, including ingestion frameworks, storage abstractions, and orchestration systems. Fluent in Python and SQL and writes production-quality code. • Production experience with relational and NoSQL databases, schema design, query optimization, and operational concerns at scale. Comfortable working across structured, semi-structured, and unstructured data. • Proven track record of working cross-functionally with scientists, ML researchers, and engineers. Able to translate domain requirements into platform decisions and explain technical trade-offs to diverse audiences. • Experience with cloud infrastructure and containerized deployment (AWS, Kubernetes). • Hands-on experience with modern table formats and open lakehouse patterns (Iceberg, Delta Lake, Hudi). Bonus Points For • Experience with workflow orchestration systems (Flyte, Airflow, Dagster, or similar). • Experience building data infrastructure that serves agentic and LLM-driven workflows, including vector databases, RAG infrastructure, and retrieval-optimized data access patterns. • Background in scientific computing, life sciences, or research software. • Proficiency with AI-assisted development tools (Cursor, Claude Code, or similar) and ability to incorporate them effectively into day-to-day engineering work. Compensation We offer competitive base compensation with bonus potential and generous early-stage equity. Your final offer will reflect your background, expertise, and expected impact. U.S. Benefits. Full-time U.S. employees receive a comprehensive benefits program including medical, dental, and vision coverage; employer-paid life and disability insurance; flexible time off with generous company wide holidays; paid parental leave; an educational assistance program; commuter benefits, including bike share memberships for office based employees; and a company subsidized lunch program. International Benefits. Full-time employees outside the U.S. receive a comprehensive benefits program tailored to their region. USD salary ranges apply only to U.S.-based positions; international salaries are set to local market. Expected Base Salary Range $192,000-$272,000 USD

Similar Jobs

More Jobs at Lila Sciences

More Enterprise Technology Jobs

Find similar Staff Engineer, Data Platform jobs: