Roblox's data infrastructure processes petabytes of data daily, powering analytics, ML, and product decisions. As a
Senior Software Engineer in our
Data Infra org, you will design, build, and scale the distributed data infrastructure platforms that power Roblox. You will own and drive the next-generation architecture of our core platforms, which span Kafka, Flink, Spark, Trino, Druid, Airflow and Data Catalog. This role combines high ambiguity and ownership to push the boundaries of what our infrastructure can handle at massive scale, giving you the unique opportunity to steer the evolution of the data landscape.
You Will:- Own and Scale Core Platform Components: Take responsibility for the design, architecture, and implementation of 1-2 key data platform frameworks within our stack
- Collaborate and Align: Partner with infra, data science, and product engineering teams to ensure your target platform's capabilities are directly guided by platform governance and product requirements.
- Optimize Performance at Scale: Dive deep into engine internals, query planning, state management, memory optimization, serialization efficiency to maximize throughput and reliability under heavy load.
- Drive Infrastructure Robustness: Lead the design, testing, and operational lifecycle of next-generation infrastructure features running on Kubernetes across cloud environments.
- Agentic Interface: Embed AI/ML capabilities within our data platforms, leveraging LLMs for data discovery and generation, building autonomous, self-serve mechanisms for platform interaction layer.
You Have:- 5+ years of experience building, designing, testing and maintaining production-grade, large-scale distributed systems.
- Data Platform Depth: Deep technical experience building or strong familiarity with at least 1 or 2 foundational technologies within our stack: Kafka, Flink, Spark, Trino, Druid, Airflow, or Data Catalog/Metadata systems.
- Strong Engineering Foundations: Robust proficiency in Java, Go, or Scala, with a track record of writing clean, highly performant backend code.
- Cloud Fluency: Experience operating and troubleshooting data infrastructure at scale on top of Kubernetes in AWS or GCP.
- Technical Leadership: Demonstrated ability to influence technical direction across teams, mentor engineers, and drive alignment on complex cross-cutting initiatives.
- B.S. equivalent in CS or sufficient experience.
Nice to Have:- Contributions to open-source projects in the data infrastructure ecosystem.
- Experience operating infrastructure at consumer-internet scale (100M+ users).
For roles that are based at our headquarters in San Mateo, CA: The starting base pay for this position is as shown below. The actual base pay is dependent upon a variety of job-related factors such as professional background, training, work experience, location, business needs and market demand. Therefore, in some circumstances, the actual salary could fall outside of this expected range. This pay range is subject to change and may be modified in the future. All full-time employees are also eligible for equity compensation and for benefits as described on
this page.
Annual Salary Range
$243,290-$295,250 USD
Roles that are based in an office are onsite Tuesday, Wednesday, and Thursday, with optional presence on Monday and Friday (unless otherwise noted).