Bachelor's degree in Computer Science, Software Engineering, or related field.
3+ years experience in backend engineering and cloud infrastructure.
Understanding of distributed system principles like replication and consistency models.
Experience designing or building distributed databases and storage systems.
Strong knowledge of storage engine internals and optimization techniques.
Proficiency in programming languages such as Go, Java, or Python.
Hands-on experience with Ansible for automating operational workflows.
Responsibilities
Design and evolve cloud-native infrastructure storage platforms for data services.
Create reliable storage capabilities, including replication, sharding, and backup frameworks.
Integrate open-source technologies to meet enterprise storage requirements.
Automate infrastructure provisioning and operational workflows using Terraform and Ansible.
Develop reusable platform components for managing distributed data systems.
Collaborate with DBA teams to optimize storage solutions and address performance issues.
Establish observability and diagnostics to enhance system performance and transparency.
Benefits
Free snacks and drinks
Fully paid medical, dental, and vision insurance for employees
401k contributions
Bi-annual reviews with potential for pay increases
Health and wellness benefits including gym membership
Quarterly team-building events
Full Job Description
KEY RESPONSIBILITIES
Design, build, and continuously evolve cloud-native infrastructure storage platforms that support distributed databases, middleware systems, and large-scale data services.
Design scalable and reliable storage platform capabilities, including data placement, sharding, replication, indexing, compaction, compression, validation, backup, and recovery frameworks.
Build infrastructure storage solutions by integrating and extending open-source database, storage, and middleware technologies to meet enterprise-grade platform requirements.
Drive automation for infrastructure provisioning, configuration management, CI/CD pipelines, observability, and operational workflows using Terraform, Ansible, Go, and Python, including authoring Ansible playbooks and roles that orchestrate database operational tasks across large fleets of distributed database nodes, covering provisioning, configuration, rolling upgrades, patching, backup/restore, and recovery.
Develop reusable platform components and standardized storage capabilities that enable application teams and database teams to efficiently manage large-scale distributed data systems.
Work closely with DBA teams to understand database platform requirements, storage-layer constraints, performance bottlenecks, and data governance needs, and translate them into scalable infrastructure solutions.
Partner with DBAs to design and improve database underlying capabilities, including storage format optimization, indexing strategy, data partitioning, compaction policy, backup architecture, and recovery design.
Evaluate architecture trade-offs across performance, scalability, availability, consistency, cost, and operational complexity for distributed storage and database platforms.
Design and implement performance optimization solutions across storage engines, system I/O, network communication, resource scheduling, and distributed data-processing paths.
Build observability, diagnostics, and automation capabilities as part of the platform design to improve system transparency, troubleshooting efficiency, and engineering productivity.
Collaborate with application engineering, DBA, SRE, architecture, and platform teams to standardize storage infrastructure capabilities and support the long-term evolution of the company's database and storage platforms.
Requirements
REQUIRED QUALIFICATIONS
Bachelor's degree or above in Computer Science, Software Engineering, Computer Engineering, or a related field.
3+ years of experience in backend engineering, cloud infrastructure, distributed systems, database systems, storage infrastructure, or related areas.
Solid understanding of distributed system principles, including CAP, BASE, replication, sharding, load balancing, failure detection, consistency models, and fault tolerance.
Hands-on experience designing or building distributed databases, storage systems, middleware platforms, message queues, or large-scale data-processing systems.
Strong understanding of storage engine or database internals, including B+-tree, LSM-tree, indexing, compaction, compression, data validation, and storage optimization.
Experience building platform-level capabilities or reusable infrastructure components for database, storage, or data-processing systems.
Proficiency in at least one programming language such as Go, Java, C++, Python, or Rust.
Hands-on experience with Ansible (playbooks, roles, inventory management, idempotent task design) and using it to automate database or distributed-system operational workflows at scale.
Solid knowledge of Linux systems, networking fundamentals, file systems, I/O performance, and distributed system troubleshooting.
Ability to work closely with DBA teams and translate database requirements into infrastructure platform design and technical solutions.
Strong problem-solving skills with the ability to analyze complex performance, scalability, and reliability challenges in distributed systems.
Strong communication and cross-functional collaboration skills.
PREFERRED QUALIFICATIONS
Experience with cloud-native infrastructure, Kubernetes, containerized platforms, or cloud storage services.
Experience with open-source systems such as Cassandra, Kafka, Redis, MySQL, VictoriaMetrics, Loki, ELK or similar technologies.
Experience designing systems with high QPS, low latency, large-scale storage, or strict availability requirements.
Experience with multi-region architecture, cross-site disaster recovery design, active-active architecture, hot/cold standby, or automated failover solutions.
Experience partnering with DBA teams on database platform design, data governance, backup and recovery architecture, or storage-layer optimization.
Experience contributing to architecture reviews, technical design documents, platform standards, or infrastructure solution design.
Benefits
Salary range: TBD
Free snacks and drinks
Fully paid medical, dental, and vision insurance (partial coverage for dependents)
Contributions to 401k funds
Bi-annual reviews, and annual pay increases
Health and wellness benefits, including free gym membership