Bachelor's degree in Computer Science or related field.
7+ years of experience in backend engineering and cloud infrastructure.
Strong grasp of distributed systems and their principles.
Hands-on experience with distributed databases and storage systems.
Proficient in programming languages like Go, Java, or Python.
Experience with Ansible for automation in operational workflows.
Solid knowledge of Linux systems and networking fundamentals.
Responsibilities
Design and evolve cloud-native infrastructure storage platforms.
Build scalable storage capabilities for data management.
Integrate open-source technologies for enterprise storage solutions.
Automate workflows using Terraform, Ansible, Go, and Python.
Develop reusable components for managing distributed data systems.
Collaborate with DBA teams to understand storage and performance needs.
Evaluate architecture trade-offs for distributed solutions.
Benefits
Free snacks and drinks.
Fully paid medical, dental, and vision insurance.
401k contributions.
Bi-annual reviews and annual pay increases.
Health and wellness benefits, including free gym membership.
Quarterly team-building events.
Full Job Description
KEY RESPONSIBILITIES
Design, build, and continuously evolve cloud-native infrastructure storage platforms that support distributed databases, middleware systems, and large-scale data services.
Design scalable and reliable storage platform capabilities, including data placement, sharding, replication, indexing, compaction, compression, validation, backup, and recovery frameworks.
Build infrastructure storage solutions by integrating and extending open-source database, storage, and middleware technologies to meet enterprise-grade platform requirements.
Drive automation for infrastructure provisioning, configuration management, CI/CD pipelines, observability, and operational workflows using Terraform, Ansible, Go, and Python, including authoring Ansible playbooks and roles that orchestrate database operational tasks across large fleets of distributed database nodes, covering provisioning, configuration, rolling upgrades, patching, backup/restore, and recovery.
Develop reusable platform components and standardized storage capabilities that enable application teams and database teams to efficiently manage large-scale distributed data systems.
Work closely with DBA teams to understand database platform requirements, storage-layer constraints, performance bottlenecks, and data governance needs, and translate them into scalable infrastructure solutions.
Partner with DBAs to design and improve underlying database capabilities, including storage format optimization, indexing strategy, data partitioning, compaction policy, backup architecture, and recovery design.
Evaluate architecture trade-offs across performance, scalability, availability, consistency, cost, and operational complexity for distributed storage and database platforms.
Design and implement performance optimization solutions across storage engines, system I/O, network communication, resource scheduling, and distributed data-processing paths.
Build observability, diagnostics, and automation capabilities as part of the platform design to improve system transparency, troubleshooting efficiency, and engineering productivity.
Collaborate with application engineering, DBA, SRE, architecture, and platform teams to standardize storage infrastructure capabilities and support the long-term evolution of the company's database and storage platforms.
Requirements
REQUIRED QUALIFICATIONS
Bachelor's degree or above in Computer Science, Software Engineering, Computer Engineering, or a related field.
7+ years of experience in backend engineering, cloud infrastructure, distributed systems, database systems, storage infrastructure, or related areas.
Solid understanding of distributed system principles, including CAP, BASE, replication, sharding, load balancing, failure detection, consistency models, and fault tolerance.
Hands-on experience designing or building distributed databases, storage systems, middleware platforms, message queues, or large-scale data-processing systems.
Strong understanding of storage engine or database internals, including B+-tree, LSM-tree, indexing, compaction, compression, data validation, and storage optimization.
Experience building platform-level capabilities or reusable infrastructure components for database, storage, or data-processing systems.
Proficiency in at least one programming language such as Go, Java, C++, Python, or Rust.
Hands-on experience with Ansible (playbooks, roles, inventory management, idempotent task design) and using it to automate database or distributed-system operational workflows at scale.
Solid knowledge of Linux systems, networking fundamentals, file systems, I/O performance, and distributed system troubleshooting.
Ability to work closely with DBA teams and translate database requirements into infrastructure platform design and technical solutions.
Strong problem-solving skills with the ability to analyze complex performance, scalability, and reliability challenges in distributed systems.
Strong communication and cross-functional collaboration skills.
PREFERRED QUALIFICATIONS
Cloud-native infrastructure, Kubernetes, containerized platforms, or cloud storage services.
Open-source systems such as Cassandra, Kafka, Redis, MySQL, VictoriaMetrics, Loki, ELK or similar technologies.
Designing systems with high QPS, low latency, large-scale storage, or strict availability requirements.