The HubSpot Data Infrastructure HBase team is responsible for all of HubSpot's HBase databases and Hadoop infrastructure. We aim to provide 99.99% reliability for our HBase clusters, as well as patching our HBase distribution and providing HubSpot developers with libraries to improve HBase ease of use & monitoring.
We spend a lot of time building automation, tooling, and monitoring to equip developers to understand and optimize their datastore usage and minimize the impact of any operational issues on customers and developers alike. For example, we’ve built database clients to make monitoring and analysis of usage patterns easier for HubSpot teams and are working to commit these changes back to the community to better support and monitor in a multi-tenant environment.
We’re also operationally responsible for a huge volume of traffic to and from these datastores. Our HBase clusters serve over 3 million requests/second across 220+ tables, while our ElasticSearch clusters serve over 20k searches/second and 50k indexes/second to 90+ billion documents. Streaming that data to and from applications amounts to more than 3 GB/sec of data through our Kafka clusters, with hundreds of producers and consumers
What we’re looking for
Confidence can sometimes hold us back from applying for a job. But we'll let you in on a secret: there's no such thing as the 'perfect' candidate. HubSpot is a place where everyone can grow. So however you identify and whatever background you bring with you, please apply if this is a role that would make you excited to come into work every day.
HubSpot helps millions of organizations grow better, and we’d love to grow better with you. Our business builds the software and systems that power the world’s small to medium-sized businesses. Our company culture builds connections, careers, and employee growth. How? By creating a workplace that values flexibility, autonomy, and transparency. If that sounds like something you’d like to be part of, we’d love to hear from you.