Do you want to be part of the transformation that's driving forward a truly data-driven company? As a Data Engineer you will work closely with Data Scientists and Machine Learning Developers to build the Big Data foundation that enables advanced AI and machine learning capabilities for strategically important initiatives.
You will be part of an innovative, highly collaborative team involved in data munging and integration, developing machine learning models, building simulation and forecasting tools, and operationalizing solutions on top of the Big Data platform. This position requires excellent technical skills, a strong desire to learn, good communication skills, attention to details, and the ability to self-manage. You will get great exposure as you work directly with a collaborative team to tackle tough business challenges.
As part of the Data Science team, you will help build the infrastructurerequired for optimal extraction, transformation, and loading of data from a wide variety of data sources in our data lake using Spark and Big Datatechnologies. You will help design and build data stores that integrate various data sources, suitable for development and deployment of important data science models.
You will also create and maintain optimal data pipelines for feeding machine learning services in production, which are providing real-time data-driven decisions for key business functions in the company.
You will identify, design, and implement internal process improvements: automating manual processes, optimizing data delivery, re-designing infrastructure for greater scalability, and usability. You will recommend technical solutions, and work with appropriate teams to implement them.
You will have an opportunity to work with cutting-edge researchers in advanced analytics and help build an extensive graph database scaled for millions of entities and relationships. You will then optimize the graph database for real-time scoring of machine learning models that will support a mission-critical application.
- 3+ years of experience in IT with at least one year of experience in ETL/ELT methods, Java or Scala programming, and Spark and SparkSQL
- Experience in data modeling (both transactional and dimensional), using tools such as ERWin
- Exposure to Spark streaming and SparkML
- Knowledge of NoSQL databases (such as Hbase or Cassandra), and Graph/Network analysis
- Most importantly, you have the curiosity, passion, and ability to quickly learn, adapt and implement Open Source technologies
About GEICO For more than 75 years, GEICO has stood out from the rest of the insurance industry! We are one of the nation's largest and fastest-growing auto insurers thanks to our low rates, outstanding service and clever marketing. We're an industry leader employing thousands of dedicated and hard-working associates. As a wholly owned subsidiary of Berkshire Hathaway, we offer associates training and career advancement in a financially stable and rewarding workplace.