Senior Data Engineers

CCC Intelligent Solutions, Inc.

$108K — $125K *
Information Technology
Less than 5 years of experience
Job Overview by Ladders

Qualifications

  • Master's degree in Computer Science, Computer Engineering, or related field
  • 2 years of software development/data processing experience
  • Hands-on experience with Python & PySpark
  • Proficiency in Hadoop, AWS EMR, and Redshift
  • Strong SQL skills for data profiling and validation
  • Experience with Apache Kafka and Airflow

Responsibilities

  • Develop large-scale end-to-end data pipeline applications
  • Create data flows to extract and store ingested data
  • Automate software tests for data flow components
  • Guide junior team members in application performance tuning
  • Perform root cause analysis and identify improvement opportunities
  • Mentor junior engineers to enhance their skillset

Benefits

  • Telecommuting permitted
  • Opportunities for professional development
  • Collaborative work environment
  • Access to advanced data technologies
  • Exposure to a variety of projects across different worksites
Full Job Description
The Role

Key Responsibilities:

Senior Data Engineers for various and unanticipated worksites throughout the U.S. (HQ: Chicago, IL). Develop large scale end to end data pipeline applications, covering multiple data sources spread across data center and AWS cloud. Use developed software applications to locate and analyze source data; create data flows to extract, profile, and store ingested data; define and build data cleansing and imputation; map to a common data model; transform to satisfy business rules and statistical computations; and validate data content. Produce software data building blocks, data models, and data flows, such as dimensional data, data feeds, dashboard reporting, and data science research and exploration. Produce automated software tests of data flow components and for data content quality. Automate orchestration and error handling for use by production operation teams. Provide technical expertise to diagnose errors from production support teams. Guide junior team members in performance tuning applications in distributed computing environments. Perform root cause analysis on all data and processes and identify opportunities for improvement. Develop metadata-driven and fully parameterized data processing tools. Mentor junior engineers. Technical Environment: Programming using Python & PySpark; Hadoop; HDFS, map-reduce, YARN, AWS EMR, Redshift, Terraform; Hive, HBase, parquet, ORC, Spark SQL, Sqoop, Apache Hudi; Orchestrating ETL pipelines involving data sourcing, transformations & publishing using Apache Airflow; Performance tuning applications in distributed computing environments; Designing & developing data pipeline applications with Apache Kafka; Advanced SQL for data profiling & data validation; Unix commands & scripting; performing root cause analysis on internal & external data & processes to identify opportunities for improvement; JIRA, Gitlab, Subversion; Development of metadata-driven & fully parameterized data processing tools; AWS.

#LI-DNI

#NOINDEED

Requirements:

Master's degree in Computer Science, Computer Engineering, Management Information Systems or related field plus 2 years of experience in software development/data processing or analysis required. Required skills: Hands-on experience with: Programming using Python & PySpark; Hadoop; HDFS, map-reduce, YARN, AWS EMR, Redshift, Terraform; Hive, HBase, parquet, ORC, Spark SQL, Sqoop, Apache Hudi; Orchestrating ETL pipelines involving data sourcing, transformations & publishing using Apache Airflow; Performance tuning applications in distributed computing environments; Designing & developing data pipeline applications with Apache Kafka; Advanced SQL for data profiling & data validation; Unix commands & scripting; performing root cause analysis on internal & external data & processes to identify opportunities for improvement; JIRA, Gitlab, Subversion; Development of metadata-driven & fully parameterized data processing tools; AWS; Telecommuting permitted.

Salary:

$108,077/yr.-$125,282/yr. + Benefits: www.cccis.com/careers

Similar Jobs

More Jobs at CCC Intelligent Solutions, Inc.

More Information Technology Jobs

Find similar Senior Data Engineers jobs: