Data Engineer in Santa Clara, CA

Trianz   •  

Santa Clara, CA 95050

Industry: Business Services

  •  

Less than 5 years

Posted 56 days ago

Trianz is a global professional services firm committed to enabling leaders to develop and execute operational strategies, leverage new business and technology paradigms, and achieve results expected by senior management in their organizations- predictably.

What We Stand For

Our clients are transforming their businesses, competitive strategies, product and service portfolios, customer-partner-employee interactions and their ecosystem. The cost of misses is not financial alone but a lost window of opportunity. So getting things right the first time is absolutely critical.

As a result, Trianz is focusing on three important themes in our engagement model with clients.

Crystallize business impact from a top management point of view

Help Clients achieve results from strategy-by making execution predictable through innovative execution techniques

Create a positive, enriching partnership experience in everything we do

Industries, Clients & Practices

Trianz works with clients across High Technology, Banking, Insurance, Manufacturing, Retail, Telecom, e-businesses and Public Services. Most clients are Fortune 1000 organizations and our relationships are sponsored by senior leaders in Enterprise Analytics Sales, Finance, Marketing, Human Resources, Operations and Information Technology. We partner with our clients to address the following key service areas:

Cloud

Analytics

Digitization

Infrastructure

Security

Job Description

Overview

Data is the way our clients make decisions. It is the core to their business, helping create an experience for customers and providing insights into the effectiveness of our product launch & features.


As a Data Engineer , you will be a part of an early stage team that builds the data pipelines, collection, and storage, and exposes services that make data a first-class citizen. We are looking for a Data Engineer to build a scalable data platform. You'll have ownership of core data pipelines that powers top line metrics; You will also use data expertise to help evolve data models in several components of the data stack; You will help architect, building, and launching scalable data pipelines to support growing data processing and analytics needs. Your efforts will allow access to business and user behavior insights, using huge amounts of data to fuel several teams such as Analytics, Data Science, Marketplace and many others.


Responsibilities

  • Owner of the core company data pipeline, responsible for scaling up data processing flow to meet the rapid data growth
  • Evolve data model and data schema based on business and engineering needs
  • Implement systems tracking data quality and consistency
  • Develop tools supporting self-service data pipeline management (ETL)
  • SQL and MapReduce job tuning to improve data processing performance

Experience

  • 3+ years of relevant professional experience
  • Experience with Hadoop (or similar) Ecosystem (MapReduce, Yarn, HDFS, Hive, Spark, Presto, Pig, HBase, Parquet)
  • Proficient in at least one of the SQL languages (MySQL, PostgreSQL, SqlServer, Oracle)
  • Good understanding of SQL Engine and able to conduct advanced performance tuning
  • Strong skills in scripting language (Python, Ruby, Bash)
  • 1+ years of experience with workflow management tools (Airflow, Oozie, Azkaban, UC4)
  • Comfortable working directly with data analytics to bridge Lyft's business goals with data engineering


Valid Through: 2019-11-11