Senior Big Data Engineer

  •  

Burlingame, CA

Industry: Media

  •  

5 - 7 years

Posted 24 days ago

  by    Pete Locke

GREEN CARD OR CITIZENSHIP REQUIRED

ENTERTAINMENT MEDIA FIRM IS SEEKING BIG DATA DEVELOPERS

AT BURLINGAME AND CHATSWORTH CALIFORNIA

BENEFITS

Access to concerts, sporting events, movies, comprehensive medical, dental, vision programs,

fully stocked kitchen, several catered meals weekly, annual profit sharing based on growth

and employee contribution, 30 days vacation after on full time year, nine holidays

WHAT YOU'LL DO:

  • Software development and design in Python
  • Hands on experience with Apache Spark / Hadoop
  • Deep expertise in SQL and ETL batch processing
  • Proficient in a Unix/Linux, CentOS
  • Hands on experience in AWS: S3 / EMR / Data Pipeline / Lambda

WHAT YOU'LL LEARN:

You will use your analytical skills, enthusiasm for telling stories through data and knowledge of economics and statistics to build brand and credibility. You will work closely with the rest of the research team to respond to data requests, conduct analysis on digital trends, and build data that drives insights and results.

CRITERIA

Ideal candidates are self-starters who love a challenge and can work in a fast-paced, deadline-driven environment. The Engineering team works directly with the Managing Director for Data Mining & Analytics and the rest of the Data Science team to develop solutions.

  • 5 years of relevant experience
  • Bachelor's degree in computer science, data science, computer engineering or related field from an accredited university, master's degree a plus
  • Proven experience with building and optimizing data pipelines, architectures, and datasets from both structured and unstructured datasets
  • Demonstrated ability to build processes that support data transformation, structures, metadata, accuracy checking, and workload management
  • Experience and proficiency in the following tools and technologies:
  • Command line tools and git version control technology
  • Parallel computing (e.g. Hadoop, Kafka, Spark, Hive, etc.) and multi threading
  • Cloud-based services (Google Cloud, Amazon Web Services)
  • Relational SQL and NoSQL databases, graphical database (e.g. Neo4J) experience a plus
  • Programming languages (e.g., Python, R, Java, C++, C#, Ruby, JavaScript, etc.)
  • Front-end web development or visualization experience a plus (e.g. Tableau, D3js, leaflet, R Shiny, etc.)
  • Systems Development Life Cycle ("SDLC") best practices
  • Interfacing with application programming interface ("API"), building interfaces a plus
  • Specific ETL tools/schedulers (e.g. Alteryx, Dask, Airflow, Luigi, etc.) a plus
  • Algorithms and data structures a plus
  • Organized, self-directed, and resourceful with the ability to appropriately prioritize work in a fast-paced environment
  • Excellent communication and visualization skills, with the ability to synthesize disparate data and information into a strong narrative
  • Team player with an outgoing personality and high level of integrity

Salary

$120K - $150K