Perficient

Databricks Lead Data Engineer

Perficient$73K — $170K *
Information Technology
8 - 10 years of experience
Job Overview by Ladders

Qualifications

  • 8+ years of data engineering experience with 3 years in a lead role.
  • 5+ years of hands-on Databricks experience with Python, SQL, and Pyspark.
  • Strong background in developing large scale data processing and distribution systems using Databricks.
  • Experience with cloud environments such as AWS and Azure.
  • Proficient in DevOps practices including CI/CD with Databricks and Git.
  • Ability to work in complex multi-vendor environments with various teams.
  • Bachelor's degree in Computer Science or related field.

Responsibilities

  • Lead technical planning and requirements gathering for development.
  • Serve as a mentor and provide technical leadership to the team.
  • Develop and maintain effective working relationships within the team.
  • Work with the data engineering team on data ingestion and transformation processes.
  • Lead design and deployment of scalable data pipelines using Databricks.
  • Enforce best practices in notebook development and job orchestration.
  • Document code and participate in user documentation development.

Benefits

  • Opportunities for professional growth and development.
  • Mentorship from senior engineers and industry leaders.
  • Flexibility to work from various office locations in major cities.
  • Engagement with innovative and cutting-edge technologies in data engineering.
  • Collaborative work environment culture; frequent interaction with data scientists and analysts.
Full Job Description
Job Description

Job Overview:

As a Databricks Lead Data Engineer, you are expected to lead the development team and have strong development and programming skills in Python, SQL, Pyspark, Databricks, Notebook Repositories, Medallion Architecture, CI/CD, and ETL platforms associated with Data Lakehouse solutions. You are expected to be experienced and fluent in agile development and agile tools.

You will work with architects and infrastructure teams to develop, test, deploy and troubleshoot your code as well as provide input into solutions and design of the system. You will collaborate with some of the best talent in the industry to create and implement innovative high-quality solutions focused on our clients' business needs.

From time to time, you will participate in customer discovery sessions, responding to customer proposal requests etc. The ability to report to a Perficient office is required. We have office locations in major cities such as Houston, TX, Dallas, TX, Chicago, IL, Saint Louis, MO, Detroit, MI, Irvine, CA, New York, NY, Minneapolis, MN

Responsibilities

  • Lead the technical planning and requirements gathering.
  • Serve as a technical lead and mentor. Provide technical support and leadership in the development and continual improvement of service.
  • Develop and maintain effective working relationships with team members.
  • Demonstrate the ability to adapt and work with team members of various experience levels.
  • Work with the data engineering team to define and develop data ingestion, validation, transformation, and data engineering code.
  • Lead the design, development, testing and deployment of scalable batch/streaming data pipelines using Databricks. Data jobs include data ingestion, cleansing, transformation, aggregation, governance, and sharing.
  • Define and enforce best practices for Databricks usage, including notebook development, job orchestration, metadata-driven frameworks, data quality checks, and optimization of spark job performance.
  • Develop Databricks platform components using Python, SQL, Pyspark, Notebook Repositories (medallion architecture L2, L3, L4), and CI/CD.
  • Integrate Databricks solutions with cloud platforms like Azure (ADF, ADLS, Event Hub, Azure DevOps) or AWS (S3, Lamda, Glue).
  • Design and implement analytical data models and star schemas for BI and AI usage.
  • Collaborate with data scientists and analysts to meet data requirements.
  • Implement logging, monitoring and exception handling for data workflows.
  • Participate in code reviews and CI/CD pipeline development.
  • Document code artifacts and participate in developing user documentation and run books
  • Troubleshoot deployment to various environments and provide test support.
  • Participate in design sessions, demos, and prototype sessions, testing and training workshops with business users and IT associates.


Qualifications

  • Passionate developer with 8+ years of data engineering experience, with at least 3 years in a lead or senior data engineer role.
  • Minimum 5 years of hands-on Databricks experience using Python, SQL, Pyspark, Notebook Repositories (medallion architecture L2, L3, L4), CI/CD.
  • Minimum 5 years of Databricks experience in developing large scale data processing/data storage/data distribution systems.
  • Experience with AWS and/or Azure cloud environment
  • DevOps experience with Databricks CI/CD (Databricks Asset Bundles) and Git
  • Minimum 5 years of working with large multi-vendor environment with multiple teams as a part of a project
  • Minimum 5 years of working with a complex Big Data/Lakehouse environment
  • Minimum of 5 years of experience with JIRA/GitHub/Git and other code management toolsets
  • Experience with ELT/ETL development, patterns and tooling is recommended
  • Experience with SQL including Postgres, RDBMS platforms
  • Experience with various IDE and code repositories as well as unit testing frameworks.
  • Flexible and adaptable attitude, disciplined to manage multiple responsibilities and adjust to varied environments.
  • Ability to produce high quality products within deadlines
  • Strong written and oral communication skills; Ability to communicate effectively with technical and non-technical staff.
  • Excellent problem-solving skills.
  • Be independent and self-driven.
  • Bachelor's degree in Computer Science, Engineering, Information Systems, or a related field.
  • Work is expected to be out of the local Perficient office.
  • Must be open to travel and work at client site when required.


Disclaimer: The above statements are not intended to be a complete statement of job content, rather to act as a guide to the essential functions performed by the employee assigned to this classification. Management retains the discretion to add or change the duties of the position at any time.

#LI-RS1

About Perficient

Perficient is a leading digital consultancy that helps companies transform their businesses and operations through technology. They deliver solutions to clients that range from Fortune 500 companies to emerging businesses. Perficient has a broad range of capabilities, including strategy, design, technology, and operations. They have expertise in a variety of industries, including healthcare, financial services, retail, and energy. Perficient has been recognized as a top employer and a top company for women technologists. They are committed to giving back to their communities through philanthropy and volunteerism.
Learn more about Perficient
Size
6,079 employees
Market Cap
$2.4 billion
Industry
Net Income
$30.1 million
Founded
1998
5 Year Trend
+9.3%
Revenue
$612.1 million
NASDAQ

Similar Jobs

More Jobs at Perficient

More Information Technology Jobs

Find similar Databricks Lead Data Engineer jobs: