Perficient

Databricks Lead Data Engineer

Perficient$73K — $170K *
Information Technology
8 - 10 years of experience
Job Overview by Ladders

Qualifications

  • 8+ years of data engineering experience, with at least 3 years in a lead role.
  • 5 years of hands-on Databricks experience with Python, SQL, Pyspark, and CI/CD.
  • 5 years developing large scale data processing and storage systems in Databricks.
  • Experience with AWS and/or Azure cloud environments.
  • Strong skills in DevOps practices and tools, specifically with Databricks CI/CD.
  • Proficient in code management tools like JIRA, GitHub, and Git.

Responsibilities

  • Lead technical planning and gather requirements for data projects.
  • Mentor team members and provide ongoing technical support.
  • Build and maintain effective relationships within the team.
  • Collaborate with the data engineering team to develop data engineering code.
  • Design and deploy scalable data pipelines using Databricks for batch and streaming data.
  • Enforce best practices in Databricks usage and performance optimization.
  • Work on integrating Databricks solutions with cloud platforms such as Azure or AWS.

Benefits

  • Participation in customer discovery sessions and proposal processes.
  • Opportunities to work with top talent in the industry.
  • Access to major office locations for flexible remote work arrangements.
  • Engagement in design sessions and trainings to enhance skills.
Full Job Description
Job Description

Job Overview:

As a Databricks Lead Data Engineer, you are expected to lead the development team and have strong development and programming skills in Python, SQL, Pyspark, Databricks, Notebook Repositories, Medallion Architecture, CI/CD, and ETL platforms associated with Data Lakehouse solutions. You are expected to be experienced and fluent in agile development and agile tools.

You will work with architects and infrastructure teams to develop, test, deploy and troubleshoot your code as well as provide input into solutions and design of the system. You will collaborate with some of the best talent in the industry to create and implement innovative high-quality solutions focused on our clients' business needs.

From time to time, you will participate in customer discovery sessions, responding to customer proposal requests etc. The ability to report to a Perficient office is required. We have office locations in major cities such as Houston, TX, Dallas, TX, Chicago, IL, Saint Louis, MO, Detroit, MI, Irvine, CA, New York, NY, Minneapolis, MN

Responsibilities

  • Lead the technical planning and requirements gathering.
  • Serve as a technical lead and mentor. Provide technical support and leadership in the development and continual improvement of service.
  • Develop and maintain effective working relationships with team members.
  • Demonstrate the ability to adapt and work with team members of various experience levels.
  • Work with the data engineering team to define and develop data ingestion, validation, transformation, and data engineering code.
  • Lead the design, development, testing and deployment of scalable batch/streaming data pipelines using Databricks. Data jobs include data ingestion, cleansing, transformation, aggregation, governance, and sharing.
  • Define and enforce best practices for Databricks usage, including notebook development, job orchestration, metadata-driven frameworks, data quality checks, and optimization of spark job performance.
  • Develop Databricks platform components using Python, SQL, Pyspark, Notebook Repositories (medallion architecture L2, L3, L4), and CI/CD.
  • Integrate Databricks solutions with cloud platforms like Azure (ADF, ADLS, Event Hub, Azure DevOps) or AWS (S3, Lamda, Glue).
  • Design and implement analytical data models and star schemas for BI and AI usage.
  • Collaborate with data scientists and analysts to meet data requirements.
  • Implement logging, monitoring and exception handling for data workflows.
  • Participate in code reviews and CI/CD pipeline development.
  • Document code artifacts and participate in developing user documentation and run books
  • Troubleshoot deployment to various environments and provide test support.
  • Participate in design sessions, demos, and prototype sessions, testing and training workshops with business users and IT associates.


Qualifications

  • Passionate developer with 8+ years of data engineering experience, with at least 3 years in a lead or senior data engineer role.
  • Minimum 5 years of hands-on Databricks experience using Python, SQL, Pyspark, Notebook Repositories (medallion architecture L2, L3, L4), CI/CD.
  • Minimum 5 years of Databricks experience in developing large scale data processing/data storage/data distribution systems.
  • Experience with AWS and/or Azure cloud environment
  • DevOps experience with Databricks CI/CD (Databricks Asset Bundles) and Git
  • Minimum 5 years of working with large multi-vendor environment with multiple teams as a part of a project
  • Minimum 5 years of working with a complex Big Data/Lakehouse environment
  • Minimum of 5 years of experience with JIRA/GitHub/Git and other code management toolsets
  • Experience with ELT/ETL development, patterns and tooling is recommended
  • Experience with SQL including Postgres, RDBMS platforms
  • Experience with various IDE and code repositories as well as unit testing frameworks.
  • Flexible and adaptable attitude, disciplined to manage multiple responsibilities and adjust to varied environments.
  • Ability to produce high quality products within deadlines
  • Strong written and oral communication skills; Ability to communicate effectively with technical and non-technical staff.
  • Excellent problem-solving skills.
  • Be independent and self-driven.
  • Bachelor's degree in Computer Science, Engineering, Information Systems, or a related field.
  • Work is expected to be out of the local Perficient office.
  • Must be open to travel and work at client site when required.


#LI-RS1

About Perficient

Perficient is a leading digital consultancy that helps companies transform their businesses and operations through technology. They deliver solutions to clients that range from Fortune 500 companies to emerging businesses. Perficient has a broad range of capabilities, including strategy, design, technology, and operations. They have expertise in a variety of industries, including healthcare, financial services, retail, and energy. Perficient has been recognized as a top employer and a top company for women technologists. They are committed to giving back to their communities through philanthropy and volunteerism.
Learn more about Perficient
Size
6,079 employees
Market Cap
$2.4 billion
Industry
Net Income
$30.1 million
Founded
1998
5 Year Trend
+9.3%
Revenue
$612.1 million
NASDAQ

Similar Jobs

More Jobs at Perficient

More Information Technology Jobs

Find similar Databricks Lead Data Engineer jobs: