Job DescriptionJob Overview:As a Databricks Lead Data Engineer, you are expected to lead the development team and have strong development and programming skills in Python, SQL, Pyspark, Databricks, Notebook Repositories, Medallion Architecture, CI/CD, and ETL platforms associated with Data Lakehouse solutions. You are expected to be experienced and fluent in agile development and agile tools.
You will work with architects and infrastructure teams to develop, test, deploy and troubleshoot your code as well as provide input into solutions and design of the system. You will collaborate with some of the best talent in the industry to create and implement innovative high-quality solutions focused on our clients' business needs.
From time to time, you will participate in customer discovery sessions, responding to customer proposal requests etc. The ability to report to a Perficient office is required. We have office locations in major cities such as Houston, TX, Dallas, TX, Chicago, IL, Saint Louis, MO, Detroit, MI, Irvine, CA, New York, NY, Minneapolis, MN
Responsibilities- Lead the technical planning and requirements gathering.
- Serve as a technical lead and mentor. Provide technical support and leadership in the development and continual improvement of service.
- Develop and maintain effective working relationships with team members.
- Demonstrate the ability to adapt and work with team members of various experience levels.
- Work with the data engineering team to define and develop data ingestion, validation, transformation, and data engineering code.
- Lead the design, development, testing and deployment of scalable batch/streaming data pipelines using Databricks. Data jobs include data ingestion, cleansing, transformation, aggregation, governance, and sharing.
- Define and enforce best practices for Databricks usage, including notebook development, job orchestration, metadata-driven frameworks, data quality checks, and optimization of spark job performance.
- Develop Databricks platform components using Python, SQL, Pyspark, Notebook Repositories (medallion architecture L2, L3, L4), and CI/CD.
- Integrate Databricks solutions with cloud platforms like Azure (ADF, ADLS, Event Hub, Azure DevOps) or AWS (S3, Lamda, Glue).
- Design and implement analytical data models and star schemas for BI and AI usage.
- Collaborate with data scientists and analysts to meet data requirements.
- Implement logging, monitoring and exception handling for data workflows.
- Participate in code reviews and CI/CD pipeline development.
- Document code artifacts and participate in developing user documentation and run books
- Troubleshoot deployment to various environments and provide test support.
- Participate in design sessions, demos, and prototype sessions, testing and training workshops with business users and IT associates.
Qualifications- Passionate developer with 8+ years of data engineering experience, with at least 3 years in a lead or senior data engineer role.
- Minimum 5 years of hands-on Databricks experience using Python, SQL, Pyspark, Notebook Repositories (medallion architecture L2, L3, L4), CI/CD.
- Minimum 5 years of Databricks experience in developing large scale data processing/data storage/data distribution systems.
- Experience with AWS and/or Azure cloud environment
- DevOps experience with Databricks CI/CD (Databricks Asset Bundles) and Git
- Minimum 5 years of working with large multi-vendor environment with multiple teams as a part of a project
- Minimum 5 years of working with a complex Big Data/Lakehouse environment
- Minimum of 5 years of experience with JIRA/GitHub/Git and other code management toolsets
- Experience with ELT/ETL development, patterns and tooling is recommended
- Experience with SQL including Postgres, RDBMS platforms
- Experience with various IDE and code repositories as well as unit testing frameworks.
- Flexible and adaptable attitude, disciplined to manage multiple responsibilities and adjust to varied environments.
- Ability to produce high quality products within deadlines
- Strong written and oral communication skills; Ability to communicate effectively with technical and non-technical staff.
- Excellent problem-solving skills.
- Be independent and self-driven.
- Bachelor's degree in Computer Science, Engineering, Information Systems, or a related field.
- Work is expected to be out of the local Perficient office.
- Must be open to travel and work at client site when required.
The salary range for this position takes into consideration a variety of factors, including but not limited to skill sets, level of experience, applicable office location, training, licensure and certifications, and other business and organizational needs. The new hire salary range displays the minimum and maximum salary targets for this position across all US locations, and the range has not been adjusted for any specific state differentials. It is not typical for a candidate to be hired at or near the top of the range for their role, and compensation decisions are dependent on the unique facts and circumstances regarding each candidate. A reasonable estimate of the current salary range for this position is $73,008 to $170,640. Please note that the salary range posted reflects the base salary only and does not include benefits or any potential variable compensation programs. Information regarding the benefits available for this position are in our benefits overview.
Disclaimer: The above statements are not intended to be a complete statement of job content, rather to act as a guide to the essential functions performed by the employee assigned to this classification. Management retains the discretion to add or change the duties of the position at any time.
#LI-RS1