Location: NYC, NY (Hybrid).Key responsibilities: Architecture & Platform Engineering- Develop end-to-end architectural patterns leveraging Databricks, Spark, Delta Lake, Unity catalog and Azure Data services
- Design scalable Lakehouse foundations, ensuring performance, reliability, and maintainability of data layers.
- Establish architectural frameworks and standards for ingestion, curation, storage, and consumption.
- Experience in building a data platform with unity catalog and design the multi-tenant solution with data governance principles.
Collaboration & Leadership- Serve as a primary technical advisor for engineering teams, analysts, and data science partners.
- Participate in planning ceremonies, architectural councils, and cross-functional strategy discussions.
Platform Innovation & Best Practices- Continuously evaluate emerging tools, Databricks capabilities, and Azure offerings to enhance platform maturity.
- Maintain and refine coding conventions, DevOps standards, and operational playbooks.
- Drive automation across deployment processes using Azure DevOps, or equivalent CI/CD toolchains.
Skills: - Bachelor's degree in Computer Science, Engineering, or related discipline.
- 10 to 15+ years of experience in data engineering and/or data architecture roles.
- Advanced hands-on experience with Databricks, Apache Spark, and the Azure data stack.
- Strong practical knowledge of Lakehouse patterns, Delta Lake, and modern data warehousing.
- Demonstrated background designing resilient ETL/ELT systems with both batch and streaming patterns.
- Proficiency with SQL data stores, schema design, and performance optimization.
- Experience working with structured and semi-structured formats such as Parquet, ORC, and JSON.
- Solid understanding of cloud security, access governance, and metadata/catalog strategies (e.g., Unity Catalog).
- Experience implementing automated build/deploy workflows for data solutions.
- Excellent analytical, problem-solving, and communication skills.
Preferred Skills - Hands-on experience with data lineage tools, or governance frameworks.
- Exposure to domain-driven design or product-oriented data architecture.
The base compensation range for this role in the posted location is: 125,000 - 135,000.
Capgemini provides compensation range information in accordance with applicable national, state, provincial, and local pay transparency laws. The base compensation range listed for this position reflects the minimum and maximum target compensation Capgemini, in good faith, believes it may pay for the role at the time of this posting. This range may be subject to change as permitted by law.
The actual compensation offered to any candidate may fall outside of the posted range and will be determined based on multiple factors legally permitted in the applicable jurisdiction.
These may include, but are not limited to: Geographic location, Education and qualifications, Certifications and licenses, Relevant experience and skills, Seniority and performance, Market and business consideration, Internal pay equity.
It is not typical for candidates to be hired at or near the top of the posted compensation range.
In addition to base salary, this role may be eligible for additional compensation such as variable incentives, bonuses, or commissions, depending on the position and applicable laws.
Capgemini offers a comprehensive, non-negotiable benefits package to all regular, full-time employees. In the U.S. and Canada, available benefits are determined by local policy and eligibility and may include:
- Paid time off based on employee grade (A-F), defined by policy: Vacation: 12-25 days, depending on grade, Company paid holidays, Personal Days, Sick Leave
- Medical, dental, and vision coverage (or provincial healthcare coordination in Canada)
- Retirement savings plans (e.g., 401(k) in the U.S., RRSP in Canada)
- Life and disability insurance
- Employee assistance programs
- Other benefits as provided by local policy and eligibility