SugarCRM

Senior Data Engineer - Databricks

SugarCRM$155K — $185K *
US-AnywhereRemote in United States
Enterprise Technology
Less than 5 years of experience
Job Overview by Ladders

Qualifications

  • 4+ years of data engineering experience
  • 2+ years on Databricks or Apache Spark ecosystem
  • Proficient in PySpark, SQL, and Python for production pipelines
  • Hands-on experience with Delta Lake and its lifecycle management
  • Knowledge of PostgreSQL for production data integration
  • Familiarity with legacy ETL tools and frameworks
  • Experience with multi-tenant architectures and data privacy principles

Responsibilities

  • Own Databricks production support for Sugar Predict platform
  • Maintain SLA performance metrics for data pipeline delivery
  • Implement pipeline optimizations for cost and performance
  • Migrate legacy ETL/ELT pipelines to Databricks
  • Support customer onboarding with reliable tenant data pipelines
  • Design high-performance Databricks pipelines for ERP/CRM data
  • Enforce data security best practices in Databricks environments

Benefits

  • Excellent healthcare package for you and your family
  • 401(k) match for savings and investment
  • Unlimited Paid Time Off policy
  • Paid Parental Leave
  • Access to online legal services
  • Financial planning services available
  • Discounted pet insurance offered
  • Corporate benefit program with exclusive discounts
  • Health and wellness reimbursement program
  • Travel discounts provided
  • Educational resources for career development
  • Employee referral bonus program
  • Merit-based opportunities for professional growth
Full Job Description
Where You Fit In:

The Sugar Predict platform powers revenue intelligence for mid-market enterprises by fusing ERP and CRM data into actionable insights. As a Senior Data Engineer, you will own the Databricks pipelines that make this possible, driving production reliability, cost efficiency, and platform growth through customer onboarding and legacy modernization. You will work closely with ML engineers, product teams, and the Enterprise Architecture team to ensure the data backbone behind Sugar Predict is always fast, clean, and ready to deliver at a global scale.

Impact You Will Make in the Role:

  • Own Databricks production support for the Sugar Predict data platform, including monitoring, alerting, and incident response across all production data flows


  • Maintain and report on SLA performance metrics for data pipeline delivery, ensuring visibility into platform health and accountability across internal and external stakeholders


  • Identify and implement pipeline optimizations that reduce Databricks compute costs, improve throughput, and reduce processing windows while tracking impacts through measurable KPIs


  • Migrate legacy ETL/ELT pipelines to Databricks, building automation tooling to reduce manual intervention and ensure uninterrupted data delivery during transitions


  • Support new customers onboarding by provisioning, validating, and hardening tenant data pipelines that deliver reliable, isolated data from day one


  • Design and build high-performance Databricks pipelines that ingest, transform, and serve ERP and CRM data at scale across both Azure and AWS environments


  • Own the Delta Lake architecture including schema design, partitioning strategies, data quality enforcement, and incremental processing patterns


  • Enforce data security best practices across Databricks environments, including role-based access control, secrets management, and compliance requirements for enterprise CRM and ERP data


  • Implement data quality monitoring and observability across pipeline health and ML model inputs, ensuring data integrity that directly supports Sugar Predict prediction accuracy


  • Apply and enforce multi-tenant data isolation patterns ensuring reliable, secure data delivery across Sugar Predict enterprise customers


  • Partner with the Enterprise Architecture team to ensure Sugar Predict data pipelines integrate seamlessly with the broader SugarAI product ecosystem


  • Support a globally distributed operation through on-call rotation and after-hours incident response, meeting SLAs across multiple time zones


  • Maintain technical documentation, runbooks, and architectural decision records, contributing to team knowledge sharing and operational readiness across on-call and incident response scenarios


  • Apply CI/CD best practices to data pipeline development, including version control, automated testing, and deployment tooling to ensure reliable and repeatable pipeline delivery


What You Will Bring:

  • 4+ years of data engineering experience


  • At least 2 years on Databricks or the Apache Spark ecosystem across Azure and/or AWS


  • Proficiency in PySpark, SQL, and Python with a strong track record building and operating production-grade pipelines under SLA constraints


  • Hands-on experience with Delta Lake including schema evolution, ACID transactions, optimize/vacuum lifecycle, and both incremental and streaming processing patterns


  • Hands-on experience with pipeline performance tuning and compute optimization in production Databricks environments


  • Solid working knowledge of PostgreSQL including query optimization, schema design, and use as a source or sink in production data pipelines


  • Experience supporting and maintaining legacy ETL tooling (SSIS, Informatica, custom Python/SQL pipelines, or similar) in production


  • Experience supporting large-scale multi-tenant architectures with a focus on tenant isolation, per-tenant performance, and data privacy, including navigating tools and platforms that default to single-tenant assumptions


  • Proven ability to work collaboratively across data science, product, and infrastructure teams, owning end-to-end delivery in a cross-functional environment


  • Strong understanding of data governance, security, and compliance principles, including access control, data privacy, and protection of sensitive enterprise data across multi-tenant environments


Preferred Qualifications/Experience:

  • Experience operating Databricks workspaces across both Azure and AWS, including cost governance, cluster management, and cross-cloud data access


  • Experience optimizing Databricks workloads in a Serverless environment, including compute cost governance and performance tuning for serverless compute


  • Experience with Microsoft SQL Server in a data engineering or ETL context


  • Exposure to ML feature engineering or feature stores (Databricks Feature Store, Feast, or similar) supporting predictive analytics


  • Experience with customer onboarding automation or IaC patterns for provisioning tenant data pipelines at scale


  • Databricks Certified Data Engineer Associate or Professional certification


$155,000 - $185,000 a year

Expected salary range, depending on experience.

We understand that no candidate is perfectly qualified for any job. Experience comes in different forms; many skills are transferable; and passion goes a long way. Even more important than your resume is a clear demonstration of dedication, impact, and the ability to thrive in a fluid and collaborative environment. We want you to learn new things in this role, and we encourage you to apply if your experience is close to what we're looking for. We also know that diversity of background and thought makes for better problem solving and more creative thinking, which is why we're dedicated to adding new perspectives to the team.

Benefits and Perks:

Beyond a stellar work environment, friendly people, and inspiring work, we have some sweet benefits and perks:
• Excellent healthcare package for you and your family
• Savings and Investment - 401(k) match
• Unlimited Paid Time Off
• Paid Parental Leave
• Online Legal Services (Rocket Lawyer)
• Financial Planning Services (Origin)
• Discounted Pet Insurance (Embrace Pet Insurance)
• Corporate Benefit Program (Working Advantage). This benefit offers you exclusive travel and entertainment offers and special discounts that are not available to the general public
• Health and Wellness Reimbursement Program
• Travel Discounts
• Educational Resources - Career & Personal Development Program
• Employee Referral Bonus Program
• We are a merit-based company - many opportunities to learn, excel and grow your career!

About SugarCRM

SugarCRM is a customer relationship management software company that provides a cloud-based platform for managing customer interactions and data. The company's platform includes sales automation, marketing automation, customer support, and analytics tools. SugarCRM's customers include small and medium-sized businesses in industries such as finance, healthcare, and manufacturing. The company was founded in 2004 and is headquartered in Cupertino, California.
Learn more about SugarCRM
Size
500 employees
Industry
Founded
2004

Similar Jobs

More Jobs at SugarCRM

More Enterprise Technology Jobs

Find similar Senior Data Engineer - Databricks jobs: