Veeva Systems Inc

Data Engineer

Veeva Systems Inc$100K — $175K *
Pharmaceuticals & Biotech
Less than 5 years of experience
Job Overview by Ladders

Qualifications

  • 4+ years of data engineering experience with a focus on production-grade data platforms
  • Expertise in Python and Apache Spark, including JVM tuning and memory management
  • Experience building on AWS or GCP, with knowledge of services like EMR or Databricks
  • Familiarity with orchestration tools such as Airflow or AWS Step Functions
  • Strong debugging skills for distributed systems and bottlenecks at scale
  • Proficient with CI/CD tools and processes
  • Excellent communication skills to convey technical architectures to stakeholders

Responsibilities

  • Architect and build resilient data processing systems using Python and Spark on AWS
  • Design and implement end-to-end ETL/ELT workflows from diverse sources
  • Lead the implementation of the Medallion Architecture for data management
  • Build reusable libraries for data quality validation and pipeline monitoring
  • Automate deployment and testing through CI/CD processes
  • Enforce data governance standards for security and compliance
  • Proactively monitor system health and resolve bottlenecks in distributed systems

Benefits

  • Medical, dental, vision, and basic life insurance
  • PTO and company paid holidays
  • Retirement programs
  • 1% charitable giving program
Full Job Description
The Role

Veeva OpenData supports the industry by providing real-time reference data across the complete healthcare ecosystem, to support commercial sales execution, compliance, and business analytics. We drive value to our customers through constant innovation, using cloud-based solutions and state-of-the-art technologies to deliver product excellence and customer success.

As a Data Engineer, you will own the end-to-end development lifecycle, collaborating with a high-performing engineering team to design, build, and deploy high-impact features. Operating within a fast-paced Agile environment, you will have a direct hand in engineering the data foundation for Veeva's life sciences customers.

What You'll Do
    • Architect and build resilient, distributed data processing systems using Python and Spark on AWS
    • Design and implement end-to-end ETL/ELT workflows that ingest and unify data from diverse sources -ranging from modern table formats like Iceberg and Delta to legacy business files such as Excel and CSV -ensuring a scalable and consistent single source of truth for the organization
    • Lead the implementation of the Medallion Architecture, managing data maturity through Bronze, Silver, and Gold layers. You will define how data is structured, classified, and stored to maximize business value while ensuring scalability and high availability.
    • Build reusable libraries and frameworks for data quality validation, metadata tracking, and pipeline monitoring
    • Build CI/CD process, to automate deployment and testing to maintain a high bar for engineering excellence
    • Enforce data governance standards, including security, privacy, and regulatory compliance
    • Proactively monitor system health, implement automated observability, and resolve complex bottlenecks in distributed systems to ensure peak resource efficiency and cost-effectiveness
    • Partner directly with Product Managers and Data Scientists to translate business requirements into innovative solutions
    • Own the full feature lifecycle-from initial whiteboarding to production deployment and long-term maintenance

Requirements
    • 4+ years of professional data engineering experience with a demonstrated ability to architect and deploy production-grade data platforms from scratch
    • Expert-level proficiency in Python and Apache Spark, with specific experience in JVM tuning, memory management, and optimizing execution plans for large-scale distributed workloads
    • Deep expertise in modern data architecture, software design patterns, and various data modeling techniques designed for scalability and performance
    • Proven track record of building on AWS (primary) or GCP, including hands-on experience with managed services like EMR or Databricks
    • Extensive experience designing and managing complex data lifecycles using orchestration tools such as Airflow, AWS Step Functions, or Prefect
    • Deep understanding of data cleansing, curation, and transformation strategies, coupled with experience implementing data governance, security, and lifecycle management policies
    • Strong background in building reusable libraries, frameworks, and internal tools that standardize data ingestion and automate ETL/ELT workflows
    • Exceptional debugging skills for distributed systems and resolving performance bottlenecks at scale
    • Proficiency with CI/CD tools and processes (e.g. Codefresh, Jenkins)
    • Excellent verbal and written communication skills in English, with the ability to translate complex technical architectures into actionable insights for stakeholders and cross-functional teams
    • Must be located in EST or CST
    • Applicants must have the unrestricted right to work in the United States. Veeva will not provide sponsorship at this time

Nice to Have
    • Relevant certifications (e.g., AWS, Spark, or similar)
    • Familiarity with streaming and distributed technologies such as Spark Streaming, EKS, Kinesis, or Apache Kafka
    • Experience implementing or managing modern cloud data warehouses or lakehouse architectures
    • Prior experience working in the Life Sciences industry

Perks & Benefits
    • Medical, dental, vision, and basic life insurance
    • PTO and company paid holidays
    • Retirement programs
    • 1% charitable giving program

Compensation
    • Base pay: $100,000 - $175,000 CAD
    • The salary range listed here has been provided to comply with local regulations and represents a potential base salary range for this role. Please note that actual salaries may vary within the range above or below, depending on experience and location. We look at compensation for each individual and base our offer on your unique qualifications, experience, and expected contributions. This position may also be eligible for other types of compensation in addition to base salary, such as variable bonus and/or stock bonus.


#LI-RemoteCanada

#LI-Associate

Veeva's headquarters is located in the San Francisco Bay Area with offices in more than 15 countries around the world.

Work Where It's Best for You

Work Anywhere means you can work in an office or at home on any given day. It's about getting the work done in the way and place that works best for each person. This applies across all locations and departments.

Work Anywhere does not mean work at any time. We have predictable core hours where employees are generally available for meetings and collaboration. Employees are focused and available during core hours.

We invest in our offices to make them places where our employees like to go. If you work in the office three or more days a week, you will have a dedicated office workspace. Our offices function as hubs to draw people in, create social bonds, and where random connections and mixing of ideas happen. We're investing more in offices, culture, and offsite meetings, not less.

Product teams are organized in regional product hubs for optimal collaboration and live within a time zone of their hub. Our current product hubs are located in Pleasanton, Columbus, Boston, Kansas City, New York City, Raleigh, and Toronto. We create opportunities for teams to get together in person regularly.

Customer-facing roles, such as Sales and Professional Services, live near and/or travel to their customers.

When an employee moves within a country it does not cause a change in salary. Where you live impacts you and your family. Not knowing if your compensation will change if you move can cause stress and uncertainty for everyone. We wanted to eliminate that.

Work at Veeva. Work where it's best for you.

About Veeva Systems Inc

Veeva Systems Inc. provides cloud-based software for the life sciences industry in North America, Europe, the Asia Pacific, the Middle East, Africa, and Latin America. The company offers Veeva Commercial Cloud, a suite of commercial applications for sales and marketing executives, including Veeva CRM, a multichannel customer relationship management solution that enables pharmaceutical and biotechnology companies to identify and build relationships with healthcare professionals through various touch points; and Veeva Vault, a cloud-based enterprise content management platform and suite of applications for managing commercial functions, including medical, sales, and marketing, as well as research and development functions, such as clinical, regulatory, and quality. Veeva Systems Inc. was founded in 2007 and is headquartered in Pleasanton, California.
Learn more about Veeva Systems Inc
Size
5,482 employees
Market Cap
$25.2 billion
Industry
Net Income
$380 million
Founded
2007
5 Year Trend
+27.4%
Revenue
$1.4 billion
NASDAQ

Similar Jobs

More Jobs at Veeva Systems Inc

More Pharmaceuticals & Biotech Jobs

Find similar Data Engineer jobs: