Full Job Description
Designs, implements, tests, deploys, and supports data pipelines using Cloud infrastructure and Azure services. Develops scalable, repeatable, and maintainable serverless solutions using Azure Data Factory, Azure Functions, and Databricks. Adopts best practices for data ingestion and extraction from multiple sources like RDBMS, NoSQL, Files, Kafka, and Big Data tools. Creates and maintains SQL code as necessary as part of data pipelines. Gathers project requirements by meeting with stakeholders and various operational and business teams. Works with Cloud administrators to implement and support enterprise security standards in the Cloud data infrastructure. Proposes and builds monitoring tools with Cloud administrators to optimize performance and ensure high availability. Collaborates with system and product teams to understand system requirements and necessary modifications to data flow, security, and retention. Creates solutions to improve product stability, scalability, and performance. Works with data warehouse, business intelligence, and advanced analytics teams to evaluate Big Data and cloud use cases. Escalates support issues with internal teams and vendors. Participates in rotational on-call duty to support the production environment. Develops, manages, and owns the full data lifecycle from raw data acquisition through transformation to end-user consumption. Leads the evaluation and adoption of AI capabilities within the Cloud Data Platform, with a focus on improving data engineering productivity, Snowflake development practices, data quality, metadata management, and platform automation. Researches and enables Snowflake and Azure AI features such as Snowflake Cortex, Cortex Code Assistant, Cortex AI functions, Cortex Search, Cortex Analyst, and Azure AI services.Establishes best practices and governance patterns to ensure AI is used appropriately, securely, and cost-effectively, while reinforcing strong data engineering principles such as data cleansing, curation, standardization, and reusable pipeline design.
Knowledge, Skills, and Abilities:
• See the big picture concerned data analytics landscape, tools, solutions, and business goals. [Required]• Proficiency with SQL. [Required]• Understanding of distributed computing paradigm and working knowledge in technologies like Spark, Impala, NiFi, Azure Data Factory, Azure Functions and Snowflake. [Required]• Expert in managing code and dependencies, building and maintaining CI/CD pipelines in Azure DevOps/ GitHub along with other Configuration management best practices. [Required]• Ensure alignment with our Cloud DevOps model based on 100% automation and adoption of repeatable patterns that can be leveraged across the organization. [Required]• Full stack design and development experience within the Azure ecosystem in combination with Snowflake including building platforms and frameworks to create consistent, verifiable, and automatic management of applications and infrastructure between non-production and production environments. [Required]• Good understanding of Big Data technologies like Spark, NiFi, Impala, Sqoop, Hive and File formats like Parquet, AVRO, ORC, CSV, JSON. [Required]
• Working knowledge of Snowflake Cortex, Cortex AI functions, Cortex Search, Cortex Analyst, Cortex Code Assistant, and emerging Snowflake AI capabilities. [Preferred]
• Understanding of AI cost drivers, including token usage, model selection, warehouse consumption, data volume, prompt design, and repeated processing patterns. [Preferred]
• Ability to design AI-ready data products, curated datasets, semantic layers, and metadata-driven frameworks that support reliable analytics and AI adoption. [Preferred]
• Ability to research, evaluate, and educate engineering teams on AI-enabled productivity tools for Data Engineering, pipeline optimization, documentation, testing, monitoring, and operational support. [Preferred]
• Ability to communicate complex information to internal and external audiences. [Required]• Assist team members with production issues and offer support, guidance, and assist in communicating issues with appropriate stakeholders when necessary. [Required]• Provides the technical expertise and/or direction for multiple complex projects of a development or technology group. - reword as a business partner to other groups. [Required]• Develops project plans including financials, resource management, and risks. [Required]• Partners with the management team in developing strategic plans and objectives for a modern data platform useful for the whole organization. [Preferred]• Actively leads design or process development in a broad scope. [Required]• Presentation skills to all levels of technical staff and leadership across the organization [Required]• Understanding of Azure Networking concepts like Express route, VNet, Private Networks, subnets etc. [Preferred]• Healthcare industry knowledge, HL7, Epic Data model. [Preferred]• Good working knowledge with Git, DevOps, CICD. [Preferred]• Creating testing frameworks for repeatable pipelines. [Preferred]• Data warehousing and Business Intelligence [Required]• Performance tuning in Big Data environments, ADF, Snowflake. [Preferred]
Education:
• Bachelor's [Required]• Master's [Preferred]
Field of Study:
• in Computer Science or a related field
• in Computer Science, Data Science or related field.
Work Experience:
• 10+ of experience with java and/or python. [Preferred]• 10+ of professional experience with software design and architecture using database, big data and cloud platforms. [Required]• 3+ of experience with azure and snowflake. [Preferred]• 3+ of experience with big data, hadoop and distributed systems, azure/aws, snowflake [Required]• 3+ of experience with cloudera hadoop or similar. [Preferred]• 5+ of experience with any programming language such as java, python and scala. [Required]• Experience in researching emerging technologies and trends, standards, and products and synthesizing into clear technology roadmaps and strategies. [Required]• Some experience in designing and implementing Infrastructure as Code (IaaC) solutions to manage Azure resources using Terraform. [Required]
Additional Information:
Azure Cloud and/or Snowflake/Databricks Certifications, or similar
Licenses and Certifications:
• SnowPro Advanced: Data Engineer (DEA-C02) [Preferred]• SnowPro Advanced: Architect (ARA-C01) [Preferred]• Hadoop/Big Data certifications (Hadoop/BD) [Preferred]• Databricks certifications (DC) [Preferred]• Microsoft Certified: Fundamentals series (Cloud, Azure, AI, Data, Security etc.) (FUNDAMENTALS) [Required]
Physical Requirements: (Please click the link below to view work requirements)
Physical Requirements - https://tinyurl.com/23km2677
Pay Range:
$96,266.14 - $179,045.63