GCP/Linux Data Engineer (Remote)

Da Vinci Software

$90K — $130K *
Information Technology
5 - 7 years of experience
Job Overview by Ladders

Qualifications

  • Bachelor's or Associate's degree in Computer Science/Engineering or related field with relevant experience
  • 5 years (Bachelor's) or 7 years (Associate's) of experience in a similar role
  • In-depth software engineering knowledge with coding experience in languages like C, C++, Golang, Java, or C#
  • Strong GCP expertise and Linux system administration skills
  • Demonstrated problem-solving and time management abilities
  • Familiarity with Agile software development methodologies

Responsibilities

  • Develop and manage data replication scripts for data integrity
  • Construct and uphold data validation, processing, and ingestion pipelines
  • Automate deployment of data scripts in cloud environments
  • Catalog datasets and maintain detailed descriptions of their contents
  • Build and refine dashboards and reports showcasing ingested data
  • Design validation scripts/APIs to confirm dataset accuracy and integrity
  • Produce user documentation that includes dataset descriptions and tutorials

Benefits

  • Fully remote work option
  • W2 candidates only
Full Job Description
Overview:
Our client is seeking resources that will be supporting an engineering team tasked with building a research data platform which will ingest and make discoverable research generated data.

Key Responsibilities:
  • Data Engineering Skills & Experience:
  • Create, verify, and maintain data replication scripts
  • Create, verify, and maintain data validation, processing, and ingestion pipelines
  • Deploy and automate the execution of data replication scripts and data pipelines in cloud infrastructure
  • Create and maintain data catalogs that describe datasets and their contents (i.e. files, file types, tables/views, columns, fields, etc.)
  • Create, verify, and maintain dashboards and reports that characterize ingested datasets
  • Create, verify, and maintain data validation scripts/APIs that verify the production dataset contains the correct number of samples/records, expects values/fields/columns are populated, and values are of the correct data type, format, and range.
  • Deploy and automate the execution of data validation scripts/APIs
  • Create and maintain user documentation (dataset descriptions, tutorials, code examples, etc.)
  • Define entitlements, user groups, roles, and permissions utilized to grant access to datasets Programming Languages:
Position Requirements:
  • Strong GCP
  • Linux system administration and scripting
  • Pro Desk API
  • Required qualifications for this position include: Bachelor's Degree in Computer Science/Engineering or related field with 5 years of experience as noted below; OR an Associate's degree in Computer/Science/Engineering or related field with 7 years of experience.
  • Have in-depth knowledge of software engineering with experience coding applications or services in a high-level language (C, C++, Golang, Java, C# etc.) and a basic knowledge of related fields.
  • Demonstrated problem solving and time management skills.
  • Possesses strong technical aptitude for designing and implementing software solutions. Experience with modern application development frameworks
  • Knowledge of professional software engineering practices & best practices for the full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations.
  • Deep hands-on technical expertise, excellent verbal and written communication skills.
  • Experience with Agile software development techniques.


Additional Information:
  • Fully Remote
  • W2 Candidates Only


Similar Jobs

More Jobs at Da Vinci Software

More Information Technology Jobs

Find similar GCP/Linux Data Engineer (Remote) jobs: