Data Engineer

Texas State Library and Archives Commision

• $90K — $120K *

Plano, TX 75094In-Person

Information Technology

Less than 5 years of experience

1 month ago

By clicking Apply, I agree with Ladders' Terms of Use and Privacy Policy

Job Overview by Ladders

Qualifications

5-7 years experience in technology development
Strong knowledge of Oracle, SQL, RDBMS
Proficient in Python, Hadoop, Hive, Spark, and PySpark
Experience in developing big data applications and ETL processes
Familiarity with statistical modeling in Python (Jupyter, SciPy, NumPy, pandas, Scikit-learn)
Experience in Agile methodology and Autosys is preferred
Ability to work independently and as part of a team

Responsibilities

Develop and enhance application components for ML/AI models
Lead a team of developers and collaborate with business partners
Ensure ML/AI models are production-ready and robust
Participate in operational issue analysis and troubleshooting
Conduct peer reviews of designs and code
Build workflows and schedule jobs using Autosys
Drive IT projects to ensure on-time delivery according to processes

Benefits

Work in a dynamic and fast-paced agile environment
Opportunity to lead and collaborate with developers
Gain experience with cutting-edge data management technologies
Participate in peer reviews to enhance collaboration
Develop skills in machine learning and big data applications

Full Job Description

The Enterprise Data Management Services team is looking for a strong, self-motivated Technology Developer to become an integral part of the Data Resiliency program.

Responsibilities include but are not limited to: Developing and enhancing application components for supporting ML/AI models and data ingestion processes, with focus on code resiliency and stability. Interacting & leading a team of developers, and interacting with business partners and develop processes to ensure that ML/AI models are production-ready Developing, enhancing, modifying and/or maintaining applications Working in a fast paced agile environment, under minimal supervision, with guidance from senior team members Participating in analysis on operational issues Participating in peer reviews for designs, code, and other work productsStrong knowledge of Oracle, SQL, RDBMS along with Python, Hadoop, Hive, Spark Experience in developing Hive & DBMS based applications Python programming background (scripting and object-oriented design) Coding experience with "big data" (Spark/PySpark, SQL, Hadoop, ETL development) Experience implementing statistical models in python (Jupyter notebooks, scipy, numpy, pandas, Scikit-learn) Machine learning experience or knowledge

Overview:
Assess requirement and evaluate existing solutions
Build Process to interact with HDFS and Oracle using Python/ PySpark and Oracle PL/ SQL
Create Workflows, jobs and schedule them using Autosys
Works across development teams to contribute to the story refinement and delivery of data requirements through the delivery life cycle
Leverages architecture components in solution development, codes solutions to integrate, clean, transform, and control data as per acceptance criteria
Develops and executes test plans to produce quantitative results, identifies test issues and errors, and triages underlying causes
Drives complex information technology projects to ensure on-time delivery and adheres to team delivery and release processes
Identifies, defines, and documents data engineering requirements, communicating required information for deployment, maintenance, support, and business functionality
Ability to work independently with solid analytical skills
Ability to work with the team; excellent team player with great attitude
Data Resiliency Capabilities

Top 3 skills:
1. Oracle & PL/SQL Knowledge (Expert level)
2. Hadoop ecosystem, Hive Tables
3. Python/PySpark

Preferred Skills:
1. Autosys
2. Agile

Other Required Skills:Strong knowledge of Oracle, SQL, RDBMS along with Python, Hadoop, Hive, Spark Experience in developing Hive & DBMS based applications Python programming background (scripting and object-oriented design) Coding experience with "big data" (Spark/PySpark, SQL, Hadoop, ETL development) Experience implementing statistical models in python (Jupyter notebooks, scipy, numpy, pandas, Scikit-learn) Machine learning experience or knowledge

* Ladders Estimates

Similar Jobs

Data Engineer - Healthcare
$97K — $133K *
Remote
Reposted Today
Data Engineer
$90K — $110K *
Ragle Inc
North Richland Hills, TX 76180 (Tarrant County)
Today
Associate Data Cloud Business Transformation Architect
$110K — $218K *
Deloitte
Houston, TX 77084 (Harris County)
Today
Associate Data Cloud Business Transformation Architect
$110K — $218K *
Deloitte
Dallas, TX 75217 (Dallas County)
Today
Associate Data Cloud Business Transformation Architect
$110K — $218K *
Deloitte
Austin, TX 78745 (Travis County)
Today
Senior Principal System Administrator, Storage & Data Protection- McKinney, TX
$107K — $204K *
Raytheon Technologies
Mckinney, TX 75070 (Collin County)
Yesterday

Get Ready For Your
Next Interview

More Information Technology Jobs

Client Partner - Banking / Financial Services / Capital Markets
$325K — $350K + $100K bonus *
Large IT Services Firm (client of TechLink Systems)
New York, NY 10001 (New York County)
1 week ago
Senior Marketing Specialist
$85K — $144K *
SailPoint Technologies
Remote
Reposted Today
Sales Services Advisor – Managed Services (AMS & Premium Support)
$137K — $172K *
Blue Yonder
Dallas, TX 75217 (Dallas County)
Reposted Today
QA Automation Engineer
$96K — $163K *
San Mateo, CA 94403 (San Mateo County)
Reposted Today
Software Engineer (Backend) - MTS
$117K — $176K *
Salesforce
Redwood City, CA 94061 (San Mateo County)
Reposted Today

Find similar Data Engineer jobs:

Nationwide Plano, TX

Data Engineer

Job Overview by Ladders

Full Job Description

Get Ready For Your Next Interview

Find similar Data Engineer jobs:

Get Ready For Your
Next Interview