Principal AI & Machine Learning Engineer, Spring, Texas, Onsite
This role has been designed as ''Onsite' with an expectation that you will primarily work from an HPE office.
Job Description:We are looking for an experienced Principal AI Engineer to drive the design, development, and deployment of AI/ML-powered applications. Candidate should have strong hands-on experience in application development, lead and mentor a team of AI developers, define best practices, and deliver scalable, production grade AI solutions aligned with business goals.
Location: Spring, TexasOnsite daily work required Key Responsibilities- Design, develop, and deploy AI applications, microservices, and APIs on Kubernetes-based infrastructure, ensuring scalability, reliability, and performance across development, staging, and production environments.
- Build and maintain end-to-end AI pipelines covering deployment, monitoring, versioning, and continuous improvement using modern MLOps/AIOps tools and practices.
- Lead and mentor a team of AI/ML engineers, conduct code reviews, and define best practices.
- Continuously evaluate and adopt emerging AI tools, frameworks, LLM technologies, and open-source solutions to enhance platform capabilities and team productivity.
- Collaborate closely with Business Analysts, Architect and technical teams to align AI engineering efforts with business objectives and ensure secure, compliant solutions.
- Establish and maintain technical documentation, deployment runbooks and SOPs
Required Qualifications- 10+ years of hands-on experience in software engineering, with a strong focus on AI/ML application development and deployment.
- Expertise in Kubernetes - container orchestration, Helm charts, pod management, scaling, and troubleshooting.
- Strong experience with MLOps/AIOps tools and practices (e.g., MLflow, Kubeflow, Airflow, model registries, monitoring frameworks).
- Hands-on experience with cloud platforms - Azure, AWS, or GCP, including their AI services.
- Strong programming skills in Python; familiarity with FastAPI, Flask, or similar frameworks is mandatory.
- Hands-on experience with CI/CD pipelines and tools such as GitOps, Docker, Jenkins, or GitHub Actions.
- Lead and mentor development teams, drive delivery, and manage technical priorities.
- Experience working with Agentic and GenAI frameworks and vector databases etc.
- Experience with observability and monitoring tools (Prometheus, Grafana, OpenTelemetry) for AI workloads.
- Good understanding of AI security, responsible AI principles, and governance frameworks.
Education- Bachelor's or Master's degree in Computer Science, Engineering, AI/ML, or a related field.
#unitedstates
What We Can Offer You:Health & WellbeingWe strive to provide our team members and their loved ones with a comprehensive suite of benefits that supports their physical, financial and emotional wellbeing.
Personal & Professional DevelopmentWe also invest in your career because the better you are, the better we all are. We have specific programs catered to helping you reach any career goals you have - whether you want to become a knowledge expert in your field or apply your skills to another division.
Unconditional InclusionWe are unconditionally inclusive in the way we work and celebrate individual uniqueness. We know varied backgrounds are valued and succeed here. We have the flexibility to manage our work and personal needs. We make bold moves, together, and are a force for good.
Let's Stay Connected:#unitedstates
#operations
Job:Engineering
Job Level:TCP_05
"The expected salary/wage range for this position is provided below. Actual offer may vary from this range based upon geographic location, work experience, education/training, and/or skill level.
- United States of America: Annual Salary USD 152,000 - 349,000 in Texas
The listed salary range reflects base salary. Variable incentives may also be offered."