Infosys

SRE Druid Support

Infosys$90K — $127K *
Information Technology
Less than 5 years of experience
Job Overview by Ladders

Qualifications

  • 5-7 years of experience in IT infrastructure management
  • Deep understanding of Apache Druid at scale
  • Familiarity with Kubernetes, AWS, Hadoop, and Docker
  • Experience in incident management and root cause analysis
  • Bachelor's degree or equivalent experience required

Responsibilities

  • Collaborate with teams to resolve incidents and analyze root causes
  • Evaluate IT infrastructure and prepare actionable reports
  • Design scalable, cost-effective IT infrastructure solutions
  • Manage deployment protocols and test post-deployment changes
  • Coordinate maintenance and emergency fixes while minimizing system disruption
  • Analyze performance data to support capacity planning
  • Implement security measures and conduct compliance audits

Benefits

  • Medical, dental, vision, and life insurance
  • Long-term and short-term disability coverage
  • Health and dependent care reimbursement accounts
  • Diverse insurance options including accident and critical illness
  • 401(k) plan with employer contributions
  • Paid holidays plus paid time off
Full Job Description
Job details

Job Role

Infrastructure Consultant 2

Career Role

Consultant - Infrastructure Management - US

Work Location

Austin, TX, Sunnyvale, CA

State / Region / Province

California, Texas

Country

USA

Domain

Delivery

Interest Group

Infosys Limited

Company

ITL USA

Requisition ID

147262BR

Technical Skills 1

Technology|DevOps|Site Reliability Engineering(SRE)

Technical Skills 2

Technology|Big Data - Hadoop|Hadoop

Technical Skills 3

Technology|Container Platform|Kubernetes

Technical Skills 4

Technology|Cloud Platform|AWS Container services

Technical Skills 5

Technology|Container Platform|Docker

In the assigned Job Role of Infrastructure Consultant 2, your Area Of Responsibility will be as below:
• Collaborate with internal and client teams to resolve complex incidents, conduct root cause analyses, and document findings with preventive recommendations
• Participate in evaluation of client IT infrastructure, prepare actionable assessment reports, and support due diligence to document infrastructure maturity and improvement opportunities
• Contribute to the design of scalable, cost-effective IT infrastructure solutions, review reusable components, and develop technical documentation for deployed systems
• Align release schedules and environment readiness, execute deployments as per protocols, perform post-deployment testing, and manage version control to track changes
• Co-ordinate maintenance schedules, emergency fixes, and technology upgrades while ensuring uninterrupted integration into existing systems and processes
• Facilitate performance data analysis across systems, coordinate insights on system behavior, and support capacity planning to optimize performance
• Conduct security checks, recovery drills, and compliance audits, implement security measures, and coordinate continuity plans to maintain adherence to standards
• Gather feedback to identify automation opportunities, analyze existing infrastructure processes, and propose enhancements for efficiency gains
• Act as liaison with onsite, offshore, and vendor teams to document project requirements, ensuring effective collaboration
• Develop a centralized repository of technical and procedural knowledge, leveraging insights from other projects to drive efficiency and retain organizational expertise

Your contribution to the team:
• A collaborative spirit and excellent communication skills.
• Ability to handle complex incidents and implement resolutions
• A knack for conducting IT infrastructure assessment and identifying key optimization opportunities
• Focused approach towards deployment management, system optimization, and process automation initiatives including sector specific focus
• The ability to work with cross-functional teams

Required Skill and Experience
• Deep understanding and experience in administration & usage of Apache Druid at scale.
• Deep understanding and experience in one or more of the following - Kubernetes, AWS, Hadoop, Flink, Docker, Spinnaker, Helm.
• Ensure 24x7 availability and stability of Druid and supporting platforms
Perform:
Cluster operations (start, stop, restart, rolling upgrades)
Capacity planning and infrastructure scaling
Performance tuning and resource optimization
Participate in incident management, root cause analysis (RCA), and problem management
Prepare and maintain SOPs, runbooks, and operational documentation
Support change management, patching, upgrades, and security compliance
• Understanding of SRE principles and goals along with good Oncall experience
• Experience and understanding on Scaling, Capacity Planning and Disaster Recovery
• This role involves close collaboration with systems and network engineers, DBAs, and monitoring and security teams.
• As the primary point of contact for ingestion and query services, with a particular focus on technologies like Druid running across diverse environments including AWS, Kubernetes, and baremetal, leveraging your expertise in systems like Kafka, Flink, and Hadoop to ensure adherence to Service Level Agreements (SLAs).
• Develop and maintain code and documentation to solve critical challenges within some of the world's largest systems, and improve the entire service lifecycle from design to decommissioning.

Preferred Skill and Experience
• Experience working on supporting Java applications is a plus.
• Experience using monitoring and logging solutions like Prometheus, Grafana, Splunk etc.
• Experience in AWS, Hadoop

Additional Required Qualifications
• Bachelor's degree or foreign equivalent required from an accredited institution. Will also consider three years of progressive experience in the specialty in lieu of every year of education.
• This position may require relocation and/or travel to work/project location.
• Candidates authorized to work for any employer in the United States without employer-based visa sponsorship are welcome to apply. Infosys is unable to provide immigration sponsorship for this role now or in the future.

Additional Details

Estimated annual compensation range for candidate based in the below locations will be
Sunnyvale, CA: $90751 to $127084

Benefits

Along with competitive pay, as a full-time Infosys employee you are also eligible for the following benefits:
  • Medical/Dental/Vision/Life Insurance
  • Long-term/Short-term Disability
  • Health and Dependent Care Reimbursement Accounts
  • Insurance (Accident, Critical Illness , Hospital Indemnity, Legal)
  • 401(k) plan and contributions dependent on salary level
  • Paid holidays plus Paid Time Off

About Infosys

Infosys Limited is an Indian multinational corporation that provides business consulting, information technology and outsourcing services. It has its headquarters in Bangalore, Karnataka, India. Infosys is the second-largest Indian IT company after Tata Consultancy Services by 2017 revenue figures and the 596th largest public company in the world based on revenue. On 31 March 2018, its market capitalisation was $37.32 billion. The credit rating of the company is A? (rating by Standard & Poor's).
Learn more about Infosys
Size
314,015 employees
Market Cap
$77.5 billion
Industry
Net Income
$178.5 billion
Founded
2004
5 Year Trend
+12.2%
Revenue
$945.9 billion
NASDAQ

Similar Jobs

More Jobs at Infosys

More Information Technology Jobs

Find similar SRE Druid Support jobs: