Infosys

Technology Lead - Spark Scala Developer

Infosys$92K — $123K *
Enterprise Technology
Less than 5 years of experience
Job Overview by Ladders

Qualifications

  • Eligible to work in Canada without employer-based visa sponsorship.
  • Located within commuting distance of Mississauga, Ontario, or willing to relocate.
  • Bachelor's degree or equivalent experience in IT.
  • Minimum of 4 years of IT experience.
  • 4+ years of experience with Big Data technologies, particularly Spark and Scala.
  • Strong expertise in Apache Spark (Core, SQL, DataFrames, RDDs) and PySpark.
  • Hands-on experience with Hadoop, Kafka, and NoSQL databases.

Responsibilities

  • Design and develop large-scale data processing pipelines using Apache Spark (Scala & PySpark).
  • Build and optimize data processing workflows using Spark, Kafka, and the Hadoop ecosystem.
  • Develop Spark applications for complex transformations using RDDs and DataFrames.
  • Implement streaming pipelines with Kafka and Spark Streaming/Structured Streaming.
  • Maintain and develop data lake solutions using HDFS, Hive, and NoSQL.
  • Optimize existing SQL/Hive workloads into Spark jobs for performance improvements.
  • Collaborate with cross-functional teams to translate business requirements into technical solutions.

Benefits

  • Work in a challenging and dynamic environment with cutting-edge technology.
  • Opportunity to work with a diverse, global team.
  • Access to continuous learning and professional development resources.
  • Engagement in innovative projects focused on big data and analytics.
  • Potential for career advancement in a large multinational company.
Full Job Description
Job details

Job Role

Technology Lead - CAN

Work Location

Mississauga

State / Region / Province

Ontario

Country

Canada

Skills

Technology|Big Data - Data Processing|Spark, Technology|Functional Programming|Scala

Domain

Delivery

Interest Group

Infosys Limited

Company

ITL Canada

Requisition ID

149066BR

Infosys is seeking an experienced Spark Scala Developer to design, develop, and optimize scalable bigdata solutions. The candidate will work on building high-performance batch and real-time data pipelines leveraging the Hadoop ecosystem and distributed computing frameworks(Spark). The role involves working closely with data engineers, architects, and business stakeholders to deliver robust, scalable, and efficient data processing systems.

Required Qualifications:
  • Candidates authorized to work for any employer in Canada without employer-based visa sponsorship are welcome to apply. Infosys is unable to provide immigration sponsorship for this role at this time.
  • Candidate must be located within commuting distance of Mississauga, Ontario or be willing to relocate to the areas.
  • Bachelor's degree or foreign equivalent required from an accredited institution. Will also consider three years of progressive experience in the specialty in lieu of every year of education.
  • At least 4 years of Information Technology experience
  • 4+ years of experience in Big Data technologies.
  • Strong expertise in:
    • Apache Spark (Core, SQL, DataFrames, RDDs)
    • Scala programming
    • PySpark
  • Hands-on experience with:
    • Kafka (real-time streaming)
    • Hadoop ecosystem (HDFS, Hive, Impala)
    • NoSQL Databases (HBase, MongoDB, Couchbase)
  • Strong understanding of distributed computing concepts and data processing frameworks.
  • Experience in building ETL/data pipelines for large-scale datasets.
  • Proficiency in SQL and data modeling.
Preferred Qualifications:
  • Hands-on experience with data lakes, data warehouses, and scalable ETL pipeline design, including batch and real-time processing architecture.
  • Strong understanding and practical exposure to Agile software development methodologies (Scrum) and SDLC practices.
  • Proven experience in Banking domain, supporting use cases such as fraud detection, risk analytics, regulatory reporting, and customer insights.
  • Excellent analytical, problem-solving, and communication skills, with the ability to translate business requirements into scalable technical solutions.
  • Demonstrated ability to work effectively in cross-functional, multi-stakeholder environments, collaborating with Business, Data Engineering, and Architecture teams.
  • Experience with real-time data streaming frameworks such as Kafka and Spark Streaming for low-latency processing.
  • Understanding data modeling concepts (dimensional modeling, snowflake schemas) to support analytics workloads.
  • Experience and desire to work in a global delivery environment.
Key Responsibilities:
  • Design and develop large-scale data processing pipelines using Apache Spark (Scala & PySpark)
  • Build and optimize batch and real-time data processing workflows using Spark, Kafka, and Hadoop ecosystem
  • Develop Spark applications using RDDs, DataFrames, and Spark SQL for complex transformations
  • Develop and optimize PySpark applications leveraging joins, Spark DAG execution flow, stage optimization, transformation techniques, and streaming with dynamic allocation and failover handling.
  • Implement streaming pipelines using Kafka and Spark Streaming / Structured Streaming
  • Develop and maintain HDFS, Hive, NoSql and Impala-based data lake solutions
  • Convert existing SQL/Hive workloads into optimized Spark jobs for improved performance
  • Work with ETL pipelines to ingest, cleanse, transform, and process large datasets
  • Optimize performance through partitioning, caching, serialization, and tuning techniques
  • Handle data formats such as Parquet, ORC, Avro, JSON
  • Integrate multiple data sources including streaming systems, flat files RDBMS, and APIs
  • Collaborate with cross-functional teams to understand business requirements and translate them into scalable technical solutions
  • Ensure data quality, reliability, and performance monitoring across pipelines
  • Participate in code reviews, design discussions, and best practices implementation
Key Skills:
  • Distributed Data Processing.
  • Spark Optimization & Performance Tuning.
  • Real-time Data Streaming.
  • Data Modeling & ETL Design.
  • Problem-solving and Analytical Thinking.
  • Strong Communication & Stakeholder Management.
Nice to Have:
  • Exposure to Machine Learning pipelines or MLOps workflows.
  • Experience with Databricks platform.
  • Experience with AWS/GCP.
Summary:
This role requires a highly skilled Spark Scala Developer with strong expertise in big data engineering, streaming systems, and distributed computation, capable of building scalable, high-performance data platforms supporting enterprise analytics.

Estimated annual compensation range for the candidate based in the below location will be:
Ontario: $ 92740 to $ 123375

The job entails sitting as well as working at a computer for extended periods of time. Should be able to communicate by telephone, email or face to face. Travel may be required as per the job requirements

About Infosys

Infosys Limited is an Indian multinational corporation that provides business consulting, information technology and outsourcing services. It has its headquarters in Bangalore, Karnataka, India. Infosys is the second-largest Indian IT company after Tata Consultancy Services by 2017 revenue figures and the 596th largest public company in the world based on revenue. On 31 March 2018, its market capitalisation was $37.32 billion. The credit rating of the company is A? (rating by Standard & Poor's).
Learn more about Infosys
Size
314,015 employees
Market Cap
$77.5 billion
Industry
Net Income
$178.5 billion
Founded
2004
5 Year Trend
+12.2%
Revenue
$945.9 billion
NASDAQ

Similar Jobs

More Jobs at Infosys

More Enterprise Technology Jobs

Find similar Technology Lead - Spark Scala Developer jobs: