Advanced Micro Devices, Inc

Sr. Software Development Engineer - Collectives and Network

Advanced Micro Devices, Inc$130K — $180K *
Information Technology
Less than 5 years of experience
Job Overview by Ladders

Qualifications

  • 5-7 years of experience in software optimization and performance analysis
  • Deep knowledge of Network, NIC, and GPU hardware architecture
  • Hands-on experience mapping model architecture to low-level software and hardware
  • Expertise in AI frameworks such as PyTorch, JAX, and vLLM
  • Strong leadership skills and experience working with cross-functional teams
  • Excellent communication skills, both written and verbal
  • PhD or master's degree in computer science, electrical engineering, or related field

Responsibilities

  • Help with strategy and roadmap for AMD Collectives and Network optimizations
  • Provide guidelines on network load-balancing, workload scheduling, and model sharding
  • Tune and analyze performance of large-scale models across various applications
  • Participate in hardware-software co-design for future optimizations on network systems
  • Develop tools and infrastructure for performance estimation and reporting
  • Communicate performance analysis to stakeholders and provide recommendations
  • Collaborate with teams to identify opportunities and develop strategies

Benefits

  • Comprehensive AMD benefits package
Full Job Description
THE ROLE:

This software engineer role will help drive AMD's strategy, architecture, optimization and tooling to achieve industry-leading AI Pre-training and Distributed Inference Performance on AMD GPU. You will partner across hardware architecture, AI frameworks, compilers, runtime, ROCm, developer tools and models to scale performance analysis and optimization.

As an Engineer of Collectives and Network performance, you will drive the end-to-end technical performance attainment across the entire software stack focusing on getting the best performance on multiple generations of AMD GPUs with a wide range of models including latest state-of-the-art AI models. You will help set the strategy and roadmap for general optimization, accelerating supporting new models and out of box performance.

If you are passionate about performance optimization, getting the best out of the hardware, and shaping the future of AI acceleration, then this role is for you.

THE PERSON:

The ideal candidate will have deep knowledge with Network, NIC and GPU hardware architecture, software optimization, performance modeling, AI frameworks and latest trend in inference and training optimization. Hand-on experience in mapping model architecture to low level software, hardware and understanding the impact of each layer of the stack on model performance. Strong knowledge in latest generative model architecture, especially SoTA models, distributed inference and deployment at scale is crucial.

KEY RESPONSIBILITIES:
  • Help with strategy and roadmap for AMD Collectives and Network optimizations.
  • Provide guidelines to customers on efficient network load-balancing, workload scheduling and model sharding strategies.
  • Performance tuning, profiling and analysis of large-scale models for LLM, diffusion, multimodal, RecSys and generative AI, single node and distributed. In addition to exploring various tradeoffs and design decisions.
  • Participate in hardware-software co-design for future hardware optimizations - especially on scale-up networks, NIC and scale-out networks.
  • Develop and improve framework, tools and infrastructure for performance estimation, modeling and reporting.
  • Communicate and present the results of the performance analysis and modeling to stakeholders, and senior leadership. And provide a concrete recommendation.
  • Cross team collaboration and working across the organization to identify opportunities and develop strategies.


PREFERRED EXPERIENCE:
  • Multiple years of technical experience in performance optimization.
  • Strong technical expertise and experience in performance analysis, projection, and network hardware architecture.
  • Deep knowledge and hand-on experience of AI Frameworks such as PyTorch, JAX, vLLM, and SGLang.
  • Strong technical leadership skills, ability to work collaboratively with cross-functional teams.
  • Mentor, coach, and inspire a diverse and talented team of researchers and engineers.
  • Excellent written, verbal, and presentation skills, ability to coordinate internally and externally.


ACADEMIC CREDENTIALS:
  • A PhD or master's degree in computer science, electrical engineering, or a related field.

LOCATION:

San Jose, CA (hybrid)

#LI-MV1

Benefits offered are described: AMD benefits at a glance.

About Advanced Micro Devices, Inc

Advanced Micro Devices, Inc. Careers

Join the innovative forefront of technology with a career at Advanced Micro Devices, Inc. (AMD), a leader in semiconductor development. As part of our global team, you will contribute to an organization renowned for its dedication to innovation, leadership, and diversity in the tech industry.

Work You’ll Do

At AMD, we offer job opportunities that push the boundaries of what is possible. Our team is composed of professionals who lead the way in microprocessor and graphics technology, driving industry standards and innovation. With AMD, you will be part of a culture that values growth and professional development, ensuring that every team member has the opportunity to excel.

Transform Your Career

AMD is not just about advancing technology, but also about advancing careers. Whether you are looking for an internship, a full-time position, or leadership roles, AMD provides the platform to propel your career to new heights. Our commitment to professional growth is matched by our dedication to diversity and inclusion, making AMD a place where everyone can thrive.

Innovative Work Environment

Join a team of over 12,000 dedicated professionals at the intersection of technology, industry expertise, and digital innovation. At AMD, you will work on groundbreaking projects that shape the future of computing and graphics. Our collaborative environment encourages networking and the sharing of ideas across teams and disciplines.

Career Development and Benefits

AMD is committed to the development of its employees. We offer robust training programs, including leadership development and diversity training, to ensure our team is equipped for both current challenges and future opportunities. Our benefits package is designed to support the well-being and financial security of our employees and their families.

Explore Job Opportunities

From engineering to marketing, AMD offers a range of career paths that cater to diverse skills and interests. Our hiring process is designed to be transparent and engaging, helping you to understand where you fit within our team and how you can contribute to our collective goals.

Stay Connected

Join Our Team Search open positions that match your skills and interest. We look for passionate, curious, creative, and solution-driven team players. Explore the opportunities to join a company that’s committed to your career growth and to innovation in the technology sector.

Keep Up to Date

Stay ahead with career tips, insider perspectives, and industry-leading insights you can put to use today—all from the people who work here.

Job Alert Emails

Personalize your subscription to receive job alerts, latest news, and insider tips tailored to your preferences. Discover the exciting and rewarding career opportunities that await at Advanced Micro Devices, Inc.

Interview and Resume Tips

Prepare for your future with AMD by accessing resources that help you craft your resume and excel in interviews. Our goal is to help you showcase your best professional self and align your skills with the needs of our dynamic team. At Advanced Micro Devices, Inc., we empower our employees to innovate, lead, and grow. Join us in driving the future of technology while building a rewarding and sustainable career.
Learn more about Advanced Micro Devices, Inc
Size
15,500 employees
Market Cap
$100.9 billion
Industry
Net Income
$2.4 billion
Founded
1969
5 Year Trend
+30.9%
Revenue
$9.7 billion
NASDAQ

Similar Jobs

More Jobs at Advanced Micro Devices, Inc

More Information Technology Jobs

Find similar Sr. Software Development Engineer - Collectives and Network jobs: