Senior Manager, AI Infrastructure Network Operations

Oracle Corporation • $118K — $251K *

Seattle, WA 98115In-Person

Information Technology

Less than 5 years of experience

Reposted Today

Be an Early Applicant

By clicking Apply, I agree with Ladders' Terms of Use and Privacy Policy

Job Overview by Ladders

Qualifications

7+ years of experience in network operations and engineering
Strong expertise in RDMA/RoCE network fabrics
Proven leadership in managing large-scale cloud environments
Experience with software architecture and development
Strong customer focus and communication skills
Ability to oversee a large global team of engineers
Familiarity with cloud-based technologies and infrastructure

Responsibilities

Direct the design and operation of high-performance network systems
Manage software development tasks related to OS and application design
Enhance existing software architectures and propose improvements
Lead technical initiatives in network operation for AI workloads
Oversee customer-facing network technology deployments
Coordinate with cross-functional teams to ensure system reliability
Monitor performance metrics and optimize network infrastructure

Benefits

Comprehensive medical, dental, and vision insurance
Short and long-term disability coverage
401(k) plan with company matching
Flexible vacation and paid time off policies
Paid parental leave and adoption assistance
Employee Stock Purchase Plan and financial planning services
Voluntary benefits including auto and pet insurance

Full Job Description

Job Description

The OCI AI Infrastructure Network Operations team operates and improves the RDMA/RoCE network fabrics that power OCI's largest AI, GPU, and HPC workloads. These high-performance fabrics are the foundation for frontier AI customers and tier-0 workloads running on Oracle Cloud Infrastructure.
As a Senior Manager, you will lead a team responsible for the development, operation, and improvement of large-scale RDMA network fabrics and supporting systems. This role requires deep networking expertise, especially in RDMA/RoCE, Clos fabrics, congestion control, telemetry, and performance troubleshooting, combined with software engineering experience. You will help build and improve tools, automation, monitoring, and operational systems that make these fabrics more reliable, observable, and efficient at global cloud scale.
You will work closely with Network Availability, Network Automation, Network Monitoring, GNOC, hardware engineering, and service teams to resolve complex customer escalations, improve operational readiness, and drive engineering programs that increase performance and availability. The ideal candidate brings both hands-on technical depth and strong people leadership, with experience managing engineers who operate and build software for large-scale distributed infrastructure.

Responsibilities

As a Senior Manager in the AI Infrastructure Network Operations organization, you will:

Manage and develop a team of engineers responsible for RDMA/RoCE fabric operations, performance, automation, and troubleshooting.
Lead operational and software engineering efforts that improve the reliability, availability, observability, and performance of OCI AI/HPC networking fabrics.
Apply deep networking knowledge, including RDMA, RoCE, Ethernet fabrics, congestion control, QoS, telemetry, and large-scale troubleshooting.
Apply software architecture and development experience to guide the design, debugging, and enhancement of operational tools, automation platforms, monitoring systems, and infrastructure services.
Drive improvements within existing software and network architectures, and identify opportunities to simplify, automate, and scale operational workflows.
Support customer escalations, NOC events and complex production incidents by coordinating technical investigation across networking, software, hardware, and operations teams.
Define and execute team roadmaps focused on engineering efficiency, operational excellence, network performance, and service availability.
Build and maintain data-driven metrics that represent fabric health, operational backlog, customer impact, performance trends, and business-critical service status.
Partner with Network Availability, Network Automation, Network Monitoring, GNOC, deployment, and hardware teams to deliver reliable service to OCI customers.
Ensure operational planning, staffing, readiness, and execution meet corporate and service-level expectations.
Participate in the manager on-call rotation and provide leadership during high-severity incidents.
Attract, mentor, and grow engineers with a mix of networking, software development, automation, and distributed systems experience.

Preferred Experience

Strong background in operating or building network for large-scale cloud.

Experience with RDMA/RoCE, GPU/HPC networking, Clos fabrics, congestion management, telemetry, and performance debugging.

Qualifications

Disclaimer:

Certain US customer or client-facing roles may be required to comply with applicable requirements, such as immunization and occupational health mandates.

Range and benefit information provided in this posting are specific to the stated locations only

US: Hiring Range in USD from: $118,300 to $251,600 per annum. May be eligible for bonus, equity, and compensation deferral.

Oracle maintains broad salary ranges for its roles in order to account for variations in knowledge, skills, experience, market conditions and locations, as well as reflect Oracle's differing products, industries and lines of business.
Candidates are typically placed into the range based on the preceding factors as well as internal peer equity.

Oracle US offers a comprehensive benefits package which includes the following:
1. Medical, dental, and vision insurance, including expert medical opinion
2. Short term disability and long term disability
3. Life insurance and AD&D
4. Supplemental life insurance (Employee/Spouse/Child)
5. Health care and dependent care Flexible Spending Accounts
6. Pre-tax commuter and parking benefits
7. 401(k) Savings and Investment Plan with company match
8. Paid time off: Flexible Vacation is provided to all eligible employees assigned to a salaried (non-overtime eligible) position. Accrued Vacation is provided to all other employees eligible for vacation benefits. For employees working at least 35 hours per week, the vacation accrual rate is 13 days annually for the first three years of employment and 18 days annually for subsequent years of employment. Vacation accrual is prorated for employees working between 20 and 34 hours per week. Employees working fewer than 20 hours per week are not eligible for vacation.
9. 11 paid holidays
10. Paid sick leave: 72 hours of paid sick leave upon date of hire. Refreshes each calendar year. Unused balance will carry over each year up to a maximum cap of 112 hours.
11. Paid parental leave
12. Adoption assistance
13. Employee Stock Purchase Plan
14. Financial planning and group legal
15. Voluntary benefits including auto, homeowner and pet insurance

The role will generally accept applications for at least three calendar days from the posting date or as long as the job remains posted.
Career Level - M3

About Oracle Corporation

Oracle Dyn Global Business Unit is a pioneer in managed DNS and a leader in cloud-based infrastructure that connects users with digital content and experiences across a global internet. Dyn's solution is powered by a global network that drives 40 billion traffic optimization decisions daily for more than 3,500 enterprise customers, including preeminent digital brands such as Netflix, Twitter, Linkedin and CNBC. Adding Dyn's best-in-class DNS and email services extend the Oracle cloud computing platform and provides enterprise customers with a one-stop shop for Infrastructure-as-a-Service (IaaS) and Platform-as-a-Service (PaaS). On January 31, 2017 Oracle completed the acquisition of Dyn, which now operates as an Oracle Infrastructure-as-a-Service (IaaS) global business unit (GBU).

Oracle Corporation Careers

Join Oracle Corporation, a global leader in technology and innovation, and be part of a team that values professional growth, leadership, and diversity. At Oracle, we offer unparalleled job opportunities in the tech industry, fostering a culture of innovation and continuous improvement.

Work You’ll Do

At Oracle, your work will directly impact the future of technology across industries. As part of our team, you will lead projects that redefine the way businesses operate, leveraging Oracle’s cutting-edge technology solutions. Our commitment to leadership in the tech community means you’ll be working at the forefront of innovation, enhancing your skills through hands-on experience and comprehensive diversity training.

Join Our Dynamic Team

Oracle is not just a technology company; we are a team of dedicated professionals committed to creating a supportive and inclusive environment. Here, every team member’s contribution is valued, and diversity is celebrated. With Oracle, you are not just accepting a job; you are joining a community that promotes personal and professional growth through constant learning and development opportunities.

Innovative Work and Career Advancement

Embrace the chance to do innovative work with Oracle Corporation, where we push the boundaries of what is possible. With over 130,000 dedicated professionals globally, Oracle offers a workplace where innovation and thought leadership thrive. This environment is perfect for those who are driven to explore new ideas and are eager for opportunities to advance their careers.

Explore Job Opportunities and Internships

Whether you’re a seasoned professional looking for your next career challenge or a student seeking a promising internship, Oracle provides a range of opportunities. Explore positions that match your skills and interests in areas such as cloud computing, enterprise software, and business analytics. Our hiring process is designed to find not just the right skills but also the right fit for Oracle’s unique culture.

Benefits and Culture

Oracle is committed to supporting our employees’ life and work ambitions. We offer competitive benefits, including health insurance, retirement plans, and wellness programs, all designed to support your career and well-being. Our culture of empowerment encourages networking and collaboration across teams and geographies, ensuring that innovation and creativity flourish.

Develop Your Skills Through Training and Networking

Prepare for your future with Oracle’s comprehensive training programs. From leadership development to technical skills enhancement, we provide the tools necessary to succeed in your career and stay ahead in the industry. Networking within Oracle’s global community will also open doors to collaborative opportunities and career advancement.

Stay Connected with Oracle Careers

Keep up to date with the latest from Oracle Corporation by following our careers blog. Gain insights from the experts and learn about new job openings as they become available. Personalize your job search and stay informed about Oracle’s career events and professional development opportunities.

Join Oracle Corporation—Where Careers Grow

At Oracle, we believe in nurturing the potential of our employees. The growth of our company is driven by the individual successes of our team members. We invite you to bring your unique talents to Oracle, join our mission to drive technological innovation, and help shape the future of the digital world.

Search Oracle Jobs

Ready to take the next step in your career? Search for open positions that align with your skills and passions. We are continuously looking for curious, creative, and motivated individuals to join our team. Explore the opportunities and find out how you can contribute to the success of Oracle Corporation.

Oracle Corporation: Leadership, Innovation, Opportunity.

Learn more about Oracle Corporation

Size

143,000 employees

Market Cap

$217.3 billion

Industry

Enterprise Technology

Net Income

$12.8 billion

Founded

1977

5 Year Trend

+2.3%

Revenue

$39.6 billion

NASDAQ

ORCL

* Ladders Estimates

Similar Jobs

Sr. Engineer - Core Network
$100K — $130K *
Lumos
Remote
Today
Network Dev Engineer II - AMZ26902.1
$138K — $184K *
Amazon
Seattle, WA 98115 (King County)
Today
Network Services Engineer (LHC team)
$90K — $120K *
Luxoft
Remote
Today
Carrier Relations Engineer
$124K — $271K *
Remote
Reposted 2 days ago
Network Services Engineer (LHC team)
$90K — $120K *
Luxoft
Remote
2 days ago
Mainframe VTAM Engineer - Remote - Multiple locations
$90K — $120K *
Truist Financial
Remote
Reposted 2 days ago

Get Ready For Your
Next Interview

More Jobs at Oracle Corporation

Senior Principal Program Manager
$109K — $223K *
Austin, TX 78745 (Travis County)
Reposted Today
Information Technology
In-Person
Senior Manager, AI Infrastructure Network Operations
$118K — $251K *
Seattle, WA 98115 (King County)
Reposted Today
Information Technology
In-Person
Principal Software Engineer
$99K — $234K *
Austin, TX 78745 (Travis County)
Today
Information Technology
In-Person
Data Center Training Senior Director (Nashville, TN)
$169K — $355K *
Nashville, TN 37211 (Davidson County)
Today
Information Technology
In-Person
Principal Software Engineer
$99K — $234K *
Nashville, TN 37211 (Davidson County)
Today
Information Technology
In-Person

More Information Technology Jobs

Client Partner - Banking / Financial Services / Capital Markets
$325K — $350K + $100K bonus *
Large IT Services Firm (client of TechLink Systems)
New York, NY 10001 (New York County)
1 week ago
Sr. SDET Automation Engineer
$140K — $165K *
Yubico
Bellevue, WA 98006 (King County)
Today
Project Engineer III
$90K — $120K *
Palmetto Technology Group
Tucson, AZ 85705 (Pima County)
Reposted Today
HPC-Kubernetes Solutions Architect
$200K — $350K *
INSPYR Solutions
Dallas, TX 75217 (Dallas County)
Reposted Today
Sr Network Engineer / Architect, Global Network & Security - Alpharetta, GA, Boston, MA or Billerica, MA Hybrid
$143K — $214K *
Cabot Corporation
Boston, MA 02115 (Suffolk County)
Today

Find similar Senior Manager, AI Infrastructure Network Operations jobs:

Nationwide Seattle, WA