Lambda

Director, Data Center Operations - North America

Lambda$130K — $180K *
Information Technology
8 - 10 years of experience
Job Overview by Ladders

Qualifications

  • 10+ years in data center operations with 7+ years in leadership roles.
  • Experience supporting AI, HPC, or cloud infrastructure at scale.
  • Knowledge of power and cooling systems, networking, and facility automation tools.
  • Proven track record in improving operational efficiency and vendor management.
  • Preferred Bachelor's degree in Engineering, Computer Science, or related field; Master's is a plus.
  • Strong communication and stakeholder management skills.
  • Willingness to travel up to 50% across North America.

Responsibilities

  • Develop and execute data center operations strategy for North America.
  • Drive continuous improvement emphasizing sustainability and efficiency.
  • Collaborate with engineering teams to forecast AI and GPU compute needs.
  • Lead multi-site operations to ensure 24/7 reliability across facilities.
  • Establish standardized procedures for maintenance and service delivery.
  • Monitor operational KPIs to ensure compliance with standards.
  • Implement AI-driven solutions for system performance and maintenance.

Benefits

  • Health, dental, and vision coverage for employees and dependents.
  • Wellness and Commuter stipends for select roles.
  • 401k Plan with 2% company match for USA employees.
  • Flexible Paid Time Off Plan that is actively encouraged.
Full Job Description
Lambda, The Superintelligence Cloud, builds Gigawatt-scale AI Factories for Training and Inference. Lambda's mission is to make compute as ubiquitous as electricity and give every person access to artificial intelligence. One person, one GPU.

If you'd like to build the world's best deep learning cloud, join us.

Lambda, Inc. is seeking a highly skilled and experienced Director of Data Center Operations to lead and support Lambda Data Center Operations in North America.

What You'll Do:

As Director of Data Center Operations for North America you lead and support large-scale AI and high-performance computing (HPC) infrastructure in all of Lambda's North America data centers. This individual will lead and oversee all aspects of data center operations - including reliability, hardware break/fix, capacity planning, provider interface, team mentorship, and new data center setup -ensuring world-class uptime, customer response, and scalability to meet rapidly growing AI infrastructure demands.

Key Responsibilities:

Strategic Leadership
  • Develop and execute the North American data center operations strategy aligned with AI infrastructure goals and organizational growth.
  • Drive continuous improvement across facility operations, emphasizing sustainability, efficiency, and resilience.
  • Partner with Engineering, Capacity Planning, and Infrastructure teams to forecast and support future AI and GPU-based compute requirements. As well as provide operational feedback on designs and system improvements.
  • Oversee expansion projects, retrofits, and site selection in collaboration with Data Center Infrastructure Engineering and HPC Architecture teams.

Operational Excellence
  • Lead a multi-site operations team ensuring 24/7/365 reliability, availability, and SLA response across all facilities.
  • Establish standardized procedures, metrics, and best practices for preventive maintenance, incident management, and service delivery.
  • Monitor operational KPIs including uptime, PUE, safety, and compliance with corporate and regulatory standards.
  • Implement automation and AI-driven monitoring solutions to optimize system performance and predictive maintenance. Coordinate and communicate data center provider maintenances with customers and impacted teams.

Team Leadership and Development
  • Build, mentor, and scale a high-performing team of operations managers, technicians, and engineers across multiple regions.
  • Routinely visit all sites to maintain standards, develop relationships, and identify areas of efficiency.
  • Foster a culture of safety, accountability, and continuous learning driving data center operations to take on more responsibility and work up the stack.
  • Assist in the build out of new data center whitespace and deployment of AI Infrastructure.

Financial and Vendor Management
  • Develop and manage operating budgets, capital expenditures, and cost-optimization initiatives.
  • Oversee strategic vendor partnerships with numerous data center providers for power, cooling, maintenance, and critical infrastructure components.

Risk and Compliance
  • Ensure compliance with environmental, safety, and industry regulations (e.g., NFPA, OSHA, ISO standards).
  • Lead incident response and root cause analysis to drive preventive improvements for incidents related to data center operations or infrastructure.
  • Act as primary point of contact for audits related to data center operations for compliance such as SOCII, ISO, etc.

Qualifications:
  • 10+ years of experience in data center operations, with at least 7 years in a leadership role managing multi-site or hyperscale facilities.
  • Proven experience supporting AI, HPC, or cloud infrastructure at scale.
  • Deep understanding of power and cooling systems, networking, capacity planning, and facility automation tools (DCIM, BMS, etc.).
  • Strong track record of improving operational efficiency and managing relationships with data center providers.
  • Preferred Bachelor's degree in Engineering, Computer Science, or related field; Master's bonus.
  • Exceptional communication, cross-functional collaboration, and stakeholder management skills. Ability to build relationships and consensus and positive team culture.
  • Willingness to travel (up to 50%) to data center sites across North America and data center sites under construction.

Preferred Skills:
  • Experience with GPU clusters, AI infrastructure networking, and large-scale storage systems.
  • Familiarity with cloud-scale operational practices (e.g., AWS, Google, Microsoft data center standards).
  • Certifications such as CDCDP, CDCP, PMP, or PE are a plus.

Salary Range Information

The annual salary range for this position has been set based on market data and other factors. However, a salary higher or lower than this range may be appropriate for a candidate whose qualifications differ meaningfully from those listed in the job description.

About Lambda
  • Founded in 2012, ~400 employees (2025) and growing fast
  • We offer generous cash & equity compensation
  • Our investors include Andra Capital, SGW, Andrej Karpathy, ARK Invest, Fincadia Advisors, G Squared, In-Q-Tel (IQT), KHK & Partners, NVIDIA, Pegatron, Supermicro, Wistron, Wiwynn, US Innovative Technology, Gradient Ventures, Mercato Partners, SVB, 1517, Crescent Cove.
  • We are experiencing extremely high demand for our systems, with quarter over quarter, year over year profitability
  • Our research papers have been accepted into top machine learning and graphics conferences, including NeurIPS, ICCV, SIGGRAPH, and TOG
  • Health, dental, and vision coverage for you and your dependents
  • Wellness and Commuter stipends for select roles
  • 401k Plan with 2% company match (USA employees)
  • Flexible Paid Time Off Plan that we all actually use

A Final Note:

You do not need to match all of the listed expectations to apply for this position. We are committed to building a team with a variety of backgrounds, experiences, and skills.

Equal Opportunity Employer

Lambda is an Equal Opportunity employer. Applicants are considered without regard to race, color, religion, creed, national origin, age, sex, gender, marital status, sexual orientation and identity, genetic information, veteran status, citizenship, or any other factors prohibited by local, state, or federal law.

About Lambda

Lambda is an online education company that offers courses in computer science and software engineering. The company was founded in 2017 by Austen Allred and Ben Nelson. Lambda's courses are designed to be accessible to anyone, regardless of their background or prior experience. The company's mission is to provide high-quality education that leads to well-paying jobs in the tech industry. Lambda has partnerships with a number of companies, including Amazon, Google, and Microsoft, and has helped thousands of students launch careers in tech.
Learn more about Lambda
Size
1,000 employees
Industry
Net Income
-$5 million
Founded
2017
5 Year Trend
+100%
Revenue
$100 million
NASDAQ

Similar Jobs

More Jobs at Lambda

  • Lambda
    Senior Incident Manager
    $120K — $150K *
    San Jose, CA 95123 (Santa Clara County)
    Information Technology
    In-Person
  • Lambda
    Senior Incident Manager
    $120K — $150K *
    Remote
    Information Technology
    Remote in San Jose, CA
  • Lambda
    Account CTO
    $200K — $250K *
    San Francisco, CA 94112 (San Francisco County)
    Enterprise Technology
    In-Person
  • Lambda
    Account CTO
    $200K — $250K *
    Remote
    Enterprise Technology
    Remote in San Francisco, CA
  • Lambda
    Account CTO
    $200K — $250K *
    San Jose, CA 95123 (Santa Clara County)
    Enterprise Technology
    In-Person

More Information Technology Jobs

Find similar Director, Data Center Operations - North America jobs: