Hardware Operations Technical Program Manager
OpenAI
About the role
About the Team
The Stargate team is responsible for building the physical infrastructure that powers OpenAI’s largest-scale AI systems. We design, deploy, and operate next-generation data center infrastructure across a rapidly expanding footprint, bringing together hardware, networking, facilities, supply chain, and deployment execution.
This work sits at the intersection of advanced AI hardware and real-world infrastructure delivery. Our team turns compute requirements into deployable, reliable, and scalable systems that support frontier AI workloads.
About the Role
We are seeking a Hardware Operations Technical Program Manager to drive execution across the lifecycle of AI infrastructure hardware programs.
In this role, you will own cross-functional program execution across hardware readiness, supplier coordination, deployment planning, rack-level integration, manufacturing operations, logistics, field deployment, and operational handoff. You will partner closely with hardware engineering, data center engineering, networking, supply chain, manufacturing, deployment, and operations teams to ensure critical infrastructure programs move from design intent to production readiness.
This role is ideal for someone who can operate at both the technical and programmatic level: understanding hardware systems, identifying operational blockers, driving accountability across teams, and creating scalable processes for high-volume infrastructure deployment.
Key Responsibilities
- Drive end-to-end Hardware Operations readiness programs across AI infrastructure systems, including servers, racks, networking hardware, power and cooling interfaces, and related data center infrastructure.
- Develop and operationalize scalable hardware operations processes, workflows, and support models spanning deployment, repair operations, diagnostics, break/fix, escalation management, and sustaining operations.
- Lead cross-functional execution of Hardware Operations readiness initiatives, ensuring operational capabilities, tooling, documentation, staffing models, and workflows are established prior to production deployment and operational handoff.
- Partner across Hardware Engineering, Manufacturing, Supply Chain, Data Center Operations, Network Operations, Deployment, Reliability Engineering, and external suppliers to ensure alignment on operational requirements, supportability, and readiness milestones.
- Develop operational scorecards, reporting frameworks, and metric algorithms to measure hardware operational health, repair performance, deployment quality, readiness status, and execution efficiency.
- Identify operational, technical, supplier, tooling, and process risks early; drive mitigation plans, cross-functional alignment, and executive-level communication.
- Lead cross-functional issue resolution efforts during hardware deployment, validation, operational ramp, and sustaining operations, ensuring rapid containment, corrective action development, an
Underpaid estimate
~₹45 LPA for Technical Program Managers (industry-wide) · based on 21 submissions