Technical Program Manager - Cloud Machine Learning and Compute Services
Google India Pvt LtdJob Description
Technical Program Manager, Cloud ML and Compute Services
Minimum qualifications:
- Bachelor's degree in a technical field, or equivalent practical experience.
- 5 years of experience in program management.
Preferred qualifications:
- 5 years of experience managing cross-functional or cross-team projects.
- Familiarity with network, compute and storage at scale.
- Proficiency in software and data center hardware.
- Knowledge of software development processes.
- Understanding of data center terminology and operations.
About the job
A problem isn’t truly solved until it’s solved for all. That’s why Googlers build products that help create opportunities for everyone, whether down the street or across the globe. As a Technical Program Manager at Google, you’ll use your technical expertise to lead complex, multi-disciplinary projects from start to finish. You’ll work with stakeholders to plan requirements, identify risks, manage project schedules, and communicate clearly with cross-functional partners across the company. You're equally comfortable explaining your team's analyses and recommendations to executives as you are discussing the technical tradeoffs in product development with engineers.
The Cloud ML Compute Services (MCS) team defines and drives the overall Cloud ML Compute IaaS and IaaS+ product offering and technical strategy. We enable our customers with the best AI/ML Infra in the world for talent powered by TPUs, GPUs.
As a Technical Program Manager, you will remove the bottleneck of complex cluster setup through automation, so AI/ML researchers and engineers, cloud architects, and IT admins can focus on their end goals rather than configuration. You will mitigate slowdowns in distributed training through optimized workload placement and understanding of physical data center networks. You will manage job queues and ensure efficient resource allocation through its managed Slurm environment. You will provide observability into all cluster components and an evolving topology view for cluster health.Google Cloud accelerates every organization’s ability to digitally transform its business and industry. We deliver enterprise-grade solutions that leverage Google’s cutting-edge technology, and tools that help developers build more sustainably. Customers in more than 200 countries and territories turn to Google Cloud as their trusted partner to enable growth and solve their most critical business problems.
Responsibilities
- Manage the Product Development Cycle (PLC) for products that simplify the deployment and management of high-performance computing (HPC) and AI/ML clusters on Google Cloud (e.g., cluster director and cluster toolkit).
- Manage project schedules, identify technical risks and clearly communicate them to project stakeholders.
- Collaborate with engineering manager and TL to define the software qualification and rollout strategy.
- Collaborate with engineering, product, and stakeholders to ensure alignment on operational priorities and customer needs.
- Manage escalations and critical incidents, ensuring timely resolution and effective communication.
Experience Level
Mid LevelJob role
Job requirements
About company
Similar jobs you can apply for
MarketingSales & Marketing Executive
Smartprime Management (OPC) Private Limited
Insurance Manager
Flexi InsuranceField Officer
Muthoot Finance
Retail Sales Executive
Tata CromaRetail Staff
Sowparnika Retail