Senior Infrastructure Automation Engineer
Gruve.ai
Senior Infrastructure Automation Engineer
Gruve.ai
Bengaluru/Bangalore
Not disclosed
Job Details
Job Description
Senior SDE-Infrastructure Automation
Key Roles & Responsibilities:
- Automate infrastructure deployment using Terraform, Ansible, and Helm for VMware and cloud environments.
- Develop and implement VMware workload migration strategies, including vMotion, HCX, SRM (Site Recovery Manager), and lift-and-shift migrations.
- Migrate VMware-based workloads to public cloud (AWS, Azure, GCP) or hybrid cloud environments.
- Optimize and manage AI POD workloads on VMware and Kubernetes-based environments.
- Leverage VMware HCX for live and bulk workload migrations, ensuring minimal downtime and optimal performance.
- Automate virtual machine provisioning and lifecycle management using VMware vSphere APIs, PowerCLI, or vRealize Automation.
- Integrate VMware workloads with Kubernetes for containerized AI/ML workflows.
- Ensure workload high availability and disaster recovery post-migration using VMware SRM, vSAN, and backup strategies.
- Monitor and troubleshoot migration performance issues using vRealize Operations, Prometheus, Grafana, and ELK.
- Develop and optimize CI/CD pipelines to automate workload migration, deployment, and validation.
- Ensure security and compliance for workloads before, during, and after migration.
- Work closely with cloud architects to design hybrid cloud solutions supporting AI/ML workloads.
Basic Qualifications:
- 5-8 years of experience in infrastructure automation, VMware workload migration, and cloud integration.
- Expertise in VMware vSphere, ESXi, vMotion, HCX, SRM, vSAN, NSX-T.
- Hands-on experience with workload migration tools such as VMware HCX, CloudEndure, AWS Application Migration Service, Azure Migrate.
- Proficiency in Infrastructure-as-Code (Terraform, Ansible, PowerCLI, vRealize Automation).
- Strong experience with Kubernetes (EKS, AKS, GKE) and containerized AI/ML workloads.
- Experience in public cloud migration (AWS, Azure, GCP) for VMware-based workloads.
- Hands-on knowledge of CI/CD tools (Jenkins, GitLab CI/CD, ArgoCD, Tekton).
- Strong scripting and automation skills in Python, Bash, or PowerShell.
- Familiarity with disaster recovery, backup, and business continuity planning in VMware environments.
- Performance tuning and troubleshooting experience for VMware-based workloads.
Preferred Qualifications:
- Experience with NVIDIA GPU orchestration (KubeFlow, Triton, RAPIDS, etc.).
- Familiarity with Packer for automated VM image creation.
- Familiarity with Edge AI deployments, federated learning, and AI inferencing at scale.
- Contributions to open-source infrastructure automation projects.
Job role
Work location
Bengaluru
Department
IT & Information Security
Role / Category
IT Security
Employment type
Full Time
Shift
Day Shift
Job requirements
Experience
Min. 5 years
About company
Name
Gruve.ai
Job posted by Gruve.ai
This job has expired