Vice President - Site Reliability Engineer (AWS & Kubernetes)
NatWest GroupJob Description
Site Reliability Engineer (AWS & Kubernetes), VP
Join us as a Site Reliability Engineer
- In this key role, you’ll improve, drive, and embed non-functional and operational characteristics such as availability, performance, efficiency, change management, monitoring, security, incident response, and capacity planning of our products and services
- You’ll enjoy significant stakeholder interaction, working in collaboration with engineers to ensure a principled approach to deliver change in a safe and secure way
- This is a chance to join an inclusive team with a collaborative ethos and a commitment to innovation and professional development
- We're offering this role at vice president level
What you'll do
As a Senior Site Reliability Engineer, you’ll act as a hands‑on expert responsible for ensuring the reliability, availability and performance of critical production platforms.
You’ll lead the adoption of SRE practices, embedding resilience, observability and operational excellence into distributed systems running on AWS and Kubernetes. You’ll also take ownership of 24/7 production support models, ensuring systems are highly available and incidents are effectively managed and learned from.
In addition to this, you’ll:
- Designing and operating highly resilient AWS-based Kubernetes platforms (EKS) aligned to enterprise standards
- Owning and improving production reliability, availability, and SLA/SLO frameworks
- Leading incident management, escalation and 24/7 on-call practices, including post-incident reviews
- Embedding SRE principles such as error budgets, toil reduction, and reliability engineering into delivery teams
- Implementing infrastructure and platform automation using Terraform and GitOps methodologies
- Driving self-healing, auto-scaling and failure recovery mechanisms using tools such as Karpenter
- Building secure, scalable networking and service communication (e.g. Cilium)
- Defining and operating observability platforms using Grafana, Prometheus, Loki, Tempo
- Partnering with DevOps and engineering teams to ensure production readiness and operational excellence
- Leading complex troubleshooting across distributed systems and cloud-native environments
- Developing reusable “golden paths”, operational runbooks and reliability patterns
- Ensuring platforms meet regulatory, security and operational risk requirements
- Using data, SLIs and metrics to drive continuous improvement and proactive reliability enhancements
The skills you'll need
We’re looking for a highly experienced SRE who has a strong background in operating large-scale, business-critical platforms with a passion for reliability engineering
We’re also looking for:
- Deep expertise managing production systems on AWS and Kubernetes (EKS)
- Strong experience in 24/7 support models, incident management and on-call leadership
- Advanced knowledge of SRE principles (SLIs, SLOs, error budgets, toil reduction)
- Proficiency in Terraform, GitOps, and cloud automation practices
- Hands-on experience with GitLab CI/CD and Argo CD
- Strong understanding of Kubernetes networking, security and service mesh technologies, ideally Cilium
- Experience scaling infrastructure using Karpenter and auto-scaling strategies
- Expertise in observability tooling (Grafana, Prometheus, Loki, Tempo)
- Proven ability to troubleshoot and resolve complex, cross-system production issues
- Experience operating in regulated or high-security environments
- Strong leadership, mentoring, and stakeholder engagement capabilities
- Ability to balance reliability, risk, and delivery in a fast-paced environment
Hours
45Job Posting Closing Date:
16/06/2026Experience Level
Executive LevelJob role
Job requirements
About company
Similar jobs you can apply for
Accounts / FinanceAmusement Center Manager
Pinnacle Holdings
Procurement Manager
Kadamba Sports & Fitness
Housekeeping Boy
Stonarts D - India's Largest Tiles Store
Furniture Designer
Mbh Creative Studio Llp
Field Sales Recruiter
Stone Onepoint Solutions Private Limited
Manufacturing Engineer
Neo SanYou can expect a minimum salary of 0 INR. The salary offered will depend on your skills, experience and performance in the interview.
The candidate should have completed the required education and people who have 5 to 31 years are eligible to apply for this job. You can apply for more jobs in Bengaluru/Bangalore to get hired quickly.
The candidate should have sound communication skills and sound communication skills for this job.
Both Male and Female candidates can apply for this job.
No, it's not a work from home job and can't be done online. You can explore and apply for other work from home jobs in Bengaluru/Bangalore at apna.
No work-related deposit needs to be made during your employment with the company.
Go to the apna app and apply for this job. Click on the apply button and call HR directly to schedule your interview.
The last date to apply for this job is . For more details, download apna app and find Full Time jobs in Bengaluru/Bangalore . Through apna, you can find jobs in 64 cities across India. Join NOW!