Service Management Lead - Site Reliability Engineering
Accenture India Private LimitedJob Description
Service Management Lead
Project Role : Service Management LeadProject Role Description : Lead the delivery of programs, projects or managed services. Coordinate projects through contract management and shared service coordination. Develop and maintain relationships with key stakeholders and sponsors to ensure high levels of commitment and enable strategic agenda
Must have skills : Site Reliability Engineering
Good to have skills : Python (Programming Language), DevOps, Kubernetes
Minimum 7.5 year(s) of experience is required
Educational Qualification : 15 years full time education
Infrastructure Engineering Site Reliability Engineering
Job Title: Infrastructure Engineering Site Reliability Engineering.
Summary
A Site Reliability Engineer (SRE) ensures systems are stable, scalable, and highly available, bridging the gap between Business Application development and IT operations. This role combines automation, observability, incident response, and performance engineering to maintain continuous service reliability while accelerating delivery velocity. The Site Reliability Engineer designs and maintains production systems that meet defined Service Level Objectives (SLOs) and error budgets. Using software engineering principles, an SRE prevents downtime, automates operations, and improves platform performance through observability, fault tolerance, and system resilience.
Key Responsibilities:
- Reliability and Performance: Monitor and optimize system uptime, latency, and throughput to meet SLOs and SLIs.
- Incident Management: Lead incident response, manage escalations, perform root cause analysis (RCA), and drive postmortem reviews.
- Automation and Tooling: Develop CI/CD pipelines, automate infrastructure management, and eliminate manual toil through scripting and orchestration.
- Monitoring and Observability: Implement metrics, logging, and tracing frameworks (Prometheus, Grafana, ELK, Datadog) to gain real-time visibility into distributed systems.
- Capacity Planning: Conduct resource forecasting, design scalable infrastructure, and handle performance under surge conditions.
- Change & Release Management: Partner with developers to ensure safe, reliable rollout of new features with automated testing and rollback mechanisms.
- Disaster Recovery & Resilience Engineering: Implement multi-region resilience strategies, chaos tests, and failover automation for business continuity.
- Process Improvement: Use post-incident analytics to refine operational practices and improve reliability with data-driven improvements.
- Collaborate with product, design, ML, and DevOps teams to build intelligent workflows and user experiences
- Implement Infrastructure as Code (IaC) using tools like Terraform, CloudFormation, AZURE DEV OPS or Pulumi.
- Expert in Cloud IaaS and PaaS services.
Required Skills:
- Expertise in Python, Go, Bash, or JavaScript for automation and tooling.
- Hands-on with cloud environments AWS, Azure, GCP and orchestration tools like Kubernetes and Terraform.
- Deep understanding of Linux systems, networking, and distributed architectures.
- Experience with observability solutions Prometheus, Grafana, Datadog, CloudWatch, or New Relic.
- Familiarity with incident management and alerting platforms (PagerDuty, xmatters)
- Proficiency in CI/CD frameworks such as Jenkins, GitHub Actions, or GitLab CI.
- Working knowledge of security, compliance, and performance optimization for highly available systems.
Certifications (Required / Preferred):
- AWS Certified Solutions Architect Professional
- Microsoft Certified: Azure Solutions Architect Expert
- Google Professional Cloud Architect
- Certified Kubernetes Administrator (CKA)
- HashiCorp Certified: Terraform Associate
- Certified DevOps Engineer certifications (AWS, Azure, or Google)
Additional Information:
- The candidate should have minimum 15 years of experience in Site Reliability Engineering.
- This position is based at our Pune office.
- A 15 years full time education is required.
Job role
Job requirements
About company
Similar jobs you can apply for
Business DevelopmentSenior Presales Executive
Life Space Property Solutions
Business Development Manager (BDM)
Randstad India Private LimitedRelationship Manager
Bajaj Allianz
Telesales Executive
Randstad India Private Limited
Steward
Dawholesum Resto Cafe
Retail Partner
Bajaj Life Insurance Company LimitedYou can expect a minimum salary of 0 INR. The salary offered will depend on your skills, experience and performance in the interview.
The candidate should have completed the required education and people who have 7 to 10 years are eligible to apply for this job. You can apply for more jobs in Pune to get hired quickly.
The candidate should have sound communication skills and sound communication skills for this job.
Both Male and Female candidates can apply for this job.
No, it's not a work from home job and can't be done online. You can explore and apply for other work from home jobs in Pune at apna.
No work-related deposit needs to be made during your employment with the company.
Go to the apna app and apply for this job. Click on the apply button and call HR directly to schedule your interview.
The last date to apply for this job is . For more details, download apna app and find Full Time jobs in Pune . Through apna, you can find jobs in 64 cities across India. Join NOW!