Site Reliability Engineer

Birlasoft Limited

Hyderabad

Not disclosed

Work from Office

Full Time

Min. 2 years

Job Details

Job Description

Site Reliability Engineer


We Are Hiring: Site Reliability Engineer (SRE)


Key Responsibilities

  • Reliability & Scalability: Ensure systems availability, stability, security, and performance.
  • DevOps Collaboration: Work in shared responsibility with application and platform teams.
  • Reliability Processes: Drive root cause analysis, systems validation, performance tuning, and capacity management.
  • Incident Response: Dedicate time to fixing production issues, improving reliability software, and handling on-call events.
  • Monitoring & Metrics: Design and implement monitoring, alerting, and dashboards to track SLOs/SLAs and operational efficiency.
  • Automation & Innovation: Simplify and automate infrastructure operations for efficiency.
  • Cross-Team Coordination: Collaborate with infrastructure, platform, and application SMEs to promote best practices.
  • Continuous Improvement: Support RCA programs to reduce downtime and increase resiliency.


Technology Skills

  • APM Tools: New Relic (service maps, tracing, dashboards, custom events/queries, JVM monitoring, alerting).
  • Spring Boot: Application development, monitoring, JVM/application parameter tuning.
  • Splunk: Queries, dashboards, and monitoring.
  • DevOps Pipelines: Bitbucket, CloudBees, AWS Cloud.
  • Distributed Tracing: Knowledge of frameworks like Jaeger.
  • This role is perfect for engineers passionate about system reliability, automation, and performance optimization.


Please share resume if interested

Job role

Work location

Hyderabad

Department

Software Engineering

Role / Category

Software Project Management

Employment type

Full Time

Shift

Day Shift

Job requirements

Experience

Min. 2 years

About company

Name

Birlasoft Limited

Job posted by Birlasoft Limited

Apply via email