Site Reliability Engineer
Oracle India Private Limited
Apply on company website
Site Reliability Engineer
Oracle India Private Limited
Bengaluru/Bangalore
Not disclosed
Job Details
Job Description
Site Reliability Developer 3
As a Site Reliability Engineer, you will be responsible for defining, deploying, and operating key services with a strong emphasis on system architecture, production operations, capacity planning, performance optimization, deployment, and release engineering. You will help deliver exceptional experiences for our customers and partners while ensuring our services meet reliability, scalability, and performance standards.
Responsibilities
Own the architecture, design, implementation, and production operations of core system and platform services
Improve system reliability through automation, self-healing mechanisms, and real-time monitoring and alerting
Identify and respond to production issues, driving root-cause analysis and implementing preventative solutions
Contribute to the design, development, and operation of platform services, including provisioning, configuration, deployment, and ongoing support
Partner with a globally distributed team to prototype, evaluate, and roll out new platform capabilities
Design, write, and deploy software to improve the availability, scalability, and operational efficiency of services
Develop and evolve standards, architectures, and best practices for large-scale distributed systems
Lead and support capacity planning, demand forecasting, performance analysis, and system tuning
Stay current with emerging technologies and apply innovative approaches to solving complex infrastructure and cloud-service challenges
Qualifications & Experience
5-8 years of experience in Site Reliability Engineering, DevOps, or a closely related role
Experience developing and/or operating large-scale, distributed systems and services
Hands-on experience with containerized environments using Kubernetes, Docker, Mesos, or similar technologies
Experience with infrastructure automation and Infrastructure-as-Code tools such as Terraform, Chef, Ansible, Puppet, or Packer
Familiarity with cloud orchestration frameworks and supporting them in an SRE or production environment
Experience building and maintaining CI/CD pipelines using tools such as Git (or other VCS), GitLab Runners, Jenkins, and Rundeck
Experience supporting production, test, and development environments at medium to large scale
Proficiency in scripting for automation and deployments using Bash, PowerShell, or similar
Knowledge of cloud compute platforms, networking, monitoring, logging, and data processing/analytics
Proficiency in at least one modern programming language such as Python, Go or Java
Experience operating fault-tolerant, highly available, high-throughput, and scalable systems
Hands-on experience with at least one major cloud provider (AWS, OCI, GCP, or equivalent)
As a Site Reliability Engineer, you will be responsible for defining, deploying, and operating key services with a strong emphasis on system architecture, production operations, capacity planning, performance optimization, deployment, and release engineering. You will help deliver exceptional experiences for our customers and partners while ensuring our services meet reliability, scalability, and performance standards.
Responsibilities
Own the architecture, design, implementation, and production operations of core system and platform services
Improve system reliability through automation, self-healing mechanisms, and real-time monitoring and alerting
Identify and respond to production issues, driving root-cause analysis and implementing preventative solutions
Contribute to the design, development, and operation of platform services, including provisioning, configuration, deployment, and ongoing support
Partner with a globally distributed team to prototype, evaluate, and roll out new platform capabilities
Design, write, and deploy software to improve the availability, scalability, and operational efficiency of services
Develop and evolve standards, architectures, and best practices for large-scale distributed systems
Lead and support capacity planning, demand forecasting, performance analysis, and system tuning
Stay current with emerging technologies and apply innovative approaches to solving complex infrastructure and cloud-service challenges
Qualifications & Experience
5-8 years of experience in Site Reliability Engineering, DevOps, or a closely related role
Experience developing and/or operating large-scale, distributed systems and services
Hands-on experience with containerized environments using Kubernetes, Docker, Mesos, or similar technologies
Experience with infrastructure automation and Infrastructure-as-Code tools such as Terraform, Chef, Ansible, Puppet, or Packer
Familiarity with cloud orchestration frameworks and supporting them in an SRE or production environment
Experience building and maintaining CI/CD pipelines using tools such as Git (or other VCS), GitLab Runners, Jenkins, and Rundeck
Experience supporting production, test, and development environments at medium to large scale
Proficiency in scripting for automation and deployments using Bash, PowerShell, or similar
Knowledge of cloud compute platforms, networking, monitoring, logging, and data processing/analytics
Proficiency in at least one modern programming language such as Python, Go or Java
Experience operating fault-tolerant, highly available, high-throughput, and scalable systems
Hands-on experience with at least one major cloud provider (AWS, OCI, GCP, or equivalent)
understanding of services and technologies.
Career Level - IC3
As a world leader in cloud solutions, Oracle uses tomorrow’s technology to tackle today’s challenges. We’ve partnered with industry-leaders in almost every sector—and continue to thrive after 40+ years of change by operating with integrity.
We know that true innovation starts when everyone is empowered to contribute. That’s why we’re committed to growing an inclusive workforce that promotes opportunities for all.
Oracle careers open the door to global opportunities where work-life balance flourishes. We offer competitive benefits based on parity and consistency and support our people with flexible medical, life insurance, and retirement options. We also encourage employees to give back to their communities through our volunteer programs.
We’re committed to including people with disabilities at all stages of the employment process. If you require accessibility assistance or accommodation for a disability at any point, let us know by emailing accommodation-request_mb@oracle.com or by calling +1 888 404 2494 in the United States.
Oracle is an Equal Employment Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability and protected veterans’ status, or any other characteristic protected by law. Oracle will consider for employment qualified applicants with arrest and conviction records pursuant to applicable law.
Experience Level
Senior LevelJob role
Work location
BENGALURU, KARNATAKA, India
Department
Software Engineering
Role / Category
DevOps
Employment type
Full Time
Shift
Day Shift
Job requirements
Experience
Min. 5 years
About company
Name
Oracle India Private Limited
Job posted by Oracle India Private Limited
Apply on company website