Found Description
Key Responsibilities
- Design, implement, and maintain highly available, scalable, and reliable production systems.
- Develop monitoring, alerting, and observability solutions to improve system reliability.
- Automate operational tasks and reduce manual intervention through scripting and tooling.
- Troubleshoot production incidents, perform root cause analysis, and drive long-term improvements.
- Collaborate with development teams to improve system performance, deployment processes, and operational efficiency.
- Participate in capacity planning, disaster recovery planning, and incident response activities.
- Create and maintain operational documentation and runbooks.
Qualifications
- Bachelor's degree in Computer Science, Computer Engineering, or a related field.
- 3+ years of experience in Site Reliability Engineering, DevOps, or Infrastructure Engineering.
- Strong experi...
Ready to Apply?
Submit your application for Site Reliability Engineer (SRE) at DADACONSULTANTS PTE. LTD.
Apply Now