Found Description
As a Senior Site Reliability Developer (IC4), you will play a key role in ensuring the availability, scalability, and operational excellence of OCI's Japan Sovereign Cloud services. You will design and implement automation, drive service reliability improvements, lead complex incident investigations, and partner with development teams to improve operational readiness. You will own and prioritize an SRD operational improvement backlog based on shift feedback, incident reviews, alert quality reviews, and business reliability requirements.
The role combines software engineering expertise with large-scale cloud operations and requires participation in a 24x7 shift rotation supporting critical cloud infrastructure. You will translate operational and business requirements into reliability plans, then execute improvements through tooling, automation, runbook updates, process changes, and cross-team coordination. You will also serve as a technical mentor for less experienced engineer...