Found Description
Will be responsible for Eyes on glass Monitoring, Triage & Incident Ownership, Troubleshooting & Restoration, Cross-Team Collaboration, Platform & Application Stack Awareness and Service Quality & Process Excellence.
- Perform rapid intake, triage, and prioritization of alerts, tickets, and incidents.
- Act as Incident Owner during high-severity events, ensuring clear communication, timely updates, and swift restoration of service.
- Maintain accurate, real-time incident timelines and post-incident documentation.
- Execute root-cause isolation across application, middleware, APIs, data, and infrastructure layers.
- Use observability/monitoring tools (e.g. Kibana, Dynatrace, CloudWatch, Grafana) to correlate logs, metrics, and traces; identify anomalies, performance bottlenecks, and failure patterns.
- Perform targeted mitigations, rollbacks, config fixes, and coordinate hotfixes to restore service quickly.
- Engage with App...