Found Description
Enhance reliability initiatives at Confluent as a Senior Site Reliability Engineer. Lead engineering efforts, improve tooling, and manage incident responses across a multi-cloud streaming platform.
In this senior-level role, you'll spend 75% of your time engineering proactive solutions while the remaining 25% focuses on coaching teams and improving organizational reliability practices. Your expertise will be critical in minimizing incidents in Confluent's dynamic cloud environment.
Key Responsibilities:
• Analyze failure patterns for proactive reliability design
• Own configuration of Rootly and integrate with key tools
• Define and maintain SLO/SLA frameworks
• Edit customer-facing incident documents for quality
• Develop training programs and coach teams through post-mortems
Requirements:
• 10+ years in SRE, incident management, or reliability engineering
• Cloud experience with AWS, GCP, or Azure
• Expertise in incident management tools like Rootly
In this senior-level role, you'll spend 75% of your time engineering proactive solutions while the remaining 25% focuses on coaching teams and improving organizational reliability practices. Your expertise will be critical in minimizing incidents in Confluent's dynamic cloud environment.
Key Responsibilities:
• Analyze failure patterns for proactive reliability design
• Own configuration of Rootly and integrate with key tools
• Define and maintain SLO/SLA frameworks
• Edit customer-facing incident documents for quality
• Develop training programs and coach teams through post-mortems
Requirements:
• 10+ years in SRE, incident management, or reliability engineering
• Cloud experience with AWS, GCP, or Azure
• Expertise in incident management tools like Rootly
Ready to Apply?
Submit your application for Confluent Senior Reliability Engineer Role at IBM
Apply Now