Found Description
We're seeking a Site Reliability Engineer to enhance our real-time transcription and summarization platforms powered by Large Language Models. Your role will be vital in ensuring system reliability and performance.
As an SRE, you will confront the complexities inherent in handling diverse real-time communication formats. Your daily tasks will include monitoring system health, optimizing infrastructure, and managing incident responses. You will implement automation to maintain efficient, low-latency data pipelines across various platforms.
Key Responsibilities:
• Monitor and manage the health of LLM platforms
• Improve performance through enhanced observability
• Respond effectively to incidents when they arise
• Scale infrastructure to meet growing operational needs
• Automate processes for reliable data handling
Requirements:
• Expertise in Dynatrace along with RUM
o• Familiarity with ElasticSearch and Grafana
• Experience with OpenShift and OpenTeleme...
As an SRE, you will confront the complexities inherent in handling diverse real-time communication formats. Your daily tasks will include monitoring system health, optimizing infrastructure, and managing incident responses. You will implement automation to maintain efficient, low-latency data pipelines across various platforms.
Key Responsibilities:
• Monitor and manage the health of LLM platforms
• Improve performance through enhanced observability
• Respond effectively to incidents when they arise
• Scale infrastructure to meet growing operational needs
• Automate processes for reliable data handling
Requirements:
• Expertise in Dynatrace along with RUM
o• Familiarity with ElasticSearch and Grafana
• Experience with OpenShift and OpenTeleme...
Ready to Apply?
Submit your application for Site Reliability Engineer for LLM Platforms at Insight Global
Apply Now