C

System Reliability Engineer (Data Centre)

Centre for Strategic Infocomm Technologies

Singapore, Singapore, Singapore Full-time June 20, 2026

Found Description

Responsibilities

  • Oversee and manage IT operations within the data centre, including day-to-day monitoring, incident management, and problem management

  • Lead the end-to-end incident management lifecycle that encompass immediate troubleshooting, root cause identification, and resolution implementation to restore services, followed by comprehensive post-incident analysis

  • Develop and maintain documentation on IT infrastructure, operations, and procedures within the data centre

  • Perform capacity planning to ensure IT infrastructure is scalable for future demands

  • Collaborate and coordinate with Data Centre Facilities teams on matters related to power, cooling, and physical infrastructure

  • Design and implement robust observability platform alongside network monitoring tools for performance monitoring and real-time alerting of IT devices and networks

  • Implement and manage remote management tools for out...
  • Ready to Apply?

    Submit your application for System Reliability Engineer (Data Centre) at Centre for Strategic Infocomm Technologies

    Apply Now