Found Description
Key Responsibilities
- Design and develop compute cluster architectures optimized for performance, reliability, scalability, and serviceability within KLA systems.
- Define and validate server hardware configurations, including CPUs, GPUs, memory subsystems, storage, networking, and specialized accelerators.
- Analyze and optimize system-level performance across hardware and software layers, including CPU/GPU utilization, memory bandwidth, PCIe topology, NUMA architecture, and I/O performance.
- Collaborate with hardware, software, firmware, and systems engineering teams to ensure seamless integration of compute clusters into broader system architectures.
- Support server bring‑up, hardware integration, diagnostics, benchmarking, stress testing, and root‑cause analysis activities.
- Manage and troubleshoot enterprise server platforms, including BIOS/firmware configuration, BMC/IPMI management, thermal and power optimization, and ha...
Ready to Apply?
Submit your application for HPC System Engineer at DEXIAN SINGAPORE PTE. LTD.
Apply Now