C

Senior LLM Inference & Deployment Architect

Confidential

singapore, singapore, Singapore Full-time June 20, 2026

Found Description

VIPKid is seeking a Senior LLM Deployment & Inference Optimization Engineer in Singapore to build and manage self-hosted inference infrastructure for various AI-driven applications. You will design cloud inference services and improve performance management systems to enhance real-time interactions.

The ideal candidate has a strong background in backend engineering, with over 5 years of experience, proven skills in optimizing inference techniques, and proficiency in tools like vLLM and TensorRT.

#J-18808-Ljbffr

Ready to Apply?

Submit your application for Senior LLM Inference & Deployment Architect at Confidential

Apply Now