Found Description
Job Overview
We are seeking a skilled and passionate AI Engineer to join our team in developing and optimizing AI solutions, focusing on LLM (Large Language Models) and RAG (Retrieval-Augmented Generation) systems. This role involves working across both on-device and cloud environments, contributing to end-to-end model development, deployment, and performance optimization.
Responsibilities
- Learning and tuning LLM and RAG models (fine-tuning, LoRA, etc.)
- Optimizing and deploying on-device and cloud environment models (ONNX, Qualcomm NPU, etc.)
- Building a RAG pipeline based on document embedding and vector retrieval
- Data preprocessing and pipeline automation
- Monitoring and improving model performance
Qualifications
- Candidate should possess a Bachelor's Degree or equivalent in Science & Technology, Mathematics, Computer Science / Information Technology, or Engineering (Computer ...