Found Description
RedHat seeks a Machine Learning Engineer specializing in model optimization algorithms to enhance the future of open-source AI. Join a team committed to simplifying GenAI deployments while driving innovative solutions for large model optimization.
In this senior role, you will work with the RedHat AI Inference team, contributing to state-of-the-art deep learning software. You will bridge research and production by optimizing LLMs and working on projects like LLM-compressor and vLLM. The ideal candidate enjoys mentoring and fostering a culture of learning while collaborating with product and research teams.
Key Responsibilities:
• Design and develop inference optimization algorithms
• Implement model compression using quantization and pruning
• Collaborate with research scientists on production-ready systems
• Profile LLM performance regarding latency and throughput
• Mentor team members while contributing to open-source projects
Requirements:
• Strong machine...
In this senior role, you will work with the RedHat AI Inference team, contributing to state-of-the-art deep learning software. You will bridge research and production by optimizing LLMs and working on projects like LLM-compressor and vLLM. The ideal candidate enjoys mentoring and fostering a culture of learning while collaborating with product and research teams.
Key Responsibilities:
• Design and develop inference optimization algorithms
• Implement model compression using quantization and pruning
• Collaborate with research scientists on production-ready systems
• Profile LLM performance regarding latency and throughput
• Mentor team members while contributing to open-source projects
Requirements:
• Strong machine...
Ready to Apply?
Submit your application for Machine Learning Engineer at RedHat at Red Hat
Apply Now