Vacancy Description
About the job
As a member of our AI model team, you will drive innovation in model serving and inference architectures for advanced AI systems. Your work will focus on optimizing model deployment and inference strategies to deliver highly responsive, efficient, and scalable performance across real‑world applications. You will work on a wide spectrum of systems, ranging from resource‑efficient models designed for limited hardware environments to complex, multi‑modal architectures that integrate data such as text, images, and audio.
We expect you to have deep expertise in designing and optimizing model serving pipelines and inference frameworks as well as a strong background in advanced model architectures. You will adopt a hands‑on, research‑driven approach to develop, test, and implement novel serving strategies and inference algorithms. Your responsibilities include engineering robust inference pipelines, establishing comprehensive performance metrics, and iden...
Ready to Apply?
अभी आवेदन करें
Submit your application for AI Research Engineer (Kernel & Inference Optimization) at tether
Apply for this Position