AI Research Engineer (Model Compression & Quantization)

Tether.io

remote, romblon, Philippines Full-time May 30, 2026

Vacancy Description

About The Job

As a member of our AI research team, you will drive innovation in model compression and efficient deployment for advanced multimodal AI systems, including large language models (LLMs) and vision-language models (VLMs). Your work will focus on reducing model footprint and computational cost while preserving accuracy, enabling high-performance AI to run efficiently across resource‑constrained edge devices. You will apply and advance compression techniques such as quantization, knowledge distillation, and pruning to streamline complex multimodal architectures that integrate text, images, and audio.

Responsibilities

Apply low‑bit quantization to reduce model size and inference latency for generative AI models (LLMs, VLMs, multimodal) while maintaining accuracy and output quality.
Leverage knowledge distillation to transfer capabilities from larger teacher models to smaller student models, enabling efficient multimodal reasonin...

Ready to Apply?

अभी आवेदन करें

Submit your application for AI Research Engineer (Model Compression & Quantization) at Tether.io

Apply for this Position

Location remote, romblon

Country Philippines

Type Full-time

Category Engineering

Posted May 30, 2026

AI Research Engineer (Model Compression & Quantization)

Vacancy Description

About The Job

Responsibilities

Ready to Apply?

Vacancy Details

About Tether.io

Tether.io

Share This Vacancy