Full Stack LLM Engineer

Cerebras

toronto, on, Canada Full-time June 01, 2026

Vacancy Description

About the Role

We are seeking a versatile and experienced engineer to join our Inference Core Model Bringup team. This team is responsible for rapidly bringing up state‑of‑the‑art open‑source models (like LLaMA, Qwen, etc.) or customer‑provided proprietary models on our Cerebras CSX systems. Success in this role requires a system‑minded generalist who thrives in fast‑paced bring‑up environments and is comfortable working across the entire Cerebras software stack. Your work will play a critical role in achieving unprecedented levels of performance, efficiency, and scalability for AI applications.

Responsibilities

Contribute to the end-to‑end bring up of ML models on Cerebras CSX systems.
Work across the stack: model architecture translation, graph lowering, compiler optimizations, runtime integration, and performance tuning.
Debug performance and correctness issues spanning model code, compiler IRs, runt...

Ready to Apply?

अभी आवेदन करें

Submit your application for Full Stack LLM Engineer at Cerebras

Apply for this Position

Location toronto, on

Country Canada

Type Full-time

Category IT & Technology

Posted June 01, 2026

Full Stack LLM Engineer

Vacancy Description

About the Role

Responsibilities

Ready to Apply?

Vacancy Details

About Cerebras

Cerebras

Share This Vacancy