H

Associate Director, Software Engineering (Model Hosting/Inference Optimisation)

HSBC Global Services Limited

Shenzhen, Guangdong, China Full-time May 29, 2026
Apply Now

Vacancy Description

Some careers have more impact than others.

If you’re looking for a career where you can make a real impression, join HSBC and discover how valued you’ll be.

 

We are currently seeking an experienced professional to join our team in the role of Associate Director, Software Engineering (Model Hosting/Inference Optimisation).

 

Business: CTO Platforms (AI Platforms)

Location: Shenzhen / Guangzhou

Req ID: 44990

 

Principal responsibilities

  • Design, build, and operate scalable, reliable model hosting platforms for LLMs, embeddings, and STT/TTS across heterogeneous hardware. 
  • Drive inference optimisation for latency, throughput, and cost (quantisation, KV-cache optimisation, dynamic/continuous batching). 
  • Evaluate, integrate, and tailor inference frameworks (e.g., vLLM, TensorRT-LLM,...

Ready to Apply?

अभी आवेदन करें

Submit your application for Associate Director, Software Engineering (Model Hosting/Inference Optimisation) at HSBC Global Services Limited

Apply for this Position