P

AI Inference Optimization Engineer

Persistent Systems

India, India, India Full-time June 30, 2026
Apply Now

Vacancy Description

About Position:


You will be the team's primary authority on NVIDIA's inference ecosystem NIM (NVIDIA Inference Microservices), Triton Inference Server, TensorRT, and the BioNeMo platform. Your core mission is to take structural biology AI models whether NIM-ready or research-grade Python scripts and turn them into production-quality, API-accessible inference services.

Critical Requirement: Several target models (LigandMPNN, Boltz, custom AlphaFold2 variants) are not yet available as official NVIDIA NIM services. This role requires hands-on ability to build NIM-compliant containers from scratch and configure Triton model repositories for models that currently only have CLI or notebook interfaces.


  • Role: Nvidia Engineer
  • Location: All Persistent Locations
  • Experience: 4 to 7 Years
  • Job Type: Full Time Employment

Ready to Apply?

अभी आवेदन करें

Submit your application for AI Inference Optimization Engineer at Persistent Systems

Apply for this Position