AI Inference Optimization Engineer

Persistent Systems

India, India, India Full-time June 30, 2026

Vacancy Description

About Position:

You will be the team's primary authority on NVIDIA's inference ecosystem NIM (NVIDIA Inference Microservices), Triton Inference Server, TensorRT, and the BioNeMo platform. Your core mission is to take structural biology AI models whether NIM-ready or research-grade Python scripts and turn them into production-quality, API-accessible inference services.

Critical Requirement: Several target models (LigandMPNN, Boltz, custom AlphaFold2 variants) are not yet available as official NVIDIA NIM services. This role requires hands-on ability to build NIM-compliant containers from scratch and configure Triton model repositories for models that currently only have CLI or notebook interfaces.

Role: Nvidia Engineer
Location: All Persistent Locations
Experience: 4 to 7 Years
Job Type: Full Time Employment

Ready to Apply?

अभी आवेदन करें

Submit your application for AI Inference Optimization Engineer at Persistent Systems

Apply for this Position

Location India, India

Country India

Type Full-time

Category Computer Occupations

Posted June 30, 2026

AI Inference Optimization Engineer

Vacancy Description

Ready to Apply?

Vacancy Details

About Persistent Systems

Persistent Systems

Share This Vacancy