Vacancy Description
We are looking for an AI/ML-focused Data Engineer who brings deep expertise in building intelligent data pipelines for unstructured content and is experienced in integrating with modern machine learning ecosystems.
The ideal candidate will have hands-on experience in PySpark and Python with a strong focus on document classification, cleansing, quality metrics, and the ability to work with LLMs, vector databases and Retrieval-Augmented Generation (RAG) frameworks.
Candidates will play a critical role in bridging data engineering and machine learning to enable the development of AI-first applications.
Responsibilities- Build robust, scalable data processing pipelines for unstructured documents (PDFs, emails, forms, etc.) using PySpark and document cleansing, classification, and enrichment techniques to prepare high-quality data for AI/ML applications.
- Develop and integrate data workflows that feed into LLM-based pipelines and support vector-bas...
Ready to Apply?
अभी आवेदन करें
Submit your application for AI/ML Engineer at Virtusa
Apply for this Position