2

Remote

24-MAG

New York, New York, United States Full-time June 18, 2026
Apply Now

Vacancy Description

We are sharing a specialised part-time consulting opportunity for professors, PhD students, and advanced academic researchers experienced in domain-specific problem design, Python-based evaluation, benchmark task development, and structured reasoning assessment.

This role supports current and upcoming remote consulting opportunities focused on academic benchmark task design, Python-based evaluation workflows, domain-specific problem development, golden solution preparation, model behavior analysis, and high-quality project execution. Selected professionals will apply their academic expertise to create challenging real-world tasks, define precise expected outputs, develop executable tests, and evaluate reasoning or problem-solving performance across advanced subject areas.

Key Responsibilities

Professionals in this role may contribute to:

Academic Task Design & Development

  • ...

Ready to Apply?

अभी आवेदन करें

Submit your application for Remote at 24-MAG

Apply for this Position