Scientific AI Evaluation & Computational Problem Designer

Weekday AI • United States, United States • Posted June 01, 2026

About the Role

This role is for one of our clients

Compensation: $45-$100 per hour

We are building a large-scale evaluation benchmark to test advanced AI reasoning across scientific and engineering domains. This role focuses on designing rigorous, research-grade computational problems that assess how effectively AI systems can leverage real scientific software tools to solve complex challenges.

Unlike traditional annotation roles, this position requires creating original, graduate-level problems rooted in real-world scientific workflows. You will iteratively refine these problems through calibration against state-of-the-art AI models, ensuring the right balance of difficulty, depth, and reasoning complexity.

Requirements

What You’ll Do

  • Design advanced computational problems requiring the use of domain-specific scientific software
  • Create tasks that test both precise execution ...