This role focuses on creating a benchmark dataset aimed at evaluating AI models in the context of professional document understanding and instruction following within the Engineering & Built Environment domain. You will engage in tasks that involve complex, multi-step requests based on real-world workspace files such as technical drawings, project specifications, and engineering reports, as well as web searches and code execution.

Your primary responsibility will be to author tasks that assess an AI''s capability to interpret engineering documentation, adhere to multi-step instructions, and generate precise, well-structured outputs, each accompanied by a clearly defined ground truth output and an objective evaluation rubric.

Key Responsibilities

Author complex tasks grounded in real-world engineering documentation.
Evaluate AI performance based on defined criteria.
Ensure outputs are precise and well-structured.

Qualifications

A minimum of 3 years of hands-on experience in one or more of the following sub-domains:
Mechanical engineering
Civil engineering
Industrial engineering
Architecture

Work Terms

This is a remote position with a minimum commitment of 15, 20 hours per week.

Compensation

Hourly compensation ranges from $90 to $110.

Engineering and Built Environment Specialist for AI Evaluation

About this role

Related Jobs

Cloud Architect for AI Model Training

Competitive Programming Checker for AI Training

Software Engineer, New Grad

Audio Engineer for AI Model Training

Senior Software Engineer for AI Systems