Engineering and Built Environment Specialist for AI Evaluation
$90–$110/hr
About this role
This role focuses on creating a benchmark dataset aimed at evaluating AI models in the context of professional document understanding and instruction following within the Engineering & Built Environment domain. You will engage in tasks that involve complex, multi-step requests based on real-world workspace files such as technical drawings, project specifications, and engineering reports, as well as web searches and code execution.
Your primary responsibility will be to author tasks that assess an AI''s capability to interpret engineering documentation, adhere to multi-step instructions, and generate precise, well-structured outputs, each accompanied by a clearly defined ground truth output and an objective evaluation rubric.
Key Responsibilities- Author complex tasks grounded in real-world engineering documentation.
- Evaluate AI performance based on defined criteria.
- Ensure outputs are precise and well-structured.
- A minimum of 3 years of hands-on experience in one or more of the following sub-domains:
- Mechanical engineering
- Civil engineering
- Industrial engineering
- Architecture
This is a remote position with a minimum commitment of 15, 20 hours per week.
CompensationHourly compensation ranges from $90 to $110.