Biology Subject Matter Expert for AI Model Evaluation
from $50/hour
About this role
Role Overview:
Join a dynamic team as a Biology Subject Matter Expert, where your analytical skills and deep understanding of biological concepts will play a crucial role in the development and evaluation of advanced Artificial Intelligence systems. In this position, you will contribute to training and assessing Large Language Models (LLMs), such as ChatGPT, by crafting and solving complex Biology problems. Your expertise will significantly impact the accuracy and educational value of next-generation AI technologies.
This role offers a unique opportunity to merge scientific knowledge with cutting-edge AI advancements while enhancing your skills in an AI-driven environment.
Key Responsibilities:
- Design challenging Biology questions that assess the reasoning, knowledge, and problem-solving capabilities of large language models.
- Develop accurate, detailed, and well-structured solutions with clear step-by-step explanations.
- Evaluate AI-generated responses for correctness, scientific rigor, clarity, and reasoning quality.
- Identify areas where AI systems struggle, including conceptual understanding, multi-step reasoning, data interpretation, and scientific analysis.
- Collaborate with AI researchers and project teams to enhance evaluation methodologies and model performance.
- Contribute to the development of Biology benchmarks covering topics from undergraduate to advanced graduate-level curricula.
- Provide detailed annotations, feedback, and quality assessments to support model improvement.
Qualifications:
- Excellent analytical, research, and problem-solving abilities.
- Strong English reading and comprehension skills.
- Ability to explain complex biological concepts in a clear, concise, and accessible manner.
- Exceptional attention to detail and commitment to accuracy.
- Strong written communication and documentation skills.
- Ability to work independently and manage tasks effectively in a remote environment.
- Comfortable using digital tools and online collaboration platforms.
Work Terms:
- Commitments Required: at least 4 hours per day and up to 40 hours per week with 4 hours of overlap with PST.