About this role
Role Overview
This position offers a unique opportunity to work as a forward-deployed research partner within enterprise AI systems. You will engage directly with live workflows, identify real-world failure modes, and drive experimental cycles to enhance system performance.
Key Responsibilities- Collaborate with domain experts and client teams as a research partner embedded in enterprise AI workflows.
- Identify, formalize, and prioritize system failure modes in real-world deployments.
- Design high-signal datasets and evaluation protocols to address identified weaknesses.
- Conduct rapid experimental loops to validate hypotheses and measure improvements.
- Produce clear, decision-oriented analyses of system behavior and performance.
- Develop and benchmark agentic workflows, emphasizing robustness and scalability.
- Create lightweight tools to support evaluation, data curation, and rapid iteration.
- Contribute to internal and external research artifacts, including reports and benchmarks.
- Master’s degree in Computer Science, Machine Learning, Artificial Intelligence, or a related technical field.
- Strong judgment regarding research signal quality, including data selection and evaluation design.
- Experience in designing datasets and evaluation frameworks for machine learning systems.
- Ability to translate ambiguous operational issues into structured research problems.
- Familiarity with reinforcement learning environments and/or agentic system evaluation.
- Clear and concise communicator with a focus on actionable insights.
- Proven ability to execute in fast iteration cycles and high-ambiguity settings.
- Collaborative mindset with experience working across research, product, and domain teams.
- Strong client-facing experience, particularly in technical or research-driven environments.
- Experience in building internal research or evaluation tools.
- Contributions to benchmarks, research publications, or open research initiatives.
- Exposure to enterprise AI deployments or forward-deployed research models.
Full-time position with remote work flexibility.
CompensationAnnual salary ranging from $250,000 to $500,000.
EligibilityOpen to candidates with the required qualifications and experience.