About this role
As a Process & Operations Engineering Expert, you will play a pivotal role in advancing frontier agent evaluations in process and operations engineering. Your primary responsibility will be to create long-horizon operations engineering tasks that reflect real-world applications, each accompanied by a deterministic rubric to assess agent performance against verifiable ground truth. The tasks you develop will require checkable answers, avoiding open-ended questions and subjective evaluations.
Key Responsibilities- Develop process documentation including process flow diagrams and Standard Operating Procedures (SOPs) with necessary steps and control points.
- Conduct analyses focused on yield and quality, incorporating ground-truth defect rates and performing root-cause investigations with documented correct causes.
- Create Failure Mode and Effects Analysis (FMEA) and improvement documentation, including FMEAs against a failure-mode checklist and Six Sigma writeups with verifiable statistical outputs.
- Engage in challenging scenarios that demand extended periods of focused work.
- Bachelor''s or Master''s degree in Industrial, Manufacturing, or Mechanical Engineering with a minimum of 3 years of experience in process, operations, or manufacturing engineering.
- Proficiency in methodologies such as Lean, Six Sigma (Green or Black Belt), Statistical Process Control (SPC), FMEA, and root cause analysis techniques (5 Whys, fishbone, 8D).
- Ability to read and produce operations engineering artifacts, including process flows, SOPs, FMEAs, project charters, and kaizen reports.
- Strong written communication skills, with the ability to articulate reasoning clearly and encode it into deterministic rubrics.
- Must be located in the United States or Canada.
This is a remote position, and the role is compensated on an hourly basis.
CompensationThe hourly rate ranges from $75 to $100, depending on domain expertise and prior experience. High-performing contributors will have opportunities for promotion based on the quality and throughput of their tasks.