About this role
Join a pioneering initiative to create realistic enterprise environments for training and evaluating frontier AI agents. This role seeks experienced aerospace and defense professionals from Fortune 500 primes and major Tier-1 suppliers to recreate digital workspaces and design challenging tasks that reflect real-world scenarios in the aerospace and defense sectors.
Key Responsibilities- Build a realistic digital workspace using daily tools, including program review decks, CDRLs/SOWs, engineering change proposals, V&V plans, supplier qualification packages, risk registers, and email threads, along with relevant platforms like ANSYS Fluent simulations, Siemens Opcenter MES, and Jira/Confluence.
- Design multi-step tasks based on actual workflows that require navigating multiple applications, files, and stakeholders, effectively challenging frontier AI agents.
- Collaborate with fellow aerospace and defense experts to design the environment, shape task scope, and review scenarios for realism and rigor.
- Work asynchronously with research teams to refine task designs and evaluation criteria for A&D agent benchmarks.
- Contribute to frontier AI research and benchmarking, directly influencing how leading labs train and evaluate the next generation of AI systems.
- 3+ years of full-time experience at a Fortune 500 A&D prime, major Tier-1 supplier, or federally-funded R&D center.
- Background in one or more areas such as:
- Program/project management on DoD or NASA contracts (EVM, IMS, CDRLs).
- Aerospace design or manufacturing engineering (structures, avionics, propulsion).
- Supply-chain management under DFARS/ITAR/CMMC.
- Quality and mission assurance (AS9100, NADCAP).
- Systems engineering, V&V, or integrated test.
- US Person status required for most A&D work; active/prior clearance not required for this project, but familiarity with classified-program workflows is a plus.
- Day-to-day use of ANSYS Fluent/STAR-CCM+, Siemens Opcenter or Rockwell FactoryTalk, and Jira/Confluence.
- Strong analytical thinking and writing skills, capable of translating A&D workflows into structured task specifications.
This project will start with an effective hourly rate, transitioning to a compensation model based on the throughput of quality work rather than a flat accruing hourly rate.
About UsThis opportunity is part of a talent marketplace that connects top experts with leading AI labs and research organizations, backed by prominent investors. Join thousands of professionals contributing to the advancement of the next generation of AI systems.