About this role
This role focuses on designing advanced computational problems that challenge AI systems to utilize scientific software tools effectively. You will create original, graduate-level tasks that reflect real scientific workflows in seismology and geophysics, ensuring that they are calibrated against cutting-edge AI models.
Key Responsibilities- Design complex problems that require the use of domain-specific scientific software libraries.
- Develop tasks that test the ability of solvers to implement intricate multi-step scientific workflows.
- Create sequences of queries or experiments aimed at uncovering hidden information, necessitating strategic reasoning.
- Engage in a calibration loop to refine problem designs based on performance against state-of-the-art AI models.
We are particularly interested in candidates with substantial hands-on experience in:
- Seismology & Geophysics: Proficiency with ObsPy or SPECFEM for seismic waveform analysis, travel-time tomography, moment tensor inversion, or synthetic seismogram generation.
The ideal candidate will possess:
- Graduate-level expertise (MS or PhD preferred) in seismology or geophysics, with practical experience using relevant software tools.
- Experience writing code that interacts with these libraries to tackle real research problems, along with an understanding of their limitations and complexities.
- A puzzle designer mindset, capable of constructing problems that emphasize reasoning strategies over mere computation.
- Graduate-level training in a relevant STEM domain (MS, PhD, or equivalent research experience).
- Proficiency with at least one of the specified scientific software libraries, demonstrated through research publications, open-source contributions, or professional experience.
- Strong Python programming skills for writing problem setups, oracle functions, and solution validators.
- Ability to work independently and iterate on problem designs based on feedback.
- Comfort in a Linux/terminal environment with remote compute sandboxes.
- Availability for at least 15, 20 hours per week.
- Experience across multiple listed domains or tools.
- Familiarity with benchmark or evaluation design.
- Background in scientific pedagogy or exam/problem-set design.
- Experience with computational reproducibility and containerized environments.