SaidGig

Senior Software Engineer for LLM Evaluation & Repository Validation

from $40/hour

Remote — Mexico, India or NigeriaNon-remote: unknownContracttechnologyUpdated Jun 3, 2026
Apply Now

About this role

Role Overview: This position focuses on developing and validating LLM evaluation datasets to tackle realistic software engineering challenges. The role emphasizes creating verifiable software engineering tasks derived from public repository histories, utilizing a synthetic approach with human involvement to enhance dataset diversity across various programming languages and difficulty levels.

Key Responsibilities:

  • Analyze and triage GitHub issues across trending open-source libraries.
  • Set up and configure code repositories, including Dockerization and environment setup.
  • Evaluate unit test coverage and quality.
  • Modify and run codebases locally to assess LLM performance in bug-fixing scenarios.
  • Collaborate with researchers to design and identify repositories and issues that challenge LLMs.
  • Lead a team of junior engineers on collaborative projects.

Qualifications:

  • Minimum of 3 years of overall experience.
  • Strong experience with at least one programming language, specifically Go.
  • Proficiency with Git, Docker, and basic software pipeline setup.
  • Able to understand and navigate complex codebases.
  • Comfortable running, modifying, and testing real-world projects locally.
  • Experience contributing to or evaluating open-source projects is a plus.

Preferred Qualifications:

  • Previous involvement in LLM research or evaluation projects.
  • Experience building or testing developer tools or automation agents.

Work Terms:

  • Commitment of at least 4 hours per day and a minimum of 20 hours per week, with 4 hours of overlap with PST. Options for time commitment include 20 hrs/week, 30 hrs/week, or 40 hrs/week.
  • Contractor assignment (no medical/paid leave).
  • Location: Open to candidates in India, Pakistan, Nigeria, Kenya, Egypt, Ghana, Bangladesh, Turkey, and Mexico.

Compensation: Competitive rates commensurate with experience.

Eligibility: Open to candidates with the required qualifications and location restrictions as specified.

Evaluation Process: The evaluation process includes two rounds of interviews: a 60-minute technical interview followed by a 30-minute technical and cultural discussion.

Related Jobs