SaidGig

Senior Software Engineer for LLM Evaluation & Repository Validation

from $40/hour

Remote — Mexico, India or NigeriaContracttechnologyUpdated Jun 3, 2026
Apply Now

About this role

Role Overview: This position offers the opportunity to engage in the development of LLM evaluation and training datasets aimed at solving realistic software engineering challenges. You will contribute to building verifiable software engineering tasks based on public repository histories, utilizing a synthetic approach with human-in-the-loop methodologies, while expanding dataset coverage across various programming languages and difficulty levels.

Key Responsibilities:

  • Analyze and triage GitHub issues across trending open-source libraries.
  • Set up and configure code repositories, including Dockerization and environment setup.
  • Evaluate unit test coverage and quality.
  • Modify and run codebases locally to assess LLM performance in bug-fixing scenarios.
  • Collaborate with researchers to design and identify repositories and issues that present challenges for LLMs.
  • Lead a team of junior engineers on collaborative projects.

Qualifications:

  • Minimum of 3 years of overall experience.
  • Strong experience with at least one programming language, preferably Go.
  • Proficiency with Git, Docker, and basic software pipeline setup.
  • Ability to understand and navigate complex codebases.
  • Comfortable running, modifying, and testing real-world projects locally.
  • Experience contributing to or evaluating open-source projects is a plus.

Nice to Have:

  • Previous participation in LLM research or evaluation projects.
  • Experience building or testing developer tools or automation agents.

Work Terms:

  • Commitments Required: At least 4 hours per day and a minimum of 20 hours per week, with 4 hours of overlap with PST. Options for time commitment include 20 hrs/week, 30 hrs/week, or 40 hrs/week.
  • Employment Type: Contractor assignment (no medical/paid leave).
  • Location: Open to candidates from India, Pakistan, Nigeria, Kenya, Egypt, Ghana, Bangladesh, Turkey, and Mexico.

Compensation: Competitive compensation based on experience and commitment.

Eligibility:

  • Must be eligible to work in the specified locations.

Evaluation Process:

  • Two rounds of interviews: 60 minutes for technical assessment and 30 minutes for technical and cultural discussion.

Related Jobs