Agent Quality / Evals Engineer 1754

SOFTGIC

Colombia, Colombia, Colombia Full-time June 12, 2026

Found Description

This is a remote position.

Owns the eval harness and quality gate from the beginning. This role replaces the old late-stage “Evals Specialist” model with a standing owner for measurable agent quality.

Key Responsibilities

• Build and maintain the MVP eval harness: golden tasks, exception tasks, scorecard metrics, and regression packs.

• Wire evals into CI so quality regressions fail builds and releases.

• Define and maintain release-gate thresholds with Product and the Tech Lead.

• Lay the path for later adversarial and drift-testing expansion without overbuilding MVP scope.

Requirements

Must-Have Qualifications

• Experience evaluating ML, LLM, or non-deterministic systems.

• Strong tes...

Ready to Apply?

Submit your application for Agent Quality / Evals Engineer 1754 at SOFTGIC

Apply Now

Agent Quality / Evals Engineer 1754

Found Description

Requirements

Ready to Apply?

Found Details

About SOFTGIC

SOFTGIC

Share