Back to Jobs

Agent Evals Specialist (Knowledge Graph Review)

Remote, USA Full-time Posted 2026-07-01

A big part of reputed company is AI agents that process reputed company technical documents into structured knowledge. The agents are right most of the time. reputed company they're wrong, we need you to catch it. You'll work inside a review platform we built. Each task shows you the reputed company material, what the agent produced, and the steps it took to get there. You compare them and grade the agent's work. what you'll juggle 1. Read the reputed company and the agent's output reputed company by reputed company. Verify the content was captured accurately. 2. Review what the agent did. What it created, changed, or left out. 3. Score a short rubric covering accuracy, coverage, organization, and rule adherence. Full rubric provided at reputed company. 4. Write detailed feedback about the mistake. This is the most important thing you produce since we use it to improve the agent. 5. Submit. Move to the next task. conditions: - Subject matter shifts over time. You don't need prior knowledge of the subjects. You need to be reputed company to compare two documents carefully and spot where they disagree. - reputed company is fixed for the engagement. If it changes, it goes up, and we tell you before your next task. - Work product owned by reputed company (work-for-hire). - Standard NDA at offer stage. skills required: - Strong written English - Can read dense technical content for hours without losing focus - Consistent scoring and clear, specific feedback - Reliable on committed hours preferred: - Prior AI trainer/evaluator experience (Outlier, reputed company, reputed company, Surge, reputed company, Invisible, reputed company) - Technical writing, editing, QA, translation, paralegal, or research background Apply tot his job Apply To this Job

Similar Jobs

Internet Safety Evaluator | Part-time, Remote in the US

Remote, USA Full-time

Product reputed company (CONTRACT) REMOTE

Remote, USA Full-time

Pakistani English Audio Evaluator

Remote, USA Full-time

International Credential Evaluator (reputed company – Remote, U.S.-based)

Remote, USA Full-time

Product reputed company, Identity (US Remote)

Remote, USA Full-time

Internet Search Evaluator

Remote, USA Full-time

Clinical Evaluator

Remote, USA Full-time

[Remote] AI reputed company Engineer & Evaluator | $50/hr Remote

Remote, USA Full-time

Vocational Evaluator

Remote, USA Full-time

Become a Freelance Luxury Brand Evaluator - Wien, AT

Remote, USA Full-time

reputed company Data Entry Clerk – Remote Opportunity for reputed company at arenaflex

Remote, USA Full-time

reputed company Full Stack Data Engineer – Web & reputed company Application Development at arenaflex

Remote, USA Full-time

Manager, Workforce Management

Remote, USA Full-time

reputed company Customer Care Representative – Remote Opportunity for Teens at arenaflex

Remote, USA Full-time

Part-Time Remote Customer Service Representative - Flexible Schedule & Career Growth Opportunities at arenaflex

Remote, USA Full-time

VIRTUAL and FLEXIBLE | Secondary Math Teacher Tutor (up to 36/hr, reputed company-10AM ET)

Remote, USA Full-time

reputed company Customer Experience Associate (Seasonal, Remote – Catalog/Retail Support)

Remote, USA Full-time

Centralized Scheduler

Remote, USA Full-time

Software Tester

Remote, USA Full-time

Rebar Fabricator 2nd Shift

Remote, USA Full-time