• *About The Job
• *Mercor
connects elite creative and technical talent with leading AI research labs. Headquartered in San Francisco, our investors include
• *Benchmark**
,
• *General Catalyst**
,
• *Peter Thiel**
,
• *Adam D'Angelo**
,
• *Larry Summers**
, and
• *Jack Dorsey**
.
• *Position:**
AI Model Evaluation Specialist
• *Type:
• *Contract
• Compensation:
• $25–$35/hour
• *Commitment:
• *20 hours/week
• *Role Responsibilities
• Write realistic prompts reflecting professional and consumer domain-specific guidance.
• Evaluate AI-generated responses for factual accuracy and practical usefulness.
• Identify fabricated claims and misleading reasoning in model outputs.
• Score and rank model responses using structured rubrics.
• Provide written justifications with specific evidence for evaluations.
• *Qualifications
• *Must-Have
• Professional experience applying domain expertise in a practitioner or advisory capacity.
• Familiarity with industry-specific standards, regulations, or clinical guidelines.
• Strong written communication and critical reasoning skills.
• *Application Process (Takes 20–30 mins to complete)
• Submit your resume to begin.
• Complete the Model Response Evaluation assessment.
• *Resources & Support**
• For details about the interview process and platform information, please check: https://talent.docs.mercor.com/welcome/welcome
• For any help or support, reach out to:
[email protected]
• PS: Our team reviews applications daily. Please complete your AI interview and application steps to be considered for this opportunity.*
,