Back

Generalist Evaluator Expert

Worldwide Salaried Open

Mercor is seeking detail-oriented writing experts to contribute to a high-impact AI research project with a leading lab. Freelancers will author prompt–golden answer pairs that train and evaluate advanced language models. This is a short-term, flexible opportunity for professionals with strong academic backgrounds and a knack for instructional clarity. Ideal for those who enjoy distilling complex concepts into well-crafted text. * * * ### Job Details: - Design and Optimize Prompts: Create detailed prompts with multiple constraints and instructions. - Define and Document Evaluation Standards: Establish high-level expectations for correct responses in general consumer contexts, and develop comprehensive rubric. - Conduct Model Testing and Grading: Run prompts through models and assess preliminary outputs against expectations. - Support Benchmarking and Quality Assurance: Collaborate in QA review processes to ensure prompt tasks and rubrics meet rigor, maintaining consistency and reliability before integration into official benchmarks. ### Minimum Qualifications: - BS or BA from a reputable institution completed or in progress - Strong writing and critical thinking skills. - Ability to work independently and meet deadlines. - Significant familiarity with ChatGPT or similar tools for personal decision-making or hobbies / general interests. - US or Canada based. ### Preferred Qualifications: - Experience in teaching or research. ### Application & Onboarding Process: - Complete an AI-led interview, this should take around 15 minutes. - Complete a 45-minute written assessment that will guide you through writing rubrics. - If selected, you will be invited to work on the project. ### More Details About This Role: - This is a remote and asynchronous role — work on your own schedule. - Expect to contribute at least 20 hours per week. - Expect a commitment of around 1 month. - You’ll be working in a structured project environment with clear goals and tools. * * * ### About [Mercor](https://mercor.com/): - Our team is based in San Francisco, CA - We [specialize](https://www.forbes.com/sites/johnwerner/2024/03/20/this-ai-startup-wants-to-create-jobs-not-take-them-away/) in recruiting experts for top AI labs - Our investors include Benchmark, General Catalyst, Adam D’Angelo, Larry Summers, and Jack Dorsey

Apply To This Job

More jobs

AI Product Engineer

Worldwide Salaried

Key Account Manager

Worldwide Salaried

Program Associate

Worldwide Salaried

Remote Amazon Marketplace Content & Keyword Optimization Specialist – SEO‑Driven Product Listing Expert for High‑Volume E‑Commerce

Worldwide Salaried

Remote Luxury Fashion Customer Experience Specialist – Amazon Shopbop Full‑Time Work‑From‑Home Role

Worldwide Salaried

Remote Amazon Customer Experience Specialist – Work‑From‑Home Customer Care Center Representative (Full‑Time, Flexible Shifts)

Worldwide Salaried

Remote Amazon Customer Service Representative – Entry‑Level Full‑Time Role with Comprehensive Training, Competitive Pay, Flexible Hours, and Clear Career Advancement Path

Worldwide Salaried

Remote Amazon Virtual Customer Care Advisor – Full‑Time Work‑From‑Home Role Supporting Billing, Insurance, and Pharmacy Services (Arizona Residents)

Worldwide Salaried

Remote Amazon Customer Service Representative – Fully Remote Flexible Schedule, Immediate Openings, Competitive Pay & Comprehensive Benefits

Worldwide Salaried

Part-Time Remote Amazon Customer Experience Specialist – Flexible Home‑Based Chat Support Role (20‑30 hrs/week)

Worldwide Salaried

FULL TIME United Airlines No Experience $25/hr?ysmartpros | WFM

Worldwide Salaried

Experienced Freelance UI/UX Engineer - Remote On-Demand Design Expert for High-Stakes Software Development Projects

Worldwide Salaried

Experienced Customer Service Representative – Remote Healthcare Support

Worldwide Salaried

Associate Attorney

Worldwide Salaried

Virtual Data Entry Specialist - Flexible Remote Work Opportunity at arenaflex

Worldwide Salaried

Marketing Coordinator, CRM & Loyalty

Worldwide Salaried

IRB Analyst

Worldwide Salaried

Experienced Part-Time Evening Data Entry Specialist – Remote Opportunity with arenaflex

Worldwide Salaried

Corporate Product Cybersecurity Governance & Incident Response Leader (Remote - Secret Clearance Required)

Worldwide Salaried

Professional Telephone Receptionist- *Part Time*

Worldwide Salaried