Back

Research Scientist, LLM Evaluation – Post-Training

Worldwide Salaried Open

Job Description:

  • Define and execute a rigorous research agenda focused on LLM evaluation and post-training, with emphasis on evaluation-driven model improvement
  • Design experiments to study how evaluation methodologies impact fine-tuning and post-training outcomes
  • Develop and validate comprehensive evaluation frameworks for LLM and multimodal systems
  • Lead research on frontier evaluation domains including long-context, cross-modal, and dynamic multi-turn evaluations
  • Analyze model behavior and failure patterns; generate actionable recommendations for model improvement
  • Partner with Language Data Scientists to integrate human-in-the-loop and synthetic data/evaluation strategies

Requirements:

  • MS or PhD in Computer Science, Machine Learning, Statistics, Applied Mathematics, AI, or a related quantitative field (PhD strongly preferred)
  • 5+ years of relevant experience in applied ML research or research science, with substantial work in LLMs or foundation models (graduate research counts)
  • Demonstrated experience with LLM evaluation, benchmarking, alignment, post-training, or model quality research
  • Strong foundation in experimental design, statistical analysis, and scientific reasoning for ML systems
  • Strong Python coding skills for research experimentation, data processing, evaluation pipelines, statistical analysis, and visualization
  • Hands-on experience with modern ML frameworks (PyTorch, Hugging Face, JAX/TensorFlow)

Benefits:

  • Remote work options
  • Professional development opportunities

Apply tot his job Apply To this Job

More jobs

Public Health Research Scientist (REMOTE ROLE)

Worldwide Salaried

Sr. Blockchain Developer

Worldwide Salaried

Remote Blockchain Developer

Worldwide Salaried

Remote Blockchain Developer (Solidity / Smart Contract) – Quantum Studio

Worldwide Salaried

Blockchain Developer, Work from Home

Worldwide Salaried

AI/ML Research Scientist (Remote)

Worldwide Salaried

Molecular Biologist / Research Scientist – Remote ($60 –$80/hr)

Worldwide Salaried

Remote Research Scientist, Life Sciences (PhD)-932675

Worldwide Salaried

Remote AI Research Scientist - Machine Learning for Healthcare

Worldwide Salaried

AI Research Scientist (United States, Remote)

Worldwide Salaried

Remote Data Entry Technician – Full‑Time Work‑From‑Home Role Supporting Administrative Operations at arenaflex

Worldwide Salaried

Remote Data Entry Specialists (Male & Female) – Flexible Hours, $20‑$25 per Hour, Customer Interaction & Survey Support

Worldwide Salaried

Regional Account Manager - SFNC - Delaware/East Shore

Worldwide Salaried

Project Manager TD&S - Pennsylvania - Remote

Worldwide Salaried

Account Manager, Endoscopy, Ontario

Worldwide Salaried

Experienced Credit Customer Service Representative – Remote Opportunity at arenaflex

Worldwide Salaried

Crisis Triage Specialist - 988 National Suicide Prevention Line (Follow Up) - Monday-Friday 3PM-11:30PM

Worldwide Salaried

Experienced Remote Chat Agent – No Experience Needed – Work Remotely Worldwide

Worldwide Salaried

Recruiter (Temporary)

Worldwide Salaried

Experienced Full Stack Customer Service Representative – Nexpatha Online Chat Support (Part-Time)

Worldwide Salaried