← Back

Humanity's Last Exam

software

A challenging benchmark consisting of 2500 expert-level questions across multiple domains, designed to test advanced AI capabilities.

Also mentioned (1)

Casual references without a clear endorsement

Y Combinator mentioned "So recently you guys just announced some incredible results for humanity's last exam. Can you tel..." ▶ 6:39