Skip to main content
v2026.1714 entries · CC-BY 4.0
CASRAI
Dictionary termTrack CStablev2026.2

BIG-bench

The Beyond the Imitation Game benchmark, a community-contributed collection of more than 200 tasks designed to probe capabilities of large language models that may be missed by narrower benchmarks.

ByCASRAI Editorial Board
· Last updated 21 May 2026

Examples

Worked examples

  • Is an instance

    A model technical report including BIG-bench Hard average accuracy across 23 tasks.

  • Is an instance

    A new benchmark paper using BIG-bench as a baseline distribution of LLM capability.

Counter-examples

Looks similar, but isn't

  • Not an instance

    MMLU (a different benchmark).

  • Not an instance

    A single dataset like SQuAD.

Editorial commentary

BIG-bench (Srivastava et al., 2023) emphasises task diversity: arithmetic, logic, multilingual translation, theory-of-mind, common-sense, code, and intentionally adversarial probes. The 'BIG-bench Hard' subset captures the hardest tasks where models showed substantial headroom; it has been heavily reused as a comparison set.

References

  • Srivastava et al., 'Beyond the Imitation Game' (Transactions on Machine Learning Research, 2023).

Also known as

Beyond the Imitation Game Benchmark · BBH (subset)

Machine-readable encodings

Use in your systems

JATS XML <role> element
xml
<role vocab="credit"
      vocab-identifier="https://casrai.org/dictionary/"
      vocab-term="BIG-bench"
      vocab-term-identifier="https://casrai.org/dictionary/term/big-bench" />
Schema.org DefinedTerm (JSON-LD)
json
{
  "@context": "https://schema.org",
  "@type": "DefinedTerm",
  "name": "BIG-bench",
  "identifier": "https://casrai.org/dictionary/term/big-bench",
  "description": "The Beyond the Imitation Game benchmark, a community-contributed collection of more than 200 tasks designed to probe capabilities of large language models that may be missed by narrower benchmarks.",
  "inDefinedTermSet": "https://casrai.org/dictionary/domain/ai-and-ml-research-outputs/",
  "url": "https://casrai.org/dictionary/term/big-bench",
  "sameAs": [
    "Beyond the Imitation Game Benchmark",
    "BBH (subset)"
  ],
  "license": "https://creativecommons.org/licenses/by/4.0/"
}
LAC

Partner Deal

LAC Health Supplies Mobile App

Referenced across the research world

University of Cambridge logoColumbia University logoUniversity of Edinburgh logoHarvard University logoUniversity of Oxford logoPrinceton University logoStanford School of Medicine logoUniversity College London logoORCID logoCrossref logoUniversity of Cambridge logoColumbia University logoUniversity of Edinburgh logoHarvard University logoUniversity of Oxford logoPrinceton University logoStanford School of Medicine logoUniversity College London logoORCID logoCrossref logo
  • University of Cambridge logo
  • Columbia University logo
  • University of Edinburgh logo
  • Harvard University logo
  • University of Oxford logo
  • Princeton University logo
  • Stanford School of Medicine logo
  • University College London logo
  • ORCID logo
  • Crossref logo

View CASRAI adoption →