Skip to main content
v2026.1714 entries · CC-BY 4.0
Dictionary termTrack AStablev2026.2

Large language model (LLM)

A neural-network model trained on large text corpora using self-supervised next-token prediction (or analogous objective), with parameter counts typically in the billions, capable of generating coherent text and performing a broad range of natural-language tasks without task-specific training.

ByCASRAI Editorial Board
· Last updated 21 May 2026

Examples

Worked examples

  • Is an instance

    GPT-4 (OpenAI)

  • Is an instance

    Claude 3 (Anthropic)

  • Is an instance

    Llama 3 (Meta)

Counter-examples

Looks similar, but isn't

  • Not an instance

    A BERT-based classifier fine-tuned only to label sentiment is not typically called an LLM in the generative sense

Editorial commentary

LLMs underpin most current generative AI writing tools (GPT family, Claude, Gemini, Llama, etc.). They differ from earlier task-specific NLP models in their scale and in-context learning ability. For disclosure purposes the relevant attributes are the model name, version, training-data cutoff, and whether retrieval augmentation or fine-tuning was applied.

References

  • Bommasani et al. 2021 ‘On the Opportunities and Risks of Foundation Models’
  • Brown et al. 2020 ‘Language Models are Few-Shot Learners’

Also known as

LLM · Foundation language model

Machine-readable encodings

Use in your systems

JATS XML <role> element
xml
<role vocab="credit"
      vocab-identifier="https://casrai.org/dictionary/"
      vocab-term="Large language model (LLM)"
      vocab-term-identifier="https://casrai.org/dictionary/term/large-language-model" />
Schema.org DefinedTerm (JSON-LD)
json
{
  "@context": "https://schema.org",
  "@type": "DefinedTerm",
  "name": "Large language model (LLM)",
  "identifier": "https://casrai.org/dictionary/term/large-language-model",
  "description": "A neural-network model trained on large text corpora using self-supervised next-token prediction (or analogous objective), with parameter counts typically in the billions, capable of generating coherent text and performing a broad range of natural-language tasks without task-specific training.",
  "inDefinedTermSet": "https://casrai.org/dictionary/domain/generative-ai-use-and-disclosure/",
  "url": "https://casrai.org/dictionary/term/large-language-model",
  "sameAs": [
    "LLM",
    "Foundation language model"
  ],
  "license": "https://creativecommons.org/licenses/by/4.0/"
}

Adopted by research universities worldwide

University of Cambridge logoColumbia University logoUniversity of Edinburgh logoHarvard University logoMassachusetts Institute of Technology logoUniversity of Oxford logoPrinceton University logoStanford School of Medicine logoUniversity College London logoUniversity of Cambridge logoColumbia University logoUniversity of Edinburgh logoHarvard University logoMassachusetts Institute of Technology logoUniversity of Oxford logoPrinceton University logoStanford School of Medicine logoUniversity College London logo
  • University of Cambridge logo
  • Columbia University logo
  • University of Edinburgh logo
  • Harvard University logo
  • Massachusetts Institute of Technology logo
  • University of Oxford logo
  • Princeton University logo
  • Stanford School of Medicine logo
  • University College London logo

View CASRAI adoption →