Dictionary domainTrack C
AI and ML research outputs
Model cards, system cards, datasheets, benchmarks, evaluation suites.
For implementers
Operational deployment checklist for AI and ML research outputs: prerequisites, five deploy steps, integration notes for Pure, Symplectic Elements, Worktribe, DSpace, and more, plus the pitfalls that recur in the field.
Terms in this domain
43 terms
Synthetic benchmark
A benchmark whose evaluation items are wholly or partially generated by another model or procedural method, rather than collected from natural human-produced sources, used to probe specific capabilities or to scale evaluation cheaply.
RLHF (Reinforcement Learning from Human Feedback)
A training methodology in which a language model is fine-tuned using a reward signal derived from human preferences over pairs (or larger sets) of candidate model outputs, typically by first training a reward model and then optimising the policy against it via PPO or a related algorithm.
Constitutional AI (concept)
A training methodology in which a model is trained to align its outputs with a written set of principles ('a constitution'), with the model itself used to critique and revise candidate responses against those principles in place of direct human feedback at scale.
Prompt injection
An attack on a language-model-based system in which adversarial instructions, embedded in untrusted input (a document, web page, tool output, image), cause the model to act in ways that diverge from its developer's or user's intent.
Jailbreak (LLM)
A prompt or interaction pattern that causes a language model to bypass its safety training and produce outputs the model was tuned to refuse, such as harmful instructions, restricted content, or violations of provider policy.
Red-teaming
The practice of deliberately adversarial testing of an AI system by skilled testers attempting to elicit failures, unsafe outputs, or policy violations, in order to discover weaknesses before deployment.
AI safety case
A structured, evidence-based argument that an AI system is acceptably safe to deploy in a defined context, modelled on safety cases from established engineering disciplines (nuclear, aviation, medical devices).
AI evaluation card
A structured documentation artefact specifically describing an evaluation of an AI system, separate from the model card, including the evaluation methodology, datasets, metrics, results, and known limitations of the evaluation itself.
Reproducible AI experiment
An AI experiment for which sufficient artefacts and metadata are released (data, code, seed, environment, hyperparameters, training procedure) that an independent investigator can re-run it and obtain numerically equivalent or statistically indistinguishable results.
Open-source model (criteria)
A model meeting the criteria articulated by the Open Source Initiative's Open Source AI Definition: open data information, open code, and open weights, with each released under terms compatible with the OSI's freedoms to use, study, modify, and share.
Open weights model
A model whose trained parameter values are publicly released and downloadable, typically under a named licence, distinct from but often described as 'open' even when training data and code are not released.
Model weight licence
The licence terms governing the use, modification, and redistribution of a model's trained weights, which may differ from the licence on the training code and the licence on the training data.
Model checkpoint
A saved snapshot of a model's parameters (and optionally optimiser state) at a specific point in training, identified by a step number or version tag and serialised to a file format such as safetensors or .pt.
Model evaluation suite
A defined collection of benchmarks, tasks, and metrics, with standardised prompting and decoding rules, used to characterise a model's capabilities and behaviour across a range of dimensions.
Model fine-tune lineage
The specific portion of model lineage that records the sequence of fine-tuning operations applied to a base model: dataset, method (SFT, DPO, RLHF, LoRA), hyperparameters, and resulting checkpoint identifier.
Model lineage
The chain of provenance for a model recording its base model, the fine-tuning datasets and procedures applied, and any further derivatives, such that any deployed model can be traced back to its constituent training operations.
Inference carbon footprint
The greenhouse-gas emissions associated with serving inference requests from a deployed model, typically expressed per-request (e.g., gCO2e per query) or in aggregate (kgCO2e per month).
Training carbon footprint
The total greenhouse-gas emissions, expressed in kilograms or tonnes of CO2-equivalent, attributable to training a machine-learning model, estimated from energy consumption and the carbon intensity of the electricity supply.
Compute (FLOPs estimate)
The total floating-point operations consumed by training a model, conventionally reported as a single number (e.g., 3.0 x 10^25 FLOPs) used as a regulatory and scientific proxy for training-run scale.
Training data composition
The mixture of data sources, by domain, language, modality, and provenance, used to train a model, including the proportions and any filtering or deduplication applied.
Parameter count
The total number of learnable scalar weights in a machine-learning model, conventionally reported as a count (e.g., 7B = 7 x 10^9 parameters) and disclosed as a basic model metadata field.
Mixture-of-experts (MoE)
A neural-network architecture in which a learned router directs each input (or token) to a small subset of specialist sub-networks ('experts'), so that the model has a large total parameter count but uses only a fraction per forward pass.
Frontier model
A foundation model whose capabilities meet or exceed the most advanced publicly known systems at the time of training, often defined operationally by training-compute thresholds or by performance on canonical benchmarks.
Foundation model
A large machine-learning model trained on broad data at scale and adaptable to a wide range of downstream tasks through fine-tuning, prompting, or retrieval augmentation.
MMLU benchmark
The Massive Multitask Language Understanding benchmark, a 57-subject multiple-choice test covering elementary, high-school, college, and professional knowledge, designed to probe broad-coverage language-model knowledge.
HELM benchmark
The Holistic Evaluation of Language Models benchmark, a multi-metric framework evaluating language models across accuracy, calibration, robustness, fairness, bias, toxicity, and efficiency on a fixed set of scenarios.
BIG-bench
The Beyond the Imitation Game benchmark, a community-contributed collection of more than 200 tasks designed to probe capabilities of large language models that may be missed by narrower benchmarks.
MLCommons benchmark
A benchmark published by the MLCommons consortium for measuring AI system performance under standardised workloads, datasets, and submission rules, with the principal suites being MLPerf Training, MLPerf Inference, and MLPerf HPC.
Hugging Face Hub (concept)
A web-based platform and ecosystem for sharing machine-learning models, datasets, and demonstration applications ('Spaces'), with conventions for model cards, dataset cards, and versioned repositories.
NIST AI RMF (Risk Management Framework)
The US National Institute of Standards and Technology's voluntary framework for managing risks associated with AI systems across the AI lifecycle, structured around the functions Govern, Map, Measure, and Manage.
ISO/IEC 42001 (AI management system)
An international standard, published in 2023, specifying requirements for establishing, implementing, maintaining, and continually improving an AI Management System within an organisation, structured analogously to ISO 9001 (quality) and ISO/IEC 27001 (information security).
AI conformance assessment
A formal evaluation, conducted by the AI system provider or a notified third-party body, demonstrating that an AI system meets the applicable regulatory or standard-based requirements before being placed on the market.
AI assurance
The process of measuring, evaluating, and communicating the trustworthiness of AI systems through evidence-based mechanisms such as audits, certifications, impact assessments, and conformity declarations.
Trustworthy AI
AI systems exhibiting properties (lawful, ethical, technically robust) that warrant the trust of users, affected parties, and society, as articulated in the EU High-Level Expert Group's framework and adopted in subsequent regulation.
Responsible AI
An umbrella term covering the design, development, deployment, and governance practices intended to ensure AI systems are ethical, fair, transparent, accountable, robust, secure, and respectful of privacy.
Use card
A documentation artefact recording an intended deployment context for a model or system, including the user population, the decisions the model informs, the supervision regime, and out-of-scope uses.
Algorithm card
A documentation artefact describing the algorithmic method or family (e.g., a particular gradient-boosting estimator, a clustering algorithm) independent of any particular trained instance, including inductive biases, assumptions, complexity, and intended use cases.
Bias audit (model)
An audit specifically focused on disparate model performance across demographic, geographic, or contextual sub-groups, including testing for direct, proxy, and intersectional disparities.
Model audit
A structured assessment of a machine-learning model by an independent party against pre-specified criteria covering performance, robustness, fairness, security, privacy, and conformance with stated policy.
Data statement (NLP)
A standardised description of an NLP dataset covering curation rationale, language variety, speaker and annotator demographics, speech situation, text characteristics, and recording quality.
Datasheet for datasets
A structured document accompanying a machine-learning dataset that records its motivation, composition, collection process, pre-processing, intended uses, distribution, and maintenance, modelled on electronic-component datasheets.
System card
A documentation artefact describing an AI-enabled system in its production configuration, including the constituent models, the pre- and post-processing pipeline, safety filters, monitoring, and operational guardrails.
Model card
A short, structured document accompanying a machine-learning model that records its intended use, training data, evaluation methodology, performance characteristics across population sub-groups, and known limitations.







