Laboratory & ResearchLab & research supplies.Reagents, consumables, PPE & instruments — documented, fast, chain-of-custody shipping.Shop lac.us lac.us

Dictionary domainTrack C

AI and ML research outputs

Model cards, system cards, datasheets, benchmarks, evaluation suites.

Implementation guide →Working group →Editorial in this domain →

For implementers

Operational deployment checklist for AI and ML research outputs: prerequisites, five deploy steps, integration notes for Pure, Symplectic Elements, Worktribe, DSpace, and more, plus the pitfalls that recur in the field.

View implementation checklist →

Terms in this domain

51 terms

Dictionary termProposed

EU AI Act Article 53 (General-Purpose AI Model Obligations)

Article 53 of the EU AI Act (Regulation (EU) 2024/1689) sets the baseline obligations that apply to every provider of a general-purpose AI (GPAI) model placed on the EU market -- a model trained on large amounts of data using self-supervision at scale, showing significant generality, and capable of competently performing a wide range of distinct tasks (Article 3(63)). A GPAI provider must, at minimum: (1) draw up and keep current technical documentation of the model per Annex XI, available on request to the AI Office and national authorities; (2) draw up and make available to downstream AI-system providers the information and documentation set out in Annex XII needed to understand the model's capabilities and limitations; (3) put in place a policy to comply with EU copyright law, including respecting rightsholders' text-and-data-mining opt-outs reserved under Article 4(3) of Directive (EU) 2019/790; and (4) draw up and publish a sufficiently detailed summary of the content used to train the model, following a template the AI Office provides. These obligations became applicable on 2 August 2025 (Article 113(b)). Models released under a genuinely free and open-source licence, with publicly available weights, architecture, and usage information, are exempt from obligations (1) and (2) above -- but never from the copyright-policy or training-data-summary duties, and never at all if the model is also classified as carrying 'systemic risk.'

AI and ML research outputs· Data & methods→

Dictionary termProposed

EU AI Act Article 6 (High-Risk Classification Rules)

Article 6 of the EU AI Act (Regulation (EU) 2024/1689) is the classification test that determines whether an AI system counts as 'high-risk.' An AI system is high-risk if either: (1) under Article 6(1), it is a safety component of, or is itself, a product covered by EU harmonisation legislation listed in Annex I and that product requires third-party conformity assessment; or (2) under Article 6(2), it falls into one of the eight functional use-case categories listed in Annex III (biometrics; critical infrastructure; education/vocational training; employment/worker management; essential private and public services; law enforcement; migration/asylum/border control; administration of justice and democratic processes). Article 6(3) provides a narrow, documented exception for Annex III systems that perform only a narrow procedural task, improve a completed human activity's result, detect patterns without replacing human assessment, or perform preparatory tasks -- unless the system profiles natural persons, in which case it is always high-risk. Classification under Article 6 is the trigger, not the substance: it determines whether the Title III, Chapter 2 obligations (risk management, data governance, technical documentation, human oversight, and more) apply at all.

AI and ML research outputs· Data & methods→

Dictionary termProposed

NEJM AI

NEJM AI is a peer-reviewed monthly journal published by NEJM Group (a division of the Massachusetts Medical Society, the same publisher as the flagship New England Journal of Medicine) that publishes original research, reviews, perspectives, and policy analysis on the application of artificial intelligence and machine learning to medicine and health care. It launched in December 2023, with its inaugural issue in January 2024. NEJM AI is editorially and operationally distinct from NEJM itself: it has its own submission portal (ai.nejm.org), its own editorial board and peer-review process, and a narrower topical scope than the flagship journal's general internal medicine remit. A manuscript is within NEJM AI's scope if its central contribution is an AI/ML method, dataset, or system applied to a biomedical or clinical problem, or an analysis of AI's technical, regulatory, ethical, or health-system implications — not simply a clinical study that happens to use software.

AI and ML research outputs· Data & methods→

Dictionary termProposed

National AI Research Resource (NAIRR)

The National AI Research Resource (NAIRR) is a US federal initiative -- currently operating as the NAIRR Pilot -- that provides researchers, educators, and students with shared access to AI computational infrastructure (supercomputing and cloud allocations), AI-ready datasets, pretrained models, and educational resources, rather than direct cash funding. It is led by the National Science Foundation (NSF) in partnership with more than a dozen other federal agencies, including the Department of Energy (DOE) and National Institutes of Health (NIH), who co-lead a security-focused 'NAIRR Secure' track for sensitive data. Eligible US-based researchers, educators, and graduate students (with a faculty sponsor) apply for time-limited resource allocations -- typically a 3-month Start-Up award or a 12-month Research award -- through a competitive, proposal-based process rather than an open-access account. A project 'uses NAIRR' when it draws on the pilot's pooled portfolio of partner-provided compute, storage, or model resources under an awarded allocation; a project that purchases or funds its own AI hardware, or that uses a single university's internal HPC cluster with no NAIRR allocation, is not a NAIRR activity even if the research itself is AI-related.

AI and ML research outputs· Data & methods→

Dictionary termProposed

Scite (Smart Citations)

Scite is a commercial citation-analysis platform whose core feature, Smart Citations, uses a trained natural-language-processing (deep learning) model to classify each individual citing statement of a paper as supporting, contrasting, or mentioning the claim it cites, and displays the actual citation-context sentence alongside that classification. A citation is a Scite Smart Citation, specifically, when it comes with (a) the excerpted sentence from the citing paper that references the cited work, and (b) a supporting/contrasting/mentioning label assigned to that specific citing statement -- not just a tally of how many times the work has been cited.

AI and ML research outputs· Data & methods→

Dictionary termProposed

EU AI Act Article 10 (Data and Data Governance)

Article 10 of the EU AI Act (Regulation (EU) 2024/1689), in Title III, Chapter 2, sets the data-quality and data-governance obligations for the training, validation, and testing datasets used to build high-risk AI systems. A system falls under Article 10 only after it has already met the Act's separate 'high-risk' classification test (an Annex III listed use case, or a safety component covered by Annex I sectoral product law). For a high-risk system built using techniques that involve training models on data, Article 10(2) requires documented data-governance practices covering collection and origin, preparation operations (annotation, labelling, cleaning, updating, enrichment), bias examination, bias-mitigation measures, and identification of data gaps; Article 10(3)-(4) require the training, validation, and testing datasets to be relevant, sufficiently representative, and to the best extent possible free of errors and complete for the system's intended purpose and deployment context. For a high-risk system that does not use such training techniques, Article 10(6) narrows the same requirements to the testing dataset only. Article 10(5) separately permits limited, safeguarded processing of special-category personal data solely to detect and correct bias. It does not apply to AI systems or models developed and used solely for scientific research and development prior to being placed on the market or put into service, per the Article 2(6)/2(8) research exemption.

AI and ML research outputs· Data & methods→

Dictionary termProposed

Consensus (AI Academic Search Engine)

Consensus is a named AI-powered academic search engine (built by the company Consensus, at consensus.app) that retrieves peer-reviewed papers relevant to a natural-language research question and generates a synthesis of what the retrieved literature says, rather than returning a plain ranked list of results. For questions phrased as a yes/no/maybe claim, it additionally displays a 'Consensus Meter' -- a visual indicator of how the retrieved papers' findings line up (agree, disagree, or mixed) on that specific claim, generated from the paper set Consensus itself retrieved and summarized. It is a specific product in the 'literature summarization and evidence-synthesis' category, distinct from general-purpose AI chatbots (which do not search a dedicated indexed academic corpus or cite retrieved papers by default) and from citation-graph or general-purpose scholarly search tools such as Semantic Scholar (which surface and rank papers but do not generate a claim-level agreement synthesis across them).

AI and ML research outputs· Data & methods→

Dictionary termProposed

AI Research Tool

An AI research tool is software that applies machine learning — typically a large language model (LLM), an embedding-based semantic search index, or both — to a specific stage of the research workflow: finding and screening literature, extracting or summarizing data from papers, mapping citation relationships, drafting or revising manuscript text, or analyzing research data. "AI research tool" is a category label, not the name of any single product: it covers named tools with genuinely different scopes (a citation-mapping tool like Connected Papers does not do what a writing-assistance tool like Paperpal does), and a page or citation that treats it as one interchangeable thing is usually mis-scoped. What makes a tool an instance of this category, rather than a general-purpose AI assistant that happens to get used for research, is that it is built or marketed specifically around a research task — an academic search index, a citation graph, a manuscript-formatting model trained on published literature — rather than being a general chat interface pointed at an arbitrary prompt.

AI and ML research outputs

Terms in this domain

EU AI Act Article 53 (General-Purpose AI Model Obligations)

EU AI Act Article 6 (High-Risk Classification Rules)

NEJM AI

National AI Research Resource (NAIRR)

Scite (Smart Citations)

EU AI Act Article 10 (Data and Data Governance)

Consensus (AI Academic Search Engine)

AI Research Tool

Synthetic benchmark

RLHF (Reinforcement Learning from Human Feedback)

Constitutional AI (concept)

Prompt injection

Jailbreak (LLM)

Red-teaming

AI safety case

AI evaluation card

Reproducible AI experiment

Open-source model (criteria)

Open weights model

Model weight licence

Model checkpoint

Model evaluation suite

Model fine-tune lineage

Model lineage

Inference carbon footprint

Training carbon footprint

Compute (FLOPs estimate)

Training data composition

Parameter count

Mixture-of-experts (MoE)

Frontier model

Foundation model

MMLU benchmark

HELM benchmark

BIG-bench

MLCommons benchmark

Hugging Face Hub (concept)

NIST AI RMF (Risk Management Framework)

ISO/IEC 42001 (AI management system)

AI conformance assessment

AI assurance

Trustworthy AI

Responsible AI

Use card

Algorithm card

Bias audit (model)

Model audit

Data statement (NLP)

Datasheet for datasets

System card

Model card