The AI safety institute network is a group of ten government-backed bodies — currently Australia, Canada, the EU, France, Japan, Kenya, the Republic of Korea, Singapore, the UK and the US — that coordinate technical methods for evaluating advanced AI models before and after deployment. It is not one institute; it is a forum for shared testing standards, joint exercises and information-sharing between national evaluators, formally renamed in February 2026 to reflect a wider science-of-evaluation mandate.
The AI safety institute network traces to the May 2024 Seoul AI Summit, where the UK, US, Japan, Singapore, the EU and several other governments agreed to establish an international forum for AI safety science. Since then it has run joint model-testing pilots, absorbed a new member (Kenya), and changed its own name — developments that most explainer content published in 2024–25 has not caught up with.
- What is the AI safety institute network?
- How does cross-border AI model testing actually work?
- What changed when the network renamed itself in 2026?
- How do the national institutes compare?
- Where can academic researchers plug into pre-deployment evaluation work?
- Answer-first questions on the AI safety institute network
- What happens next
What is the AI safety institute network?
The network is a technical forum, not a regulator. Each participating jurisdiction runs its own domestic AI safety or security institute — a government office, typically sitting inside a science or digital ministry — and the network is the channel through which those offices align methods rather than rules.
As of February 2026 its formal name is the International Network for Advanced AI Measurement, Evaluation and Science, having operated since its November 2024 founding as the “International Network of AI Safety Institutes.” The rename, announced on the UK AI Security Institute’s blog, signals a shift from a safety-branding forum toward a standing scientific body focused on measurement methodology.
- Established: November 2024, following the Seoul Statement of Intent signed at the May 2024 AI Seoul Summit
- Current members: Australia, Canada, the European Union, France, Japan, Kenya, the Republic of Korea, Singapore, the United Kingdom and the United States
- 2026 Network Coordinator: the United Kingdom, via the AI Security Institute (AISI)
- Mandate: build internationally recognised methods for measuring and evaluating advanced AI capabilities and risks
How does cross-border AI model testing actually work?
Cross-border testing runs on bilateral agreements and joint exercises rather than a single binding treaty. The clearest example is the UK–US Memorandum of Understanding signed on 1 April 2024, under which the two institutes committed to a shared approach to model evaluations, at least one joint test of a publicly accessible model, technical-research collaboration and staff exchanges.
Member institutes have already run several joint exercises. A pilot evaluation of an open foundation model tested interoperability between two toolkits — the UK’s open-source Inspect framework and Singapore’s Moonshot platform — checking whether results from different infrastructure could be compared directly.
- A multilingual evaluation led jointly by Singapore, Japan and the UK translated and validated benchmark questions into ten languages to test privacy, crime, intellectual-property and jailbreak-resistance risks across languages
- A joint exercise on “agentic” AI systems examined risks including sensitive-information leakage, fraud and cybersecurity in models capable of autonomous planning and task execution
- Information-sharing covers model limitations, capabilities and documented safety incidents, subject to each institute’s domestic law
No member institute has ceded sovereign testing authority to the network. Each government still decides what to evaluate and publish; the network’s value is methodological convergence, not enforcement.
What changed when the network renamed itself in 2026?
At a technical meeting held alongside the NeurIPS conference in San Diego in late 2025, members reported growing consensus on evaluation principles and surfaced open methodological questions in a February 2026 blog post — evidence the network has moved from launch-phase diplomacy to a working scientific programme.
Consensus areas reported include: evaluations must state clear objectives before they run; results must be transparent and reproducible; testing parameters should match the evaluation’s purpose (best-case elicitation versus realistic use); and reports should separate individual tests and stay legible to non-specialist policymakers.
Unresolved questions include how much methodological detail evaluators can publish without enabling replication of dangerous capabilities, how flexible reporting templates should be, and how to evaluate deployed AI systems — not just underlying models. The network’s next meeting is at the India AI Impact Summit, where members plan to test approaches against real-world use cases.
How do the national institutes compare?
Funding, remit and regulatory posture vary sharply between members — “the AI safety institute” is a misleading singular, since at least six materially different institutional models operate under one network.
| Jurisdiction | Institute | Parent body | Reported funding | Primary posture |
|---|---|---|---|---|
| United Kingdom | AI Security Institute (AISI, formerly AI Safety Institute) | Department for Science, Innovation and Technology | £100 million secured until 2030 | Voluntary evaluations, R&D, 2026 Network Coordinator |
| United States | Center for AI Standards and Innovation (CAISI, formerly US AISI) | NIST | $10 million for FY2024/25 | Voluntary evaluations and standards R&D |
| European Union | EU AI Office | DG Connect | €46.5 million setup funding | Binding regulation — enforces the EU AI Act |
| Singapore | Digital Trust Centre / AI Verify Foundation | IMDA | $37 million setup funding | Testing tooling and content-assurance standards |
| Japan | Japan AI Safety Institute | IPA | Not separately disclosed | Evaluation methodology and standards research |
| Canada | Canadian AI Safety Institute | ISED | Not separately disclosed | R&D and international coordination |
The EU is the outlier: its AI Office is the only member with statutory enforcement power, administering the EU AI Act’s obligations on providers of general-purpose models with systemic risk. Every other member’s work is voluntary — which is why joint testing protocols, not legislation, are the network’s main coordination tool.
Where can academic researchers plug into pre-deployment evaluation work?
Pre-deployment evaluation is not a closed government function. Several member institutes have built explicit, funded routes for academic researchers rather than treating testing as an internal civil-service exercise.
- Open evaluation tooling: the UK AISI’s Inspect framework is published as open-source software, so university research groups can build and run their own model evaluations using the same infrastructure member institutes use, rather than reverse-engineering proprietary test harnesses
- Grant-funded safety research: AISI’s Systemic Safety Grants programme has funded academic work on systemic AI risks, alongside The Alignment Project — a multi-funder grants and compute programme backed by AISI, UKRI, Schmidt Sciences and several frontier AI developers supporting alignment research beyond government labs
- Conference-linked workshops: the network’s 2025 technical meeting was held alongside NeurIPS to exchange methods with academic and industry researchers, not as a closed intergovernmental session
- Multilingual and cross-cultural evaluation gaps: the network has flagged that evaluation science for non-English, non-Western contexts is underdeveloped — a direct, citable entry point for university linguistics and computational social science groups
For institutional leaders, the practical implication is that AI safety evaluation now sits alongside conventional funder compliance work: grant terms from bodies co-funding alignment research carry reporting and reproducibility expectations that research administration teams already apply to other externally funded programmes.
Answer-first questions on the AI safety institute network
Which countries have AI safety institutes?
Ten members currently participate in the International Network for Advanced AI Measurement, Evaluation and Science: Australia, Canada, the European Union, France, Japan, Kenya, the Republic of Korea, Singapore, the United Kingdom and the United States, each running its own domestic evaluation body.
Did the AI Safety Institute change its name?
Yes. In 2025 the UK’s AI Safety Institute was renamed the AI Security Institute, and its US counterpart became the Center for AI Standards and Innovation (CAISI). In February 2026 the international network itself was renamed from the “International Network of AI Safety Institutes” to the “International Network for Advanced AI Measurement, Evaluation and Science.”
Are the network’s testing protocols legally binding?
No single binding treaty governs the whole network. Cooperation runs through bilateral instruments such as the UK–US Memorandum of Understanding and voluntary joint exercises; the EU AI Office is the sole member with statutory enforcement power, via the EU AI Act.
Who coordinates the network in 2026?
The United Kingdom, through its AI Security Institute, holds the 2026 Network Coordinator role and has committed to converting the group’s evaluation-science consensus into detailed best-practice documentation before the network’s next meeting at the India AI Impact Summit.
What happens next
The network’s trajectory — from a 2024 diplomatic statement to a 2026 renamed scientific body with a rotating coordinator and a published consensus-and-open-questions agenda — suggests consolidation around evaluation methodology rather than a drift into national silos. Kenya’s addition also signals a deliberate widening beyond the original G7-plus-Singapore core.
For universities, funders and publishers tracking AI governance, the signal to watch is the best-practice documentation the UK has committed to during its 2026 coordinator term, and whether evaluating deployed AI systems — not just underlying models — gets resolved before the India AI Impact Summit.