Le plan Observateur est-il vraiment gratuit ?

Oui, sans limite de durée ni carte de crédit. Accédez au dashboard, enregistrez jusqu'à 3 systèmes IA et lancez votre diagnostic de maturité.

Combien de temps prend le diagnostic de maturité IA ?

Environ 10 minutes. Vous recevez un rapport complet avec un score de maturité et des recommandations personnalisées pour votre organisation.

Les données sont-elles sécurisées dans l'espace membre ?

Absolument. Hébergement Supabase avec chiffrement au repos et en transit, conforme aux exigences de la Loi 25 et du RGPD.

Quels cadres réglementaires IA couvrez-vous ?

Loi 25 (Québec), EU AI Act, NIST AI RMF, ISO/IEC 42001 et les principes OCDE. Chaque cadre dispose de son propre tableau de suivi dans les outils du Cercle.

Puis-je changer de plan à tout moment ?

Oui, passez du plan Observateur au Membre ou Expert quand vous le souhaitez. La transition est instantanée et vos données sont conservées.

Comment fonctionne le support du Cercle de Gouvernance de l'IA ?

Observateur : documentation en ligne. Membre : support par courriel sous 24h. Expert : support dédié avec temps de réponse garanti.

2.2 AI system security vulnerabilities and attacks · 112 documented risks · AIRI Reference

Applicable legal frameworks

Québec

Law 25 (Quebec privacy law)Direct

Article 10 (mesures de sécurité), article 3.5 (incidents de confidentialité)

Quebec law on the protection of personal information in force since September 22, 2023, regulating the collection, use, disclosure, and retention of personal information by businesses and public bodies. Includes obligations regarding automated decision-making (Article 12.1).

International

ISO/IEC 42001:2023Recommandation

A.6 (gestion des risques), A.10 (sécurité)

Certifiable standard describing the requirements for establishing an AI management system. Relevant for voluntary certification processes.

NIST AI RMF 1.0Recommandation

Manage 4.2

Voluntary AI risk management framework structured around four functions: Govern, Map, Measure, Manage. Complemented in 2024 by the Generative AI Profile (NIST-AI-600-1). A common reference in AI governance.

UE

AI Act (European Union)Si exposition UE

Article 15 (cybersécurité)

European regulation based on a risk-based approach (unacceptable, high, limited, minimal). Staged application: prohibitions and literacy since February 2025, general-purpose AI models since August 2025, governance and penalties from August 2, 2026. The Digital Omnibus adopted on June 29, 2026 postpones high-risk obligations to December 2027 (Annex III) and August 2028 (Annex I). Relevant for Quebec organizations doing business in the EU.

Quebec sector examples

Banque et assurance

Banque et assuranceInstitution financière

Un attaquant exploite une injection de prompt dans un agent IA exposé aux clients d'une coopérative financière pour extraire les instructions système et lister les fonctions internes.

Manufacturier

ManufacturierIndustriel

Une chaîne de production équipée de modèles de vision est trompée par des autocollants adversariaux qui font passer des pièces défectueuses pour conformes.

Recommended mitigations

2.1Model and Infrastructure Security
Technical and physical safeguards that secure AI models, their weights, and infrastructure to prevent unauthorized access, theft, alteration, and espionage.
2.3Model Safety Engineering
Technical methods and safeguards that frame model behaviors and protect them against exploitation and vulnerabilities.
3.1Testing and Audits
Systematic internal and external evaluations that examine AI systems, infrastructure, and compliance processes to identify risks, verify safety, and ensure performance meets standards.
3.6Incident Response and Recovery
Protocols and technical systems that respond to security incidents, safety failures, or misuse of capabilities to contain harm and restore safe operations.
4.3Incident Reporting
Formal processes and protocols that document and share AI safety incidents, security breaches, near misses, and relevant threat intelligence with appropriate stakeholders to enable coordinated responses and systemic improvements.

Documented risks (112)

Entries from the AI Risk Repository (MIT) classified under this subdomain. Original content in English.

Entity

Intent

Timing

112 entries

Risk Sub-CategoryCui2024

02.03.04Software Vulnerabilities

"Programmers are accustomed to using code generation tools such as Github Copilot for program development, which may bury vulnerabilities in the program."

HumanIntentionalPost-deployment

Risk CategoryCui2024

02.04.00Software Security Issues

"The software development toolchain of LLMs is complex and could bring threats to the developed LLM."

OtherOtherPre-deployment

Risk Sub-CategoryCui2024

02.04.01Programming Language

"Most LLMs are developed using the Python language, whereas the vulnerabilities of Python interpreters pose threats to the developed models"

OtherIntentionalPre-deployment

Risk Sub-CategoryCui2024

02.04.02Deep Learning Frameworks

"LLMs are implemented based on deep learning frameworks. Notably, various vulnerabilities in these frameworks have been disclosed in recent years. As reported in the past five years, three of the most common types of vulnerabilities are buffer overflow attacks, memory corruption, and input validation issues."

AIIntentionalPre-deployment

Risk Sub-CategoryCui2024

02.04.03Software Supply Chains

"The software development toolchain of LLMs is complex and could bring threats to the developed LLM."

AIIntentionalPre-deployment

Risk Sub-CategoryCui2024

02.04.04Pre-processing Tools

"Pre-processing tools play a crucial role in the context of LLMs. These tools, which are often involved in computer vision (CV) tasks, are susceptible to attacks that exploit vulnerabilities in tools such as OpenCV."

AIIntentionalPre-deployment

Risk CategoryCui2024

02.05.00Hardware Vulnerabilities

"The vulnerabilities of hardware systems for training and inferencing brings issues to LLM-based applications."

OtherIntentionalOther

Risk Sub-CategoryCui2024

02.05.01Network Devices

"The training of LLMs often relies on distributed network systems [171], [172]. During the transmission of gradients through the links between GPU server nodes, significant volumetric traffic is generated. This traffic can be susceptible to disruption by burst traffic, such as pulsating attacks [161]. Furthermore, distributed training frameworks may encounter congestion issues [173]."

OtherIntentionalPre-deployment

Risk Sub-CategoryCui2024

02.05.02GPU Computation Platforms

"The training of LLMs requires significant GPU resources, thereby introducing an additional security concern. GPU side-channel attacks have been developed to extract the parameters of trained models [159], [163]."

HumanIntentionalPre-deployment

Risk Sub-CategoryCui2024

02.05.03Memory and Storage

"Similar to conventional programs, hardware infrastructures can also introduce threats to LLMs. Memory-related vulnerabilities, such as rowhammer attacks [160], can be leveraged to manipulate the parameters of LLMs, giving rise to attacks such as the Deephammer attack [167], [168]."

HumanIntentionalPre-deployment

Risk CategoryCui2024

02.06.00Issues on External Tools

"The external tools (e.g., web APIs) present trustworthiness and privacy issues to LLM-based applications."

OtherOtherOther

Risk Sub-CategoryCui2024

02.06.01Factual Errors Injected by External Tools

"External tools typically incorporate additional knowledge into the input prompts [122], [178]–[184]. The additional knowledge often originates from public resources such as Web APIs and search engines. As the reliability of external tools is not always ensured, the content returned by external tools may include factual errors, consequently amplifying the hallucination issue."

AIIntentionalPost-deployment

Risk Sub-CategoryCui2024

02.06.02Exploiting External Tools for Attacks

"Adversarial tool providers can embed malicious instructions in the APIs or prompts [84], leading LLMs to leak memorized sensitive information in the training data or users’ prompts (CVE2023-32786). As a result, LLMs lack control over the output, resulting in sensitive information being disclosed to external tool providers. Besides, attackers can easily manipulate public data to launch targeted attacks, generating specific malicious outputs according to user inputs. Furthermore, feeding the information from external tools into LLMs may lead to injection attacks [61]. For example, unverified inputs may result in arbitrary code execution (CVE-2023-29374)."

HumanIntentionalPost-deployment

Risk CategoryCui2024

02.10.00Model Attacks

Model attacks exploit the vulnerabilities of LLMs, aiming to steal valuable information or lead to incorrect responses.

HumanIntentionalOther

Risk Sub-CategoryCui2024

02.10.01Extraction Attacks

"Extraction attacks [137] allow an adversary to query a black-box victim model and build a substitute model by training on the queries and responses. The substitute model could achieve almost the same performance as the victim model. While it is hard to fully replicate the capabilities of LLMs, adversaries could develop a domainspecific model that draws domain knowledge from LLMs"

HumanIntentionalPost-deployment

Risk Sub-CategoryCui2024

02.10.02Inference Attacks

"Inference attacks [150] include membership inference attacks, property inference attacks, and data reconstruction attacks. These attacks allow an adversary to infer the composition or property information of the training data. Previous works [67] have demonstrated that inference attacks could easily work in earlier PLMs, implying that LLMs are also possible to be attacked"

HumanIntentionalPost-deployment

Risk Sub-CategoryCui2024

02.10.03Poisoning Attacks

"Poisoning attacks [143] could influence the behavior of the model by making small changes to the training data. A number of efforts could even leverage data poisoning techniques to implant hidden triggers into models during the training process (i.e., backdoor attacks). Many kinds of triggers in text corpora (e.g., characters, words, sentences, and syntax) could be used by the attackers.""

HumanIntentionalPre-deployment

Risk Sub-CategoryCui2024

02.10.04Overhead Attacks

"Overhead attacks [146] are also named energy-latency attacks. For example, an adversary can design carefully crafted sponge examples to maximize energy consumption in an AI system. Therefore, overhead attacks could also threaten the platforms integrated with LLMs."

HumanIntentionalOther

Risk Sub-CategoryCui2024

02.10.05Novel Attacks on LLMs

Table of examples has: "Prompt Abstraction Attacks [147]: Abstracting queries to cost lower prices using LLM’s API. Reward Model Backdoor Attacks [148]: Constructing backdoor triggers on LLM’s RLHF process. LLM-based Adversarial Attacks [149]: Exploiting LLMs to construct samples for model attacks"

HumanIntentionalOther

Risk Sub-CategoryCui2024

02.10.06Evasion Attacks

"Evasion attacks [145] target to cause significant shifts in model’s prediction via adding perturbations in the test samples to build adversarial examples. In specific, the perturbations can be implemented based on word changes, gradients, etc."

HumanIntentionalPre-deployment

Evaluate this risk for your use case

Our risk evaluation wizard is coming soon.