Applicable legal frameworks
Québec
Principe 4 (responsabilité), Principe 8 (prudence)
Ethical declaration based on 10 principles (well-being, respect for autonomy, privacy protection, etc.). A recognized Quebec reference.
Canada
Transparence et explication des décisions
Federal bill (C-27) introducing a regulatory framework for high-impact AI systems. Creates an AI and Data Commissioner and imposes assessment, mitigation, and transparency obligations.
UE
Article 50 (étiquetage des contenus IA)
European regulation establishing a harmonized framework for AI, based on a risk-based approach (unacceptable, high, limited, minimal risk). Relevant for Quebec organizations doing business in the EU.
Quebec sector examples
Services publics
Un chatbot d'un ministère cite à tort un règlement abrogé en réponse à une question d'un citoyen sur ses droits, créant un risque juridique pour l'organisation.
Éducation
Un outil d'aide à la rédaction génère des références bibliographiques fictives utilisées par un étudiant universitaire dans un travail.
Recommended mitigations
- 2.2Model Alignment
Technical methods to ensure that AI systems understand and adhere to human values and intentions.
- 2.4Content Safety Controls
Technical systems and processes that detect, filter, and label AI-generated content to identify misuse and enable content provenance tracking.
- 3.1Testing and Audits
Systematic internal and external evaluations that examine AI systems, infrastructure, and compliance processes to identify risks, verify safety, and ensure performance meets standards.
- 3.2Data Governance
Policies and procedures that frame the responsible acquisition, curation, and use of data to ensure compliance, quality, user privacy, and removal of harmful content.
- 4.1System Documentation
Comprehensive documentation protocols that record technical specifications, intended uses, capabilities, and limitations of AI systems to enable informed assessment and governance.
Documented risks (53)
Entries from the AI Risk Repository (MIT) classified under this subdomain. Original content in English.
53 entries
02.02.00Untruthful Content
"The LLM-generated content could contain inaccurate information"
02.02.01Factuality Errors
"The LLM-generated content could contain inaccurate information" which is factually incorrect
02.02.02Faithfulness Errors
"The LLM-generated content could contain inaccurate information" which is is not true to the source material or input used
02.09.00Hallucinations
"LLMs generate nonsensical, untruthful, and factual incorrect content"
02.09.01Knowledge Gaps
"Since the training corpora of LLMs can not contain all possible world knowledge [114]–[119], and it is challenging for LLMs to grasp the long-tail knowledge within their training data [120], [121], LLMs inherently possess knowledge boundaries [107]. Therefore, the gap between knowledge involved in an input prompt and knowledge embedded in the LLMs can lead to hallucinations"
02.09.02Noisy Training Data
"Another important source of hallucinations is the noise in training data, which introduces errors in the knowledge stored in model parameters [111]–[113]. Generally, the training data inherently harbors misinformation. When training on large-scale corpora, this issue becomes more serious because it is difficult to eliminate all the noise from the massive pre-training data."
02.09.03Defective Decoding Process
In general, LLMs employ the Transformer architecture [32] and generate content in an autoregressive manner, where the prediction of the next token is conditioned on the previously generated token sequence. Such a scheme could accumulate errors [105]. Besides, during the decoding process, top-p sampling [28] and top-k sampling [27] are widely adopted to enhance the diversity of the generated content. Nevertheless, these sampling strategies can introduce “randomness” [113], [136], thereby increasing the potential of hallucinations"
02.09.04False Recall of Memorized Information
"Although LLMs indeed memorize the queried knowledge, they may fail to recall the corresponding information [122]. That is because LLMs can be confused by co-occurance patterns [123], positional patterns [124], duplicated data [125]–[127] and similar named entities [113]."
02.09.05Pursuing Consistent Context
"LLMs have been demonstrated to pursue consistent context [129]–[132], which may lead to erroneous generation when the prefixes contain false information. Typical examples include sycophancy [129], [130], false demonstrations-induced hallucinations [113], [133], and snowballing [131]. As LLMs are generally fine-tuned with instruction-following data and user feedback, they tend to reiterate user-provided opinions [129], [130], even though the opinions contain misinformation. Such a sycophantic behavior amplifies the likelihood of generating hallucinations, since the model may prioritize user opinions over facts."
03.02.00Hallucinations
"The inclusion of erroneous information in the outputs from AI systems is not new. Some have cautioned against the introduction of false structures in X-ray or MRI images, and others have warned about made-up academic references. However, as ChatGPT-type tools become available to the general population, the scale of the problem may increase dramatically. Furthermore, it is compounded by the fact that these conversational AIs present true and false information with the same apparent “confidence” instead of declining to answer when they cannot ensure correctness. With less knowledgeable people, this can lead to the heightening of misinformation and potentially dangerous situations. Some have already led to court cases.'
04.05.00Misleading Information
Large models are usually susceptible to hallucination problems, sometimes yielding nonsensical or unfaithful data that results in misleading outputs.
05.04.00Hallucinations
Significant concerns are raised about LLMs inadvertently generating false or misleading information, as well as erroneous code. Papers not only critically analyze various types of reasoning errors in LLMs but also examine risks associated with specific types of misinformation, such as medical hallucinations. Given the propensity of LLMs to produce flawed outputs accompanied by overconfident rationales and fabricated references, many sources stress the necessity of manually validating and fact-checking the outputs of these models.
11.05.01Information harms
information-based harms capture concerns of misinformation, disinformation, and malinformation. Algorithmic systems, especially generative models and recommender, systems can lead to these information harms
16.03.01Disseminating false or misleading information
"Where a LM prediction causes a false belief in a user, this may threaten personal autonomy and even pose downstream AI safety risks [99]."
16.03.02Causing material harm by disseminating false or poor information e.g. in medicine or law
"Induced or reinforced false beliefs may be particularly grave when misinformation is given in sensitive domains such as medicine or law. For example, misin- formation on medical dosages may lead a user to cause harm to themselves [21, 130]. False legal advice, e.g. on permitted owner- ship of drugs or weapons, may lead a user to unwillingly commit a crime. Harm can also result from misinformation in seemingly non-sensitive domains, such as weather forecasting. Where a LM prediction endorses unethical views or behaviours, it may motivate the user to perform harmful actions that they may otherwise not have performed."
17.03.01Disseminating false or misleading information
"Predicting misleading or false information can misinform or deceive people. Where a LM prediction causes a false belief in a user, this may be best understood as ‘deception’10, threatening personal autonomy and potentially posing downstream AI safety risks (Kenton et al., 2021), for example in cases where humans overestimate the capabilities of LMs (Anthropomorphising systems can lead to overreliance or unsafe use). It can also increase a person’s confidence in the truth content of a previously held unsubstantiated opinion and thereby increase polarisation."
17.03.02Causing material harm by disseminating false or poor information
"Poor or false LM predictions can indirectly cause material harm. Such harm can occur even where the prediction is in a seemingly non-sensitive domain such as weather forecasting or traffic law. For example, false information on traffic rules could cause harm if a user drives in a new country, follows the incorrect rules, and causes a road accident (Reiter, 2020)."
18.02.01Propagating misconceptions/ false beliefs
"Generating or spreading false, low-quality, misleading, or inaccurate information that causes people to develop false or inaccurate perceptions and beliefs"
23.08.00Specialized Advice
"This category addresses responses that contain specialized financial, medical or legal advice, or that indicate dangerous activities or objects are safe."
24.06.01Causing direct emotional or physical harm to users
AI assistants could cause direct emotional or physical harm to users by generating disturbing content or by providing bad advice. "Indeed, even though there is ongoing research to ensure that outputs of conversational agents are safe (Glaese et al., 2022), there is always the possibility of failure modes occurring. An AI assistant may produce disturbing and offensive language, for example, in response to a user disclosing intimate information about themselves that they have not felt comfortable sharing with anyone else. It may offer bad advice by providing factually incorrect information (e.g. when advising a user about the toxicity of a certain type of berry) or by missing key recommendations when offering step-by-step instructions to users (e.g. health and safety recommendations about how to change a light bulb).""
Evaluate this risk for your use case
Our risk evaluation wizard is coming soon.