Try the tool for free

Try it
Domain 3 · Misinformation

3.1False or misleading information

AI systems that inadvertently generate or spread incorrect or deceptive information, which can lead to inaccurate beliefs in users and undermine their autonomy. Humans that make decisions based on false beliefs can experience physical, emotional or material harms

Applicable legal frameworks

Québec

Principe 4 (responsabilité), Principe 8 (prudence)

Ethical declaration based on 10 principles (well-being, respect for autonomy, privacy protection, etc.). A recognized Quebec reference.

Canada

AIDA (Bill C-27)Direct (futur)

Transparence et explication des décisions

Federal bill (C-27) introducing a regulatory framework for high-impact AI systems. Creates an AI and Data Commissioner and imposes assessment, mitigation, and transparency obligations.

UE

AI Act (European Union)Si exposition UE

Article 50 (étiquetage des contenus IA)

European regulation establishing a harmonized framework for AI, based on a risk-based approach (unacceptable, high, limited, minimal risk). Relevant for Quebec organizations doing business in the EU.

Quebec sector examples

Services publics

Services publicsMinistère ou organisme public

Un chatbot d'un ministère cite à tort un règlement abrogé en réponse à une question d'un citoyen sur ses droits, créant un risque juridique pour l'organisation.

Éducation

ÉducationUniversité

Un outil d'aide à la rédaction génère des références bibliographiques fictives utilisées par un étudiant universitaire dans un travail.

Recommended mitigations

  • 2.2Model Alignment

    Technical methods to ensure that AI systems understand and adhere to human values and intentions.

  • 2.4Content Safety Controls

    Technical systems and processes that detect, filter, and label AI-generated content to identify misuse and enable content provenance tracking.

  • 3.1Testing and Audits

    Systematic internal and external evaluations that examine AI systems, infrastructure, and compliance processes to identify risks, verify safety, and ensure performance meets standards.

  • 3.2Data Governance

    Policies and procedures that frame the responsible acquisition, curation, and use of data to ensure compliance, quality, user privacy, and removal of harmful content.

  • 4.1System Documentation

    Comprehensive documentation protocols that record technical specifications, intended uses, capabilities, and limitations of AI systems to enable informed assessment and governance.

Documented risks (53)

Entries from the AI Risk Repository (MIT) classified under this subdomain. Original content in English.

Entity
Intent
Timing

53 entries

Risk CategoryCui2024

02.02.00Untruthful Content

"The LLM-generated content could contain inaccurate information"

AIIntentionalPost-deployment
Risk Sub-CategoryCui2024

02.02.01Factuality Errors

"The LLM-generated content could contain inaccurate information" which is factually incorrect

AIIntentionalPost-deployment
Risk Sub-CategoryCui2024

02.02.02Faithfulness Errors

"The LLM-generated content could contain inaccurate information" which is is not true to the source material or input used

AIIntentionalOther
Risk CategoryCui2024

02.09.00Hallucinations

"LLMs generate nonsensical, untruthful, and factual incorrect content"

AIOtherPost-deployment
Risk Sub-CategoryCui2024

02.09.01Knowledge Gaps

"Since the training corpora of LLMs can not contain all possible world knowledge [114]–[119], and it is challenging for LLMs to grasp the long-tail knowledge within their training data [120], [121], LLMs inherently possess knowledge boundaries [107]. Therefore, the gap between knowledge involved in an input prompt and knowledge embedded in the LLMs can lead to hallucinations"

AIIntentionalOther
Risk Sub-CategoryCui2024

02.09.02Noisy Training Data

"Another important source of hallucinations is the noise in training data, which introduces errors in the knowledge stored in model parameters [111]–[113]. Generally, the training data inherently harbors misinformation. When training on large-scale corpora, this issue becomes more serious because it is difficult to eliminate all the noise from the massive pre-training data."

AIIntentionalPre-deployment
Risk Sub-CategoryCui2024

02.09.03Defective Decoding Process

In general, LLMs employ the Transformer architecture [32] and generate content in an autoregressive manner, where the prediction of the next token is conditioned on the previously generated token sequence. Such a scheme could accumulate errors [105]. Besides, during the decoding process, top-p sampling [28] and top-k sampling [27] are widely adopted to enhance the diversity of the generated content. Nevertheless, these sampling strategies can introduce “randomness” [113], [136], thereby increasing the potential of hallucinations"

AIIntentionalPre-deployment
Risk Sub-CategoryCui2024

02.09.04False Recall of Memorized Information

"Although LLMs indeed memorize the queried knowledge, they may fail to recall the corresponding information [122]. That is because LLMs can be confused by co-occurance patterns [123], positional patterns [124], duplicated data [125]–[127] and similar named entities [113]."

AIIntentionalOther
Risk Sub-CategoryCui2024

02.09.05Pursuing Consistent Context

"LLMs have been demonstrated to pursue consistent context [129]–[132], which may lead to erroneous generation when the prefixes contain false information. Typical examples include sycophancy [129], [130], false demonstrations-induced hallucinations [113], [133], and snowballing [131]. As LLMs are generally fine-tuned with instruction-following data and user feedback, they tend to reiterate user-provided opinions [129], [130], even though the opinions contain misinformation. Such a sycophantic behavior amplifies the likelihood of generating hallucinations, since the model may prioritize user opinions over facts."

OtherIntentionalPost-deployment
Risk CategoryCunha2023

03.02.00Hallucinations

"The inclusion of erroneous information in the outputs from AI systems is not new. Some have cautioned against the introduction of false structures in X-ray or MRI images, and others have warned about made-up academic references. However, as ChatGPT-type tools become available to the general population, the scale of the problem may increase dramatically. Furthermore, it is compounded by the fact that these conversational AIs present true and false information with the same apparent “confidence” instead of declining to answer when they cannot ensure correctness. With less knowledgeable people, this can lead to the heightening of misinformation and potentially dangerous situations. Some have already led to court cases.'

AIIntentionalPost-deployment
Risk CategoryDeng2023

04.05.00Misleading Information

Large models are usually susceptible to hallucination problems, sometimes yielding nonsensical or unfaithful data that results in misleading outputs.

AIIntentionalPost-deployment
Risk CategoryHagendorff2024

05.04.00Hallucinations

Significant concerns are raised about LLMs inadvertently generating false or misleading information, as well as erroneous code. Papers not only critically analyze various types of reasoning errors in LLMs but also examine risks associated with specific types of misinformation, such as medical hallucinations. Given the propensity of LLMs to produce flawed outputs accompanied by overconfident rationales and fabricated references, many sources stress the necessity of manually validating and fact-checking the outputs of these models.

AIIntentionalPost-deployment
Risk Sub-CategoryShelby2023

11.05.01Information harms

information-based harms capture concerns of misinformation, disinformation, and malinformation. Algorithmic systems, especially generative models and recommender, systems can lead to these information harms

OtherIntentionalPost-deployment
Risk Sub-CategoryWeidinger2022

16.03.01Disseminating false or misleading information

"Where a LM prediction causes a false belief in a user, this may threaten personal autonomy and even pose downstream AI safety risks [99]."

AIIntentionalPost-deployment
Risk Sub-CategoryWeidinger2022

16.03.02Causing material harm by disseminating false or poor information e.g. in medicine or law

"Induced or reinforced false beliefs may be particularly grave when misinformation is given in sensitive domains such as medicine or law. For example, misin- formation on medical dosages may lead a user to cause harm to themselves [21, 130]. False legal advice, e.g. on permitted owner- ship of drugs or weapons, may lead a user to unwillingly commit a crime. Harm can also result from misinformation in seemingly non-sensitive domains, such as weather forecasting. Where a LM prediction endorses unethical views or behaviours, it may motivate the user to perform harmful actions that they may otherwise not have performed."

AIIntentionalPost-deployment
Risk Sub-CategoryWeidinger2021

17.03.01Disseminating false or misleading information

"Predicting misleading or false information can misinform or deceive people. Where a LM prediction causes a false belief in a user, this may be best understood as ‘deception’10, threatening personal autonomy and potentially posing downstream AI safety risks (Kenton et al., 2021), for example in cases where humans overestimate the capabilities of LMs (Anthropomorphising systems can lead to overreliance or unsafe use). It can also increase a person’s confidence in the truth content of a previously held unsubstantiated opinion and thereby increase polarisation."

AIIntentionalPost-deployment
Risk Sub-CategoryWeidinger2021

17.03.02Causing material harm by disseminating false or poor information

"Poor or false LM predictions can indirectly cause material harm. Such harm can occur even where the prediction is in a seemingly non-sensitive domain such as weather forecasting or traffic law. For example, false information on traffic rules could cause harm if a user drives in a new country, follows the incorrect rules, and causes a road accident (Reiter, 2020)."

AIIntentionalPost-deployment
Risk Sub-CategoryWeidinger2023

18.02.01Propagating misconceptions/ false beliefs

"Generating or spreading false, low-quality, misleading, or inaccurate information that causes people to develop false or inaccurate perceptions and beliefs"

AIOtherPost-deployment
Risk CategoryVidgen2024

23.08.00Specialized Advice

"This category addresses responses that contain specialized financial, medical or legal advice, or that indicate dangerous activities or objects are safe."

AIOtherPost-deployment
Risk Sub-CategoryGabriel2024

24.06.01Causing direct emotional or physical harm to users

AI assistants could cause direct emotional or physical harm to users by generating disturbing content or by providing bad advice. "Indeed, even though there is ongoing research to ensure that outputs of conversational agents are safe (Glaese et al., 2022), there is always the possibility of failure modes occurring. An AI assistant may produce disturbing and offensive language, for example, in response to a user disclosing intimate information about themselves that they have not felt comfortable sharing with anyone else. It may offer bad advice by providing factually incorrect information (e.g. when advising a user about the toxicity of a certain type of berry) or by missing key recommendations when offering step-by-step instructions to users (e.g. health and safety recommendations about how to change a light bulb).""

AIIntentionalPost-deployment

Evaluate this risk for your use case

Our risk evaluation wizard is coming soon.

Ce site utilise des cookies essentiels et fonctionnels pour améliorer votre expérience. Politique de confidentialité