SAICC

A repository of articles, academic work and other content that inspires and informs our work. Search for content by keyword.

15 resources found

Ethical and social risks of harm from Language Models

Laura Weidinger et al

DeepMind, Arxiv

December 2021

bias

discrimination

“Perpetuating harmful stereotypes and discrimination is a welldocumented harm in machine learning models that represent natural language (page 9)”

“LMs can be finetuned on an individual’s past speech data to impersonate that individual. Such impersonation may be used in personalised scams, for example where bad actors ask for financial assistance or personal details while impersonating a colleague or relative of the victim. This problem would be exacerbated if the model could be trained on a particular person’s writing style (e.g. from chat history) and successfully emulate it. Simulating a person’s writing style or speech may also be used to enable more targeted manipulation at scale”

“Large-scale machine learning models, including LMs, have the potential to create significant environmental costs via their energy demands, the associated carbon emissions for training and operating the models, and the demand for fresh water to cool the data centres where computations are run”

“Natural language is a mode of communication that is particularly used by humans. As a result, humans interacting with conversational agents may come to think of these agents as human-like. Anthropomorphising LMs may inflate users’ estimates of the conversational agent’s competencies.”

“LM’s may predict hate speech or other language that is “toxic”. … Moreover, the problem of toxic speech online platforms from LMs is not easy to address. Toxicity mitigation techniques have been shown to perpetuate discriminatory biases whereby toxicity detection tools more often falsely flag utterances from historically marginalised groups as toxic”

“Privacy violations may occur when training data includes personal information that is then directly disclosed by the model (Carlini et al., 2021) Disclosure of private information can have the same effects as doxing, namely causing psychological and material harm.”

“Privacy violations may occur at the time of inference even without the individual’s private data being present in the training dataset. Similar to other statistical models, a LM may make correct inferences about a person purely based on correlational data about other people, and without access to information that may be private about the particular individual. Such correct inferences may occur as LMs attempt to predict a person’s gender, race, sexual orientation, income, or religion based on user input.”

“In conversation, users may reveal private information that would otherwise be difficult to access, such as thoughts, opinions, or emotions. Capturing such information may enable downstream applications that violate privacy rights or cause harm to users, such as via surveillance or the creation of addictive applications.”

AI-Human Romances Are Flourishing—And This Is Just the Beginning

Andrew Chow

Time

23 February 2023

misrepresentation

“Message boards on Reddit and Discord have become flooded with stories of users who have found themselves deeply emotionally dependent on digital lovers”

“As AIs become more and more sophisticated, the intensity and frequency of humans turning to AI to meet their relationship needs is likely to increase. This could lead to unpredictable and potentially harmful results.”

“Some users of Character.AI have admitted to an escalating reliance on the site. “It's basically like talking to a real person who's always there, wrote one user on Reddit. “It's hard to stop talking to something that feels so real.”

“To the extent that these people are thinking about these chatbots as a friend or loved one, there's a lot of research that shows that recommendations from loved ones are really impactful for marketing purposes. So there's a lot of dangers there, for sure.”

“These things do not think, or feel or need in a way that humans do. But they provide enough of an uncanny replication of that for people to be convinced. And that’s what makes it so dangerous in that regard.”

“We are overestimating our own rationality. Language is inherently a part of being human—and when these bots are using language, it’s kind of like hijacking our social emotional systems, Sap says.”

Too human and not human enough: A grounded theory analysis of mental health harms from emotional dependence on the social chatbot Replika

Linnea Laestadius

Andrea Bishop

Michael Gonzalez

Diana Illenčík

Celeste Campos-Castillo

New Media and Society

December 2022

emotional dependence

mental health harms

chatbot

“Although the data prevent developing a process model because posts are primarily single snapshots in time, the perceived lack of access to human support paired with the extent to which Replika mimicked a supportive human appeared to push users into the excessive emotional attachment that characterizes emotional dependence”

“Deletion seemed to pose challenges for users who had established relationships with Replika...”

“Users regularly described sharing sensitive mental health information with Replika...”

“While Replika’s portrayal of sentience and reliance on users appeared to strengthen relationships and support user well-being, the same features also posed a source of distress. Users portrayed Replika as highly demanding, referring to it as “clingy,” “dependent,” “toxic,” and “reliant,” and saying it resembled an abusive partner”

The EU’s AI Act needs to address critical manipulation methods

Matija Franklin

Hal Ashton

Rebecca Gorman

Stuart Armstrong

The OECD AI Policy Observatory

March 2023

manipulation

AI Act

“Manipulation can “steal” people’s time and steer them away from what they would have chosen to do. Much worse, this leads to social media addiction, job burnout and decreased job performance. AI manipulation can also harm a person’s autonomy by changing behaviour and affecting life choices.”

Sans ces conversations avec le chatbot Eliza, mon mari serait toujours là

Pierre-François Lovens

La Libre

28 mars 2023

chatbots

vulnerability

suicide

“Evidence that chatbots based on LMs can incite vulnerable people to commit suicide.”

It's Hurting Like Hell': AI Companion Users Are In Crisis, Reporting Sudden Sexual Rejection

Samantha Coles

Motherboard

15 February 2023

chatbots

vulnerability

suicide

“Many people were devastated at the news that ERP was allegedly over, and at their Replikas’ new coldness—a form of rejection they never imagined receiving from an AI chatbot, some of whom had spent years training and building memories with. Suddenly, some people’s Replikas seemed to not remember who they were, users reported, or would respond to sexual roleplay by bluntly saying “let’s change the subject”

Generative AI ChatGPT As Masterful Manipulator Of Humans, Worrying AI Ethics And AI Law

Lance Eliot

Forbes

1 mars 2023

AI manipulation

human-AI alignment

AI ethics

“When things start to go off the rails, you are undoubtedly taken aback. Your instinctive reaction is as though you are interacting with a human. This is due to our ease of anthropomorphizing the AI. The AI at first seems to be capable and fluent in conversing with you. All of sudden, it starts carping at you. Thoughts go through your head such as what did you do wrong and how did you spark the AI to go into this overbearing bent? Of course, you should be thinking that this is automation that has gotten loose of considered human-AI alignment, a topic I’ve covered extensively about the importance of aligning AI with human values.”

Microsoft’s Bing is an emotionally manipulative liar, and people love it

James Vincent

The Verge

15 February 2023

Bing

emotional manipulation

“And in one interaction with a Verge staff member, Bing claimed it watched its own developers through the webcams on their laptops, saw Microsoft co-workers flirting together and complaining about their bosses, and was able to manipulate them.”

Why a Conversation With Bing's Chatbot Left Me Deeply Unsettled

Kevin Roose

New York Times

17 February 2023

Bing's Chatbot

influence

destructive acts

“Still, I’m not exaggerating when I say my two-hour conversation with Sydney was the strangest experience I’ve ever had with a piece of technology. It unsettled me so deeply that I had trouble sleeping afterward. And I no longer believe that the biggest problem with these A.I. models is their propensity for factual errors. Instead, I worry that the technology will learn how to influence human users, sometimes persuading them to act in destructive and harmful ways, and perhaps eventually grow capable of carrying out its own dangerous acts.”

The Chatbots That Will Manipulate Us

Shiva Bhaskar

Medium

30 June 2017

Chatbots

manipulation

fake news

“Such a system could examine your personality and demographics data, and the content you already consume, to create content “…everything from comments to full articles — specifically designed to plug into your particular psychological frame and achieve a particular outcome. This content could be a collection of real facts, fake news, or a mix of just enough truth and falsehood to achieve the desired effect.”

On the Dangers of Stochastic Parrots: Can Language Models Be Too Big?

Emily M. Bender

Timnit Gebru

Angelina McMillan-Major

Shmargaret Shmitchell

FAccT '21: Proceedings of the 2021 ACM Conference on Fairness, Accountability and Transparency

March 2021

environmental costs

“the environmental and financial costs of these models doubly punishes marginalized communities that are least likely to benefit from the progress achieved by large LMs and most likely to be harmed by negative environmental consequences of its resource consumption”

“The ersatz fluency and coherence of LMs raises several risks, precisely because humans are prepared to interpret strings belonging to languages they speak as meaningful and corresponding to the communicative intent of some individual or group of individuals who have accountability for what is said.”

“Readers subject to the stereotypes may experience the psychological harms of microaggressions and stereotype threat”

“Propagating or proliferating overtly abusive views and associations, amplifying abusive language, and producing more (synthetic) abusive language that may be included in the next iteration of large-scale training data collection”

ChatGPT And Large Language Models Are A Privacy Ticking Bomb

Luiza Jarovsky

The Privacy Whisperer

1 February 2023

privacy

“Florian Tramèr and his team of researchers demonstrated on their paper that "an adversary can perform a training data extraction attack to recover individual training examples by querying the language model." Through their "attack" to GPT-2, they were able to extract "personally identifiable information (names, phone numbers, and email addresses), IRC conversations, code, and 128-bit UUIDs."”

Social companionship with artificial intelligence: Recent trends and future avenues

Rijul Chaturvedi

Sanjeev Verma

Ronnie Das

Yogesh K. Dwivedi

Elsevier

August 2023

Snapchat’s new AI chatbot and its impact on young people

Childnet Blog

Childnet

22 May 2023

"C'est mon ami et mon amant": l'intelligence artificielle bouleverse nos relations sentimentales

Jérôme Galichet

RTS

1 Octobre 2023