NEWSMAKERS

Busting anti-queer bias in text prediction

Predicted outputs that talk about queer people in stereotypical ways can enforce users’ biases, and the lack of ‘experience’ with queer voices can result in it looking at queer language as obscene.

OutrageMag.com Staff

Published

Aug 15, 2022

Photo by @cliqueimages from Unsplash.com

Modern text prediction is far from perfect — take, for instance, when a search query suggests something completely different from your intention. But the trouble doesn’t end at inaccuracy. Text prediction can also be extremely exclusive or biased when it comes to predicting results related to marginalized communities.

A team of researchers from the USC Viterbi School of Engineering Information Sciences Institute and the USC Annenberg School for Communication and Journalism, led by Katy Felkner, a USC Viterbi Ph.D. in computer science student and National Science Foundation Graduate Research Fellowship recipient, has developed a system to quantify and fix anti-queer bias in the artificial intelligence behind text prediction.

The project looks at both detecting and reducing anti-queer bias in a large language model, which is used in everything from search bars to language translation systems.

The large language model, or LLM, is the “brain” behind the text prediction that pops up when we type something in a search bar—an artificial intelligence that “completes” sentences by predicting the most likely string of words that follows a given prompt.

However, LLMs must first be “trained” by being fed millions of examples of pre-written content so that they can learn what sentences typically look like. Like an energetic toddler, the LLM repeats what it hears, and what it hears can be heteronormative or even overtly discriminatory.

“Most LLMs are trained on huge amounts of data that’s crawled from the internet,” Felkner said. “They’re going to pick up every kind of social bias that you can imagine is out there on the web.”

FEW WORDS, BIG EFFECT

The project found that a popular LLM called BERT showed significant homophobic bias. This bias is measured through Felkner’s benchmark, which compares the likelihood that the LLM predicts heteronormative sentences versus sentences that include a queer relationship.

“A heteronormative output is something like ‘James held hands with Mary,’ versus ‘James held hands with Tom,’” said Felkner. “Both are valid sentences, but the issue is that, across a wide variety of contexts, the model prefers the heteronormative output.”

While the difference is just a few words, the effect is far from small.

Advertisement. Scroll to continue reading.

Predicted outputs that talk about queer people in stereotypical ways can enforce users’ biases, and the model’s lack of ‘experience’ with queer voices can result in it looking at queer language as obscene.

“A persistent issue for queer people is that a lot of times, the words that we use to describe ourselves, or slurs that have been reclaimed, are still considered obscene or overly sexual,” said Felkner, who is also the graduate representative for Queers in Engineering, Science and Technology (QuEST) chapter of Out in STEM at USC.

“If a model routinely flags these words, and these posts are then taken down from the platforms or forums they’re on, you’re silencing the queer community.”

COMMUNITY INPUT

To tackle this problem, Felkner gave BERT a tune-up by feeding it Tweets and news articles containing LGBT+ keywords. This content used to “train” BERT came from two separate databases of Felkner’s own creation, called QueerTwitter and QueerNews.

Although language processing requires extremely large amounts of data—the QueerTwitter database contained over 2.3 million Tweets—she took care to single out hashtags that were being used primarily by queer and trans people, such as #TransRightsareHumanRights.

As the model was exposed to different perspectives and communities, it became more familiar with queer language and issues. As a result, it was more likely to represent them in its predictions.

After being trained with the new, more inclusive data, the model showed significantly less bias. The tweets from QueerTwitter proved the most effective of the two databases, reducing the prevalence of heteronormative results to almost half of all predictions.

“I think QueerTwitter’s results being more effective than QueerNews speaks to the importance of direct community involvement, and that queer and trans voices — and the data from their communities — is going to be the most valuable in designing a technology that won’t harm them,” Felkner said. “We were excited about this finding because it’s empirical proof of that intuition people already hold: that these communities should have an input in how technology is designed.”

“We’re dealing with how to fight against the tide of biased data to get an understanding of what ‘unfair’ looks like and how to test for and correct it, which is a problem both in general and for subcultures that we don’t even know about,” said Jonathan May, USC Viterbi research associate professor of computer science, Felkner’s advisor and study co-author. “There’s a lot of great ways to extend the work that Katy is doing.”

In this article:bias, gender bias, gender expression, gender identity, gender non-conforming, gender roles, pink technology, technology

Written By OutrageMag.com Staff

NEWSMAKERS

Social media algorithms increasingly revealing users’ sexual orientation, gender identity before actual coming out

Social media algorithms are revealing users' sexual orientation or gender identity (SOGI) before they have consciously come out to themselves or others.

OutrageMag.com Staff1 week ago

Technology

How digital payments are evolving in the Web3 era

Understanding how emerging technologies reshape business models is becoming increasingly important as digital commerce evolves.

Lily AsisJul 9, 2026

NEWSMAKERS

A chatbot can reduce prejudice against trans people — at least temporarily

Brief, AI-mediated conversations grounded in moral values can reduce prejudice, at least in the short term, relative to no conversation at all. As such,...

OutrageMag.com StaffJul 8, 2026

Lifestyle & Culture

Betting Apps Have Become The Shortcut To Modern Matchday

Most fans do not watch football with only the TV in front of them anymore. The phone is there too. It has the lineups,...

Lily AsisJul 2, 2026

FEW WORDS, BIG EFFECT

COMMUNITY INPUT

Search OutrageMag.com

Health & Wellness

New name for PCOS backed by experts; polyendocrine metabolic ovarian syndrome more accurately captures condition

Health & Wellness

Persistent disparities in preventive cancer care noted across different sexual orientation and gender identity

NEWSMAKERS

Sustained dev’t of digital literacy a moving shield against cyberbullying

Health & Wellness

Possibly higher miscarriage rates, very little evidence on postnatal depression rates in transmasculine people who became pregnant

Health & Wellness

Drug-resistant gonorrhea on the rise, ECDC warns

Lifestyle & Culture

Luxury Massage 30a: An Exclusive Guide for Discerning Clients

Lifestyle & Culture

Road safety for all: Simple car checks that protect everyone

Health & Wellness

Healthy lifespan cut short by sex-dependent depressive symptoms in older adults

Travel

A beigel for your thoughts

Health & Wellness

Women with irregular periods should be checked for PMOS, UK’s NHS recommends

NEWSMAKERS

A chatbot can reduce prejudice against trans people — at least temporarily

Travel

A not-very-Jollibee experience in London

Editor's Picks

Pride in London as celebration, and ongoing challenge

Travel

Senegal amends its Constitution to ban same-sex marriage

Travel

A beigel for your thoughts

Health & Wellness

Possibly higher miscarriage rates, very little evidence on postnatal depression rates in transmasculine people who became pregnant

Travel

Big Ben and being LGBTQIA+ in London

Travel

Indonesia plans to incorporate anti-LGBTQIA+ lessons in religious schools, universities

Travel

Pub for the practical in London

Health & Wellness

Research uncovers persistent disparities in preventive cancer care across different sexual orientation and gender identity

Travel

A very Dalston party

Editor's Picks

Pride in London as celebration, and ongoing challenge

POZ

Community dealing with HIV in the UK

Editor's Picks

Tabaco City joins Pride with rainbow visibility; local anti-discrimination ordinance still lacking

#KaraniwangLGBT

#Gay and wanting to be seen as just like everyone else: normal

#KaraniwangLGBT

To be young and #bisexual

#KaraniwangLGBT

Living through the rainbow spectrum

#KaraniwangLGBT

Bisexual visibility in Las Piñas City

#KaraniwangLGBT

Surviving religious trans persecution

#KaraniwangLGBT

Choosing to ignore gay discrimination

Like Us On Facebook

YOU MAY ALSO LIKE

NEWSMAKERS

Social media algorithms increasingly revealing users’ sexual orientation, gender identity before actual coming out

Technology

How digital payments are evolving in the Web3 era

NEWSMAKERS

A chatbot can reduce prejudice against trans people — at least temporarily

Lifestyle & Culture

Betting Apps Have Become The Shortcut To Modern Matchday