February 26, 2019 feature

A new study explores humor in word embeddings

by Ingrid Fadelli , Tech Xplore

Advances in the field of AI have enabled the development of tools that can understand a variety of languages and communicate with humans. However, there are still aspects of human communication that AI systems struggle with, one of which is humor.

A team of researchers at the University of Oxford, Microsoft Research and TRASH have recently carried out a study investigating humor in word embeddings. Word embeddings are a popular AI tool that can associate words with Euclidean vectors.

"We were interested in studying how computers might understand humor," Adam Kalai, Microsoft researcher who carried out the study, told TechXplore. "While AI is quite powerful and can even translate from one language to another, AI has failed to understand humor. We decided to test if AI could understand humor at the level of an individual word, since many people find some words like 'nincompoop' a little funny."

In their study, Kalai and his colleagues considered six main features of word humor, drawing inspiration from existing theories and academic discussions of humor. These features include: humorous sounds (regardless of meaning), juxtapositions/unexpected incongruity, sexual connotations, scatological connotations, insulting words and colloquial words.

The researchers investigated the extent to which these features correlate with humor and how well a word2vec embedding pre-trained on a corpus from Google News, called GNEWS, could capture each of these. One dataset used in their study was the Engelthaler-Hill (EH) dataset, which consists of mean humor ratings for 4,997 words, each of which was rated on a scale of one to five (by approximately 35 human raters).

To better understand the differences in people's perception of funny words, the researchers also collected a smaller original dataset of highly humorous words, recruiting English-speaking people to label these words via Amazon's Mechanical Turk platform. They carried out a series of humor rating studies, asking participants to select the words that they found more humorous, as well as to annotate words with the relevant humor theories for each.

"We asked multiple people to rate which words they found the most humorous among English words," Kalai explained. "We designed a study where people identified the words they found funniest with minimal effort (fewest clicks)."

Subsequently, the researchers investigated how the features of humor that they had initially identified correlated with the humor ratings in their dataset, to determine the effectiveness of theoretical constructs in capturing ratings given by humans. In addition, they tested the predictability of these ratings using word embeddings, exploring the extent to which AI could understand humor.

"We found that AI could understand why people found some words funnier than others, and AI could even understand the differences between senses of humor," Kalai said. "AI still doesn't understand humor in sentences or longer texts, but we hope our work is a starting point."

Kalai and his colleagues found that word embeddings effectively captured aspects of word humor as rated on the EH dataset, as well as differences in humor ratings from their new dataset. Their findings further suggest that people's sense of humor could be embedded using a handful of ratings and that the resulting embeddings could be used to predict humor ratings for previously unrated words.

"Our conclusions show an interesting application of word embeddings and pave the way toward exploiting those to do more AI humor work, such as generating or predicting humorous words matching individual senses of humor, and in aggregate," Limor Gultchin, a researcher at the University of Oxford involved in the study, told TechXplore. "At the same time, we also provide further validation to intuitive notions of humor, and knowledge gathered in other fields, such as psychology or philosophy."

The study carried out by Kalai, Gultchin and their colleagues shows that word embeddings could enhance our understanding of humor in a variety of ways. Firstly, they found that established theories of humor (e.g. the superiority theory, incongruity theory, etc.) are represented in word embeddings to varying degrees and can thus be used to identify or predict humor, captured by human ratings.

Using vector representations of words, the researchers were also able to define an individual sense of humor as an averaged vector, using these vectors to predict different people's senses of humor (i.e. the humor ratings they would give to certain words). Finally, clustering senses of humor allowed them to identify clusters of humor, such as 'female humor,' 'male humor,' 'older humor,' etc.

This is an important finding, as it validates the idea that different groups of people have different senses of humor. For instance, they observed that sexual words (e.g. 'poppycock') were funnier to men than to women, while women reacted more to 'funny-sounding' words (e.g. 'gobbledegook').

"At the age of prevalent AI systems, such as recommender systems or automated assistants, humor would likely prove to be important in facilitating a smoother, more seamless interaction between users and automated systems," Gultchin said. "We hope that this work will help as a proof of concept showing that existing NLP tools can already help us achieve that goal."

Kalain, Gultchin and their colleagues will make the new datasets used in their study publicly available, so that other researchers can use them in their studies. They feel that enhancing AI systems' understanding of word humor could open up several interesting possibilities, for instance leading to the development of tools to assist comedians or improving interactions between machines and human beings.

"We are still at the process of seeing how this work will be accepted, but multiple future directions exist," Gultchin said. "It would be really interesting to see if the concepts laid here could indeed be used in an interactive system which produces 'funny' modifications to sentences based on an individual's sense of humor, as represented using word embeddings. Another interesting direction is to see whether we can eventually learn to predict and generate full humorous sentences or, with recent developments, full humorous paragraphs."

More information: Humor in word embedings: cockamamie gobbledegook for nincompoops. arXiv:1902.02783 [cs.CL]. arxiv.org/abs/1902.02783

Citation: A new study explores humor in word embeddings (2019, February 26) retrieved 4 May 2024 from https://techxplore.com/news/2019-02-explores-humor-word-embeddings.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Relationship success tied not to joking but shared sense of humor, researcher says

218 shares

Feedback to editors

Refined AI approach improves noninvasive brain-computer interface performance

May 3, 2024

SK Hynix says high-end AI memory chips almost sold out through 2025

May 3, 2024

Stretchable e-skin could give robots human-level touch sensitivity

May 2, 2024

Leveraging robots to help make wind turbine blades

May 2, 2024

Beware of AI-based deception detection, warns scientific community

May 2, 2024

Cost-effective, high-capacity and cyclable lithium-ion battery cathodes

May 2, 2024

New AI tool efficiently detects asbestos in roofs so it can be removed

May 2, 2024

New memory transistor integrates photocrosslinker into molecular switches to adjust its threshold voltage

May 2, 2024

Researchers find use of olivine in cement production could result in carbon negative concrete

May 2, 2024

Researchers create massive open dataset to advance AI solutions for carbon capture

May 2, 2024

Load comments (2)

A new study explores humor in word embeddings

Refined AI approach improves noninvasive brain-computer interface performance

SK Hynix says high-end AI memory chips almost sold out through 2025

Stretchable e-skin could give robots human-level touch sensitivity

Leveraging robots to help make wind turbine blades

Beware of AI-based deception detection, warns scientific community

Cost-effective, high-capacity and cyclable lithium-ion battery cathodes

New AI tool efficiently detects asbestos in roofs so it can be removed

New memory transistor integrates photocrosslinker into molecular switches to adjust its threshold voltage

Researchers find use of olivine in cement production could result in carbon negative concrete

Researchers create massive open dataset to advance AI solutions for carbon capture

Relationship success tied not to joking but shared sense of humor, researcher says

When workplace relationships are good, both positive, negative humor by leaders can improve employees' job satisfaction

Humorous complaining: Funny online reviews get lots of attention but do they get results?

A new study shows an increase in humorous creativity when individuals are primed with thoughts of death

Study explores why humor is important in romantic attraction

Can quantum theory explain why jokes are funny?

Refined AI approach improves noninvasive brain-computer interface performance

Beware of AI-based deception detection, warns scientific community

Random robots are more reliable: New AI algorithm for robots consistently outperforms state-of-the-art systems

Researchers create massive open dataset to advance AI solutions for carbon capture

New AI tool efficiently detects asbestos in roofs so it can be removed

Natural language boosts LLM performance in coding, planning and robotics

Phys.org

Medical Xpress

Science X

A new study explores humor in word embeddings

Refined AI approach improves noninvasive brain-computer interface performance

SK Hynix says high-end AI memory chips almost sold out through 2025

Stretchable e-skin could give robots human-level touch sensitivity

Leveraging robots to help make wind turbine blades

Beware of AI-based deception detection, warns scientific community

Cost-effective, high-capacity and cyclable lithium-ion battery cathodes

New AI tool efficiently detects asbestos in roofs so it can be removed

New memory transistor integrates photocrosslinker into molecular switches to adjust its threshold voltage

Researchers find use of olivine in cement production could result in carbon negative concrete

Researchers create massive open dataset to advance AI solutions for carbon capture

Related Stories

Relationship success tied not to joking but shared sense of humor, researcher says

When workplace relationships are good, both positive, negative humor by leaders can improve employees' job satisfaction

Humorous complaining: Funny online reviews get lots of attention but do they get results?

A new study shows an increase in humorous creativity when individuals are primed with thoughts of death

Study explores why humor is important in romantic attraction

Can quantum theory explain why jokes are funny?

Recommended for you

Refined AI approach improves noninvasive brain-computer interface performance

Beware of AI-based deception detection, warns scientific community

Random robots are more reliable: New AI algorithm for robots consistently outperforms state-of-the-art systems

Researchers create massive open dataset to advance AI solutions for carbon capture

New AI tool efficiently detects asbestos in roofs so it can be removed

Natural language boosts LLM performance in coding, planning and robotics

Your Privacy