February 26, 2019 feature

A new study explores humor in word embeddings

by Ingrid Fadelli , Tech Xplore

Advances in the field of AI have enabled the development of tools that can understand a variety of languages and communicate with humans. However, there are still aspects of human communication that AI systems struggle with, one of which is humor.

A team of researchers at the University of Oxford, Microsoft Research and TRASH have recently carried out a study investigating humor in word embeddings. Word embeddings are a popular AI tool that can associate words with Euclidean vectors.

"We were interested in studying how computers might understand humor," Adam Kalai, Microsoft researcher who carried out the study, told TechXplore. "While AI is quite powerful and can even translate from one language to another, AI has failed to understand humor. We decided to test if AI could understand humor at the level of an individual word, since many people find some words like 'nincompoop' a little funny."

In their study, Kalai and his colleagues considered six main features of word humor, drawing inspiration from existing theories and academic discussions of humor. These features include: humorous sounds (regardless of meaning), juxtapositions/unexpected incongruity, sexual connotations, scatological connotations, insulting words and colloquial words.

The researchers investigated the extent to which these features correlate with humor and how well a word2vec embedding pre-trained on a corpus from Google News, called GNEWS, could capture each of these. One dataset used in their study was the Engelthaler-Hill (EH) dataset, which consists of mean humor ratings for 4,997 words, each of which was rated on a scale of one to five (by approximately 35 human raters).

To better understand the differences in people's perception of funny words, the researchers also collected a smaller original dataset of highly humorous words, recruiting English-speaking people to label these words via Amazon's Mechanical Turk platform. They carried out a series of humor rating studies, asking participants to select the words that they found more humorous, as well as to annotate words with the relevant humor theories for each.

"We asked multiple people to rate which words they found the most humorous among English words," Kalai explained. "We designed a study where people identified the words they found funniest with minimal effort (fewest clicks)."

Subsequently, the researchers investigated how the features of humor that they had initially identified correlated with the humor ratings in their dataset, to determine the effectiveness of theoretical constructs in capturing ratings given by humans. In addition, they tested the predictability of these ratings using word embeddings, exploring the extent to which AI could understand humor.

"We found that AI could understand why people found some words funnier than others, and AI could even understand the differences between senses of humor," Kalai said. "AI still doesn't understand humor in sentences or longer texts, but we hope our work is a starting point."

Kalai and his colleagues found that word embeddings effectively captured aspects of word humor as rated on the EH dataset, as well as differences in humor ratings from their new dataset. Their findings further suggest that people's sense of humor could be embedded using a handful of ratings and that the resulting embeddings could be used to predict humor ratings for previously unrated words.

"Our conclusions show an interesting application of word embeddings and pave the way toward exploiting those to do more AI humor work, such as generating or predicting humorous words matching individual senses of humor, and in aggregate," Limor Gultchin, a researcher at the University of Oxford involved in the study, told TechXplore. "At the same time, we also provide further validation to intuitive notions of humor, and knowledge gathered in other fields, such as psychology or philosophy."

The study carried out by Kalai, Gultchin and their colleagues shows that word embeddings could enhance our understanding of humor in a variety of ways. Firstly, they found that established theories of humor (e.g. the superiority theory, incongruity theory, etc.) are represented in word embeddings to varying degrees and can thus be used to identify or predict humor, captured by human ratings.

Using vector representations of words, the researchers were also able to define an individual sense of humor as an averaged vector, using these vectors to predict different people's senses of humor (i.e. the humor ratings they would give to certain words). Finally, clustering senses of humor allowed them to identify clusters of humor, such as 'female humor,' 'male humor,' 'older humor,' etc.

This is an important finding, as it validates the idea that different groups of people have different senses of humor. For instance, they observed that sexual words (e.g. 'poppycock') were funnier to men than to women, while women reacted more to 'funny-sounding' words (e.g. 'gobbledegook').

"At the age of prevalent AI systems, such as recommender systems or automated assistants, humor would likely prove to be important in facilitating a smoother, more seamless interaction between users and automated systems," Gultchin said. "We hope that this work will help as a proof of concept showing that existing NLP tools can already help us achieve that goal."

Kalain, Gultchin and their colleagues will make the new datasets used in their study publicly available, so that other researchers can use them in their studies. They feel that enhancing AI systems' understanding of word humor could open up several interesting possibilities, for instance leading to the development of tools to assist comedians or improving interactions between machines and human beings.

"We are still at the process of seeing how this work will be accepted, but multiple future directions exist," Gultchin said. "It would be really interesting to see if the concepts laid here could indeed be used in an interactive system which produces 'funny' modifications to sentences based on an individual's sense of humor, as represented using word embeddings. Another interesting direction is to see whether we can eventually learn to predict and generate full humorous sentences or, with recent developments, full humorous paragraphs."

More information: Humor in word embedings: cockamamie gobbledegook for nincompoops. arXiv:1902.02783 [cs.CL]. arxiv.org/abs/1902.02783

Citation: A new study explores humor in word embeddings (2019, February 26) retrieved 29 June 2024 from https://techxplore.com/news/2019-02-explores-humor-word-embeddings.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Relationship success tied not to joking but shared sense of humor, researcher says

218 shares

Feedback to editors

Researchers develop novel 3D printing strategy with controllable gradients porous structures

19 hours ago

Researchers develop the fastest possible flow algorithm

22 hours ago

Real-time modeling of 3D temperature distributions within nuclear microreactors to improve safety systems

Jun 28, 2024

Is ChatGPT the key to stopping deepfakes? Study asks LLMs to spot AI-generated images

Jun 27, 2024

Wireless receiver blocks interference for better mobile device performance

Jun 27, 2024

Researchers successfully develop domestic 6G antenna measurement system

Jun 27, 2024

Research shows how common plastics could passively cool and heat buildings with the seasons

Jun 27, 2024

Researchers suggest smart solution to harness waste heat from industry

Jun 27, 2024

Robotic hand with tactile fingertips achieves new dexterity feat

Jun 27, 2024

Help or hindrance? ER robots have potential to aid health care workers

Jun 27, 2024

Load comments (2)

A new study explores humor in word embeddings

Researchers develop novel 3D printing strategy with controllable gradients porous structures

Researchers develop the fastest possible flow algorithm

Real-time modeling of 3D temperature distributions within nuclear microreactors to improve safety systems

Is ChatGPT the key to stopping deepfakes? Study asks LLMs to spot AI-generated images

Wireless receiver blocks interference for better mobile device performance

Researchers successfully develop domestic 6G antenna measurement system

Research shows how common plastics could passively cool and heat buildings with the seasons

Researchers suggest smart solution to harness waste heat from industry

Robotic hand with tactile fingertips achieves new dexterity feat

Help or hindrance? ER robots have potential to aid health care workers

Relationship success tied not to joking but shared sense of humor, researcher says

When workplace relationships are good, both positive, negative humor by leaders can improve employees' job satisfaction

Humorous complaining: Funny online reviews get lots of attention but do they get results?

A new study shows an increase in humorous creativity when individuals are primed with thoughts of death

Study explores why humor is important in romantic attraction

Can quantum theory explain why jokes are funny?

Is ChatGPT the key to stopping deepfakes? Study asks LLMs to spot AI-generated images

Robotic hand with tactile fingertips achieves new dexterity feat

Sony introduces AI for single-instrument accompaniment generation in music production

New work explores optimal circumstances for reaching a common goal with humanoid robots

New tool detects AI-generated videos with 93.7% accuracy

Software engineers develop a way to run AI language models without matrix multiplication

Phys.org

Medical Xpress

Science X

A new study explores humor in word embeddings

Researchers develop novel 3D printing strategy with controllable gradients porous structures

Researchers develop the fastest possible flow algorithm

Real-time modeling of 3D temperature distributions within nuclear microreactors to improve safety systems

Is ChatGPT the key to stopping deepfakes? Study asks LLMs to spot AI-generated images

Wireless receiver blocks interference for better mobile device performance

Researchers successfully develop domestic 6G antenna measurement system

Research shows how common plastics could passively cool and heat buildings with the seasons

Researchers suggest smart solution to harness waste heat from industry

Robotic hand with tactile fingertips achieves new dexterity feat

Help or hindrance? ER robots have potential to aid health care workers

Related Stories

Relationship success tied not to joking but shared sense of humor, researcher says

When workplace relationships are good, both positive, negative humor by leaders can improve employees' job satisfaction

Humorous complaining: Funny online reviews get lots of attention but do they get results?

A new study shows an increase in humorous creativity when individuals are primed with thoughts of death

Study explores why humor is important in romantic attraction

Can quantum theory explain why jokes are funny?

Recommended for you

Is ChatGPT the key to stopping deepfakes? Study asks LLMs to spot AI-generated images

Robotic hand with tactile fingertips achieves new dexterity feat

Sony introduces AI for single-instrument accompaniment generation in music production

New work explores optimal circumstances for reaching a common goal with humanoid robots

New tool detects AI-generated videos with 93.7% accuracy

Software engineers develop a way to run AI language models without matrix multiplication

Your Privacy