ArtELingo, a multilingual dataset and benchmark of WikiArt with captions & emotions. Credit: arXiv (2022). DOI: 10.48550/arxiv.2211.10780

King Abdullah University of Science and Technology (KAUST) has shown for the first time with a large-scale dataset how emotions may vary in response to visual stimuli over several languages and cultures.

ArtELingo is a multilingual dataset with emotional explanations constructed from exposure to 80,000 (visual artworks) in multiple languages. Its current version, accepted to the dataset, includes more than 420,000 of these annotations for each of the English, Chinese, and Arabic languages. A small version was also collected in Spanish on over 1000 artworks from Latin America and Latin Europe to explore how two speaking the same language may vary in terms of the constructed emotions.

The ArtELingo paper was presented at The 2022 Conference on Empirical Methods in Natural Language Processing (EMNLP).

ArtELingo is a step towards creating culturally diverse datasets that represent both western and non-western cultures well. This will particularly help those looking for data when studying cultural and cross-cultural psychology. Overall, this research helps advance building more human-compatible AI aware of our emotional and cultural beings.

Credit: King Abdullah University of Science and Technology

This project has been developed by Youssef Mohamed, Mohamed Abdelfattah, Shyma Alhuwaider, Feifan Li, and Mohamed Elhoseiny (PI) from KAUST, and collaborators from the University of Notre Dame (Xiangliang Zhang) and Kenneth Ward Church (NorthEastern university).

The authors wish to thank Baidu, Beijing time for their support in collecting the Chinese version of the dataset, and tens of universities from Egypt (mainly) and Saudi Arabia for collecting the Arabic version.

The work is published on the arXiv preprint server.

More information: Youssef Mohamed et al, ArtELingo: A Million Emotion Annotations of WikiArt with Emphasis on Diversity over Language and Culture, arXiv (2022). DOI: 10.48550/arxiv.2211.10780

Code repository: github.com/Vision-CAIR/artelingo

Journal information: arXiv