Study identifies a universal property for efficient communication
Words categorize the semantic fields they refer to in ways that maximize communication accuracy while minimizing complexity. Recent studies have shown that human languages are optimally balanced between accuracy and complexity. For example, many languages have a word that denotes the color red, but no language has individual words to distinguish ten different shades of the color. These additional words would complicate the vocabulary and rarely would they be useful to achieve precise communication.
A study published on 23 March in the journal Proceedings of the National Academy of Sciences analyzed how artificial neural networks develop spontaneous systems to name colors. A study by Marco Baroni, ICREA research professor at the UPF Department of Translation and Language Sciences (DTCL), conducted with members of Facebook AI Research (France).
For this study, the researchers formed two artificial neural networks trained with two generic deep learning methods. As Baroni explains: "we made the networks play a color-naming game in which they had to communicate about color chips from a continuous color space. We did not limit the "language" they could use, however, when they learned to play the game successfully, we observed the color-naming terms these artificial neural networks had developed spontaneously."
The authors found that the emerging color vocabulary has exactly the same property of optimizing the complexity/accuracy trade-off found in human languages. Furthermore, this result is only maintained while the systems communicate via a discrete channel: when they are allowed to use continuous signals (such as whistles or non-linguistic hand gestures), their language loses efficiency.
From the point of view of cognitive science, the results suggest that optimal trade-offs between complexity and accuracy may be a universal property that arises in discrete communication systems, not related to specific features of human biology. Baroni adds: "the results show that modern AI systems naturally adopt similar behaviors to humans, which is nonetheless surprising."
This suggests that an efficient categorization of colors (and possibly other semantic domains) in natural languages is not dependent on specific human biological constraints, but is a general property of discrete communication systems.
More information: Rahma Chaabouni et al. Communicating artificial neural networks develop efficient color-naming systems. PNAS. (2021) doi.org/10.1073/pnas.2016569118