January 7, 2021 feature

A framework to evaluate the cognitive capabilities of machine learning agents

by Ingrid Fadelli , Tech Xplore

Over the past decade or so, computer scientists have developed machine learning (ML) techniques that perform remarkably well on a variety of tasks. While these algorithms are designed for artificially replicating human cognitive skills, there is still a lack of tools to compare their capabilities with those of humans.

With this in mind, two researchers at Savitribai Phule Pune University (SPPU) in India have recently created a framework to perform cognitive evaluations of machine-learning agents. This unique framework, outlined in a paper published in Elsevier's Cognitive Systems Research journal, draws parallels from human cognition, as it is described by psychology theories, and machine cognition.

"When I started working on my core research about few shot learning (FSL), my advisor and I contemplated on how humans can easily learn to classify objects visually and why it is so difficult for machines," Suvarna Kadam, one of the researchers who carried out the study, told TechXplore. "Humans can generalize, but machines find it quite challenging. A quick analysis of state-of-the-art FSL methods made us realize that it is not just hard to assess 'how much is learned' with performance metrics, but often, we also have no idea if a machine is truly comprehending the task at hand or merely mimicking."

Once they realized that there is a lack of reliable methods to evaluate the cognition of ML techniques, Kadam and her supervisor Vinay Vaidya started asking themselves fundamental questions about machine cognition and how it could be effectively assessed. Eventually, they decided to devise a structured approach that could help researchers to understand how machines acquire new skills and assess how much they actually learned. The framework they created offers a simple way of thinking about machine cognition, drawing parallels with human cognition.

"We decided to use humanity's collective wisdom about how humans learn and how they measure learning," Kadam explained. "Our framework uses human cognitive theories to provide stepwise guidelines to assess a machine's learning in any domain. It advocates that we list a domain's tasks and check whether they are simple or challenging to implement, which then allows us to arrange tasks in a taxonomy based on their cognitive difficulty."

The framework created by Kadam and Vaidya is designed to prompt reflection about what makes a task harder or easier to tackle than another. Human learning is generally evaluated based on how well a learner did on a specific task. The framework proposed by the researchers can be used to evaluate a machine's task-specific cognition, utilizing a concept referred to as task taxonomy.

"Since humans are very good at generalizing and quickly adapting to a new task, we also demonstrated how to quantify the generalization potential of machines," Kadam said. "For the first time, our study highlighted the fact that machines are displaying higher intelligence and we must move beyond performance metrics to measure it."

In their recent paper, Kadam and Vaidya used their framework to compare two state-of-the-art ML techniques. It could thus also prove useful for other research teams who are trying to identify the 'best' ML model for completing a specific task among different options.

In the future, the same framework could also help to better understand the processes behind a machine's predictions or actions. This could ultimately improve the reliability of AI systems, allowing developers to attain greater insight on their cognitive capabilities.

"With this framework, we explored how cognition and learning are intertwined, and learning is greatly influenced by cognition," Kadam said. "However, learning is also greatly affected by the skills a learner possesses and attitude she/he carries. It would be really interesting to see if we can extend our work to assess physical and emotional skills of machines. Though emotional skills of machines look distant and unrealized, machines are already being used in close human interactions (e.g., chatbots, robots for caregiving or companionship, etc.), so we feel they should also be tested on their emotional quotient."

More information: Cognitive evaluation of machine learning agents. Cognitive Systems Research(2020). DOI: 10.1016/j.cogsys.2020.11.003.

Citation: A framework to evaluate the cognitive capabilities of machine learning agents (2021, January 7) retrieved 19 April 2024 from https://techxplore.com/news/2021-01-framework-cognitive-capabilities-machine-agents.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

How humans use objects in novel ways to solve problems

270 shares

Feedback to editors

Researchers develop sodium battery capable of rapid charging in just a few seconds

4 hours ago

Greater access to clean water, thanks to a better membrane

6 hours ago

Silent flight edges closer to take off, according to new research

6 hours ago

A flexible and efficient DC power converter for sustainable-energy microgrids

7 hours ago

Microsoft's AI app VASA-1 makes photographs talk and sing with believable facial expressions

7 hours ago

To build a better AI helper, start by modeling the irrational behavior of humans

7 hours ago

Versatile fibers offer improved energy storage capacity for wearable devices

8 hours ago

Harnessing solar energy for high-efficiency NH₃ production

9 hours ago

A dexterous four-legged robot that can walk and handle objects simultaneously

10 hours ago

Climate change will increase value of residential rooftop solar panels across US, study finds

12 hours ago

Load comments (0)

A framework to evaluate the cognitive capabilities of machine learning agents

Researchers develop sodium battery capable of rapid charging in just a few seconds

Greater access to clean water, thanks to a better membrane

Silent flight edges closer to take off, according to new research

A flexible and efficient DC power converter for sustainable-energy microgrids

Microsoft's AI app VASA-1 makes photographs talk and sing with believable facial expressions

To build a better AI helper, start by modeling the irrational behavior of humans

Versatile fibers offer improved energy storage capacity for wearable devices

Harnessing solar energy for high-efficiency NH₃ production

A dexterous four-legged robot that can walk and handle objects simultaneously

Climate change will increase value of residential rooftop solar panels across US, study finds

How humans use objects in novel ways to solve problems

Teaching AI agents to type on a Braille keyboard

Simulated system could help develop better artificial intelligence, treatments for brain disorders

Meet Jaco and Baxter, machine learning robots who cook perfect hot dogs

ROBOSHERLOCK: a system to enhance robot performance on manipulation tasks

Framework improves 'continual learning' for artificial intelligence

Microsoft's AI app VASA-1 makes photographs talk and sing with believable facial expressions

To build a better AI helper, start by modeling the irrational behavior of humans

Team develops a way to teach a computer to type like a human

For more open and equitable public discussions on social media, try 'meronymity'

Using sim-to-real reinforcement learning to train robots to do simple tasks in broad environments

Meta's newest AI model beats some peers. But its amped-up AI agents are confusing Facebook users

Phys.org

Medical Xpress

Science X

A framework to evaluate the cognitive capabilities of machine learning agents

Researchers develop sodium battery capable of rapid charging in just a few seconds

Greater access to clean water, thanks to a better membrane

Silent flight edges closer to take off, according to new research

A flexible and efficient DC power converter for sustainable-energy microgrids

Microsoft's AI app VASA-1 makes photographs talk and sing with believable facial expressions

To build a better AI helper, start by modeling the irrational behavior of humans

Versatile fibers offer improved energy storage capacity for wearable devices

Harnessing solar energy for high-efficiency NH₃ production

A dexterous four-legged robot that can walk and handle objects simultaneously

Climate change will increase value of residential rooftop solar panels across US, study finds

Related Stories

How humans use objects in novel ways to solve problems

Teaching AI agents to type on a Braille keyboard

Simulated system could help develop better artificial intelligence, treatments for brain disorders

Meet Jaco and Baxter, machine learning robots who cook perfect hot dogs

ROBOSHERLOCK: a system to enhance robot performance on manipulation tasks

Framework improves 'continual learning' for artificial intelligence

Recommended for you

Microsoft's AI app VASA-1 makes photographs talk and sing with believable facial expressions

To build a better AI helper, start by modeling the irrational behavior of humans

Team develops a way to teach a computer to type like a human

For more open and equitable public discussions on social media, try 'meronymity'

Using sim-to-real reinforcement learning to train robots to do simple tasks in broad environments

Meta's newest AI model beats some peers. But its amped-up AI agents are confusing Facebook users

Your Privacy