March 16, 2018 weblog

Artificial intelligence: ARC test focus goes beyond factoid questions

by Nancy Owano , Tech Xplore

"Common sense" is a phrase everyone hears at one time or another, usually from an angry bystander who think you don't have any. What is "common sense?"

"Humans use common sense to fill in the gaps of any question they are posed, delivering answers within an understood but non-explicit context," Swapna Krishna wrote in Engadget.

Add a few years of developmental growth in the young child, and he or she acquires common sense but AI has problems. Calling out the challenge in AI research is Dr. Oren Etzioni, researcher and professor, who leads the Allen Institute for Artificial Intelligence, or AI2, in Seattle, Washington.

To get at the fluidity that people have, their natural ability to move from one thing to the next, the programs need what every ten year old has in spades, he said, and that is called common sense—-a set of facts, heuristics, observations, all the things that we can bring to the table, but the computer does not. "Here at the Allen Institute for Artificial Intelligence, Paul Allen has tasked us with the goal of going after this problem."

They really are. It is now reported that they have come up with a new test as part of their push to imbue AI systems with such an understanding of the world.

The new test is called ARC, which stands for AI2 Reasoning Challenge. The researchers wrote a paper about their test. "Think you have Solved Question Answering? Try ARC, the AI2 Reasoning Challenge," by Peter Clark, Isaac Cowhey, Oren Etzioni, Tushar Khot, Ashish Sabharwal, Carissa Schoenick, and Oyvind Tafjord.

Will Knight in MIT Technology Review explained that the test "will pose elementary-school-level multiple-choice science questions. Each question will require some understanding of how the world works."

The AI2 site said the questions were assembled to encourage research in advanced question-answering.

Knight quoted Gary Marcus, a professor at NYU. "I think this is a great antidote to the kind of superficial benchmarks that have become so common in the field of machine learning," he said. "It should really force AI researchers to up their game."

The authors in the paper said, "Can your model perform better? We pose ARC as a challenge to the community."

Common sense generally is regarded as the holy grail for artificial intelligence.

The authors in their paper wrote that, "Datasets have become highly influential in driving the direction of research. Recent datasets for QA have led to impressive advances, but have focused on factoid questions where surface-level cues alone are sufficient to find an answer, dis couraging progress on questions requiring reasoning or other advanced methods."

That is where their ARC comes in, to help the field move to more difficult tasks.

"We present a new question set, text corpus, and baselines assembled to encourage AI research in advanced question answering," said the authors in their paper, which is on arXiv.

There are multiple choice questions. Here's one question: "Which item below is not made from a material grown in nature?" The possible answers are a cotton shirt, a wooden chair, a plastic spoon and a grass basket. The answer taps into a common-sense picture of the world and, said Knight, "It is this common sense that the AI behind voice assistants, chatbots, and translation software lacks. And it's one reason they are so easily confused."

What contribution might this test make to the field of artificial intelligence? "If machine learning can successfully pass the Arc Reasoning Challenge, it would mean that the system has a grasp of the common sense that no AI currently possesses," wrote Krishna. "It would be a huge step forward."

More information: Think you have Solved Question Answering? Try ARC, the AI2 Reasoning Challenge (PDF)

Citation: Artificial intelligence: ARC test focus goes beyond factoid questions (2018, March 16) retrieved 17 July 2024 from https://techxplore.com/news/2018-03-artificial-intelligence-arc-focus-factoid.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Chatbots need to smarten up but easier said than done

125 shares

Feedback to editors

The magnet trick: New invention makes vibrations disappear

33 minutes ago

Creating and verifying stable AI-controlled robotic systems in a rigorous and flexible way

1 hour ago

Unlocking the potential of rust: High-efficiency green hydrogen production from hematite

1 hour ago

Scientists bridge the 'valley of death' in carbon capture technologies

1 hour ago

Flexible electronics researchers develop a completely stretchy lithium-ion battery

4 hours ago

A strategy to enhance the stability of perovskite solar cells under reverse bias conditions

6 hours ago

Engineers evaluate cybersecurity risks associated with EV fast-charging equipment

21 hours ago

Machine learning framework maps global rooftop growth for sustainable energy and urban planning

23 hours ago

Giving drones wrap-and-grip wings to allow them to land on poles and tree limbs

Jul 16, 2024

Large language models make human-like reasoning mistakes, researchers find

Jul 16, 2024

Load comments (2)

Artificial intelligence: ARC test focus goes beyond factoid questions

The magnet trick: New invention makes vibrations disappear

Creating and verifying stable AI-controlled robotic systems in a rigorous and flexible way

Unlocking the potential of rust: High-efficiency green hydrogen production from hematite

Scientists bridge the 'valley of death' in carbon capture technologies

Flexible electronics researchers develop a completely stretchy lithium-ion battery

A strategy to enhance the stability of perovskite solar cells under reverse bias conditions

Engineers evaluate cybersecurity risks associated with EV fast-charging equipment

Machine learning framework maps global rooftop growth for sustainable energy and urban planning

Giving drones wrap-and-grip wings to allow them to land on poles and tree limbs

Large language models make human-like reasoning mistakes, researchers find

Chatbots need to smarten up but easier said than done

AI can read! Tech firms race to smarten up thinking machines

Maluuba researchers try algorithm on Harry Potter text

AI system solves SAT geometry questions as well as average human test taker

Scientists help robots understand humans with board game idea

AI machine achieves IQ test score of young child

You're just a stick figure to this camera—a new camera to prevent companies from collecting private information

Visual abilities of language models found to be lacking depth

Reasoning skills of large language models are often overestimated, researchers find

A new model to plan and control the movements of humanoids in 3D environments

Researchers introduce generative AI to analyze complex tabular data

Computer scientists develop new and improved camera inspired by the human eye

Phys.org

Medical Xpress

Science X

Artificial intelligence: ARC test focus goes beyond factoid questions

The magnet trick: New invention makes vibrations disappear

Creating and verifying stable AI-controlled robotic systems in a rigorous and flexible way

Unlocking the potential of rust: High-efficiency green hydrogen production from hematite

Scientists bridge the 'valley of death' in carbon capture technologies

Flexible electronics researchers develop a completely stretchy lithium-ion battery

A strategy to enhance the stability of perovskite solar cells under reverse bias conditions

Engineers evaluate cybersecurity risks associated with EV fast-charging equipment

Machine learning framework maps global rooftop growth for sustainable energy and urban planning

Giving drones wrap-and-grip wings to allow them to land on poles and tree limbs

Large language models make human-like reasoning mistakes, researchers find

Related Stories

Chatbots need to smarten up but easier said than done

AI can read! Tech firms race to smarten up thinking machines

Maluuba researchers try algorithm on Harry Potter text

AI system solves SAT geometry questions as well as average human test taker

Scientists help robots understand humans with board game idea

AI machine achieves IQ test score of young child

Recommended for you

You're just a stick figure to this camera—a new camera to prevent companies from collecting private information

Visual abilities of language models found to be lacking depth

Reasoning skills of large language models are often overestimated, researchers find

A new model to plan and control the movements of humanoids in 3D environments

Researchers introduce generative AI to analyze complex tabular data

Computer scientists develop new and improved camera inspired by the human eye

Your Privacy