December 16, 2020 feature

A memory-augmented, artificial neural network-based architecture

by Ingrid Fadelli , Tech Xplore

Over the past decade or so, researchers have developed a variety of computational models based on artificial neural networks (ANNs). While many of these models have been found to perform well on specific tasks, they are not always able to identify iterative, sequential or algorithmic strategies that can be applied to new problems.

Past studies have found that the addition of an external memory component can improve a neural network's ability to acquire these strategies. Even with an external memory, however, they can remain prone to errors, are sensitive to changes in the data presented to them and require large amounts of training data to perform well.

Researchers at Technische Universität Darmstadt have recently introduced a new memory-augmented ANN-based architecture that can learn abstract strategies for solving problems. This architecture, presented in a paper published in Nature Machine Intelligence, separates algorithmic computations from data-dependent manipulations, dividing the flow of information processed by algorithms into two distinct 'streams'.

"Extending neural networks with external memories has increased their capacities to learn such strategies, but they are still prone to data variations, struggle to learn scalable and transferable solutions, and require massive training data," the researchers wrote in their paper. "We present the neural Harvard computer, a memory-augmented network-based architecture that employs abstraction by decoupling algorithmic operations from data manipulations, realized by splitting the information flow and separated modules."

The neural Harvard computer or NHC, divides the flow of information fed to an algorithm into two different streams, namely a data stream (containing data-specific manipulations) and control stream (containing algorithmic operations). This ultimately allows it to differentiate between modules related to data and algorithmic modules, creating two separate and yet coupled memories.

The NHC has three main algorithmic modules, which are referred to as the controller, the memory and the bus. These three components have distinct functions but interact with each other to acquire abstractions that can be applied to future tasks.

"This abstraction mechanism and evolutionary training enable the learning of robust and scalable algorithmic solutions," the researchers explained in their paper.

The team at Technische Universität Darmstadt evaluated the NHC by using it to train and run 11 different algorithms. They then tested the performance of these algorithms, along with their generalization and abstraction capabilities. The researchers found that the NHC could reliably run all 11 algorithms, while also allowing them to perform well on tasks that were more complex than those they were originally trained to complete.

"On a diverse set of 11 algorithms with varying complexities, we show that the NHC reliably learns algorithmic solutions with strong generalization and abstraction, achieves perfect generalization and scaling to arbitrary task configurations and complexities far beyond those seen during training, and independent of the data representation and the task domain," the researchers wrote in their paper.

The recent study carried out by this team of researchers confirms the potential of using external memory components to augment the performance and generalizability of neural network-based architectures across tasks of varying complexities. In the future, the NHC architecture could be used to combine and improve the capabilities of different ANNs, aiding the development of models that can identify useful strategies to make accurate predictions based on new data.

More information: Evolutionary training and abstraction yields algorithmic generalization of neural computers. Nature Machine Intelligence(2020). DOI: 10.1038/s42256-020-00255-1.

Journal information: Nature Machine Intelligence

Citation: A memory-augmented, artificial neural network-based architecture (2020, December 16) retrieved 1 July 2024 from https://techxplore.com/news/2020-12-memory-augmented-artificial-neural-network-based-architecture.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Engineers offer smart, timely ideas for AI bottlenecks

243 shares

Feedback to editors

Researchers develop novel 3D printing strategy with controllable gradients porous structures

Jun 28, 2024

Researchers develop the fastest possible flow algorithm

Jun 28, 2024

Real-time modeling of 3D temperature distributions within nuclear microreactors to improve safety systems

Jun 28, 2024

Is ChatGPT the key to stopping deepfakes? Study asks LLMs to spot AI-generated images

Jun 27, 2024

Wireless receiver blocks interference for better mobile device performance

Jun 27, 2024

Researchers successfully develop domestic 6G antenna measurement system

Jun 27, 2024

Research shows how common plastics could passively cool and heat buildings with the seasons

Jun 27, 2024

Researchers suggest smart solution to harness waste heat from industry

Jun 27, 2024

Robotic hand with tactile fingertips achieves new dexterity feat

Jun 27, 2024

Help or hindrance? ER robots have potential to aid health care workers

Jun 27, 2024

Load comments (2)

A memory-augmented, artificial neural network-based architecture

Researchers develop novel 3D printing strategy with controllable gradients porous structures

Researchers develop the fastest possible flow algorithm

Real-time modeling of 3D temperature distributions within nuclear microreactors to improve safety systems

Is ChatGPT the key to stopping deepfakes? Study asks LLMs to spot AI-generated images

Wireless receiver blocks interference for better mobile device performance

Researchers successfully develop domestic 6G antenna measurement system

Research shows how common plastics could passively cool and heat buildings with the seasons

Researchers suggest smart solution to harness waste heat from industry

Robotic hand with tactile fingertips achieves new dexterity feat

Help or hindrance? ER robots have potential to aid health care workers

Engineers offer smart, timely ideas for AI bottlenecks

HAMLET: A platform to simplify AI research and development

Study reveals that methods to infer the connectivity of neural circuits are affected by systematic errors

A ferroelectric ternary content-addressable memory to enhance deep learning models

Researchers measure reliability, confidence for next-gen AI

Photon-based processing units enable more complex machine learning

Researchers develop the fastest possible flow algorithm

Is ChatGPT the key to stopping deepfakes? Study asks LLMs to spot AI-generated images

Robotic hand with tactile fingertips achieves new dexterity feat

Sony introduces AI for single-instrument accompaniment generation in music production

Mechanical computer relies on kirigami cubes, not electronics

New work explores optimal circumstances for reaching a common goal with humanoid robots

Phys.org

Medical Xpress

Science X

A memory-augmented, artificial neural network-based architecture

Researchers develop novel 3D printing strategy with controllable gradients porous structures

Researchers develop the fastest possible flow algorithm

Real-time modeling of 3D temperature distributions within nuclear microreactors to improve safety systems

Is ChatGPT the key to stopping deepfakes? Study asks LLMs to spot AI-generated images

Wireless receiver blocks interference for better mobile device performance

Researchers successfully develop domestic 6G antenna measurement system

Research shows how common plastics could passively cool and heat buildings with the seasons

Researchers suggest smart solution to harness waste heat from industry

Robotic hand with tactile fingertips achieves new dexterity feat

Help or hindrance? ER robots have potential to aid health care workers

Related Stories

Engineers offer smart, timely ideas for AI bottlenecks

HAMLET: A platform to simplify AI research and development

Study reveals that methods to infer the connectivity of neural circuits are affected by systematic errors

A ferroelectric ternary content-addressable memory to enhance deep learning models

Researchers measure reliability, confidence for next-gen AI

Photon-based processing units enable more complex machine learning

Recommended for you

Researchers develop the fastest possible flow algorithm

Is ChatGPT the key to stopping deepfakes? Study asks LLMs to spot AI-generated images

Robotic hand with tactile fingertips achieves new dexterity feat

Sony introduces AI for single-instrument accompaniment generation in music production

Mechanical computer relies on kirigami cubes, not electronics

New work explores optimal circumstances for reaching a common goal with humanoid robots

Your Privacy