December 16, 2020 feature

A memory-augmented, artificial neural network-based architecture

by Ingrid Fadelli , Tech Xplore

Over the past decade or so, researchers have developed a variety of computational models based on artificial neural networks (ANNs). While many of these models have been found to perform well on specific tasks, they are not always able to identify iterative, sequential or algorithmic strategies that can be applied to new problems.

Past studies have found that the addition of an external memory component can improve a neural network's ability to acquire these strategies. Even with an external memory, however, they can remain prone to errors, are sensitive to changes in the data presented to them and require large amounts of training data to perform well.

Researchers at Technische Universität Darmstadt have recently introduced a new memory-augmented ANN-based architecture that can learn abstract strategies for solving problems. This architecture, presented in a paper published in Nature Machine Intelligence, separates algorithmic computations from data-dependent manipulations, dividing the flow of information processed by algorithms into two distinct 'streams'.

"Extending neural networks with external memories has increased their capacities to learn such strategies, but they are still prone to data variations, struggle to learn scalable and transferable solutions, and require massive training data," the researchers wrote in their paper. "We present the neural Harvard computer, a memory-augmented network-based architecture that employs abstraction by decoupling algorithmic operations from data manipulations, realized by splitting the information flow and separated modules."

The neural Harvard computer or NHC, divides the flow of information fed to an algorithm into two different streams, namely a data stream (containing data-specific manipulations) and control stream (containing algorithmic operations). This ultimately allows it to differentiate between modules related to data and algorithmic modules, creating two separate and yet coupled memories.

The NHC has three main algorithmic modules, which are referred to as the controller, the memory and the bus. These three components have distinct functions but interact with each other to acquire abstractions that can be applied to future tasks.

"This abstraction mechanism and evolutionary training enable the learning of robust and scalable algorithmic solutions," the researchers explained in their paper.

The team at Technische Universität Darmstadt evaluated the NHC by using it to train and run 11 different algorithms. They then tested the performance of these algorithms, along with their generalization and abstraction capabilities. The researchers found that the NHC could reliably run all 11 algorithms, while also allowing them to perform well on tasks that were more complex than those they were originally trained to complete.

"On a diverse set of 11 algorithms with varying complexities, we show that the NHC reliably learns algorithmic solutions with strong generalization and abstraction, achieves perfect generalization and scaling to arbitrary task configurations and complexities far beyond those seen during training, and independent of the data representation and the task domain," the researchers wrote in their paper.

The recent study carried out by this team of researchers confirms the potential of using external memory components to augment the performance and generalizability of neural network-based architectures across tasks of varying complexities. In the future, the NHC architecture could be used to combine and improve the capabilities of different ANNs, aiding the development of models that can identify useful strategies to make accurate predictions based on new data.

More information: Evolutionary training and abstraction yields algorithmic generalization of neural computers. Nature Machine Intelligence(2020). DOI: 10.1038/s42256-020-00255-1.

Journal information: Nature Machine Intelligence

Citation: A memory-augmented, artificial neural network-based architecture (2020, December 16) retrieved 26 April 2024 from https://techxplore.com/news/2020-12-memory-augmented-artificial-neural-network-based-architecture.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Engineers offer smart, timely ideas for AI bottlenecks

243 shares

Feedback to editors

How much energy can offshore wind farms in the U.S. produce? New study sheds light

9 hours ago

Engineers uncover key to efficient and stable organic solar cells

13 hours ago

Adobe's VideoGigaGAN uses AI to make blurry videos sharp and clear

14 hours ago

Mask-inspired perovskite smart windows enhance weather resistance and energy efficiency

14 hours ago

Researchers increase storage, efficiency and durability of capacitors

14 hours ago

Study explores why human-inspired machines can be perceived as eerie

16 hours ago

High-energy-density capacitors with 2D nanomaterials could significantly enhance energy storage

Apr 24, 2024

Study shows potential of super grids when hurricanes overshadow solar panels

Apr 24, 2024

Rubber-like stretchable energy storage device fabricated with laser precision

Apr 24, 2024

On the trail of deepfakes, researchers identify 'fingerprints' of AI-generated video

Apr 24, 2024

Load comments (2)

A memory-augmented, artificial neural network-based architecture

How much energy can offshore wind farms in the U.S. produce? New study sheds light

Engineers uncover key to efficient and stable organic solar cells

Adobe's VideoGigaGAN uses AI to make blurry videos sharp and clear

Mask-inspired perovskite smart windows enhance weather resistance and energy efficiency

Researchers increase storage, efficiency and durability of capacitors

Study explores why human-inspired machines can be perceived as eerie

High-energy-density capacitors with 2D nanomaterials could significantly enhance energy storage

Study shows potential of super grids when hurricanes overshadow solar panels

Rubber-like stretchable energy storage device fabricated with laser precision

On the trail of deepfakes, researchers identify 'fingerprints' of AI-generated video

Engineers offer smart, timely ideas for AI bottlenecks

HAMLET: A platform to simplify AI research and development

Study reveals that methods to infer the connectivity of neural circuits are affected by systematic errors

A ferroelectric ternary content-addressable memory to enhance deep learning models

Researchers measure reliability, confidence for next-gen AI

Photon-based processing units enable more complex machine learning

Study explores why human-inspired machines can be perceived as eerie

Adobe's VideoGigaGAN uses AI to make blurry videos sharp and clear

Emulating neurodegeneration and aging in artificial intelligence systems

Microsoft claims that small, localized language models can be powerful as well

Scientists pioneer new X-ray microscopy method for data analysis 'on the fly'

New tech could help traveling VR gamers experience 'ludicrous speed' without motion sickness

Phys.org

Medical Xpress

Science X

A memory-augmented, artificial neural network-based architecture

How much energy can offshore wind farms in the U.S. produce? New study sheds light

Engineers uncover key to efficient and stable organic solar cells

Adobe's VideoGigaGAN uses AI to make blurry videos sharp and clear

Mask-inspired perovskite smart windows enhance weather resistance and energy efficiency

Researchers increase storage, efficiency and durability of capacitors

Study explores why human-inspired machines can be perceived as eerie

High-energy-density capacitors with 2D nanomaterials could significantly enhance energy storage

Study shows potential of super grids when hurricanes overshadow solar panels

Rubber-like stretchable energy storage device fabricated with laser precision

On the trail of deepfakes, researchers identify 'fingerprints' of AI-generated video

Related Stories

Engineers offer smart, timely ideas for AI bottlenecks

HAMLET: A platform to simplify AI research and development

Study reveals that methods to infer the connectivity of neural circuits are affected by systematic errors

A ferroelectric ternary content-addressable memory to enhance deep learning models

Researchers measure reliability, confidence for next-gen AI

Photon-based processing units enable more complex machine learning

Recommended for you

Study explores why human-inspired machines can be perceived as eerie

Adobe's VideoGigaGAN uses AI to make blurry videos sharp and clear

Emulating neurodegeneration and aging in artificial intelligence systems

Microsoft claims that small, localized language models can be powerful as well

Scientists pioneer new X-ray microscopy method for data analysis 'on the fly'

New tech could help traveling VR gamers experience 'ludicrous speed' without motion sickness

Your Privacy