share this!
2
4
Share
Email

November 9, 2020

A continuous data supply ensures data-intensive simulations can run at maximum speed

by King Abdullah University of Science and Technology

Keep the data coming — Orchestrating the movement of data between more remote storage layers and main memory can speed up simulations by 2.5 times. Credit: KAUST

A pre-emptive memory management system developed by KAUST researchers can speed up data-intensive simulations by 2.5 times by eliminating delays due to slow data delivery. The development elegantly and transparently addresses one of the most stubborn bottlenecks in modern supercomputing—delivering data from memory fast enough to keep up with computations.

"Reducing the movement of data while keeping it close to the computing hardware is one of the most daunting challenges facing computational scientists handling big data," explains Hatem Ltaief from the research team. "This is exacerbated by the widening gap between computational speed and memory transmission capacity, and the need to store high-volume data on remote storage media."

The key challenge in processing big data is the cost and scale of storing the data in memory. The faster the memory, the more expensive it is, and the faster the data need to be moved between computing elements. Because only relatively small capacities of the fastest memory are available on even the most powerful supercomputing platforms, system engineers add successively larger, slower and more remote layers of memory to hold the tera- and petabytes of data typical of big data sets.

"It is in this hostile landscape that our system comes into play by reducing the overhead of moving data in and out of remote storage hardware," says Ltaief.

Ltaief with colleagues David Keyes and Tariq Alturkestani developed their multilayer buffer system (MLBS) to work proactively to maintain the data as close as possible to the computing hardware by orchestrating data movement among memory layers.

"MLBS relies on a multilevel buffering technique that outsmarts the simulation by making it 'see' all the hundreds of petabytes of data as being in fast memory," says Alturkestani. "The buffering mechanism prevents the application from stalling when it would have needed to access data located on remote storage, allowing the application to proceed at full speed with asynchronous computing operations."

This synergism provided by MLBS achieved a speedup of 2.5 times for a three-dimensional seismic exploration simulation involving hundreds of petabytes of data movements using KAUST's Shaheen-2 supercomputer.

"This approach also reduces the energy required to move data to and from remote storage media, which can be hundreds of times higher than the energy to perform a single computation on local memory," says Ltaief. "Using MLBS, we can mitigate the energy overhead of data movement, which is one of the main goals of our center."

More information: Tariq Alturkestani et al. Maximizing I/O Bandwidth for Reverse Time Migration on Heterogeneous Large-Scale Systems, Euro-Par 2020: Parallel Processing (2020). DOI: 10.1007/978-3-030-57675-2_17

Provided by King Abdullah University of Science and Technology

Citation: A continuous data supply ensures data-intensive simulations can run at maximum speed (2020, November 9) retrieved 16 August 2024 from https://techxplore.com/news/2020-11-data-intensive-simulations-maximum.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Multi-state data storage leaving binary behind

6 shares

Feedback to editors

Engineers design tiny batteries for powering cell-sized robots

12 hours ago

Leaf-like solar concentrators promise major boost in solar efficiency

12 hours ago

Why does AI beat humans at the strategy game Diplomacy?

13 hours ago

New technique prints metal oxide thin film circuits at room temperature

14 hours ago

Studies highlight challenges and solutions in making large language models trustworthy

15 hours ago

Finding security flaws in Android ahead of malicious hackers

15 hours ago

Robot planning tool accounts for human carelessness

16 hours ago

From shrimp to steel: Introducing nature-inspired metalworking

16 hours ago

'AI Scientist' model designed to conduct scientific research autonomously

17 hours ago

Global AI adoption is outpacing risk understanding, researchers warn

17 hours ago

Load comments (0)

A continuous data supply ensures data-intensive simulations can run at maximum speed

Engineers design tiny batteries for powering cell-sized robots

Leaf-like solar concentrators promise major boost in solar efficiency

Why does AI beat humans at the strategy game Diplomacy?

New technique prints metal oxide thin film circuits at room temperature

Studies highlight challenges and solutions in making large language models trustworthy

Finding security flaws in Android ahead of malicious hackers

Robot planning tool accounts for human carelessness

From shrimp to steel: Introducing nature-inspired metalworking

'AI Scientist' model designed to conduct scientific research autonomously

Global AI adoption is outpacing risk understanding, researchers warn

Multi-state data storage leaving binary behind

Graphene-based memory resistors show promise for brain-based computing

'Hiding' network latency for fast memory in data centers

'Memory disaggregation' for large-scale computing made practical

Superior seismic imaging for better drilling

Fully exploiting the potential of supercomputers

Robot planning tool accounts for human carelessness

'AI Scientist' model designed to conduct scientific research autonomously

Detecting machine-generated text: An arms race with the advancements of large language models

Are emergent abilities in large language models just in-context learning?

When AI aids decisions, when should humans override?

Cracking the code of life: New AI model learns DNA's hidden language

Phys.org

Medical Xpress

Science X

A continuous data supply ensures data-intensive simulations can run at maximum speed

Engineers design tiny batteries for powering cell-sized robots

Leaf-like solar concentrators promise major boost in solar efficiency

Why does AI beat humans at the strategy game Diplomacy?

New technique prints metal oxide thin film circuits at room temperature

Studies highlight challenges and solutions in making large language models trustworthy

Finding security flaws in Android ahead of malicious hackers

Robot planning tool accounts for human carelessness

From shrimp to steel: Introducing nature-inspired metalworking

'AI Scientist' model designed to conduct scientific research autonomously

Global AI adoption is outpacing risk understanding, researchers warn

Related Stories

Multi-state data storage leaving binary behind

Graphene-based memory resistors show promise for brain-based computing

'Hiding' network latency for fast memory in data centers

'Memory disaggregation' for large-scale computing made practical

Superior seismic imaging for better drilling

Fully exploiting the potential of supercomputers

Recommended for you

Robot planning tool accounts for human carelessness

'AI Scientist' model designed to conduct scientific research autonomously

Detecting machine-generated text: An arms race with the advancements of large language models

Are emergent abilities in large language models just in-context learning?

When AI aids decisions, when should humans override?

Cracking the code of life: New AI model learns DNA's hidden language

Your Privacy