July 28, 2023

Computer system based on light could jumpstart power of ChatGPT-type machine-learning programs

by Elizabeth A. Thomson, Materials Research Laboratory, Massachusetts Institute of Technology

ChatGPT has made headlines around the world with its ability to write well-done essays, emails, and computer code based on a few questions from a user.

Now an MIT-led team reports a system that could lead to machine-learning programs several orders of magnitude more powerful than the one behind ChatGPT. Plus, the system they developed could use several orders of magnitude less energy than the state-of-the-art supercomputers behind the machine-learning models of today.

In the July 17 issue of Nature Photonics, the researchers report the first experimental demonstration of the new system, which does its computations based on the movement of light rather than electrons using hundreds of micron-scale lasers. With the new system, the team reports a greater than 100-fold improvement in energy efficiency and a 25-fold improvement in compute density, a measure of the power of a system, over state-of-the-art digital computers for machine learning.

In the paper, the team also cites "substantially several more orders of magnitude for future improvement." As a result, the authors continue, the technique "opens an avenue to large-scale optoelectronic processors to accelerate machine-learning tasks from data centers to decentralized edge devices." In other words, cell phones and other small devices could become capable of running programs that can currently only be computed at large data centers.

Further, because the components of the system can be created using fabrication processes already in use today, "we expect that it could be scaled for commercial use in a few years. For example, the laser arrays involved are widely used in cell-phone face ID and data communication," says Zaijun Chen, first author, who conducted the work while a postdoctoral associate at MIT in the Research Laboratory of Electronics and is now an assistant professor at the University of Southern California.

Says Dirk Englund, an associate professor in MIT's Department of Electrical Engineering and Computer Science (EECS) and leader of the work, "ChatGPT is limited in its size by the power of today's supercomputers. It's just not economically viable to train models that are much bigger. Our new technology could make it possible to leapfrog to machine-learning models that otherwise would not be reachable in the near future."

He continues, "We don't know what capabilities the next-generation ChatGPT will have if it is 100 times more powerful, but that's the regime of discovery that this kind of technology can allow." Englund is also leader of MIT's Quantum Photonics Laboratory and is affiliated with the Research Laboratory of Electronics (RLE) and the Materials Research Laboratory.

A drumbeat of progress

The current work is the latest achievement in a drumbeat of progress over the last few years by Englund and many of the same colleagues. For example, in 2019 an Englund team reported the theoretical work that led to the current demonstration. The first author of that paper, Ryan Hamerly, now of RLE and NTT Research Inc, is also an author of the current paper.

Additional co-authors of the current Nature Photonics paper are Alexander Sludds, Ronald Davis, Ian Christen, Liane Bernstein, and Lamia Ateshian, all of RLE; and Tobias Heuser, Niels Heermeier, James A. Lott, and Stephan Reitzensttein of Technische Universitat Berlin.

Deep neural networks (DNNs) like the one behind ChatGPT are based on huge machine-learning models that simulate how the brain processes information. However, the digital technologies behind today's DNNs are reaching their limits even as the field of machine learning is growing. Further, they require huge amounts of energy and are largely confined to large data centers. That is motivating the development of new computing paradigms.

The advantages of light

Using light rather than electrons to run DNN computations has the potential to break through the current bottlenecks. Computations using optics, for example, have the potential to use far less energy than those based on electronics. Further, with optics, "you can have much larger bandwidths," or compute densities, says Chen. Light can transfer much more information over a much smaller area.

But current optical neural networks (ONNs) have significant challenges. For example, they use a great deal of energy because they are inefficient at converting incoming data based on electrical energy into light. Further, the components involved are bulky and take up significant space. And while ONNs are quite good at linear calculations like adding, they are not great at nonlinear calculations like multiplication and "if" statements.

In the current work the researchers introduce a compact architecture that, for the first time, solves all of these challenges and two more simultaneously. That architecture is based on state-of-the-art arrays of vertical surface-emitting lasers (VCSELs), a relatively new technology used in applications including LiDAR remote sensing and laser printing.

The particular VCELs reported in the Nature Photonics paper were developed by the Reitzenstein group at Technische Universitat Berlin. "This was a collaborative project that would not have been possible without them," Hamerly says.

Logan Wright is an assistant professor at Yale University who was not involved in the current research. Wright says, "The work by Zaijun Chen et al. is inspiring, encouraging me and likely many other researchers in this area that systems based on modulated VCSEL arrays could be a viable route to large-scale, high-speed optical neural networks."

"Of course, the state-of-the-art here is still far from the scale and cost that would be necessary for practically useful devices, but I am optimistic about what can be realized in the next few years, especially given the potential these systems have to accelerate the very large-scale, very expensive AI systems like those used in popular textual 'GPT' systems like ChatGPT."

Chen, Hamerly, and Englund have filed for a patent on the work.

More information: Zaijun Chen et al, Deep learning with coherent VCSEL neural networks, Nature Photonics (2023). DOI: 10.1038/s41566-023-01233-w

Journal information: Nature Photonics

Provided by Materials Research Laboratory, Massachusetts Institute of Technology

Citation: Computer system based on light could jumpstart power of ChatGPT-type machine-learning programs (2023, July 28) retrieved 29 June 2024 from https://techxplore.com/news/2023-07-based-jumpstart-power-chatgpt-type-machine-learning.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Deep learning with light: Components of machine learning model encoded onto light waves

121 shares

Feedback to editors

Researchers develop novel 3D printing strategy with controllable gradients porous structures

Jun 28, 2024

Researchers develop the fastest possible flow algorithm

Jun 28, 2024

Real-time modeling of 3D temperature distributions within nuclear microreactors to improve safety systems

Jun 28, 2024

Is ChatGPT the key to stopping deepfakes? Study asks LLMs to spot AI-generated images

Jun 27, 2024

Wireless receiver blocks interference for better mobile device performance

Jun 27, 2024

Researchers successfully develop domestic 6G antenna measurement system

Jun 27, 2024

Research shows how common plastics could passively cool and heat buildings with the seasons

Jun 27, 2024

Researchers suggest smart solution to harness waste heat from industry

Jun 27, 2024

Robotic hand with tactile fingertips achieves new dexterity feat

Jun 27, 2024

Help or hindrance? ER robots have potential to aid health care workers

Jun 27, 2024

Load comments (1)

Computer system based on light could jumpstart power of ChatGPT-type machine-learning programs

A drumbeat of progress

The advantages of light

Researchers develop novel 3D printing strategy with controllable gradients porous structures

Researchers develop the fastest possible flow algorithm

Real-time modeling of 3D temperature distributions within nuclear microreactors to improve safety systems

Is ChatGPT the key to stopping deepfakes? Study asks LLMs to spot AI-generated images

Wireless receiver blocks interference for better mobile device performance

Researchers successfully develop domestic 6G antenna measurement system

Research shows how common plastics could passively cool and heat buildings with the seasons

Researchers suggest smart solution to harness waste heat from industry

Robotic hand with tactile fingertips achieves new dexterity feat

Help or hindrance? ER robots have potential to aid health care workers

Deep learning with light: Components of machine learning model encoded onto light waves

Breaking the scaling limits of analog computing

Optical memristors review: Shining a light on neuromorphic computing

Simple data gets the most out of quantum machine learning

Chip design dramatically reduces energy needed to compute with light

A deep belief neural network based on silicon memristive synapses

Researchers develop novel 3D printing strategy with controllable gradients porous structures

Is ChatGPT the key to stopping deepfakes? Study asks LLMs to spot AI-generated images

Robotic hand with tactile fingertips achieves new dexterity feat

Light-controlled artificial maple seeds could monitor the environment even in hard-to-reach locations

Sony introduces AI for single-instrument accompaniment generation in music production

New work explores optimal circumstances for reaching a common goal with humanoid robots

Phys.org

Medical Xpress

Science X

Computer system based on light could jumpstart power of ChatGPT-type machine-learning programs

A drumbeat of progress

The advantages of light

Researchers develop novel 3D printing strategy with controllable gradients porous structures

Researchers develop the fastest possible flow algorithm

Real-time modeling of 3D temperature distributions within nuclear microreactors to improve safety systems

Is ChatGPT the key to stopping deepfakes? Study asks LLMs to spot AI-generated images

Wireless receiver blocks interference for better mobile device performance

Researchers successfully develop domestic 6G antenna measurement system

Research shows how common plastics could passively cool and heat buildings with the seasons

Researchers suggest smart solution to harness waste heat from industry

Robotic hand with tactile fingertips achieves new dexterity feat

Help or hindrance? ER robots have potential to aid health care workers

Related Stories

Deep learning with light: Components of machine learning model encoded onto light waves

Breaking the scaling limits of analog computing

Optical memristors review: Shining a light on neuromorphic computing

Simple data gets the most out of quantum machine learning

Chip design dramatically reduces energy needed to compute with light

A deep belief neural network based on silicon memristive synapses

Recommended for you

Researchers develop novel 3D printing strategy with controllable gradients porous structures

Is ChatGPT the key to stopping deepfakes? Study asks LLMs to spot AI-generated images

Robotic hand with tactile fingertips achieves new dexterity feat

Light-controlled artificial maple seeds could monitor the environment even in hard-to-reach locations

Sony introduces AI for single-instrument accompaniment generation in music production

New work explores optimal circumstances for reaching a common goal with humanoid robots

Your Privacy