December 21, 2022 report

OpenAI announces Point-E, a machine learning system that quickly creates 3D images from a text prompt

by Bob Yirka , Tech Xplore

OpenAI announces Point-E, a machine learning system that creates 3Ds images quickly given a text prompt — A high-level overview of the pipeline. First, a text prompt is fed into a GLIDE model to produce a synthetic rendered view. Next, a point cloud diffusion stack conditions on this image to produce a 3D RGB point cloud. Credit: *arXiv* (2022). DOI: 10.48550/arxiv.2212.08751

A team of researchers at San Francisco-based OpenAI, has announced the development of a machine-learning system that can create 3D images from text much more quickly than other systems. The group has published a paper describing their new system, called Point-E, on the arXiv preprint server.

Over the past year, several groups have announced products or systems that can generate a 3D-modeled image based on a text prompt, e.g., "a blue chair on a red floor," or "a young boy wearing a green hat and riding a purple bicycle." Such systems generally have two parts. The first reads the text and tries to make sense of it. The second, trained on internet searches, renders the desired image.

Because of the complexity of the task, these systems can take a long time to return a model, ranging from hours to days. In this new effort, the researchers built a similar system that returns results within minutes, though they readily acknowledge that the results "fall short of the state-of-the-art in terms of sample quality."

To create images more quickly, the researchers adopted an approach somewhat different than others. Their system does not even create images in the traditional sense. Instead, it generates point clouds, which, when viewed together, resemble the desired image. The team took this approach because generating point clouds is far easier than generating actual images. To create the results, the system routes images it finds through another AI system they developed that converts what it receives to meshes, which produce the 3D point cloud model of the intended object.

The first part of the system was made using two modules—the first converts the text into an image idea and the second part finds images that are used to generate a generic image. In operation, the system runs very much the same as others of its kind—a user inputs a descriptive text prompt and the system returns an image model. They note that while the visual quality is not comparable to other systems, it might be more suitable to other applications, such as fabricating real-world objects via a 3D printer.

The researchers have made the system open access—users who wish to work with it can access the code on GitHub.

More information: Alex Nichol et al, Point-E: A System for Generating 3D Point Clouds from Complex Prompts, arXiv (2022). DOI: 10.48550/arxiv.2212.08751

Journal information: arXiv

Citation: OpenAI announces Point-E, a machine learning system that quickly creates 3D images from a text prompt (2022, December 21) retrieved 25 April 2024 from https://techxplore.com/news/2022-12-openai-point-e-machine-quickly-3d.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Exploring text-to-audio models to make music from scratch

143 shares

Feedback to editors

High-energy-density capacitors with 2D nanomaterials could significantly enhance energy storage

10 hours ago

Study shows potential of super grids when hurricanes overshadow solar panels

10 hours ago

Rubber-like stretchable energy storage device fabricated with laser precision

10 hours ago

On the trail of deepfakes, researchers identify 'fingerprints' of AI-generated video

11 hours ago

New tech could help traveling VR gamers experience 'ludicrous speed' without motion sickness

12 hours ago

Why can't robots outrun animals?

13 hours ago

Virtual sensors help aerial vehicles stay aloft when rotors fail

13 hours ago

New insights lead to better next-gen solar cells

14 hours ago

Scientists pioneer new X-ray microscopy method for data analysis 'on the fly'

14 hours ago

Going with the flow: Research dives into electrodes on energy storage batteries

14 hours ago

Load comments (0)

OpenAI announces Point-E, a machine learning system that quickly creates 3D images from a text prompt

High-energy-density capacitors with 2D nanomaterials could significantly enhance energy storage

Study shows potential of super grids when hurricanes overshadow solar panels

Rubber-like stretchable energy storage device fabricated with laser precision

On the trail of deepfakes, researchers identify 'fingerprints' of AI-generated video

New tech could help traveling VR gamers experience 'ludicrous speed' without motion sickness

Why can't robots outrun animals?

Virtual sensors help aerial vehicles stay aloft when rotors fail

New insights lead to better next-gen solar cells

Scientists pioneer new X-ray microscopy method for data analysis 'on the fly'

Going with the flow: Research dives into electrodes on energy storage batteries

Exploring text-to-audio models to make music from scratch

A novel multi-modal image retrieval system

T2CI GAN: A deep learning model that generates compressed images from text

Researchers fine-tune control over AI image generation

AI system makes image generator models like DALL-E 2 more creative

A model to generate artistic images based on text descriptions

Emulating neurodegeneration and aging in artificial intelligence systems

Scientists pioneer new X-ray microscopy method for data analysis 'on the fly'

Microsoft claims that small, localized language models can be powerful as well

On the trail of deepfakes, researchers identify 'fingerprints' of AI-generated video

New tech could help traveling VR gamers experience 'ludicrous speed' without motion sickness

A new framework to generate human motions from language prompts

Phys.org

Medical Xpress

Science X

OpenAI announces Point-E, a machine learning system that quickly creates 3D images from a text prompt

High-energy-density capacitors with 2D nanomaterials could significantly enhance energy storage

Study shows potential of super grids when hurricanes overshadow solar panels

Rubber-like stretchable energy storage device fabricated with laser precision

On the trail of deepfakes, researchers identify 'fingerprints' of AI-generated video

New tech could help traveling VR gamers experience 'ludicrous speed' without motion sickness

Why can't robots outrun animals?

Virtual sensors help aerial vehicles stay aloft when rotors fail

New insights lead to better next-gen solar cells

Scientists pioneer new X-ray microscopy method for data analysis 'on the fly'

Going with the flow: Research dives into electrodes on energy storage batteries

Related Stories

Exploring text-to-audio models to make music from scratch

A novel multi-modal image retrieval system

T2CI GAN: A deep learning model that generates compressed images from text

Researchers fine-tune control over AI image generation

AI system makes image generator models like DALL-E 2 more creative

A model to generate artistic images based on text descriptions

Recommended for you

Emulating neurodegeneration and aging in artificial intelligence systems

Scientists pioneer new X-ray microscopy method for data analysis 'on the fly'

Microsoft claims that small, localized language models can be powerful as well

On the trail of deepfakes, researchers identify 'fingerprints' of AI-generated video

New tech could help traveling VR gamers experience 'ludicrous speed' without motion sickness

A new framework to generate human motions from language prompts

Your Privacy