March 20, 2024 feature

Training artificial neural networks to process images from a child's perspective

by Ingrid Fadelli , Tech Xplore

Psychology studies have demonstrated that by the age of 4–5, young children have developed intricate visual models of the world around them. These internal visual models allow them to outperform advanced computer vision techniques on various object recognition tasks.

Researchers at New York University recently set out to explore the possibility of training artificial neural networks on these models without domain-specific inductive biases. Their paper, published in Nature Machine Intelligence, ultimately addresses one of the oldest philosophical questions, namely the "nature vs. nurture" dilemma.

The nature vs. nurture dilemma disputes whether humans possess innate inductive biases influencing how they perceive objects, people and the world around them overall, or whether they are initially a "blank slate," developing biases as a result of their experiences. Some of the hypothesized innate biases are related to the ability to categorize and label objects.

The team at New York University set out to investigate this dilemma from a modern standpoint. To do this, they trained state-of-the-art self-supervised deep neural networks on a large dataset containing videos taken from young children's perspective using headcams (cameras attached to a hat or helmet).

"Young children develop sophisticated internal models of the world based on their visual experience," A. Emin Orhan and Brenden M. Lake wrote in their paper. "Can such models be learned from a child's visual experience without strong inductive biases? To investigate this, we train state-of-the-art neural networks on a realistic proxy of a child's visual experience without any explicit supervision or domain-specific inductive biases."

Orhan and Lake trained two types of deep learning techniques, namely embedding and generative models, on approximately 200 hours of headcam video footage collected from a single child over a two-year period. After pre-training more than 70 of these models, they tested their performance on a series of computer vision and object recognition tasks, comparing it with other state-of-the-art computer vision models.

"On average, the best embedding models perform at a respectable 70% of a high-performance ImageNet-trained model, despite substantial differences in training data," Orhan and Lake wrote. "They also learn broad semantic categories and object localization capabilities without explicit supervision, but they are less object-centric than models trained on all of ImageNet.

"Generative models trained with the same data successfully extrapolate simple properties of partially masked objects, like their rough outline, texture, color or orientation, but struggle with finer object details."

To validate their findings, the researchers carried out further experiments involving two other young children. Their results were consistent with those gathered during their first experiment, suggesting that higher-level visual representations can be learned from a child's unique visual experiences without integrating strong inductive biases.

The findings of this recent work by Orhan and Lake could serve as an inspiration for psychologists and neuroscientists, informing further studies exploring the nature vs. nurture dilemma using computational tools. Overall, the team suggests that object categorization biases depend on the unique characteristics of the human visual system, which result in different images from those typically used to train deep learning models.

"We hope that our work will inspire new collaborations between machine learning and developmental psychology, as the impact of modern deep learning on developmental psychology has been relatively limited thus far," Orhan and Lake conclude in their paper.

"Future algorithmic advances, combined with richer and larger developmental datasets, can be evaluated through the same approach, further enriching our understanding of what can be learned from a child's experience with minimal inductive biases."

More information: A. Emin Orhan et al, Learning high-level visual representations from a child's perspective without strong inductive biases, Nature Machine Intelligence (2024). DOI: 10.1038/s42256-024-00802-0

Journal information: Nature Machine Intelligence

Citation: Training artificial neural networks to process images from a child's perspective (2024, March 20) retrieved 5 May 2024 from https://techxplore.com/news/2024-03-artificial-neural-networks-images-child.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

New research shows how child-like language learning is possible using AI tools

54 shares

Feedback to editors

Refined AI approach improves noninvasive brain-computer interface performance

May 3, 2024

SK Hynix says high-end AI memory chips almost sold out through 2025

May 3, 2024

Stretchable e-skin could give robots human-level touch sensitivity

May 2, 2024

Leveraging robots to help make wind turbine blades

May 2, 2024

Beware of AI-based deception detection, warns scientific community

May 2, 2024

Cost-effective, high-capacity and cyclable lithium-ion battery cathodes

May 2, 2024

New AI tool efficiently detects asbestos in roofs so it can be removed

May 2, 2024

New memory transistor integrates photocrosslinker into molecular switches to adjust its threshold voltage

May 2, 2024

Researchers find use of olivine in cement production could result in carbon negative concrete

May 2, 2024

Researchers create massive open dataset to advance AI solutions for carbon capture

May 2, 2024

Load comments (0)

Training artificial neural networks to process images from a child's perspective

Refined AI approach improves noninvasive brain-computer interface performance

SK Hynix says high-end AI memory chips almost sold out through 2025

Stretchable e-skin could give robots human-level touch sensitivity

Leveraging robots to help make wind turbine blades

Beware of AI-based deception detection, warns scientific community

Cost-effective, high-capacity and cyclable lithium-ion battery cathodes

New AI tool efficiently detects asbestos in roofs so it can be removed

New memory transistor integrates photocrosslinker into molecular switches to adjust its threshold voltage

Researchers find use of olivine in cement production could result in carbon negative concrete

Researchers create massive open dataset to advance AI solutions for carbon capture

New research shows how child-like language learning is possible using AI tools

When it comes to AI, can we ditch the datasets?

Research team develops an AI model for effectively removing biases in a dataset

Zeroing in on the origins of bias in large language models

Revolutionizing plant disease diagnosis: Pre-trained models outperform traditional methods

Synthetic imagery sets new bar in AI training efficiency

Refined AI approach improves noninvasive brain-computer interface performance

Random robots are more reliable: New AI algorithm for robots consistently outperforms state-of-the-art systems

Beware of AI-based deception detection, warns scientific community

Researchers create massive open dataset to advance AI solutions for carbon capture

New AI tool efficiently detects asbestos in roofs so it can be removed

Science has an AI problem: Research group says they can fix it

Phys.org

Medical Xpress

Science X

Training artificial neural networks to process images from a child's perspective

Refined AI approach improves noninvasive brain-computer interface performance

SK Hynix says high-end AI memory chips almost sold out through 2025

Stretchable e-skin could give robots human-level touch sensitivity

Leveraging robots to help make wind turbine blades

Beware of AI-based deception detection, warns scientific community

Cost-effective, high-capacity and cyclable lithium-ion battery cathodes

New AI tool efficiently detects asbestos in roofs so it can be removed

New memory transistor integrates photocrosslinker into molecular switches to adjust its threshold voltage

Researchers find use of olivine in cement production could result in carbon negative concrete

Researchers create massive open dataset to advance AI solutions for carbon capture

Related Stories

New research shows how child-like language learning is possible using AI tools

When it comes to AI, can we ditch the datasets?

Research team develops an AI model for effectively removing biases in a dataset

Zeroing in on the origins of bias in large language models

Revolutionizing plant disease diagnosis: Pre-trained models outperform traditional methods

Synthetic imagery sets new bar in AI training efficiency

Recommended for you

Refined AI approach improves noninvasive brain-computer interface performance

Random robots are more reliable: New AI algorithm for robots consistently outperforms state-of-the-art systems

Beware of AI-based deception detection, warns scientific community

Researchers create massive open dataset to advance AI solutions for carbon capture

New AI tool efficiently detects asbestos in roofs so it can be removed

Science has an AI problem: Research group says they can fix it

Your Privacy