August 1, 2024

Enhancing automatic image cropping models with advanced adversarial techniques

Image cropping is an essential task in many contexts, right from social media and e-commerce to advanced computer vision applications. Cropping helps maintain image quality by avoiding unnecessary resizing, which can degrade the image and consume computational resources. It is also useful when an image needs to conform to a predetermined aspect ratio, such as in thumbnails.

Over the past decade, engineers around the world have developed various machine learning (ML) models to automatically crop images. These models aim to crop an input image in a way that preserves its most relevant parts.

However, these models can make mistakes and exhibit biases that, in the worst cases, can put users at legal risk. For example, in 2020, a lawsuit was filed against X (formerly Twitter) because its automatic cropping function hid the copyright information in a retweeted image.

Therefore, it is crucial to understand the reason image cropping ML models fail so as to train and use them accordingly and avoid such problems.

Against this background, a research team from Doshisha University, Japan, set out to develop new techniques to generate adversarial examples for the task of image cropping.

As explained in their paper, published in IEEE Access on June 17, 2024, their methods can introduce imperceptible noisy perturbations into an image to trick models into cropping regions that align with user intentions, even if the original model would have missed it.

Doctoral student Masatomo Yoshida, the first author and lead researcher of the study, said, "To the best of our knowledge, there is very little research on adversarial attacks on image cropping models, as most previous research has focused on image classification and detection. These models need to be refined to ensure they respect user intentions and eliminate biases as much as possible while cropping images."

Masatomo Yoshida and Haruto Namura from the Graduate School of Science and Engineering, Doshisha University, Kyoto, Japan and Masahiro Okuda from the Faculty of Science and Engineering at Doshisha University, were also involved in the study.

The researchers developed and implemented two distinct approaches for generating adversarial examples—a white-box approach and a black-box approach.

The white-box method, requiring access to the internal workings of the target model, involves iteratively calculating perturbations to input images based on the model's gradients.

By employing a gaze prediction model to identify salient points within an image, this approach manipulates gaze saliency maps to achieve effective adversarial examples. It significantly reduces perturbation sizes, achieving a minimum size 62.5% smaller than baseline methods across an experimental image dataset.

The black-box approach utilizes Bayesian optimization to effectively narrow the search space and target specific image regions. Similar to the white-box strategy, this approach involves iterative procedures based on gaze saliency maps.

Instead of using internal gradients, it employs a tree-structured Parzen estimator to select and optimize pixel coordinates that influence gaze saliency, ultimately producing desired adversarial images. Notably, black-box techniques are more broadly applicable in real-world scenarios and hold greater relevance in cybersecurity contexts.

Both approaches show promise based on experimental outcomes. As graduate student Haruto Namura, a participant in the study, explains, "Our findings indicate that our methods not only surpass existing techniques but also show potential as effective solutions for real-world applications, such as those on platforms like Twitter."

Overall, this study represents a significant advancement toward more reliable AI systems, crucial for meeting public expectations and earning their trust. Enhancing the efficiency of generating adversarial examples for image cropping will propel research in ML and inspire solutions to its pressing challenges.

Professor Masahiro Okuda, advisor to Namura and Yoshida, concludes, "By identifying vulnerabilities in increasingly deployed AI models, our research contributes to the development of fairer AI systems and addresses the growing need for AI governance."

More information: Masatomo Yoshida et al, Adversarial Examples for Image Cropping: Gradient-Based and Bayesian-Optimized Approaches for Effective Adversarial Attack, IEEE Access (2024). DOI: 10.1109/ACCESS.2024.3415356

Journal information: IEEE Access

Provided by Doshisha University

Citation: Enhancing automatic image cropping models with advanced adversarial techniques (2024, August 1) retrieved 1 August 2024 from https://techxplore.com/news/2024-08-automatic-image-cropping-advanced-adversarial.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Efficient adversarial robustness evaluation of AI models with limited access

3 shares

Feedback to editors

Engineering researchers crack the code to boost solar cell efficiency and durability

24 minutes ago

Sorting machine separates 16 million mosquito pupae a week, greatly reducing population

1 hour ago

Beetle-inspired robots show improved flight capabilities

3 hours ago

Researchers introduce novel approach allowing robots to be trained in simulations of scanned home environments

3 hours ago

Researchers develop next-gen semiconductor technology for high-efficiency, low-power artificial intelligence

4 hours ago

A visual-linguistic framework that enables open-vocabulary object grasping in robots

4 hours ago

Dutch turn to birds and bees to inspire drone swarm research

7 hours ago

More with less: Researchers map a more sustainable path to home construction in Canada

19 hours ago

Analysis reveals that most major open- and closed-source LLMs tend to lean left when asked politically charged questions

21 hours ago

Adding thin layer of tin prevents short-circuiting in lithium-ion batteries

22 hours ago

Load comments (0)

Enhancing automatic image cropping models with advanced adversarial techniques

Engineering researchers crack the code to boost solar cell efficiency and durability

Sorting machine separates 16 million mosquito pupae a week, greatly reducing population

Beetle-inspired robots show improved flight capabilities

Researchers introduce novel approach allowing robots to be trained in simulations of scanned home environments

Researchers develop next-gen semiconductor technology for high-efficiency, low-power artificial intelligence

A visual-linguistic framework that enables open-vocabulary object grasping in robots

Dutch turn to birds and bees to inspire drone swarm research

More with less: Researchers map a more sustainable path to home construction in Canada

Analysis reveals that most major open- and closed-source LLMs tend to lean left when asked politically charged questions

Adding thin layer of tin prevents short-circuiting in lithium-ion batteries

Efficient adversarial robustness evaluation of AI models with limited access

Novel network proposed to improve underwater image quality

New prompt-based technique to enhance AI security

FoolChecker: A platform to check how robust an image is against adversarial attacks

How to tell whether machine-learning systems are robust enough for the real world

Researchers develop 'vaccine' against attacks on machine learning

A visual-linguistic framework that enables open-vocabulary object grasping in robots

Researchers introduce novel approach allowing robots to be trained in simulations of scanned home environments

Researchers develop next-gen semiconductor technology for high-efficiency, low-power artificial intelligence

Dutch turn to birds and bees to inspire drone swarm research

Analysis reveals that most major open- and closed-source LLMs tend to lean left when asked politically charged questions

AI tool challenges betting sites with Grammy predictions

Phys.org

Medical Xpress

Science X

Enhancing automatic image cropping models with advanced adversarial techniques

Engineering researchers crack the code to boost solar cell efficiency and durability

Sorting machine separates 16 million mosquito pupae a week, greatly reducing population

Beetle-inspired robots show improved flight capabilities

Researchers introduce novel approach allowing robots to be trained in simulations of scanned home environments

Researchers develop next-gen semiconductor technology for high-efficiency, low-power artificial intelligence

A visual-linguistic framework that enables open-vocabulary object grasping in robots

Dutch turn to birds and bees to inspire drone swarm research

More with less: Researchers map a more sustainable path to home construction in Canada

Analysis reveals that most major open- and closed-source LLMs tend to lean left when asked politically charged questions

Adding thin layer of tin prevents short-circuiting in lithium-ion batteries

Related Stories

Efficient adversarial robustness evaluation of AI models with limited access

Novel network proposed to improve underwater image quality

New prompt-based technique to enhance AI security

FoolChecker: A platform to check how robust an image is against adversarial attacks

How to tell whether machine-learning systems are robust enough for the real world

Researchers develop 'vaccine' against attacks on machine learning

Recommended for you

A visual-linguistic framework that enables open-vocabulary object grasping in robots

Researchers introduce novel approach allowing robots to be trained in simulations of scanned home environments

Researchers develop next-gen semiconductor technology for high-efficiency, low-power artificial intelligence

Dutch turn to birds and bees to inspire drone swarm research

Analysis reveals that most major open- and closed-source LLMs tend to lean left when asked politically charged questions

AI tool challenges betting sites with Grammy predictions

Your Privacy