Vicarious AI team reveals how it defeated CAPTCHA

Brain-inspired vision model cracks website protection system, with much less training compared to deep learning
A representation of the letter A. This material relates to a paper that appeared in the 27 Oct.2017, issue of Science, published by AAAS. The paper, by D. George at Vicarious AI in Union City, CA, and colleagues was titled, "A generative vision model that trains with high data efficiency and breaks text-based CAPTCHAs." Credit: Vicarious AI

A group of researchers at Vicarious AI has revealed for the first time the new and innovative method they used to defeat CAPTCHA. In their paper published in the journal Science, the team describes their neural network and how it was used to crack the software that was created to prevent bots from accessing websites.

Back in 2000, groups hosting websites and users alike had grown quite irritated with bots creating havoc on websites. To prevent such access, a system called the "Completely Automated Public Turing test to tell Computers and Humans Apart" (CAPTCHA) was developed. It required users to type in text that had been distorted—a task deemed easy enough for humans, but impossible for bots. Unfortunately, as it turned out, defeating CAPTCHA was possible, as several research teams showed. They used to learn what a CAPTCHA was and then to foil systems that used them. But such methods required the system to crunch thousands or millions of examples to become reasonably proficient at cracking CAPTCHA. But then, four years ago, a team at Vicarious AI announced that they had come up with a modified neural network that could crack CAPTCHA after studying just a few examples. The company did not publish its work, however, because they realized it would allow bot makers to run rampant again. In recent times, CAPTCHA has changed—quite often, images are used instead of text, requiring users to identify something unique in them. Because of that, Vicarious AI has decided it is now safe to reveal its novel CAPTCHA-cracking technology.

The system is called a recursive cortical network—a name that offers insight into how it works. In a traditional neural network, nodes are created to hold new information—a network is built from the nodes and it is used to judge how to deal with new data. This is how it learns. The team at Vicarious AI used a neural network, too, but they added something new—recursion. Recursion is a technique whereby data is used to learn something new. As that new process is learned, the results go back into the software, as well. This process is used over and over until a solution is reached. It is a technique that has long been used to solve mazes. By applying recursion to a neural network, the researchers found they were able to reduce the learning curve of their software dramatically—it needed just five training steps, for example, to crack Google's reCAPTCHA 67 percent of the time.

Explore further

reCAPTCHA eases up on the human eye

More information: D. George et al. A generative vision model that trains with high data efficiency and breaks text-based CAPTCHAs, Science (2017). DOI: 10.1126/science.aag2612

Learning from few examples and generalizing to dramatically different situations are capabilities of human visual intelligence that are yet to be matched by leading machine learning models. By drawing inspiration from systems neuroscience, we introduce a probabilistic generative model for vision in which message-passing based inference handles recognition, segmentation and reasoning in a unified way. The model demonstrates excellent generalization and occlusion-reasoning capabilities, and outperforms deep neural networks on a challenging scene text recognition benchmark while being 300-fold more data efficient. In addition, the model fundamentally breaks the defense of modern text-based CAPTCHAs by generatively segmenting characters without CAPTCHA-specific heuristics. Our model emphasizes aspects like data efficiency and compositionality that may be important in the path toward general artificial intelligence.

Journal information: Science

© 2017 Tech Xplore

Citation: Vicarious AI team reveals how it defeated CAPTCHA (2017, October 27) retrieved 23 September 2019 from
This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Feedback to editors

User comments

Oct 28, 2017
And thanks to them we will now start having bots spam every forum around again unless they change over that picture system. WHY is there always some bunch of jerks that have nothing better to do that make life annoying for everyone else. I would like to take every one of those clowns behind the woodshed with a BIG stick and teach them some bleeding common sense and respect for others since they evidently don't have any.

Oct 30, 2017
Anyone who uses CAPTCHA to prevent bots from spamming them deserves all they get. It is a stupid system devised by a bunch of drunken students after a party. Use 2 phase data entry; they enter the data, you validate it (with server code-behind or client Javascript) and only if it passes the validation, is the submit button then enabled, one time.

Please sign in to add a comment. Registration is free, and takes less than a minute. Read more