New research reveals AI has a confidence problem
Large language models (LLMs) sometimes lose confidence when answering questions and abandon correct answers, according to a new study by researchers at Google DeepMind and University College London.
Large language models are high-capacity neural sequence models trained on massive text and multimodal corpora to perform language understanding, generation, and reasoning. Current work examines their internal representations, cognitive and social behavior analogies to humans, and limitations in mathematical, causal, and strategic reasoning. Research also addresses alignment with human values and brain activity, safety and security vulnerabilities, privacy and de-anonymization risks, cross-lingual and sociocultural biases, scaling and efficiency laws, and frameworks for tool use, multi-agent interaction, and domain-specific deployment.
Machine learning & AI
Large language models (LLMs) sometimes lose confidence when answering questions and abandon correct answers, according to a new study by researchers at Google DeepMind and University College London.
Machine learning & AI
Assessing the progress of new AI language models can be as challenging as training them. Stanford researchers offer a new approach.
Jul 15, 2025
0
50
Machine learning & AI
Artificial intelligence (AI) is infamous for its resource-heavy training, but a new study may have found a solution in a novel communications system, called ZEN, that markedly improves the way large language models (LLMs) ...
Jul 11, 2025
0
44
Computer Sciences
This summer, EPFL and ETH Zurich will release a large language model (LLM) developed on public infrastructure. Trained on the Alps supercomputer at the Swiss National Supercomputing Center (CSCS), the new LLM marks a milestone ...
Jul 9, 2025
0
55
Computer Sciences
ChatGPT works by analyzing vast amounts of text, identifying patterns and synthesizing them to generate responses to users' prompts. Color metaphors like "feeling blue" and "seeing red" are commonplace throughout the English ...
Jul 8, 2025
1
118
Computer Sciences
For all their impressive capabilities, large language models (LLMs) often fall short when given challenging new tasks that require complex reasoning skills.
Jul 8, 2025
0
93
Computer Sciences
Researchers have developed a technique that significantly improves the performance of large language models without increasing the computational power necessary to fine-tune the models. The researchers demonstrated that their ...
Jul 7, 2025
0
127
Computer Sciences
The language capabilities of today's artificial intelligence systems are astonishing. We can now engage in natural conversations with systems like ChatGPT, Gemini, and many others, with a fluency nearly comparable to that ...
Jul 7, 2025
0
69
Machine learning & AI
When teaching a Photoshop class at a children's summer camp, Stevens undergraduate student Gursimran Vasir noticed something strange.
Jun 26, 2025
0
33
Computer Sciences
A research team at the University of Barcelona (UB) has shown how artificial intelligence (AI) models can detect personality traits from written texts, and for the first time has managed to analyze in detail how these systems ...
Jun 25, 2025
0
123