Page 25: Research news on Large language models

Large language models are high-capacity neural sequence models trained on massive text and multimodal corpora to perform language understanding, generation, and reasoning. Current work examines their internal representations, cognitive and social behavior analogies to humans, and limitations in mathematical, causal, and strategic reasoning. Research also addresses alignment with human values and brain activity, safety and security vulnerabilities, privacy and de-anonymization risks, cross-lingual and sociocultural biases, scaling and efficiency laws, and frameworks for tool use, multi-agent interaction, and domain-specific deployment.

Machine learning & AI

UAE unveils new Arabic-language AI model

The United Arab Emirates on Wednesday announced a new Arabic-language artificial intelligence model, describing it as the best-performing in the region.

Machine learning & AI

AI overconfidence mirrors a human language disorder

Agents, chatbots and other tools based on artificial intelligence (AI) are increasingly used in everyday life by many. So-called large language model (LLM)-based agents, such as ChatGPT and Llama, have become impressively ...

Machine learning & AI

Energy and memory: A new neural network paradigm

Listen to the first notes of an old, beloved song. Can you name that tune? If you can, congratulations—it's a triumph of your associative memory, in which one piece of information (the first few notes) triggers the memory ...

Machine learning & AI

Key units in AI models mirror human brain's language system

EPFL researchers have discovered key "units" in large AI models that seem to be important for language, mirroring the brain's language system. When these specific units were turned off, the models got much worse at language ...

page 25 from 28