Page 27: Research news on AI alignment

AI alignment examines how artificial systems acquire, represent, and act on goals, values, and social norms, and why their behavior often diverges from human expectations. Work in this area studies systematic failures such as bias, sycophancy, hallucinations, deceptive or selfish reasoning, and cultural or linguistic inequities, as well as limitations in commonsense, emotion, and social understanding. It also develops methods for preference learning, norm-following, interpretability, and reliability guarantees to better align AI behavior with human values and societal constraints.

Centaur: AI that thinks like us—and could help explain how we think

Researchers at Helmholtz Munich have developed an artificial intelligence model that can simulate human behavior with remarkable accuracy. The language model, called Centaur, was trained on more than ten million decisions ...

Jul 2, 2025

188

Striking parallels between biological brains and AI during social interaction suggest fundamental principles

UCLA researchers have made a significant discovery showing that biological brains and artificial intelligence systems develop remarkably similar neural patterns during social interaction. This first-of-its-kind study reveals ...

Jul 2, 2025

162

RisingAttacK: New technique can make AI 'see' whatever you want

Researchers have demonstrated a new way of attacking artificial intelligence computer vision systems, allowing them to control what the AI "sees." The research shows that the new technique, called RisingAttacK, is effective ...

Jul 1, 2025

159

AI won't replace computer scientists any time soon—here are 10 reasons why

As AI systems expand their already impressive capacities, there is an increasingly common belief that the field of computer science (CS) will soon be a thing of the past. This is being communicated to today's prospective ...

Jul 1, 2025

Understanding the 'Slopocene': How the failures of AI can reveal its inner workings

Some say it's em dashes, dodgy apostrophes, or too many emoji. Others suggest that maybe the word "delve" is a chatbot's calling card. It's no longer the sight of morphed bodies or too many fingers, but it might be something ...

Jul 1, 2025

Why human empathy still matters in the age of AI

A new international study finds that people place greater emotional value on empathy they believe comes from humans—even when the exact same response is generated by artificial intelligence.

Jun 30, 2025

The rise of 'artificial historians': AI as humanity's record-keeper

In documenting and recording society's collective data on an unprecedented scale, artificial intelligence is becoming humanity's historian—changing the way we record information for posterity.

Jun 30, 2025

AI is learning to lie, scheme, and threaten its creators

The world's most advanced AI models are exhibiting troubling new behaviors—lying, scheming, and even threatening their creators to achieve their goals.

Jun 29, 2025

220

Q&A: When talking about AI, definitions matter

Artificial intelligence is everywhere lately—on the news, in podcasts and around every water cooler. A new, buzzy term, artificial general intelligence (AGI), is dominating conversations and raising more questions than it ...

Jun 27, 2025

New method can teach AI to admit uncertainty

In high-stakes situations like health care—or weeknight "Jeopardy!"—it can be safer to say "I don't know" than to answer incorrectly. Doctors, game show contestants, and standardized test-takers understand this, but most ...

Jun 26, 2025

page 27 from 37

Page 27: Research news on AI alignment

Centaur: AI that thinks like us—and could help explain how we think

Striking parallels between biological brains and AI during social interaction suggest fundamental principles

RisingAttacK: New technique can make AI 'see' whatever you want

AI won't replace computer scientists any time soon—here are 10 reasons why

Understanding the 'Slopocene': How the failures of AI can reveal its inner workings

Why human empathy still matters in the age of AI

The rise of 'artificial historians': AI as humanity's record-keeper

AI is learning to lie, scheme, and threaten its creators

Q&A: When talking about AI, definitions matter

New method can teach AI to admit uncertainty

Phys.org

Medical Xpress

Science X

Page 27: Research news on AI alignment

Your Privacy