Page 3: Research news on AI alignment

AI alignment examines how artificial systems acquire, represent, and act on goals, values, and social norms, and why their behavior often diverges from human expectations. Work in this area studies systematic failures such as bias, sycophancy, hallucinations, deceptive or selfish reasoning, and cultural or linguistic inequities, as well as limitations in commonsense, emotion, and social understanding. It also develops methods for preference learning, norm-following, interpretability, and reliability guarantees to better align AI behavior with human values and societal constraints.

Show more

End of black box AI? Scientists develop blueprint for transparent system that reveals how it learns and makes decisions

Apr 30, 2026

Evolvable AI: Are we on the brink of the next major evolutionary transition?

Apr 30, 2026

Evolving AI may arrive before AGI and create hard-to-control risks

Apr 29, 2026

The friendlier AI gets, the more it can backfire

Apr 29, 2026

Can AI quantify beauty? New study suggests it can't

Apr 28, 2026

Sharper bias tests could help stop ChatGPT from amplifying hidden stereotypes

Apr 22, 2026

How AI bias can creep into online content moderation

Apr 22, 2026

How do generative AI tools reshape the software engineering workforce?

Apr 22, 2026

AI works best with humans—not instead of them

Apr 21, 2026

From Siri to scams, AI voice clones now beat human speech in noisy settings

Apr 21, 2026

Load more