Page 34: Research news on AI alignment

AI alignment examines how artificial systems acquire, represent, and act on goals, values, and social norms, and why their behavior often diverges from human expectations. Work in this area studies systematic failures such as bias, sycophancy, hallucinations, deceptive or selfish reasoning, and cultural or linguistic inequities, as well as limitations in commonsense, emotion, and social understanding. It also develops methods for preference learning, norm-following, interpretability, and reliability guarantees to better align AI behavior with human values and societal constraints.

'Periodic table of machine learning' framework unifies AI models to accelerate innovation

MIT researchers have created a periodic table that shows how more than 20 classical machine-learning algorithms are connected. The new framework sheds light on how scientists could fuse strategies from different methods to ...

Apr 23, 2025

371

Social networks are vulnerable to relatively simple AI manipulation and polarization

It seems that no matter the topic of conversation, online opinion around it will be split into two seemingly irreconcilable camps.

Apr 15, 2025

Study cracks the code behind why AI behaves as it does

AI models like ChatGPT have amazed the world with their ability to write poetry, solve equations and even pass medical exams. But they can also churn out harmful content, or promote disinformation.

Apr 15, 2025

A weird phrase is plaguing scientific papers—and we traced it back to a glitch in AI training data

Earlier this year, scientists discovered a peculiar term appearing in published papers: "vegetative electron microscopy."

Apr 15, 2025

We need to stop pretending AI is intelligent. Here's how

We are constantly fed a version of AI that looks, sounds and acts suspiciously like us. It speaks in polished sentences, mimics emotions, expresses curiosity, claims to feel compassion, even dabbles in what it calls creativity.

Apr 14, 2025

Getting AIs working toward human goals: Study shows how to measure misalignment

Ideally, artificial intelligence agents aim to help humans, but what does that mean when humans want conflicting things? My colleagues and I have come up with a way to measure the alignment of the goals of a group of humans ...

Apr 14, 2025

AI with, for and by everyone can help maximize its benefits

Humans' ability to learn from one another across cultures over generations drives our success as a species as much as our individual intelligence. This collective cultural brain has led to new innovations and developed bodies ...

Apr 9, 2025

104

page 34 from 37

Page 34: Research news on AI alignment

'Periodic table of machine learning' framework unifies AI models to accelerate innovation

Social networks are vulnerable to relatively simple AI manipulation and polarization

Study cracks the code behind why AI behaves as it does

A weird phrase is plaguing scientific papers—and we traced it back to a glitch in AI training data

We need to stop pretending AI is intelligent. Here's how

Getting AIs working toward human goals: Study shows how to measure misalignment

AI with, for and by everyone can help maximize its benefits

Is this AI or a journalist? Research reveals stylistic differences in news articles

User-centered app development: Experts suggest balancing feedback with developer intuition

An expert's take on why we should not fear AI

Phys.org

Medical Xpress

Science X

Page 34: Research news on AI alignment

Your Privacy