Page 29: Research news on AI alignment

AI alignment examines how artificial systems acquire, represent, and act on goals, values, and social norms, and why their behavior often diverges from human expectations. Work in this area studies systematic failures such as bias, sycophancy, hallucinations, deceptive or selfish reasoning, and cultural or linguistic inequities, as well as limitations in commonsense, emotion, and social understanding. It also develops methods for preference learning, norm-following, interpretability, and reliability guarantees to better align AI behavior with human values and societal constraints.

Does AI understand?

Imagine an ant crawling in sand, tracing a path that happens to look like Winston Churchill. Would you say the ant created an image of the former British prime minister? According to the late Harvard philosopher Hilary Putnam, ...

Jul 17, 2025

New method makes AI language model evaluations faster, fairer, and less costly

Assessing the progress of new AI language models can be as challenging as training them. Stanford researchers offer a new approach.

Jul 15, 2025

Tool devised for detecting AI that scores high on accuracy, low on false accusations

Detecting writing via artificial intelligence is a tricky dance: Doing it right means being effective at identifying it while being careful not to falsely accuse a human of employing it. And few tools strike the right balance.

Jul 10, 2025

Can ChatGPT actually 'see' red? New study results are nuanced

ChatGPT works by analyzing vast amounts of text, identifying patterns and synthesizing them to generate responses to users' prompts. Color metaphors like "feeling blue" and "seeing red" are commonplace throughout the English ...

Jul 8, 2025

118

What makes a good AI prompt? Here are 4 expert tips

"And do you work well with AI?"

Jul 8, 2025

Pilot program integrates AI-generated notes with human community notes on X platform

X (formerly Twitter) launched its "Community Notes" program in 2021 to combat misinformation by allowing users to add contextual notes on posts that might be deceptive or lead to misinterpretation. An example would be users ...

Jul 4, 2025 report

Young children outperform state-of-the-art AI in visual object recognition

As artificial intelligence (AI) rapidly grows—a recent UN Trade and Development report projects the global AI market soaring to $4.8 trillion by 2033—the technology seems equipped to handle any task. Driving cars. Analyzing ...

Jul 3, 2025

Key biases in AI models used for detecting depression on social media

Artificial intelligence models used to detect depression on social media are often biased and methodologically flawed, according to a study led by Northeastern University computer science graduates.

Jul 3, 2025

Centaur: AI that thinks like us—and could help explain how we think

Researchers at Helmholtz Munich have developed an artificial intelligence model that can simulate human behavior with remarkable accuracy. The language model, called Centaur, was trained on more than ten million decisions ...

Jul 2, 2025

190

Striking parallels between biological brains and AI during social interaction suggest fundamental principles

UCLA researchers have made a significant discovery showing that biological brains and artificial intelligence systems develop remarkably similar neural patterns during social interaction. This first-of-its-kind study reveals ...

Jul 2, 2025

162

page 29 from 40

Page 29: Research news on AI alignment

Does AI understand?

New method makes AI language model evaluations faster, fairer, and less costly

Tool devised for detecting AI that scores high on accuracy, low on false accusations

Can ChatGPT actually 'see' red? New study results are nuanced

What makes a good AI prompt? Here are 4 expert tips

Pilot program integrates AI-generated notes with human community notes on X platform

Young children outperform state-of-the-art AI in visual object recognition

Key biases in AI models used for detecting depression on social media

Centaur: AI that thinks like us—and could help explain how we think

Striking parallels between biological brains and AI during social interaction suggest fundamental principles

Phys.org

Medical Xpress

Science X

Page 29: Research news on AI alignment

Your Privacy