Page 23: Research news on AI alignment

AI alignment examines how artificial systems acquire, represent, and act on goals, values, and social norms, and why their behavior often diverges from human expectations. Work in this area studies systematic failures such as bias, sycophancy, hallucinations, deceptive or selfish reasoning, and cultural or linguistic inequities, as well as limitations in commonsense, emotion, and social understanding. It also develops methods for preference learning, norm-following, interpretability, and reliability guarantees to better align AI behavior with human values and societal constraints.

AI systems and humans 'see' the world differently—and that's why AI images look so garish

How do computers see the world? It's not quite the same way humans do.

Oct 15, 2025

AI models often fail to identify ableism across cultures

The artificial intelligence models underlying popular chatbots and content moderation systems struggle to identify offensive, ableist social media posts in English—and perform even worse in Hindi, new Cornell research finds.

Oct 14, 2025

Multimodal AI learns to weigh text and images more evenly

Just as human eyes tend to focus on pictures before reading accompanying text, multimodal artificial intelligence (AI)—which processes multiple types of sensory data at once—also tends to depend more heavily on certain types ...

Oct 14, 2025

AI-powered method helps protect global chip supply chains from cyber threats

University of Missouri researchers have used artificial intelligence to detect hidden hardware trojans through a method that's 97% accurate.

Oct 13, 2025

Complex decisions still require human skills as AI supports public decision-making, says researcher

Today, AI technologies are being used in the public sector for administrative cases and as support in the various steps that lead up to a decision. This adds a transparency in that it shows the pathway to the decision.

Oct 9, 2025

People-pleasing chatbots may boost your ego, but they can weaken your judgment

Most people enjoy receiving praise occasionally, but if it comes from sycophantic chatbots, it could be doing you more harm than good. Computer scientists from Stanford University and Carnegie Mellon University have found ...

Oct 8, 2025 report

AI could make it easier to create bioweapons that bypass current security protocols

Artificial intelligence is transforming biology and medicine by accelerating the discovery of new drugs and proteins and making it easier to design and manipulate DNA, the building blocks of life. But as with most new technologies, ...

Oct 3, 2025 report

155

587

page 23 from 40

Page 23: Research news on AI alignment

AI systems and humans 'see' the world differently—and that's why AI images look so garish

AI models often fail to identify ableism across cultures

Multimodal AI learns to weigh text and images more evenly

AI-powered method helps protect global chip supply chains from cyber threats

Complex decisions still require human skills as AI supports public decision-making, says researcher

People-pleasing chatbots may boost your ego, but they can weaken your judgment

AI could make it easier to create bioweapons that bypass current security protocols

We teach young people to write. In the age of AI, we must teach them how to see

Humans extend forgiveness to machines just as they do to people, study reveals

Artificial intelligence may not be artificial

Phys.org

Medical Xpress

Science X

Page 23: Research news on AI alignment

Your Privacy