Page 12: Research news on AI alignment

AI alignment examines how artificial systems acquire, represent, and act on goals, values, and social norms, and why their behavior often diverges from human expectations. Work in this area studies systematic failures such as bias, sycophancy, hallucinations, deceptive or selfish reasoning, and cultural or linguistic inequities, as well as limitations in commonsense, emotion, and social understanding. It also develops methods for preference learning, norm-following, interpretability, and reliability guarantees to better align AI behavior with human values and societal constraints.

Security

AI education could be crucial in tackling rising voice scams

A new study from Abertay University reveals that the most effective way to protect people from AI voice scams is not through traditional warning messages, but by educating them about how advanced and authentic AI voices have ...

Machine learning & AI

Is this your AI? ZEN framework cracks AI black box

Artificial intelligence (AI) systems power everything from chatbots to security cameras, yet many of the most advanced models operate as "black boxes." Companies can use them, but outsiders can't see how they were built, ...

Computer Sciences

Don't panic: 'Humanity's last exam' has begun

When artificial intelligence systems began acing long-standing academic assessments, researchers realized they had a problem: the tests were too easy. Popular evaluations, such as the Massive Multitask Language Understanding ...

Computer Sciences

How AI could help make society less selfish

The Care Bears taught a generation of kids that sharing is caring, but not everyone has carried this principle into adulthood. Researchers at Michigan State University have found a new angle to promote cooperation: artificial ...

page 12 from 40