Page 30: Research news on AI alignment

AI alignment examines how artificial systems acquire, represent, and act on goals, values, and social norms, and why their behavior often diverges from human expectations. Work in this area studies systematic failures such as bias, sycophancy, hallucinations, deceptive or selfish reasoning, and cultural or linguistic inequities, as well as limitations in commonsense, emotion, and social understanding. It also develops methods for preference learning, norm-following, interpretability, and reliability guarantees to better align AI behavior with human values and societal constraints.

Computer Sciences

Why AI can't understand a flower the way humans do

Even with all its training and computer power, an artificial intelligence (AI) tool like ChatGPT can't represent the concept of a flower the way a human does, according to a new study.

Machine learning & AI

Top scientist wants to prevent AI from going rogue

Concerned about the rapid spread of generative AI, a pioneer researcher is developing software to keep tabs on a technology that is increasingly taking over human tasks.

Computer Sciences

Beyond translation: Multilingual benchmark makes AI multicultural

Imagine asking a conversational bot like Claude or ChatGPT a legal question in Greek about local traffic regulations. Within seconds, it replies in fluent Greek with an answer based on UK law. The model understood the language, ...

Machine learning & AI

How trustworthy is AI?

Artificial intelligence is everywhere—writing emails, recommending movies and even driving cars—but what about the AI you don't see? Who (or what) is behind the scenes developing the algorithms that go unnoticed? And can ...

Computer Sciences

AI approach developed with human decision-makers in mind

As artificial intelligence takes off, how do we efficiently integrate it into our lives and our work? Bridging the gap between promise and practice, Jann Spiess, an associate professor of operations, information, and technology ...

page 30 from 37