Page 34: Research news on AI alignment

AI alignment examines how artificial systems acquire, represent, and act on goals, values, and social norms, and why their behavior often diverges from human expectations. Work in this area studies systematic failures such as bias, sycophancy, hallucinations, deceptive or selfish reasoning, and cultural or linguistic inequities, as well as limitations in commonsense, emotion, and social understanding. It also develops methods for preference learning, norm-following, interpretability, and reliability guarantees to better align AI behavior with human values and societal constraints.

Machine learning & AI

Study cracks the code behind why AI behaves as it does

AI models like ChatGPT have amazed the world with their ability to write poetry, solve equations and even pass medical exams. But they can also churn out harmful content, or promote disinformation.

Machine learning & AI

We need to stop pretending AI is intelligent. Here's how

We are constantly fed a version of AI that looks, sounds and acts suspiciously like us. It speaks in polished sentences, mimics emotions, expresses curiosity, claims to feel compassion, even dabbles in what it calls creativity.

Computer Sciences

AI with, for and by everyone can help maximize its benefits

Humans' ability to learn from one another across cultures over generations drives our success as a species as much as our individual intelligence. This collective cultural brain has led to new innovations and developed bodies ...

Machine learning & AI

An expert's take on why we should not fear AI

Movies like "The Terminator," in which an artificial intelligence system goes rogue and tries to wipe out humanity, depict our worst fears about AI.

page 34 from 37