Page 15: Research news on AI alignment

AI alignment examines how artificial systems acquire, represent, and act on goals, values, and social norms, and why their behavior often diverges from human expectations. Work in this area studies systematic failures such as bias, sycophancy, hallucinations, deceptive or selfish reasoning, and cultural or linguistic inequities, as well as limitations in commonsense, emotion, and social understanding. It also develops methods for preference learning, norm-following, interpretability, and reliability guarantees to better align AI behavior with human values and societal constraints.

Machine learning & AI

Are we giving AI a pulse through language?

Think, know, understand, remember. These are just a few of the mental verbs we use every day to describe what happens in a person's mind. But when using these same words to talk about artificial intelligence, we can unintentionally ...

Machine learning & AI

Can we prevent AI from acting like a sociopath?

Artificial intelligence boosters predict that AI will transform life on Earth for the better. Yet there's a major problem: artificial intelligence's alarming propensity for sociopathic behavior.

Computer Sciences

Generative AIs fail at the game of visual 'telephone'

Generative AIs may not be as creative as we assume. Publishing in the journal Patterns, researchers show that when image-generating and image-describing AIs pass the same descriptive scene back and forth, they quickly veer ...

page 15 from 39