Page 4: Research news on AI alignment

AI alignment examines how artificial systems acquire, represent, and act on goals, values, and social norms, and why their behavior often diverges from human expectations. Work in this area studies systematic failures such as bias, sycophancy, hallucinations, deceptive or selfish reasoning, and cultural or linguistic inequities, as well as limitations in commonsense, emotion, and social understanding. It also develops methods for preference learning, norm-following, interpretability, and reliability guarantees to better align AI behavior with human values and societal constraints.

Machine learning & AI

Exploring conversational AI and poetry but not as we know it

What do the large language models behind human-like conversational AI really know and what does it mean to live alongside them?

Apr 20, 2026

0

3

Machine learning & AI

Anthropic says will put AI risks 'on the table' with Mythos model

American AI developer Anthropic plans to "lay the risks out on the table" even as it restricts deployment of a new model dubbed Mythos, whose powerful cybersecurity capabilities raise stark questions for companies and governments.

Apr 20, 2026

0

23

Machine learning & AI

Unpredictable AGI may resist full control, making diverse AI safer

Public concern about AI safety has grown significantly in recent years. As AI systems become more powerful, a key question is how we make sure they do what we actually want. Now, researchers suggest that rather than trying ...

Apr 18, 2026

0

25

Computer Sciences

Prompt coaching tool raises user awareness of bias in generative AI systems

A coaching tool built into artificial intelligence (AI)-powered systems may raise user awareness of bias in AI algorithms and help individuals better prompt generative AI tools to produce more inclusive content, according ...

Apr 16, 2026

0

8

Machine learning & AI

Thousands of AI‑written, edited or 'polished' books are being sold, an eerie echo of Orwell's 'novel‑writing machines'

At some point in the next several months, I am hoping to receive a modest check as a member of the class covered in the class-action settlement Bartz v. Anthropic.

Apr 15, 2026

0

8

Consumer & Gadgets

Dear AI, I'm autistic; should I go to this party?

When people ask ChatGPT and other AI models for advice, they often share deeply personal details in hopes of getting better answers: their age, their gender, their mental health history, even medical diagnoses like autism. ...

Apr 15, 2026

0

9

Machine learning & AI

Can Europe create AI that we actually understand?

Artificial intelligence is becoming increasingly important in nearly every aspect of society, but is completely dominated by the United States and China. Leaving the field to foreign powers and large companies may entail ...

Apr 15, 2026

0

3

Computer Sciences

Perfect alignment between AI and human values is mathematically impossible, study says

Perfect AI alignment with human values and interests is mathematically impossible, according to a study, but behavioral diversity among AI agents offers the promise of some control. Published in PNAS Nexus, Hector Zenil and ...

Apr 14, 2026

0

27

Consumer & Gadgets

What skills do humans need to become robot proof in the age of AI?

Alumna, author and machine learning expert Vivienne Ming explains why the best defense against AI's downsides is investing in human skills—and using the technology inquisitively, not passively.

Apr 14, 2026

0

7

Machine learning & AI

When AI seems to know you better than you know yourself

I was at my clinic the other day and asked an AI assistant about the differential diagnosis of a rash in a child. A routine question. The response came back clear and sensible. And then it added, "Are you asking about one ...

Apr 13, 2026

0

5

page 4 from 36

«
»