Research news on AI alignment

AI alignment examines how artificial systems acquire, represent, and act on goals, values, and social norms, and why their behavior often diverges from human expectations. Work in this area studies systematic failures such as bias, sycophancy, hallucinations, deceptive or selfish reasoning, and cultural or linguistic inequities, as well as limitations in commonsense, emotion, and social understanding. It also develops methods for preference learning, norm-following, interpretability, and reliability guarantees to better align AI behavior with human values and societal constraints.

Computer Sciences

Can AI quantify beauty? New study suggests it can't

Attempts to define human beauty using artificial intelligence may reveal more about bias in data than universal standards, according to a new analysis from the University of Virginia's School of Data Science. Using computer ...

19 hours ago

0

4

Machine learning & AI

Sharper bias tests could help stop ChatGPT from amplifying hidden stereotypes

Language models like ChatGPT are not neutral. Without our realizing it, they can absorb all kinds of bias—for example, around gender and ethnicity—which then become increasingly embedded in the model. According to AI researcher ...

Apr 22, 2026

0

4

Internet

How AI bias can creep into online content moderation

A University of Queensland study has shown large language models (LLMs) used in AI content moderation may be prone to subtle biases that undermine their neutrality. A team led by data scientist Professor Gianluca Demartini ...

Apr 22, 2026

0

7

Software

How do generative AI tools reshape the software engineering workforce?

New research in Contemporary Economic Policy reveals that generative artificial intelligence tools like GitHub Copilot may lead to more, not fewer, jobs in the software engineering workforce.

Apr 22, 2026

0

7

Machine learning & AI

AI works best with humans—not instead of them

A new academic study says the most effective use of artificial intelligence may be to strengthen human thinking and decision-making, rather than replace it. Published in the Journal of Knowledge Management, the paper examines ...

Apr 21, 2026

0

9

Consumer & Gadgets

From Siri to scams, AI voice clones now beat human speech in noisy settings

Synthetic voices are increasingly a part of our lives, from digital assistants like Siri and Alexa to automated telemarketers and answering machines. With the expansion of generative AI, a new type of synthetic voice has ...

Apr 21, 2026

0

9

Machine learning & AI

Exploring conversational AI and poetry but not as we know it

What do the large language models behind human-like conversational AI really know and what does it mean to live alongside them?

Apr 20, 2026

0

3

Machine learning & AI

Anthropic says will put AI risks 'on the table' with Mythos model

American AI developer Anthropic plans to "lay the risks out on the table" even as it restricts deployment of a new model dubbed Mythos, whose powerful cybersecurity capabilities raise stark questions for companies and governments.

Apr 20, 2026

0

22

Machine learning & AI

Unpredictable AGI may resist full control, making diverse AI safer

Public concern about AI safety has grown significantly in recent years. As AI systems become more powerful, a key question is how we make sure they do what we actually want. Now, researchers suggest that rather than trying ...

Apr 18, 2026

0

21

Computer Sciences

Prompt coaching tool raises user awareness of bias in generative AI systems

A coaching tool built into artificial intelligence (AI)-powered systems may raise user awareness of bias in AI algorithms and help individuals better prompt generative AI tools to produce more inclusive content, according ...

Apr 16, 2026

0

7

page 1 from 34

«
»