Page 5 - AI alignment — Research News & Scientific Publications

AI alignment examines how artificial systems acquire, represent, and act on goals, values, and social norms, and why their behavior often diverges from human expectations. Work in this area studies systematic failures such as bias, sycophancy, hallucinations, deceptive or selfish reasoning, and cultural or linguistic inequities, as well as limitations in commonsense, emotion, and social understanding. It also develops methods for preference learning, norm-following, interpretability, and reliability guarantees to better align AI behavior with human values and societal constraints.

Page 5: Research news on AI alignment

AI models can fake visual understanding of images that don't exist

Mythos AI alarm bells: Fair warning or marketing hype?

From 'BuddhaBot' to $1.99 chats with AI Jesus, the faith-based tech boom is here

Explainability is a must for older adults to trust AI, study shows

AI companions can comfort lonely users but may deepen distress over time

'More is Different': Research shows scale alone does not explain AI's power—specialization and cooperation do

New research could empower people without AI expertise to help create trustworthy AI applications

New AI testing method flags fairness risks in autonomous systems

Fair decisions, clear reasons: Creating fuzzy AI with fairness built in from the start

'Moltbook' risks: The dangers of AI-to-AI interactions in health care

Get the latest tech news for free