Page 8: Research news on AI alignment

AI alignment examines how artificial systems acquire, represent, and act on goals, values, and social norms, and why their behavior often diverges from human expectations. Work in this area studies systematic failures such as bias, sycophancy, hallucinations, deceptive or selfish reasoning, and cultural or linguistic inequities, as well as limitations in commonsense, emotion, and social understanding. It also develops methods for preference learning, norm-following, interpretability, and reliability guarantees to better align AI behavior with human values and societal constraints.

Consumer & Gadgets

Dear AI, I'm autistic; should I go to this party?

When people ask ChatGPT and other AI models for advice, they often share deeply personal details in hopes of getting better answers: their age, their gender, their mental health history, even medical diagnoses like autism. ...

Machine learning & AI

Can Europe create AI that we actually understand?

Artificial intelligence is becoming increasingly important in nearly every aspect of society, but is completely dominated by the United States and China. Leaving the field to foreign powers and large companies may entail ...

Machine learning & AI

When AI seems to know you better than you know yourself

I was at my clinic the other day and asked an AI assistant about the differential diagnosis of a rash in a child. A routine question. The response came back clear and sensible. And then it added, "Are you asking about one ...

Machine learning & AI

Mythos AI alarm bells: Fair warning or marketing hype?

Anthropic postponing the release of its new AI model Claude Mythos, said to be so skilled at coding it could be a wicked weapon for hackers, has encountered a mix of alarm and skepticism.

Consumer & Gadgets

Explainability is a must for older adults to trust AI, study shows

Voice-activated, conversational artificial intelligence (AI) agents must provide clear explanations for their suggestions, or older adults aren't likely to trust them. That's one of the main findings from a study by AI Caring ...

page 8 from 40