Page 24: Research news on Large language models

Large language models are high-capacity neural sequence models trained on massive text and multimodal corpora to perform language understanding, generation, and reasoning. Current work examines their internal representations, cognitive and social behavior analogies to humans, and limitations in mathematical, causal, and strategic reasoning. Research also addresses alignment with human values and brain activity, safety and security vulnerabilities, privacy and de-anonymization risks, cross-lingual and sociocultural biases, scaling and efficiency laws, and frameworks for tool use, multi-agent interaction, and domain-specific deployment.

Machine learning & AI

Why AI can't take over creative writing

In 1948, the founder of information theory, Claude Shannon, proposed modeling language in terms of the probability of the next word in a sentence given the previous words. These types of probabilistic language models were ...

Business

Researchers teach LLMs to solve complex planning challenges

Imagine a coffee company trying to optimize its supply chain. The company sources beans from three suppliers, roasts them at two facilities into either dark or light coffee, and then ships the roasted coffee to three retail ...

Machine learning & AI

This AI model is more certain about uncertainty

Artificial intelligence (AI) plays a role in virtually every aspect of our lives, from self-driving cars to smart vacuum cleaners, to computer models that can predict the course of an epidemic. No matter how advanced these ...

Machine learning & AI

Generative AI rivals racing to the future

Since ChatGPT burst onto the scene in late 2022, generative artificial intelligence (GenAI) models have been vying for the lead—with the US and China hotbeds for the technology.

page 24 from 24