Research news on Machine learning methodologies

Machine learning methodologies encompass algorithmic frameworks and architectures for training, optimizing, and deploying models such as neural networks, transformers, diffusion models, and reinforcement learning agents. Work in this area develops new training objectives, curriculum schemes, speculative and efficient decoding, pruning and communication-reduction strategies, and biologically inspired or physics-informed architectures. The domain also includes safety preservation, unlearning, scaling laws, and specialized methods for vision, language, control, and scientific computing, aiming to improve performance, efficiency, robustness, and controllability of complex AI systems.

Computer Sciences

What flocking birds can teach AI about reducing noise

Among the primary concerns surrounding artificial intelligence is its tendency to yield erroneous information when summarizing long documents. These "hallucinations" are problematic not only because they convey falsehoods, ...

Consumer & Gadgets

New deep learning framework solves the cold-start problem

Recommender systems suggest potentially relevant content by evaluating user preferences and are essential in reducing information overload. However, when users join a new online platform, recommendation systems often struggle ...

Computer Sciences

The AI that taught itself: How AI can learn what it never knew

For years, the guiding assumption of artificial intelligence has been simple: an AI is only as good as the data it has seen. Feed it more, train it longer, and it performs better. Feed it less, and it stumbles. A new study ...

Computer Sciences

Deep AI training gets more stable by predicting its own errors

Artificial intelligence now plays Go, paints pictures, and even converses like a human. However, there remains a decisive difference: AI requires far more electricity than the human brain to operate. Scientists have long ...

Computer Sciences

Improving AI models' ability to explain their predictions

In high-stakes settings like medical diagnostics, users often want to know what led a computer vision model to make a certain prediction, so they can determine whether to trust its output. Concept bottleneck modeling is one ...

page 1 from 25