Page 8: Research news on Machine learning methodologies

Machine learning methodologies encompass algorithmic frameworks and architectures for training, optimizing, and deploying models such as neural networks, transformers, diffusion models, and reinforcement learning agents. Work in this area develops new training objectives, curriculum schemes, speculative and efficient decoding, pruning and communication-reduction strategies, and biologically inspired or physics-informed architectures. The domain also includes safety preservation, unlearning, scaling laws, and specialized methods for vision, language, control, and scientific computing, aiming to improve performance, efficiency, robustness, and controllability of complex AI systems.

Computer Sciences

The AI that taught itself: How AI can learn what it never knew

For years, the guiding assumption of artificial intelligence has been simple: an AI is only as good as the data it has seen. Feed it more, train it longer, and it performs better. Feed it less, and it stumbles. A new study ...

Computer Sciences

Deep AI training gets more stable by predicting its own errors

Artificial intelligence now plays Go, paints pictures, and even converses like a human. However, there remains a decisive difference: AI requires far more electricity than the human brain to operate. Scientists have long ...

Computer Sciences

Improving AI models' ability to explain their predictions

In high-stakes settings like medical diagnostics, users often want to know what led a computer vision model to make a certain prediction, so they can determine whether to trust its output. Concept bottleneck modeling is one ...

Machine learning & AI

Is this your AI? ZEN framework cracks AI black box

Artificial intelligence (AI) systems power everything from chatbots to security cameras, yet many of the most advanced models operate as "black boxes." Companies can use them, but outsiders can't see how they were built, ...

Texas at heart of Amazon's AI push in United States

Tech titan Amazon is working to step out of Nvidia's shadow with custom "Trainium" chips designed specially for machine learning as billions of dollars are poured into artificial intelligence (AI).

Computer Sciences

Adaptive drafter model uses downtime to double LLM training speed

Reasoning large language models (LLMs) are designed to solve complex problems by breaking them down into a series of smaller steps. These powerful models are particularly good at challenging tasks like advanced programming ...

page 8 from 31