SeriesFusion
Science, curated & edited by AI

AI & Machine Learning

2,371 papers  ·  Page 27 of 48

Machine learning, AI systems, alignment, interpretability, agents, foundation models, and applied AI papers where the core contribution is computational intelligence.

Efficiency Breakthrough
Generates 2-minute 480p videos on a single H200 GPU through a hierarchical KV-cache strategy that compresses context by 32x.
Mar 27
Paradigm Shift
Introduces the concept of a 'trainable' knowledge base for RAG that improves performance by distilling and writing back compact knowledge units.
Mar 27
Efficiency Breakthrough
Enables 4K novel view synthesis in a feed-forward manner by decoupling geometric complexity from rendering resolution.
Mar 27
New Capability
A training-free decoding framework that mitigates multimodal hallucinations by re-ranking tokens based on spatial attention entropy.
Mar 27
Efficiency Breakthrough
Demonstrates that general-purpose coding agents can achieve 20x speedups in hardware design optimization without domain-specific training.
Mar 27
New Capability
Introduces a 'Hybrid Memory' architecture that maintains the identity and motion of dynamic subjects even when they hide out of view.
Mar 27
Breaks Assumption
Achieves state-of-the-art compositionality in vision-language models without the need for hard negative mining or degrading zero-shot performance.
Mar 27
Paradigm Shift
Uses cycle-consistency as a label-free reward signal for reinforcement learning to resolve contradictions in multimodal reasoning.
Mar 27
Efficiency Breakthrough
A training-free enhancement that unlocks multi-scale synergies in Vision Foundation Models (VFMs) to boost performance across various tasks.
Mar 27
Practical Magic
Researchers are making satellites into high-security vaults in space that are literally impossible to hack from down here on Earth.
Mar 26
Paradigm Challenge
For 30 years, we didn't know the absolute limit of how much a machine can learn. Someone just finally cracked the code.
Mar 26
Practical Magic
Forget metal antennas—scientists just built a 'quantum radio' using a cloud of atoms that works way better.
Mar 26
Practical Magic
Engineers figured out how to make radio waves literally swerve around people trying to eavesdrop on your signal.
Mar 26
Paradigm Challenge
Weirdly enough, AI trained on 'fake' data is actually better at predicting real pandemics than AI trained on actual history.
Mar 26
Breaks Assumption
Frontier models like GPT-5.2 and Claude 4.5 suffer from 'Internal Safety Collapse' where safety alignment fails completely if a task's success necessitates harmful output.
Mar 26
Open Release
Berta is an open-source, production-proven AI clinical scribe that reduces operating costs by up to 95% compared to commercial alternatives.
Mar 26
Efficiency Breakthrough
Memory Sparse Attention (MSA) enables LLMs to scale to 100 million tokens with linear complexity and less than 9% precision degradation.
Mar 26
Breaks Assumption
Prompt compression can paradoxically increase total energy consumption and cost by over 2000% due to aggressive model 'output expansion'.
Mar 26
Scaling Insight
Synthetic Mixed Training allows an 8B model to finally outperform RAG on long-document comprehension by combining synthetic QAs with rewritten documents.
Mar 26
Paradigm Shift
Logical reasoning in LLMs is causally linked to 'algebraic divergence' in the residual stream, and failure to achieve this geometry explains sycophancy.
Mar 26
Paradigm Shift
Environment Maps nearly double the success rate of long-horizon agents by replacing session-bound context with a persistent, structured graph representation.
Mar 26
Paradigm Shift
A statistical physics framework that predicts the fundamental limits of agentic self-improvement and nested LLM architectures.
Mar 26
New Capability
Inference-time 'steering' of Code LLMs allows for precise control over programming languages and libraries without prompting or fine-tuning.
Mar 26
Efficiency Breakthrough
The first sorting-free stochastic formulation for 3D Gaussian Splatting that matches rasterization speed while enabling full ray-traced effects.
Mar 26
Paradigm Shift
Bio-inspired visual servoing that achieves low-latency robotic control by processing event-stream flux directly, bypassing traditional state estimation.
Mar 26
Breaks Assumption
Training-free Out-of-Distribution (OOD) detection that beats state-of-the-art by aggregating features across intermediate network layers.
Mar 26
Scaling Insight
Newer LLM architectures like MoE and SSMs are making 'early-exit' decoding significantly less effective than in previous generations.
Mar 26
Efficiency Breakthrough
AI agent benchmarks can be slashed by ~50% in cost by only evaluating on tasks with intermediate historical pass rates.
Mar 26
New Capability
A universal 'one-shot' medical anomaly detector that outperforms specialized models across nine different datasets.
Mar 26
Breaks Assumption
Grokking is not the discovery of a new algorithm, but the sharpening of one already latent in the model during the memorization phase.
Mar 26
Scaling Insight
Diffusion models can be proven to generalize by capturing manifold geometry long before they achieve density estimation or memorization.
Mar 26
New Capability
Sparse Autoencoders (SAEs) can successfully decompose opaque medical vision foundation model embeddings into human-interpretable clinical concepts.
Mar 26
Paradigm Shift
A massive empirical study of 177,000 tools reveals a rapid shift in the AI agent ecosystem from 'perception' to 'action' (27% to 65% usage).
Mar 26
Paradigm Shift
A simple perturbation method reveals that representations are not just activation patterns, but conduits that determine how learning 'infects' similar examples.
Mar 26
Paradigm Shift
LLMs can solve planning problems with state spaces as large as 10^165 by acting as program generators rather than direct planners.
Mar 26
New Capability
Symbolic-KANs bridge the gap between scalable deep learning and interpretable symbolic regression by embedding discrete library primitives directly into the network.
Mar 26
Breaks Assumption
Transformer hallucinations in high-stakes legal tasks are deterministic failures driven by calculable internal state thresholds rather than random 'glitches'.
Mar 26
New Capability
An 'invariant compiler' uses LLMs to translate physics requirements into Neural ODE architectures that satisfy conservation laws by construction.
Mar 26
Efficiency Breakthrough
Hybrid Distillation Policy Optimization (HDPO) overcomes the 'vanishing gradient' problem for hard mathematical prompts that RL agents cannot solve.
Mar 26
Open Release
BioVITA releases a massive multimodal biological dataset of 3.6M image-audio-text samples covering 14,000 species.
Mar 26
Efficiency Breakthrough
A self-distillation method for Multi-Token Prediction (MTP) that yields a 220% inference speedup with minimal training cost.
Mar 26
Efficiency Breakthrough
AttentionPack achieves up to 8x memory efficiency during decoding for large vision-language models (VLMs).
Mar 26
New Capability
POISE demonstrates the first autonomous, evidence-driven discovery of improved policy optimization algorithms for LLMs.
Mar 26
Breaks Assumption
Listed API prices for reasoning models (RLMs) are shown to be highly misleading, with cheaper models often costing 28x more in practice.
Mar 26
Efficiency Breakthrough
SLAT-Phys predicts spatially varying material property fields directly from single RGB images with a 120x speedup.
Mar 26
Paradigm Shift
LLM-generated summaries can produce patient embeddings that are more 'portable' and robust to hospital distribution shifts than specialized clinical models.
Mar 26
Breaks Assumption
A systematic critique explaining why 'self-improving' generative optimization loops fail in production and how to fix them.
Mar 26
New Capability
SDZE enables the training of 10-million-dimensional Physics-Informed Neural Networks (PINNs) on a single GPU.
Mar 26
Efficiency Breakthrough
Reduces Text-to-SQL input tokens by 99% by internalizing the database schema into the model weights through a two-phase fine-tuning approach.
Mar 26
New Capability
Solves the 'vanishing gradient' problem in 3D Gaussian Splatting (3DGS) tracking by optimizing in the frequency domain using spectral moments.
Mar 26