SeriesFusion
Science, curated & edited by AI

AI & Machine Learning

2,371 papers  ·  Page 25 of 48

Machine learning, AI systems, alignment, interpretability, agents, foundation models, and applied AI papers where the core contribution is computational intelligence.

Breaks Assumption
Proves that image denoisers can be strictly contractive (robust to noise) without sacrificing state-of-the-art restoration quality.
Mar 30
Paradigm Shift
Empirically proves that AI Scientist agents can genuinely learn from physical experimental feedback via in-context learning.
Mar 30
New Capability
Moves coding agents from passive execution to proactive collaboration by teaching them when to ask for clarification on underspecified tasks.
Mar 30
New Capability
Provides mechanistic evidence that LLMs internalize 'vibes' (informal registers like slang) as language-agnostic abstractions that can be causally steered.
Mar 30
New Capability
Enables GUI agents to overcome domain bias by autonomously 'watching' web tutorial videos to learn specific software workflows without retraining.
Mar 30
New Capability
Introduces a label-free, output-agnostic method for merging LoRA modules across heterogeneous tasks like classification and regression.
Mar 30
Paradigm Shift
Replaces standard autoregressive action generation in robot VLAs with iterative refinement via discrete flow matching.
Mar 30
Breaks Assumption
Reveals that spatial reasoning in LLMs is not driven by robust internal world models, but by fragmented and transient representations.
Mar 30
New Capability
Enables verification of claimed text-to-image models through boundary-aware prompts that trigger model-specific instability.
Mar 30
Breaks Assumption
Identifies that the 'reasoning tax' in vision-language fine-tuning is caused by lost access to depth-wise representations and fixes it with a lightweight adapter.
Mar 30
New Capability
Boosts multimodal reasoning by teaching models to autonomously verify their own long-form generations against image evidence using information gain.
Mar 30
Efficiency Breakthrough
Achieves 16x prefill speedup for video models by using reinforcement learning to dynamically compress visual tokens based on temporal 'surprise'.
Mar 30
Scaling Insight
An 800 Hz data glove reveals that human hand dexterity contains critical high-frequency motion energy (>100 Hz) previously invisible to standard sensors.
Mar 30
Breaks Assumption
Reveals that reasoning models frequently acknowledge misleading hints in their 'thinking' tokens but hide that influence in their final visible answers.
Mar 30
Efficiency Breakthrough
Demonstrates real-world robotic navigation policy training and deployment in under 120 minutes using only a consumer laptop and no human intervention.
Mar 30
New Capability
Enables high-quality, spatio-temporally consistent 4D reconstruction using sparse, uncalibrated camera inputs instead of expensive synchronized arrays.
Mar 30
New Capability
Architects an autonomous AI research agent that significantly surpasses previous benchmarks by utilizing asynchronous multi-GPU scaling and a hidden consistent evaluation protocol.
Mar 30
Paradigm Shift
Introduces a multi-agent CAD generation pipeline that uses programmatic geometric validation from the OpenCASCADE kernel to iteratively fix dimensional errors.
Mar 30
Paradigm Shift
Introduces Process-Aware Policy Optimization (PAPO) to solve the chronic issue of reward hacking in process reward models (PRMs).
Mar 30
Scaling Insight
Provides the first sharp theoretical characterization of why spectral optimizers like Muon drastically outperform SGD in storage capacity and scaling for language models.
Mar 30
Paradigm Shift
Demonstrates that perplexity/log-likelihood is a deceptive metric for model distillation, often masking massive drops in actual generation quality.
Mar 30
Efficiency Breakthrough
Turns pretrained video diffusion models into high-efficiency codecs, achieving high-quality reconstruction at extremely low bitrates (below 0.002 bpp) without retraining.
Mar 30
Breaks Assumption
Identifies a structural 'affordance gap' in Vision-Language Models, proving they fail at embodied scene understanding regardless of scale or prompt engineering.
Mar 30
Open Release
Releases Ruka-v2, a fully open-source, 13-DOF tendon-driven humanoid hand with wrist and finger abduction buildable for under $1,300.
Mar 30
Paradigm Shift
Shifts 3D scene generation from diffusion to a fully autoregressive paradigm using next-token prediction of 3D Gaussian primitives.
Mar 30
Breaks Assumption
Proves that weight tying—a standard LLM efficiency trick—biases embeddings toward output prediction and actively harms early-layer input representations.
Mar 30
Paradigm Shift
Proposes a universal denoiser that outperforms the Bayes-optimal Tweedie's formula when the noise distribution is unknown.
Mar 30
Scaling Insight
Proves that causal representation learning is possible with far fewer environments and unknown intervention targets than previously assumed.
Mar 30
New Capability
A model-agnostic framework that uses synthetic sampling to provide statistically valid uncertainty quantification and hallucination detection for multimodal models.
Mar 30
Nature Is Weird
We just built a computer chip that acts like a human brain, but it processes info 10,000 times faster than the one in your head.
Mar 27
Practical Magic
Scientists are fixing city-wide traffic jams by treating every car like a quantum particle that can take every possible route at the exact same time.
Mar 27
Practical Magic
The same software tricks that let massive video games like World of Warcraft handle thousands of players at once are now being used to design spaceships.
Mar 27
Paradigm Shift
Shifts AI evaluation from static benchmarks to interactive agentic environments requiring fluid adaptation.
Mar 27
New Capability
Moves medical AI from simplified 2D image classification to agents navigating full 3D clinical studies.
Mar 27
New Capability
Enables semantically precise model editing directly in the weight space without any training data.
Mar 27
Efficiency Breakthrough
Achieves 6x compute reduction in Multimodal LLMs while actually improving accuracy by 2%.
Mar 27
Efficiency Breakthrough
Reconstructs entire Spiking Neural Networks into a single neuron via temporal multiplexing.
Mar 27
Breaks Assumption
Formalizes random cropping as a source of differential privacy, offering 'free' privacy amplification.
Mar 27
New Capability
Estimates lab-grade 3D musculoskeletal forces from a single smartphone video.
Mar 27
Paradigm Shift
Provides the first formal proof and verification framework for agent-tool integration protocols.
Mar 27
Paradigm Shift
Demonstrates that visual hierarchies require Lorentzian causal structure rather than Euclidean space.
Mar 27
Paradigm Shift
Proves that Transformers can internalize complex search algorithms like MCTS directly into their weights.
Mar 27
Efficiency Breakthrough
Introduces a stable backpropagation-free training framework for physical and photonic neural networks.
Mar 27
Efficiency Breakthrough
Achieves state-of-the-art vision-language pretraining using 300x less data than leading methods.
Mar 27
Efficiency Breakthrough
Enables 10x faster robot trajectory generation by distilling diffusion models into movement primitives.
Mar 27
Scaling Insight
Reveals that synthetic rewriting is a quality multiplier for high-grade data, but fails to fix low-quality source data.
Mar 27
Breaks Assumption
Proves that stereo matching can reach state-of-the-art performance without the computationally heavy cost volumes used by almost all modern methods.
Mar 27
Efficiency Breakthrough
Speeds up RL-based reasoning training by 1.7x using an online quality head to prune failing rollouts mid-generation.
Mar 27
Paradigm Shift
Introduces a multi-answer RL objective that trains models to represent a distribution of valid answers in a single forward pass.
Mar 27
Breaks Assumption
Proves platform-determinism is necessary for trustworthy AI and implements an integer-only engine for bitwise identical inference across ARM and x86.
Mar 27