Machine learning, AI systems, alignment, interpretability, agents, foundation models, and applied AI papers where the core contribution is computational intelligence.
Filter by category: Paradigm Challenge Breaks Assumption First Ever Nature Is Weird Practical Magic Cosmic Scale Life Origin Open Release Efficiency Leap New Capability Scaling Insight
Efficiency Breakthrough
ImplicitRM enables unbiased reward modeling from 'messy' implicit feedback (clicks/copies), drastically reducing the cost of RLHF data collection.
Efficiency Breakthrough
Introduces custom CUDA kernels and a sparse packing format that enables Transformers to maintain performance with over 99% feedforward sparsity.
Paradigm Shift
Enables 3D medical image segmentation pre-training using only mathematical formulas and implicit functions, requiring zero real-world data or expert annotations.
New Capability
Develops a collaborative memory framework that distills agent-agnostic reasoning trajectories, allowing different LLM models to share a single memory system.
New Capability
Identifies functionally complete safety circuits in LLMs via differentiable binary masks, allowing for near-surgical removal of backdoors and jailbreaks.
New Capability
Uses Sparse Autoencoders (SAEs) to identify and steer cultural representations in LLMs, eliciting rare cultural concepts that prompting alone misses.
Efficiency Breakthrough
Upgrades video Diffusion Transformers to ultra-high-resolution synthesis using a two-stage 'Relay LoRA' adaptation on pure images.
Paradigm Shift
A dual-path architecture that combines speculative speech-to-speech prefixes with cascaded LLM continuations for zero-latency, high-quality dialogue.
Efficiency Breakthrough
Challenges the dominance of on-policy RL for LLMs by introducing a practical off-policy value-based framework that enables data reuse.
Paradigm Shift
A biology-native transformer architecture that mirrors cellular transcription and translation, enabling interpretable predictions across DNA, RNA, and protein.
New Capability
A unified framework that decomposes monolithic 3D meshes into 'sim-ready' interactive articulated assets using a sparse 3D VQ-VAE.
Breaks Assumption
Exposes 'shortcut learning' in differentiable simulators where models non-causally exploit future information to 'regret' past mistakes rather than learning to recover.
New Capability
A generative framework for graphs that closes the fidelity gap between energy-based models and discrete diffusion.
Paradigm Shift
Introduces a 'geospatial model foundry' that learns unified representations from the weights of existing models rather than raw data.
Efficiency Breakthrough
An online length-aware scheduling strategy that eliminates training 'bubbles' during the rollout phase of LLM reinforcement learning.
New Capability
A bilevel framework where an outer LLM loop meta-optimizes an inner autoresearch loop by autonomously generating and injecting Python code at runtime.
New Capability
Integrates tactile perception into video-action models to enable high-fidelity force modulation in contact-rich robotic tasks.
Paradigm Shift
Enables training of monocular novel-view synthesis models using entirely unpaired, in-the-wild internet images.
Efficiency Breakthrough
Leverages human gaze tracking to assign non-uniform token density in diffusion models, creating perceptually perfect images with significantly less compute.
Efficiency Breakthrough
Replaces visual token compression with sparse, dynamically selected vision-language interactions in VLLMs.
New Capability
A unified reinforcement learning framework that jointly optimizes reasoning (text) and synthesis (image) for interleaved multimodal generation.
Efficiency Breakthrough
Introduces on-the-fly quantization that calibrates to individual prompts during inference, solving the 'domain shift' problem where standard quantization fails on unseen data.
Paradigm Shift
Provides a statistically rigorous framework to evaluate model performance and reliability after cherry-picking or selecting models based on the same test data.
New Capability
Develops a differentially private RLHF pipeline that decouples private reward learning from policy optimization, achieving strong alignment on Gemma-2B-IT with privacy guarantees.
Paradigm Challenge
AI is actually the most confident when it's completely making stuff up.
Practical Magic
Future phones might have 'liquid' antennas that literally swim around inside the device to hunt down a better signal.
Paradigm Challenge
A massive study found women do way more innovative science than men, but they still get robbed when it's time for the credit.
Practical Magic
Scientists found a way to make a basic home computer screw up math exactly like a super-expensive AI chip does.
Paradigm Challenge
A core rule of tech just got an update, and it turns out those fancy AI chips might eventually be totally useless.
Practical Magic
New 360-degree video treats things on screen like they have gravity, just so it can predict exactly where you're gonna look next.
Practical Magic
Your future phone might have antennas that physically slide along tracks to 'pinch' the best Wi-Fi signal possible.
Paradigm Challenge
An AI just 'figured out' how to lock down its own code using high-level math without a human ever telling it how.
Nature Is Weird
Engineers built 'invisible' backdoors into computer chips that are so well-hidden, even the most powerful microscopes can't find them.
Nature Is Weird
Scientists found one single math formula that explains why everything from stock market crashes to earthquakes actually happens.
Practical Magic
Researchers built an AI sensor that 'thinks' using light ripples, letting it spot objects in about 25 billionths of a second.
Paradigm Challenge
Researchers found one 'master' math trick that can recreate every single function on your old scientific calculator.
Nature Is Weird
There’s a new AI that can tell you an animal’s whole lifestyle and what it looks like just by listening to it make a sound.
Practical Magic
A new voting system lets you check if a national election was legit using just basic math and zero computers.
Practical Magic
New math can spot life-threatening internal bleeding in patients before doctors can even see it.
Paradigm Challenge
Those single scores we use to rank people on things like intelligence might actually be mathematical illusions.
Practical Magic
AI can now map out the secret relationships between terrorist groups that they try to keep hidden.
Efficiency Breakthrough
Achieves over 10x faster sampling for diffusion language models by shifting the process into continuous semantic space.
Efficiency Breakthrough
Integrates fast scalar rewards with slow generative CoT reasoning to reduce reward model token consumption by 20%.
Efficiency Breakthrough
Enables precise prompt routing by predicting the expected reward of a model before any response is generated.
Paradigm Shift
Introduces a training strategy where Transformers 'think' in latent space before committing to discrete tokens.
New Capability
Composes pre-trained unimanual robotic policies into complex bimanual tasks without requiring bimanual demonstration data.
New Capability
Sets a new state-of-the-art for intracortical speech decoding with 14.3% phoneme error rate using a multitask Transformer.
Breaks Assumption
Proves mathematically that AI text detectors face structural limits that will always result in false positives against diverse student populations.
Paradigm Shift
The first foundation model for zero-shot prediction of joint probability distributions in coupled time series.
Efficiency Breakthrough
Reduces Tree of Thought (ToT) computational overhead by up to 75% using plug-and-play predictors for pruning.