SeriesFusion
Science, curated & edited by AI

Paradigm Shift

329 papers  ·  Page 5 of 7
AI
A geometric fix for Rotary Positional Embeddings (RoPE) allows Transformers to generalize to long inputs out-of-the-box by preserving 'sink token' functionality.
Mar 20
AI
A synthesizable RTL implementation of Predictive Coding allows for fully distributed, non-backprop learning directly in hardware.
Mar 20
AI
Dynamic constraints using an 'online refiner' resolve the conflict between stability and performance in Reinforcement Learning Fine-Tuning (RFT).
Mar 20
AI
Uses Pearl's do-operator to automatically discover and mask irrelevant state dimensions in Reinforcement Learning.
Mar 20
AI
Fine-tunes Vision-Language Models using raw images alone by using a text-to-image model as a cycle-consistency reward.
Mar 20
AI
PowerFlow uses GFlowNets to replace heuristic rewards in unsupervised fine-tuning, allowing practitioners to explicitly tune models for either logic or creativity.
Mar 20
AI
AS2 achieves a fully differentiable neuro-symbolic bridge by replacing discrete solvers with a soft, continuous approximation of the Answer Set Programming operator.
Mar 20
AI
Standard decoding strategies (top-k, nucleus) create a 'truncation blind spot' by systematically excluding human-like, low-probability token choices.
Mar 20
AI
SINDy-KANs combine Kolmogorov-Arnold Networks with Sparse Identification of Non-linear Dynamics to create parsimonious, interpretable models.
Mar 20
AI
REST transforms the zero-shot object-navigation problem from simple waypoint selection to a tree-of-paths reasoning process.
Mar 20
AI
A linear-time attention mechanism that is weight-compatible with standard pretrained Transformers, allowing for direct knowledge transfer.
Mar 20
AI
A system where agents autonomously design, refine, and store task-specific skills as 'stateful prompts' to achieve non-parametric continual learning.
Mar 20
AI
Shifts concept unlearning in diffusion models from fragile keyword-based removal to a distributional framework using contextually diverse prompts.
Mar 20
AI
Eliminates the need for expensive process reward models by propagating terminal rewards across state-space graphs to generate dense, state-level rewards for agentic RL.
Mar 20
AI
Introduces 'intentional interventions' and Structural Final Models (SFMs) to detect and infer agent goals within causal frameworks.
Mar 20
AI
Uses Sparse Autoencoders (SAEs) to disentangle and modulate bias-relevant features in Vision-Language Models without retraining.
Mar 20
AI
Incorporates the physics of forward dynamics directly into a GNN architecture for articulated robot control.
Mar 20
AI
Argues that standard ML efficiency metrics (FLOPs, throughput) are poorly correlated with actual robot performance in Vision-Language-Action (VLA) models.
Mar 20
AI
Reframes GPU kernel optimization by benchmarking against hardware 'Speed-of-Light' limits rather than software baselines.
Mar 20
AI
Repurposes pre-trained video diffusion models as 'Latent World Simulators' to give Multimodal LLMs 3D spatial awareness without explicit 3D data.
Mar 20
AI
Introduces Capability-Priced Micro-Markets (CPMM), a micro-economic framework for autonomous AI agent transactions over HTTP 402.
Mar 19
AI
Proposes Modulated Hazard-aware Policy Optimization (MHPO) to solve the instability and mode collapse common in GRPO-based reinforcement learning.
Mar 19
AI
Mathematically proves that the Transformer architecture is functionally equivalent to a Bayesian Network performing loopy belief propagation.
Mar 19
AI
Achieves high-performance online continual learning without the massive memory overhead of traditional experience replay buffers.
Mar 19
AI
A formal, graph-native memory architecture that treats agent memory as a versioned asset, dramatically outperforming Gemini 2.5 Pro on complex recall.
Mar 19
AI
Shifts retrieval from static contrastive vector alignment to dynamic reasoning trajectories using a generative model (T1) and GRPO.
Mar 19
AI
Provides a sheaf-theoretic proof that local causal consistency in generative models does not guarantee global counterfactual coherence.
Mar 19
AI
Unifies large-scale search, recommendation, and reasoning into a single self-contained LLM by treating item IDs as a distinct modality.
Mar 19
AI
Edit-As-Act reframes 3D scene editing as a goal-regressive planning problem using symbolic action languages rather than purely generative pixel manipulation.
Mar 19
AI
A new self-refining surrogate framework enables neural models to simulate complex dynamical systems over arbitrarily long horizons without the standard failure mode of compounding error.
Mar 19
AI
The 'consensus trap' in label-free RL—where models reinforce their own systematic errors—can be broken by co-evolving the model in alternating generator and verifier roles.
Mar 19
AI
LLMs compute and cache confidence scores automatically during answer generation, well before they are prompted to verbalize them.
Mar 19
AI
Measuring the distance between human languages can now be done quantitatively using the attention mechanisms of multilingual transformers.
Mar 19
AI
AgentFactory shifts agent evolution from unreliable textual 'reflections' to a library of verifiable, executable Python subagents.
Mar 19
AI
DAPS++ reinterprets diffusion inverse problems as a decoupled EM-style initialization, significantly increasing restoration speed and stability.
Mar 19
AI
Alternating Reinforcement Learning with Rubric Rewards (ARL-RR) replaces brittle scalar reward aggregation with a semantic meta-class optimization framework.
Mar 18
AI
Atlas introduces 'Compiled Memory,' which rewrites an agent's system prompt with distilled task experience rather than using RAG or fine-tuning.
Mar 18
AI
Transition Flow Matching learns a global transition flow rather than local velocity fields, enabling single-step generation and transfer to arbitrary future time points.
Mar 18
AI
Simulation Distillation (SimDist) enables rapid sim-to-real adaptation by transferring reward and value models directly into a latent world model.
Mar 18
AI
Introduces a privacy-preserving ML framework that achieves strong non-invertibility without the utility loss of Differential Privacy or the cost of Homomorphic Encryption.
Mar 18
AI
Analyses over 10,000 experiments to prove that LLM agents are capable of genuine architectural discovery rather than just hyperparameter tuning.
Mar 18
AI
Introduces per-token adapter routing, allowing a single sequence to dynamically utilize multiple specialized LoRA experts.
Mar 18
AI
Finds that filtering knowledge at 'write-time' (ingestion) maintains 100% RAG accuracy under noise levels where standard 'read-time' filtering completely collapses.
Mar 18
AI
Proposes a protocol that replaces complex multi-agent coding frameworks with a simple, interpretable filesystem structure.
Mar 18
AI
Establishes a duality between sequence-axis attention and depth-wise residual connections, treating layer depth as an ordered variable.
Mar 18
AI
Proves that compositional generalization failure in neural networks is an architectural issue and provides a category-theoretic framework to fix it.
Mar 18
AI
Formulates Hierarchical Instruction Following as a Constrained Markov Decision Process to ensure LLMs prioritize system prompts over user instructions.
Mar 18
AI
Introduces modular, composable safety alignment via learnable control tokens rather than static parameter-level tuning.
Mar 18
AI
Decouples perceptual failures from logical errors in Vision-Language reward models to enable more reliable test-time scaling.
Mar 18
AI
Researchers identified a 'critique vector' in the latent space of Large Reasoning Models that can be steered to improve self-correction and test-time scaling.
Mar 18