PARADIGM_SHIFT PARADIGM_SHIFT
329 papers · Page 2 of 4
Unifies large-scale search, recommendation, and reasoning into a single self-contained LLM by treating item IDs as a distinct modality.
AI & ML arxiv | Mar 19
Edit-As-Act reframes 3D scene editing as a goal-regressive planning problem using symbolic action languages rather than purely generative pixel manipulation.
AI & ML arxiv | Mar 19
A new self-refining surrogate framework enables neural models to simulate complex dynamical systems over arbitrarily long horizons without the standard failure mode of compounding error.
AI & ML arxiv | Mar 19
The 'consensus trap' in label-free RL—where models reinforce their own systematic errors—can be broken by co-evolving the model in alternating generator and verifier roles.
AI & ML arxiv | Mar 19
LLMs compute and cache confidence scores automatically during answer generation, well before they are prompted to verbalize them.
AI & ML arxiv | Mar 19
Measuring the distance between human languages can now be done quantitatively using the attention mechanisms of multilingual transformers.
AI & ML arxiv | Mar 19
AgentFactory shifts agent evolution from unreliable textual 'reflections' to a library of verifiable, executable Python subagents.
AI & ML arxiv | Mar 19
DAPS++ reinterprets diffusion inverse problems as a decoupled EM-style initialization, significantly increasing restoration speed and stability.
AI & ML arxiv | Mar 19
A geometric fix for Rotary Positional Embeddings (RoPE) allows Transformers to generalize to long inputs out-of-the-box by preserving 'sink token' functionality.
AI & ML arxiv | Mar 20
A synthesizable RTL implementation of Predictive Coding allows for fully distributed, non-backprop learning directly in hardware.
AI & ML arxiv | Mar 20
Dynamic constraints using an 'online refiner' resolve the conflict between stability and performance in Reinforcement Learning Fine-Tuning (RFT).
AI & ML arxiv | Mar 20
Uses Pearl's do-operator to automatically discover and mask irrelevant state dimensions in Reinforcement Learning.
AI & ML arxiv | Mar 20
Fine-tunes Vision-Language Models using raw images alone by using a text-to-image model as a cycle-consistency reward.
AI & ML arxiv | Mar 20
PowerFlow uses GFlowNets to replace heuristic rewards in unsupervised fine-tuning, allowing practitioners to explicitly tune models for either logic or creativity.
AI & ML arxiv | Mar 20
AS2 achieves a fully differentiable neuro-symbolic bridge by replacing discrete solvers with a soft, continuous approximation of the Answer Set Programming operator.
AI & ML arxiv | Mar 20
Standard decoding strategies (top-k, nucleus) create a 'truncation blind spot' by systematically excluding human-like, low-probability token choices.
AI & ML arxiv | Mar 20
SINDy-KANs combine Kolmogorov-Arnold Networks with Sparse Identification of Non-linear Dynamics to create parsimonious, interpretable models.
AI & ML arxiv | Mar 20
REST transforms the zero-shot object-navigation problem from simple waypoint selection to a tree-of-paths reasoning process.
AI & ML arxiv | Mar 20
A linear-time attention mechanism that is weight-compatible with standard pretrained Transformers, allowing for direct knowledge transfer.
AI & ML arxiv | Mar 20
A system where agents autonomously design, refine, and store task-specific skills as 'stateful prompts' to achieve non-parametric continual learning.
AI & ML arxiv | Mar 20
Shifts concept unlearning in diffusion models from fragile keyword-based removal to a distributional framework using contextually diverse prompts.
AI & ML arxiv | Mar 20
Eliminates the need for expensive process reward models by propagating terminal rewards across state-space graphs to generate dense, state-level rewards for agentic RL.
AI & ML arxiv | Mar 20
Introduces 'intentional interventions' and Structural Final Models (SFMs) to detect and infer agent goals within causal frameworks.
AI & ML arxiv | Mar 20
Uses Sparse Autoencoders (SAEs) to disentangle and modulate bias-relevant features in Vision-Language Models without retraining.
AI & ML arxiv | Mar 20
Incorporates the physics of forward dynamics directly into a GNN architecture for articulated robot control.
AI & ML arxiv | Mar 20
Argues that standard ML efficiency metrics (FLOPs, throughput) are poorly correlated with actual robot performance in Vision-Language-Action (VLA) models.
AI & ML arxiv | Mar 20
Reframes GPU kernel optimization by benchmarking against hardware 'Speed-of-Light' limits rather than software baselines.
AI & ML arxiv | Mar 20
Repurposes pre-trained video diffusion models as 'Latent World Simulators' to give Multimodal LLMs 3D spatial awareness without explicit 3D data.
AI & ML arxiv | Mar 20
Introduces a statistical alternative to the standard frequency-based BPE tokenization used in nearly all modern LLMs.
AI & ML arxiv | Mar 23
Formally proves that a causal Transformer is mathematically equivalent to a stateless Differentiable Neural Computer.
AI & ML arxiv | Mar 23
Solves the compositional generalization failure of neural networks (0% to 100% accuracy) by embedding algebraic semiring constraints.
AI & ML arxiv | Mar 23
Challenges the 80-year-old assumption that neurons must use weighted summation as their primary aggregation mechanism.
AI & ML arxiv | Mar 23
Introduces Hyperagents: self-referential systems where the meta-level modification logic is itself an editable program.
AI & ML arxiv | Mar 23
Fine-tunes Large Vision Language Models for medical tasks using only image-description pairs, bypassing the need for expensive expert-curated instructions.
AI & ML arxiv | Mar 23
Formalizes the 'Neural Uncertainty Principle,' linking adversarial vulnerability in vision and hallucinations in LLMs to a shared geometric and information-theoretic origin.
AI & ML arxiv | Mar 23
A massive field study (9,000+ users) proves that algorithmic shifts can reduce affective polarization without sacrificing user engagement.
AI & ML arxiv | Mar 23
Enables zero-shot humanoid robot interaction by generating robot-centric 'dream' videos instead of relying on human-to-robot motion retargeting.
AI & ML arxiv | Mar 23
Replaces fixed context compression ratios with a performance-floor constraint to ensure reliable LLM deployment.
AI & ML arxiv | Mar 23
FIPO overcomes reasoning length stagnation in LLMs by using Future-KL divergence to create dense rewards, extending Chain-of-Thought lengths to over 10,000 tokens.
AI & ML arxiv | Mar 23
Breaking the 'capability ceiling' in LLM post-training by replacing full-history dependencies with explicit Markov states.
AI & ML arxiv | Mar 23
Identifies 'critical times' in diffusion generation where targeted guidance pulses significantly improve image control.
AI & ML arxiv | Mar 23
Derives a variational ELBO for the Joint-Embedding Predictive Architecture (JEPA), unifying it with generative modeling.
AI & ML arxiv | Mar 23
Integrates Kolmogorov-Arnold Networks (KANs) into causal generative modeling to produce human-readable symbolic structural equations.
AI & ML arxiv | Mar 23
Introduces a training strategy where Transformers 'think' in latent space before committing to discrete tokens.
AI & ML arxiv | Mar 24
The first foundation model for zero-shot prediction of joint probability distributions in coupled time series.
AI & ML arxiv | Mar 24
Formalizes 'Introspection' in LLMs and proves they have privileged access to their own policy logic beyond mere self-simulation.
AI & ML arxiv | Mar 24
Reason-to-Transmit introduces deliberative communication for multi-agent systems, where agents reason about *why* a message benefits the receiver rather than just broadcasting features.
AI & ML arxiv | Mar 24
This paper demonstrates that Model Context Protocol (MCP) can outperform traditional RAG for quantitative financial Q&A by interacting directly with structured data APIs.
AI & ML arxiv | Mar 24
Leum-VL-8B introduces a structural 'grammar' for video parsing by decomposing content into six film-production-style dimensions like camera language and editing.
AI & ML arxiv | Mar 24
This paper shows that pretrained monocular models can perform multi-view human mesh recovery without camera calibration or multi-view training data.
AI & ML arxiv | Mar 24
Latent representations of reasoning survive cross-architecture translation, allowing student models to inherit teacher capabilities without training.
AI & ML arxiv | Mar 24
Coding agents navigating a file system outperform SOTA long-context LLMs and RAG systems on massive datasets.
AI & ML arxiv | Mar 24
Distilling the internal process of expert systems into natural language allows small models to outperform proprietary LLMs in complex domains like Chess.
AI & ML arxiv | Mar 24
ReBOL replaces standard top-k vector retrieval with an iterative Bayesian Optimization process over document relevance.
AI & ML arxiv | Mar 24
Delightful Policy Gradient uses 'delight' (advantage x surprisal) to fix learning from stale or buggy data in distributed RL.
AI & ML arxiv | Mar 24
Continued Fraction Neural Networks (CFNN) introduce a rational inductive bias that handles singularities with 10-100x fewer parameters than standard MLPs.
AI & ML arxiv | Mar 24
Network-of-Thought (NoT) moves LLM reasoning from linear chains and trees to complex directed graphs, significantly improving multi-hop QA.
AI & ML arxiv | Mar 24
Proposes 'semantic sections' as a replacement for global feature vectors to interpret LLMs in complex, non-linear representation spaces.
AI & ML arxiv | Mar 24
Introduces Bayesian scattering as a mathematically grounded, non-learned baseline for image uncertainty quantification.
AI & ML arxiv | Mar 24
A red-teaming protocol that uses RL-driven 'profit' objectives to find structural exploits in AI agents instead of just prompt-injection vulnerabilities.
AI & ML arxiv | Mar 24
Pretrained Diffusion Transformers (DiTs) possess an intrinsic 'synchronization gap' where different features commit at specific, depth-localized layers.
AI & ML arxiv | Mar 24
The 'routing paradox' proves that selective attention requires the very pairwise computations it aims to replace, explaining why pure recurrent models fail at associative recall.
AI & ML arxiv | Mar 24
VAE tokenizers in Latent Diffusion Models create 'overly compact' manifolds that cause variance collapse, leading to unstable generative sampling.
AI & ML arxiv | Mar 24
CounterScene endows generative world models with explicit counterfactual reasoning for safety-critical driving evaluation.
AI & ML arxiv | Mar 24
Proposes multi-cluster memory for test-time adaptation, proving that a single unstructured memory pool is fundamentally insufficient for non-i.i.d. data streams.
AI & ML arxiv | Mar 24
Reframes plasticity loss in Reinforcement Learning as an optimization problem where networks get trapped in local optima of previous tasks.
AI & ML arxiv | Mar 24
Repurposes a 2B-parameter latent video transformer as a differentiable physics simulator for urban wind flow optimization.
AI & ML arxiv | Mar 24
Proposes replacing flat conversation histories with a tree-based architecture to solve 'logical context poisoning.'
AI & ML arxiv | Mar 24
Replaces self-attention with Reaction-Diffusion PDEs as the predictive engine for world models.
AI & ML arxiv | Mar 24
Reconceptualizes human-agent interaction as dynamically generated software rather than just chat.
AI & ML arxiv | Mar 24
ADARUBRIC generates task-specific evaluation rubrics on the fly, significantly outperforming static rubrics in human correlation and agent training outcomes.
AI & ML arxiv | Mar 24
DSPA performs preference alignment at inference time by steering Sparse Autoencoder (SAE) features, bypassing the need for expensive weight-update training.
AI & ML arxiv | Mar 24
Introduces a counterfactual framework for precise individual credit assignment in collaborative multi-agent LLM systems.
AI & ML arxiv | Mar 24
Provides the first unified theoretical formalism for hierarchical memory systems used by long-context language agents.
AI & ML arxiv | Mar 24
Rule-State Inference (RSI) inverts the standard ML paradigm by treating known regulatory rules as priors and inferring the latent state of compliance and drift, rather than approximating rules from noisy data.
AI & ML arxiv | Mar 24
GSB-PPO lifts proximal policy optimization from discrete action steps to full generation trajectories by framing it as a Generalized Schrödinger Bridge.
AI & ML arxiv | Mar 24
PRM-as-a-Judge shifts robotic evaluation from binary success/failure to a dense, potential-based progress metric system.
AI & ML arxiv | Mar 24
FIM-Merging provides a theoretical framework for layer-adaptive model merging using the Fisher Information Matrix to bound merging error.
AI & ML arxiv | Mar 24
Hypothesizes and demonstrates a unified Gaussian latent geometry connecting vision encoders and generative models.
AI & ML arxiv | Mar 24
Solves the structural redundancy problem in symbolic regression by collapsing expression DAG isomorphisms.
AI & ML arxiv | Mar 24
Synergizes prompt optimization with policy optimization to overcome the 'sparse reward' problem in complex reasoning tasks.
AI & ML arxiv | Mar 24
Identifies the 'golden subspace' for test-time adaptation, enabling extreme efficiency in online model updates.
AI & ML arxiv | Mar 24
Decouples high-level reasoning from low-level motor control in robotics using a visual prompting interface.
AI & ML arxiv | Mar 24
Proposed a test-time scaling paradigm for image restoration that allows compute-to-quality trade-offs during inference.
AI & ML arxiv | Mar 24
Identifies that the direction of log-probability change is more critical than magnitude for improving LLM reasoning via RL.
AI & ML arxiv | Mar 24
Identifies 'Visual Anchor Collapse' in DPO-aligned VLMs and introduces an asymmetric constraint to prevent models from ignoring visual evidence in favor of language priors.
AI & ML arxiv | Mar 24
Bypasses Reinforcement Learning during the exploration phase by using uncertainty-guided tree search to discover informative data.
AI & ML arxiv | Mar 24
UNITE enables single-stage joint training of the tokenizer and the diffusion model from scratch, removing the need for frozen VAEs.
AI & ML arxiv | Mar 24
LassoFlexNet matches or beats leading tree-based models on tabular data while maintaining Lasso-like interpretability through per-feature embeddings and a group Lasso mechanism.
AI & ML arxiv | Mar 24
Mechanistic interpretability reveals that LLMs possess 'affect reception' circuits that detect emotional content even when explicit keywords are removed.
AI & ML arxiv | Mar 25
Gradient boosting exhibits a 'first-mover bias' where correlated features selected early in the tree sequence gain an artificial, self-reinforcing importance in SHAP rankings.
AI & ML arxiv | Mar 25
Establishes a formal mathematical equivalence between Classifier-Free Guidance (CFG) and alignment-based objectives, allowing for CFG-like quality without inference-time overhead.
AI & ML arxiv | Mar 25
Shifts symbolic regression from discrete genetic search to a continuous, embedding-driven optimization paradigm.
AI & ML arxiv | Mar 25
Replaces standard autoregressive document OCR with a parallel diffusion-based denoising framework.
AI & ML arxiv | Mar 25
Demonstrates that Hebbian plasticity can induce emergent attractor dynamics in robot controllers, enabling rapid adaptation without backpropagation.
AI & ML arxiv | Mar 25
Instead of using top-activating examples, this method steers Sparse Autoencoder (SAE) features in Vision-Language Models to let the model describe its own internal visual features.
AI & ML arxiv | Mar 25
DeIllusionLLM introduces task-level autoregressive reasoning to prevent LLMs from hallucinating answers to ill-posed or faulty scientific questions.
AI & ML arxiv | Mar 25
Inter-Layer Structural Encoders (ILSE) use Cayley graphs to aggregate features from all internal LLM layers, improving accuracy by up to 44% over final-layer-only predictions.
AI & ML arxiv | Mar 25
Introduces Dual Q-DM, the first non-adversarial imitation learning method theoretically guaranteed to eliminate compounding errors.
AI & ML arxiv | Mar 25
Moving beyond coarse reward signals, this paper introduces token-level policy optimization for multimodal reasoning.
AI & ML arxiv | Mar 25