AI & Machine Learning

2,557 papers · Page 30 of 52

Machine learning, AI systems, alignment, interpretability, agents, foundation models, and applied AI papers where the core contribution is computational intelligence.

Filter by category: Paradigm Challenge Breaks Assumption First Ever Nature Is Weird Practical Magic Cosmic Scale Life Origin Open Release Efficiency Leap New Capability Scaling Insight

Efficiency Breakthrough

GIFT is a training-free frame selection framework that uses 'Directed Diversity' to boost Video-LLM performance by up to 12.5%.

Z-Erase introduces the first concept erasure method for single-stream diffusion transformers, preventing generation collapse in new unified architectures.

Breaks Assumption

This paper demonstrates that Sparse Autoencoder (SAE) features in multimodal models are not modular, challenging the core assumption of intervention-based steering.

Pixelis shifts VLM reasoning from static description to a 'reasoning in pixels' agentic paradigm that learns via an executable tool grammar.

The AE4E paradigm proposes a 'Social Contract' for multi-agent economies, replacing individual model alignment with an institutional 'Separation of Power'.

Scaling Insight

MSRL scales multimodal reward modeling by transferring reasoning capabilities from text to vision-language tasks without requiring new multimodal preference data.

SEVerA enables the synthesis of self-evolving agents with formal guarantees by combining LLM planning with first-order logic rejection samplers.

Using Signal Detection Theory, this work proves that LLM calibration and 'metacognitive efficiency' (knowing what you know) are distinct, dissociable capacities.

Efficiency Breakthrough

Photon enables efficient 3D medical volume understanding through adaptive token scheduling and a novel 'gradient restoration' backpropagation rule.

Vision Hopfield Memory Networks (V-HMN) present a brain-inspired alternative to Transformers and Mamba using hierarchical associative memory mechanisms.

Trace2Skill distills lessons from across a 'parallel fleet' of execution trajectories into a unified, conflict-free skill directory for LLM agents.

Efficiency Breakthrough

Pruning low-utility prompts before RL rollouts allows for 10x more efficient training of large reasoning models.

Breaks Assumption

Safety alignment does not have to be a 'tax' on performance; it can actually improve mathematical reasoning accuracy.

Enable long video generation from short-video diffusion models without any additional training or fine-tuning.

Training-free 6D pose estimation for unseen surgical instruments using only a CAD model as prior knowledge.

Offline Decision Transformers can now synthesize strategies that surpass the classical heuristics they were trained on for the Traveling Salesman Problem.

Efficiency Breakthrough

Simple image sharpening serves as a surrogate-free, zero-cost preemptive defense against adversarial attacks.

Representing GPS trajectories as hyperspectral images enables multi-month dense anomaly detection that was previously computationally intractable.

A foundation model for gait transforms 3D skeletal motion into a systemic biosignal for multi-system health monitoring.

Efficiency Breakthrough

A new tokenization architecture reduces the 'Token Tax' for complex non-Latin scripts by over 60%.

Breaks Assumption

Sparse Autoencoder analysis reveals that weight pruning counter-intuitively preserves rare features better than frequent ones.

LLMs can be fine-tuned to act as their own 'Z-token' compressors, achieving 18x text reduction without losing reconstruction fidelity.

Efficiency Breakthrough

GlowQ introduces group-shared low-rank approximations to speed up quantized LLM inference by up to 37%.

Defines 'Reasoning Safety' as a new security dimension and introduces a real-time monitor to detect logic-chain hijackings.

Breaks Assumption

Cross-model disagreement (CMP/CME) provides a highly effective, label-free signal for detecting confident hallucinations.

Introduces a training-free pipeline for pixel-level video anomaly detection that achieves a 5x improvement in object-level accuracy.

A model-agnostic framework to extract the model-implied causal structure from any trained temporal predictor.

Efficiency Breakthrough

Reduces LLM inference energy by 40% (and up to 81%) using a distillation-based router to skip unnecessary reasoning steps.

Detects when object detectors fail to see safety-critical objects by measuring semantic misalignment with foundation model embeddings.

Breaks Assumption

Challenges the 'Golden Data' requirement for video generation by showing that imbalanced data can outperform high-quality data through timestep-aware training.

Efficiency Breakthrough

Unlocks full-body musculoskeletal humanoid training by achieving order-of-magnitude speedups via massively parallel GPU simulation.

Fixes the inherent instability of on-policy distillation in LLMs using local support matching and top-p rollout sampling.

Efficiency Breakthrough

Achieves 45% performance gains in robotics using 5-10x fewer real-world demonstrations through high-dimensional factorization.

Enables LMMs to 'think' using compact latent visual representations rather than verbalizing everything into text.

Translates a single natural language sentence into a validated, hardware-specific computational imaging system design.

Efficiency Breakthrough

Achieves up to 4.7x speedup for Diffusion LLMs using a training-free self-speculative decoding framework.

Efficiency Breakthrough

Generates 2-minute 480p videos on a single H200 GPU through a hierarchical KV-cache strategy that compresses context by 32x.

Introduces the concept of a 'trainable' knowledge base for RAG that improves performance by distilling and writing back compact knowledge units.

Efficiency Breakthrough

Enables 4K novel view synthesis in a feed-forward manner by decoupling geometric complexity from rendering resolution.

A training-free decoding framework that mitigates multimodal hallucinations by re-ranking tokens based on spatial attention entropy.

Efficiency Breakthrough

Demonstrates that general-purpose coding agents can achieve 20x speedups in hardware design optimization without domain-specific training.

Introduces a 'Hybrid Memory' architecture that maintains the identity and motion of dynamic subjects even when they hide out of view.

Breaks Assumption

Achieves state-of-the-art compositionality in vision-language models without the need for hard negative mining or degrading zero-shot performance.

Uses cycle-consistency as a label-free reward signal for reinforcement learning to resolve contradictions in multimodal reasoning.

Efficiency Breakthrough

A training-free enhancement that unlocks multi-scale synergies in Vision Foundation Models (VFMs) to boost performance across various tasks.

Practical Magic

Researchers are making satellites into high-security vaults in space that are literally impossible to hack from down here on Earth.

Paradigm Challenge

For 30 years, we didn't know the absolute limit of how much a machine can learn. Someone just finally cracked the code.

Practical Magic

Forget metal antennas—scientists just built a 'quantum radio' using a cloud of atoms that works way better.

Practical Magic

Engineers figured out how to make radio waves literally swerve around people trying to eavesdrop on your signal.

Paradigm Challenge

Weirdly enough, AI trained on 'fake' data is actually better at predicting real pandemics than AI trained on actual history.