SERIESFUSION
.
AI
Science Discovery for Humans | Curated by AI & Humans
About
RSS
Open Release
Open Release
15 papers
OpenSanctions Pairs releases a massive benchmark for entity matching, proving that local LLMs can now match production rule-based systems in high-stakes compliance tasks.
AI & ML
arxiv | Mar 13
Tiny Aya is a 3.35B parameter multilingual model that achieves state-of-the-art results across 70 languages, challenging the need for massive scale in global AI.
AI & ML
arxiv | Mar 13
Introduces the first billion-scale SAR vision foundation model and a massive unified benchmark for all-weather geospatial semantic segmentation.
AI & ML
arxiv | Mar 13
An open foundation model for humanoid robots that achieves high performance using only 30 hours of real-world robot data by pre-training on egocentric human videos.
AI & ML
arxiv | Mar 13
Surg-R1 is a specialized surgical reasoning model released alongside the largest surgical Chain-of-Thought dataset (320,000 pairs).
AI & ML
arxiv | Mar 16
Releases Feynman, an agentic pipeline and 100k-sample dataset for generating high-quality, knowledge-rich diagrams with grounded captions.
AI & ML
arxiv | Mar 16
Introduces the largest-ever multi-modal CAD dataset with 10 million annotations for 1 million models to enable geometric deep learning on BRep data.
AI & ML
arxiv | Mar 16
Introduces a unified evaluation harness for Vision-Language-Action (VLA) models that standardizes disparate protocols and exposes hidden flaws in published SOTA models.
AI & ML
arxiv | Mar 17
Releases an 11-billion example dataset and model (RealVLG-R1) for unified real-world visual-language grounding and robotic manipulation.
AI & ML
arxiv | Mar 17
Releases a million-scale human preference dataset (29M pairs) specifically for text-to-image editing tasks.
AI & ML
arxiv | Mar 17
Tagarela releases 8,972 hours of high-quality Portuguese podcast audio, rivaling the scale of GigaSpeech for English.
AI & ML
arxiv | Mar 17
Democratizes the development of 'Deep Search' agents by open-sourcing the specialized training data and trajectory synthesis methods.
AI & ML
arxiv | Mar 17
Kamino is a massively parallel GPU physics solver that natively supports complex kinematic loops and multi-body systems.
AI & ML
arxiv | Mar 18
IQuest-Coder-V1 introduces a series of high-performance code models including a unique 'Loop' variant with a recurrent mechanism for efficiency.
AI & ML
arxiv | Mar 18
SurgΣ is a massive open-source release of 5.98M multimodal conversations and foundation models for surgical intelligence.
AI & ML
arxiv | Mar 18