Web2BigTable: A Bi-Level Multi-Agent LLM System for Internet-Scale Information Search and Extraction Paper • 2604.27221 • Published 10 days ago • 37
OpenSearch-VL: An Open Recipe for Frontier Multimodal Search Agents Paper • 2605.05185 • Published 3 days ago • 91
privacy-filter Collection OpenAI's privacy-filter fine0tuned models • 6 items • Updated 3 days ago • 9
Mario: Multimodal Graph Reasoning with Large Language Models Paper • 2603.05181 • Published Mar 5 • 10
AgentSearchBench: A Benchmark for AI Agent Search in the Wild Paper • 2604.22436 • Published 15 days ago • 14
Prism-Reranker: Beyond Relevance Scoring -- Jointly Producing Contributions and Evidence for Agentic Retrieval Paper • 2604.23734 • Published 13 days ago • 3
view article Article DeepSeek-V4: a million-token context that agents can actually use 15 days ago • 42
view article Article Training and Finetuning Multimodal Embedding & Reranker Models with Sentence Transformers 23 days ago • 69
DR^{3}-Eval: Towards Realistic and Reproducible Deep Research Evaluation Paper • 2604.14683 • Published 23 days ago • 36
ClawEnvKit: Automatic Environment Generation for Claw-Like Agents Paper • 2604.18543 • Published 19 days ago • 27
GLiNER-PII Collection PII detection models developed in collaboration with Wordcab • 5 items • Updated Jan 29 • 23
Toward Autonomous Long-Horizon Engineering for ML Research Paper • 2604.13018 • Published 25 days ago • 34
Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe Paper • 2604.13016 • Published 25 days ago • 90
KnowRL: Boosting LLM Reasoning via Reinforcement Learning with Minimal-Sufficient Knowledge Guidance Paper • 2604.12627 • Published 25 days ago • 100