-
SWE-Master: Unleashing the Potential of Software Engineering Agents via Post-Training
Paper • 2602.03411 • Published • 36 -
Learning Query-Specific Rubrics from Human Preferences for DeepResearch Report Generation
Paper • 2602.03619 • Published • 26 -
WorldVQA: Measuring Atomic World Knowledge in Multimodal Large Language Models
Paper • 2602.02537 • Published • 5 -
Accelerating Scientific Research with Gemini: Case Studies and Common Techniques
Paper • 2602.03837 • Published • 5
Collections
Discover the best community collections!
Collections including paper arxiv:2602.03837
-
Algorithm of Thoughts: Enhancing Exploration of Ideas in Large Language Models
Paper • 2308.10379 • Published -
Graph of Thoughts: Solving Elaborate Problems with Large Language Models
Paper • 2308.09687 • Published • 7 -
Skeleton-of-Thought: Large Language Models Can Do Parallel Decoding
Paper • 2307.15337 • Published • 38 -
Tab-CoT: Zero-shot Tabular Chain of Thought
Paper • 2305.17812 • Published • 2
-
LoRA+: Efficient Low Rank Adaptation of Large Models
Paper • 2402.12354 • Published • 7 -
The FinBen: An Holistic Financial Benchmark for Large Language Models
Paper • 2402.12659 • Published • 23 -
TofuEval: Evaluating Hallucinations of LLMs on Topic-Focused Dialogue Summarization
Paper • 2402.13249 • Published • 15 -
TrustLLM: Trustworthiness in Large Language Models
Paper • 2401.05561 • Published • 69
-
RoFormer: Enhanced Transformer with Rotary Position Embedding
Paper • 2104.09864 • Published • 17 -
Attention Is All You Need
Paper • 1706.03762 • Published • 111 -
Direct Nash Optimization: Teaching Language Models to Self-Improve with General Preferences
Paper • 2404.03715 • Published • 62 -
Zero-Shot Tokenizer Transfer
Paper • 2405.07883 • Published • 5
-
ChipNeMo: Domain-Adapted LLMs for Chip Design
Paper • 2311.00176 • Published • 9 -
Language Models can be Logical Solvers
Paper • 2311.06158 • Published • 20 -
JARVIS-1: Open-World Multi-task Agents with Memory-Augmented Multimodal Language Models
Paper • 2311.05997 • Published • 37 -
Lumos: Learning Agents with Unified Data, Modular Design, and Open-Source LLMs
Paper • 2311.05657 • Published • 30
-
SWE-Master: Unleashing the Potential of Software Engineering Agents via Post-Training
Paper • 2602.03411 • Published • 36 -
Learning Query-Specific Rubrics from Human Preferences for DeepResearch Report Generation
Paper • 2602.03619 • Published • 26 -
WorldVQA: Measuring Atomic World Knowledge in Multimodal Large Language Models
Paper • 2602.02537 • Published • 5 -
Accelerating Scientific Research with Gemini: Case Studies and Common Techniques
Paper • 2602.03837 • Published • 5
-
Algorithm of Thoughts: Enhancing Exploration of Ideas in Large Language Models
Paper • 2308.10379 • Published -
Graph of Thoughts: Solving Elaborate Problems with Large Language Models
Paper • 2308.09687 • Published • 7 -
Skeleton-of-Thought: Large Language Models Can Do Parallel Decoding
Paper • 2307.15337 • Published • 38 -
Tab-CoT: Zero-shot Tabular Chain of Thought
Paper • 2305.17812 • Published • 2
-
RoFormer: Enhanced Transformer with Rotary Position Embedding
Paper • 2104.09864 • Published • 17 -
Attention Is All You Need
Paper • 1706.03762 • Published • 111 -
Direct Nash Optimization: Teaching Language Models to Self-Improve with General Preferences
Paper • 2404.03715 • Published • 62 -
Zero-Shot Tokenizer Transfer
Paper • 2405.07883 • Published • 5
-
LoRA+: Efficient Low Rank Adaptation of Large Models
Paper • 2402.12354 • Published • 7 -
The FinBen: An Holistic Financial Benchmark for Large Language Models
Paper • 2402.12659 • Published • 23 -
TofuEval: Evaluating Hallucinations of LLMs on Topic-Focused Dialogue Summarization
Paper • 2402.13249 • Published • 15 -
TrustLLM: Trustworthiness in Large Language Models
Paper • 2401.05561 • Published • 69
-
ChipNeMo: Domain-Adapted LLMs for Chip Design
Paper • 2311.00176 • Published • 9 -
Language Models can be Logical Solvers
Paper • 2311.06158 • Published • 20 -
JARVIS-1: Open-World Multi-task Agents with Memory-Augmented Multimodal Language Models
Paper • 2311.05997 • Published • 37 -
Lumos: Learning Agents with Unified Data, Modular Design, and Open-Source LLMs
Paper • 2311.05657 • Published • 30