Less is More: Recursive Reasoning with Tiny Networks Paper β’ 2510.04871 β’ Published Oct 6, 2025 β’ 501
view article Article Train 400x faster Static Embedding Models with Sentence Transformers Jan 15, 2025 β’ 222
LongCodeZip: Compress Long Context for Code Language Models Paper β’ 2510.00446 β’ Published Oct 1, 2025 β’ 106
The Prompt Report: A Systematic Survey of Prompting Techniques Paper β’ 2406.06608 β’ Published Jun 6, 2024 β’ 68
Evaluating D-MERIT of Partial-annotation on Information Retrieval Paper β’ 2406.16048 β’ Published Jun 23, 2024 β’ 35
view article Article Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent +2 Apr 22, 2024 β’ 81
In-context Learning and Gradient Descent Revisited Paper β’ 2311.07772 β’ Published Nov 13, 2023 β’ 2
π Interpretability & Analysis of LMs Collection Outstanding research in LM interpretability and evaluation, summarized β’ 135 items β’ Updated 16 days ago β’ 116
Model Merging Papers Collection Collection of relevant papers about model merging β’ 13 items β’ Updated Apr 2, 2024 β’ 6
π« StarCoder2 Collection StarCoder2 models and datasets! β’ 8 items β’ Updated Mar 1, 2024 β’ 89