breezedeus 's Collections
ChatAnything: Facetime Chat with LLM-Enhanced Personas
Paper
• 2311.06772
• Published
• 35
Fine-tuning Language Models for Factuality
Paper
• 2311.08401
• Published
• 30
Unifying the Perspectives of NLP and Software Engineering: A Survey on
Language Models for Code
Paper
• 2311.07989
• Published
• 26
Instruction-Following Evaluation for Large Language Models
Paper
• 2311.07911
• Published
• 22
Prompt Engineering a Prompt Engineer
Paper
• 2311.05661
• Published
• 23
Mirasol3B: A Multimodal Autoregressive model for time-aligned and
contextual modalities
Paper
• 2311.05698
• Published
• 11
JARVIS-1: Open-World Multi-task Agents with Memory-Augmented Multimodal
Language Models
Paper
• 2311.05997
• Published
• 37
Florence-2: Advancing a Unified Representation for a Variety of Vision
Tasks
Paper
• 2311.06242
• Published
• 95
The ART of LLM Refinement: Ask, Refine, and Trust
Paper
• 2311.07961
• Published
• 11
Pre-training Small Base LMs with Fewer Tokens
Paper
• 2404.08634
• Published
• 36
Scaling (Down) CLIP: A Comprehensive Analysis of Data, Architecture, and
Training Strategies
Paper
• 2404.08197
• Published
• 29
Multi-Head Mixture-of-Experts
Paper
• 2404.15045
• Published
• 60
OpenELM: An Efficient Language Model Family with Open-source Training
and Inference Framework
Paper
• 2404.14619
• Published
• 126
Pegasus-v1 Technical Report
Paper
• 2404.14687
• Published
• 33
MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training
Paper
• 2403.09611
• Published
• 129
TokenPacker: Efficient Visual Projector for Multimodal LLM
Paper
• 2407.02392
• Published
• 23
InternLM-XComposer-2.5: A Versatile Large Vision Language Model
Supporting Long-Contextual Input and Output
Paper
• 2407.03320
• Published
• 94
Building and better understanding vision-language models: insights and
future directions
Paper
• 2408.12637
• Published
• 133
OpenGVLab/InternVL2_5-38B
Image-Text-to-Text
• 38B • Updated
• 3.79k
• 49
Expanding Performance Boundaries of Open-Source Multimodal Models with
Model, Data, and Test-Time Scaling
Paper
• 2412.05271
• Published
• 160