A GPT-4V Level Multimodal LLM on Your Phone
chongyi
yuzaa
AI & ML interests
multimodal large language models
Recent Activity
new activity 7 days ago
openbmb/MiniCPM-V-4.6:Update README.md updated a model 8 days ago
openbmb/MiniCPM-V-4.6 authored a paper 12 days ago
LLaVA-UHD v4: What Makes Efficient Visual Encoding in MLLMs?