The BERDS Benchmark aims to measure retrieval diversity for questions that are opinionated or invite diverse perspectives.
Hung-Ting Chen
timchen0618
·
AI & ML interests
NLP
Recent Activity
upvoted
a
paper
about 4 hours ago
Calibrate-Then-Act: Cost-Aware Exploration in LLM Agents
updated
a dataset
over 1 year ago
timchen0618/BERDS