tokenizer used by submit model
AI & ML interests
Large language Models
datasets 26
geniacllm/livedoor_news_corpus
Viewer
• Updated
• 2.77k • 16 • 1
geniacllm/wikipedia_v2
Preview
• Updated
• 168
geniacllm/made_by_llm_and_human
Viewer
• Updated
• 2.64k • 17
geniacllm/hanrei
Viewer
• Updated
• 2.9M • 127
geniacllm/gsm8k
Viewer
• Updated
• 1.03M • 46
geniacllm/aozora_bunko
Viewer
• Updated
• 10.2k • 13
geniacllm/kokkai_v2
Preview
• Updated
• 26
geniacllm/dataset_from_other_team
Viewer
• Updated
• 27.1k • 22
geniacllm/wiki40b
Viewer
• Updated
• 1.2M • 8
geniacllm/CulturaX_default_filtered_ja_10b
Preview
• Updated
• 4