roshniramesh
's Collections
fp8 llm
updated
nvidia/Llama-3.1-8B-Instruct-FP8
Text Generation
•
8B
•
Updated
•
30.7k
•
•
31
amd/Llama-3.1-8B-Instruct-FP8-KV
8B
•
Updated
•
7.91k
•
6
amd/Mixtral-8x7B-Instruct-v0.1-FP8-KV
3B
•
Updated
•
1.99k
•
3
amd/Meta-Llama-3-8B_fp8_quark
Text Generation
•
8B
•
Updated
•
43
ibm-ai-platform/Bamba-9B-2T-fp8
Text Generation
•
10B
•
Updated
•
10
•
2
ibm-ai-platform/Bamba-9B-fp8
Text Generation
•
10B
•
Updated
•
36
•
2
ibm-ai-platform/Bamba-9B-1.8T-fp8
Text Generation
•
10B
•
Updated
•
11
•
2
RedHatAI/Meta-Llama-3-8B-Instruct-FP8
Text Generation
•
8B
•
Updated
•
2.19k
•
•
24
RedHatAI/Meta-Llama-3-8B-Instruct-FP8-KV
Text Generation
•
8B
•
Updated
•
5.13k
•
•
8
RedHatAI/Qwen2-7B-Instruct-FP8
Text Generation
•
8B
•
Updated
•
10.7k
•
•
2
RedHatAI/Qwen2-1.5B-Instruct-FP8
Text Generation
•
2B
•
Updated
•
7.21k
RedHatAI/Mistral-7B-Instruct-v0.3-FP8
Text Generation
•
7B
•
Updated
•
581
•
3
RedHatAI/Llama-2-7b-chat-hf-FP8
Text Generation
•
7B
•
Updated
•
54
RedHatAI/gemma-2-9b-it-FP8
Text Generation
•
9B
•
Updated
•
249
•
5
RedHatAI/DeepSeek-Coder-V2-Lite-Instruct-FP8
Text Generation
•
16B
•
Updated
•
21.2k
•
9
FriendliAI/Llama-2-13b-chat-hf-fp8
Text Generation
•
Updated
•
11
•
8
FriendliAI/Meta-Llama-3-8B-Instruct-fp8
Text Generation
•
8B
•
Updated
•
15
•
2
FriendliAI/Meta-Llama-3-8B-fp8
Text Generation
•
Updated
•
16
•
3
FriendliAI/Meta-Llama-3.1-8B-Instruct-fp8
Text Generation
•
8B
•
Updated
•
2.06k
amd/Llama-3.2-3B-Instruct-FP8-KV
3B
•
Updated
•
37
amd/Llama-3.2-1B-Instruct-FP8-KV
1B
•
Updated
•
1.67k
3B
•
Updated
•
34
1B
•
Updated
•
120
amd/Meta-Llama-3.1-8B-Instruct-fp8-quark-vllm