hfplatform (Hugging Face Platform Community)

angt

posted an update 5 days ago

Post

1534

I'm excited to share that https://installama.sh is up and running! 🚀

On Linux / macOS / FreeBSD it is easier than ever:

curl https://installama.sh | sh

And Windows just joined the party 🥳

irm https://installama.sh | iex

Stay tuned for new backends on Windows!

angt

posted an update 10 days ago

Post

373

🚀 installama.sh update: Vulkan & FreeBSD support added!

The fastest way to install and run llama.cpp has just been updated!

We are expanding hardware and OS support to make local AI even more accessible. This includes:

🌋 Vulkan support for Linux on x86_64 and aarch64.
😈 FreeBSD support (CPU backend) on x86_64 and aarch64 too.
✨ Lots of small optimizations and improvements under the hood.

Give it a try right now:

curl angt.github.io/installama.sh | MODEL=unsloth/Qwen3-4B-GGUF:Q4_0 sh

angt

posted an update 19 days ago

Post

1953

One command line is all you need...

...to launch a local llama.cpp server on any Linux box or any Metal-powered Mac 🚀

curl angt.github.io/installama.sh | MODEL=unsloth/gpt-oss-20b-GGUF sh

Learn more: https://github.com/angt/installama.sh

hlarcher

posted an update 4 months ago

Post

348

GH200 cooking time 🧑‍🍳🔥!

We just updated GPU-fryer 🍳 to run on Grace Hopper Superchip (GH200) - fully optimized for ARM-based systems!
With this release, we switched to cuBLASLt to support running FP8 benchmarks. You can monitor GPU throttling, TFLOPS outliers, HBM memory health, and ensure that you get the most of your hardware setup.
Perfect for stress testing and tuning datacenter GPUs.

Check it out on Github 👉 https://github.com/huggingface/gpu-fryer

angt

posted an update 4 months ago

Post

278

The new hf jobs CLI is absolutely awesome!
I couldn't resist writing a blog post about it:
https://huggingface.co/blog/angt/your-own-gpu-powered-image-generator-with-hf-jobs

mfuntowicz

updated a Space 5 months ago

Hugging Face Platform Dev

💻

mfuntowicz

published a Space 5 months ago

Hugging Face Platform Dev

💻

angt

posted an update 6 months ago

Post

317

Just published: Nano-vLLM meets Inference Endpoints

I show how to bind Nano-vLLM (supporting Qwen3-0.6B) to a web service — and deploy it easily on Hugging Face Inference Endpoints.

Minimalist engine, maximum fun!

https://huggingface.co/blog/angt/nano-vllm-meets-inference-endpoints

hlarcher

authored a paper 8 months ago

SmolVLM: Redefining small and efficient multimodal models

Paper • 2504.05299 • Published Apr 7 • 200

hlarcher

authored a paper 10 months ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published Feb 4 • 249

hlarcher

posted an update 11 months ago

Post

1171

We are introducing multi-backend support in Hugging Face Text Generation Inference!
With new TGI architecture we are now able to plug new modeling backends to get best performances according to selected model and available hardware. This first step will very soon be followed by the integration of new backends (TRT-LLM, llama.cpp, vLLM, Neuron and TPU).

We are polishing the TensorRT-LLM backend which achieves impressive performances on NVIDIA GPUs, stay tuned 🤗 !

Check out the details: https://huggingface.co/blog/tgi-multi-backend

AI & ML interests

Team members 3

hfplatform's activity

Hugging Face Platform Dev

Hugging Face Platform Dev