Jeremy Udit

jcudit

jcudit

AI & ML interests

None yet

Recent Activity

published a Space about 6 hours ago

jcudit/Qwen-Image-Edit-2509-LoRAs-Fast

updated a Space about 7 hours ago

jcudit/Qwen-Image-Edit-2509-LoRAs-Fast

liked a Space 1 day ago

multimodalart/wan-2-2-first-last-frame

View all activity

Organizations

upvoted 3 articles 12 days ago

Article

From Files to Chunks: Improving HF Storage Efficiency

Nov 20, 2024

•

Article

From Chunks to Blocks: Accelerating Uploads and Downloads on the Hub

Feb 12

•

Article

Xet is on the Hub

Mar 18

•

upvoted a changelog 3 months ago

Changelog

Introducing HF Jobs: Run scalable compute jobs on Hugging Face

Jul 30

• 201

upvoted an article 3 months ago

Article

Inference Endpoints Changelog 🚀

Oct 11, 2024

•

upvoted 6 articles 4 months ago

Article

From Zero to GPU: A Guide to Building and Scaling Production-Ready CUDA Kernels

Aug 18

•

Article

Announcing the Synthetic Online Conversations Dataset (SOC)

Aug 12

•

Article

Building an Open Floor Parrot Agent in Python

Jul 19

•

Article

LLMGameHub: How We Won the Gradio Agents & MCP Hackathon 2025

Jul 28

•

Article

The Complete AI Architecture Landscape

Jun 8

•

Article

How to Train Your LLM Web Agent: A Statistical Diagnosis

Jul 8

•

upvoted 9 articles 5 months ago

Article

Efficient Request Queueing – Optimizing LLM Performance

Apr 2

•

Article

Prefill and Decode for Concurrent Requests - Optimizing LLM Performance

Apr 16

•

Article

How Long Prompts Block Other Requests - Optimizing LLM Performance

Jun 12

•

Article

What's Software 3.0? (Spoiler: You're Already Using It)

Jun 19

•

Article

What Coding Agent Wins?

Jun 26

•

Article

MCP is at a Tipping Point: Here's Why You Should Care

Jun 10

•

Article

ScreenEnv: Deploy your full stack Desktop Agent

Jul 10

•

Article

Nano-vLLM meets Inference Endpoints

Jun 25

•

Article

Transformers Are Getting Old: Variants and Alternatives Exist!

Jul 5

•

Jeremy Udit

AI & ML interests

Recent Activity

Organizations

jcudit's activity

From Files to Chunks: Improving HF Storage Efficiency

From Chunks to Blocks: Accelerating Uploads and Downloads on the Hub

Xet is on the Hub

Introducing HF Jobs: Run scalable compute jobs on Hugging Face

Inference Endpoints Changelog 🚀

From Zero to GPU: A Guide to Building and Scaling Production-Ready CUDA Kernels

Announcing the Synthetic Online Conversations Dataset (SOC)

Building an Open Floor Parrot Agent in Python

LLMGameHub: How We Won the Gradio Agents & MCP Hackathon 2025

The Complete AI Architecture Landscape

How to Train Your LLM Web Agent: A Statistical Diagnosis

Efficient Request Queueing – Optimizing LLM Performance

Prefill and Decode for Concurrent Requests - Optimizing LLM Performance

How Long Prompts Block Other Requests - Optimizing LLM Performance

What's Software 3.0? (Spoiler: You're Already Using It)

What Coding Agent Wins?

MCP is at a Tipping Point: Here's Why You Should Care

ScreenEnv: Deploy your full stack Desktop Agent

Nano-vLLM meets Inference Endpoints

Transformers Are Getting Old: Variants and Alternatives Exist!