Add gguf_file parameter to tokenizer loading and introduce diagnostic script for GGUF validation 6f81ff7 ndc8 commited on Aug 11, 2025
Fix GGUF filename in environment variable and update comment in requirements a2a4e98 ndc8 commited on Aug 11, 2025
Refactor application to implement GGUF backend with native transformers support; update requirements and add GGUF-specific entry point 6e96e6e ndc8 commited on Aug 11, 2025
Refactor model loading to utilize accelerate for device management; add test script to verify loading fix and prevent device conflicts 8a3c5dd ndc8 commited on Aug 11, 2025
Refactor application to use lightweight backend; update requirements and add memory analysis script for optimized model configuration a4ee3a6 ndc8 commited on Aug 11, 2025
Update Dockerfile and application entry point for GGUF backend; optimize memory usage in model parameters and requirements 358e717 ndc8 commited on Aug 11, 2025
Refactor backend service to support Gemma 3n model and update requirements; remove obsolete test script and add new dependency tests 4b4e9ed ndc8 commited on Aug 11, 2025
Add scripts for converting and generating UltraChat-style SFT dataset 7ecd130 ndc8 commited on Aug 9, 2025
Fix: Update to valid HuggingFace model and fix deprecation warnings 04d695c ndc8 commited on Aug 7, 2025
Set gemma-3n-E4B-it-GGUF as main model for all text generation endpoints 8d962fd ndc8 commited on Aug 6, 2025
π Add multimodal AI capabilities with image-text-to-text pipeline 4e10023 ndc8 commited on Aug 6, 2025