moneim
/

query-classifier-model

Model card Files Files and versions

YAML Metadata Warning: empty or missing yaml metadata in repo card (https://huggingface.co/docs/hub/model-cards#model-card-metadata)

Query Classifier Model

This is a probabilistic multi-label classifier for routing business queries to appropriate systems.

Model Description

Architecture: Multi-layer neural network with sigmoid outputs
Input: TF-IDF vectorized queries (max 500 features)
Output: Independent probabilities for each system
Systems: accounting, communication, enterprise_resource_planning, knowledge_management, project_management, sales

Performance

Macro F1: 0.817
Micro F1: 0.803
Subset Accuracy: 0.704
Hamming Loss: 0.071

Usage

from src.core.services__OLD_REFACTOR.query_classifier_service import classify_query

# Classify a single query
result = classify_query("show total revenue for last quarter")
print(f"Top system: {result['top_system']}")
print(f"Confidence: {result['confidence']:.3f}")
print(f"All systems: {result['predicted_systems']}")

Training Data

Total queries: 621
Training samples: 496
Test samples: 125
Sources: Apache Calcite fine-tuning data (Jira, QuickBooks, HubSpot, Gmail, Notion, SAP)

Systems Supported

accounting: Accounting
communication: Communication
enterprise_resource_planning: Enterprise Resource Planning
knowledge_management: Knowledge Management
project_management: Project Management
sales: Sales

Model Files

final_model.pth: PyTorch model weights and configuration
vectorizer.pkl: TF-IDF vectorizer for text preprocessing
mlb.pkl: Multi-label binarizer for label encoding
metadata.json: Model metadata and performance metrics

Downloads last month: -; Downloads are not tracked for this model. How to track

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support