Model Card for pii-classifier-tab-dataset

Model is a Longformer with a classification head, finetuned on Text Anonymization Benchmark (TAB) dataset for indicating if a token is part of a Personal Identifiable Information (PII) and should be masked out or not. Model output is the logits of the input sequence, where the classes are 1 (MASK) or 0 (NO-MASK), e.g. no IOB format used.

Model is used as an example in LeakPro repo. For further detail, see example notebook.

Downloads last month
12
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for LeakPro/pii-classifier-tab-dataset

Finetuned
(125)
this model

Dataset used to train LeakPro/pii-classifier-tab-dataset