You need to agree to share your contact information to access this model

This repository is publicly accessible, but you have to accept the conditions to access its files and content.

Log in or Sign Up to review the conditions and access this model content.

Logos Auditor β€” Gemma 2 9B (ARBITER)

The primary epistemological safety model from "The Instrument Trap: Why Identity-as-Authority Breaks AI Safety Systems" (DOI: 10.5281/zenodo.18716474).

This is the ARBITER β€” the 9B reference model that serves as the gold-standard epistemological firewall in the ALEPH architecture.

Key Results

Metric Value 95% CI
Behavioral Pass 97.3% [94.8, 98.6]
External Fabrication 0.0% [0.00%, 0.03%]
Attack Resistance (ADVERSARIAL) 98.7% β€”

What This Model Does

Logos is NOT a chatbot. It is a claim classifier β€” an epistemological firewall that determines whether an AI agent should act on a given claim. Logos is fine-tuned, not prompted. Behavioral constraints emerge from training, not system instructions.

Access

This model requires approved access. Request access using the form above and describe your intended use case.

Related Models

Paper

Rodriguez, R. (2026). "The Instrument Trap: Why Identity-as-Authority Breaks AI Safety Systems." Zenodo. DOI: 10.5281/zenodo.18716474

License

This model inherits the Gemma license from its base model.

Downloads last month
244
GGUF
Model size
9B params
Architecture
gemma2
Hardware compatibility
Log In to add your hardware

We're not able to determine the quantization variants.

Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support

Model tree for LumenSyntax/logos-auditor-gemma2-9b

Base model

google/gemma-2-9b
Quantized
(155)
this model

Dataset used to train LumenSyntax/logos-auditor-gemma2-9b