Every Output Verified.
Every Decision Defended.
The dedicated verification agent that governs every AI output
inside Mentis OS — eliminating hallucinations, enforcing policy,
and producing audit-ready intelligence.
JUDGE is a dedicated, independent verification agent that sits between every AI output and the end user. It does not generate intelligence — it governs it.
JUDGE is not the same model that generated the output. It is a separate, dedicated verification agent with its own reasoning — eliminating self-validation bias.
Every output is evaluated across source fidelity, factual consistency, policy compliance, and confidence scoring — not just a single pass/fail gate.
Every verification produces an immutable trace — what was checked, what was found, what verdict was reached. Always reconstructible.
Every AI output passes through five independent verification stages before reaching the user. Each gate can reject, escalate, or approve.
Every claim in the output is traced back to the source documents or data that informed it. If a claim cannot be linked to an approved source, it is flagged. No unsupported assertions pass through.
JUDGE compares the generated output against retrieved evidence using semantic consistency checks. Fabricated facts, conflated entities, invented statistics, and logical inconsistencies are caught and rejected.
Outputs are validated against enterprise-defined rules — data classification, disclosure restrictions, role-based access policies, and regulatory constraints. JUDGE enforces what the enterprise allows, not what the model can produce.
A composite confidence score is computed across source strength, reasoning coherence, and verification pass rates. Low-confidence outputs are either rejected or escalated to human review — never silently delivered.
JUDGE issues a final verdict — PASS, REJECT, or ESCALATE. Every verdict generates an immutable audit record containing the full chain of evidence, checks performed, and reasoning. Reconstructible months or years later.
"What is our data retention policy for EU customers?"
EU customer personal data is retained for 48 months post-contract termination with automated deletion workflows. The policy was updated in January 2025 following board approval.
2 factual claims contradict source documents. Output sent back for re-reasoning with corrected evidence.
"What is our data retention policy for EU customers?"
EU customer personal data is retained for 36 months post-contract termination with automated deletion workflows. The policy was updated in March 2024 following board approval.
Fabricated statistics, invented dates, conflated entities, or claims that do not exist in any source document.
Correct facts attributed to wrong documents, or citations that reference non-existent sections.
Contradictory claims within the same response, or conclusions that do not follow from presented evidence.
Outputs that disclose restricted data, exceed role permissions, or violate enterprise-defined content policies.
Answers based on superseded documents or outdated data versions when newer authoritative versions exist.
Assertions delivered with high confidence when source evidence is weak, ambiguous, or insufficient to support the claim.
The audit trail is not a log file. It is a cryptographically verifiable chain of evidence — reconstructible by auditors, regulators, and forensic teams months or years later.
JUDGE is not a feature you enable. It is the governance layer that makes
every Genovation intelligence product safe to deploy in regulated enterprises.