Read every document. See every entity.
Structure-aware sensitive data detection across PDFs, scans, images and emails , with confidence scores, custom entities, and role awareness.
Reads layout, not just text.
Field labels, tables, form hierarchy, legal roles. Null understands what a document is before it looks at the words, so a name in a witness field never gets mistaken for a plaintiff.
Every detection, with its doubt.
Each entity carries a confidence score, alternatives, and provenance, NER, structural signal, custom rule. Thresholds are tunable per entity type, per workspace.
Describe what to hide, in words.
“Internal project codes starting with PRJ-”, and the engine compiles a detector. No regex gymnastics. No ML pipeline to wire up. Your compliance team writes the rule.
Write it in English. Get a detector.
Your legal or compliance team already knows what needs hiding, internal project codes, case IDs, product SKUs, claim numbers. Null's compiler turns plain-language descriptions into typed, versioned, threshold-tunable detectors, no regex syntax, no pipeline to maintain.
Same name. Different role, different token.
In a complaint, Mara Hoffmannis the plaintiff. In the next paragraph, as “Hoffmann”, she's still the plaintiff, the same token. But in a different document where she's a witness, she's a different token entirely. Structure-aware coreference keeps the semantics intact, so the model reasons correctly.
Whatever your team actually uses.
Native parsing where possible, OCR where needed. Structure awareness applies the same, regardless of format.
Trained on regulated corpora. Auditable end-to-end.
We don't train on your documents. The detectors are pretrained on 2.4M regulated corpora, insurance, legal, healthcare, financial. Your data stays in your vault; the engine just reads it.
Bring your hardest document. We'll parse it with you.
A real claim file, a contract, a discharge summary, whatever keeps your DPO up at night. Run it through the inspector. See every detection. Talk to the engineer who trained the detector.