- Intended use
- Auto-categorize free-text incident reports into 47 standard categories (security, safety, maintenance, medical, environmental, suspicious activity, access violation, etc.).
- Out of scope
- NOT intended for: auto-submitting incidents without human review, analyzing CCTV footage, identifying individuals.
- Training data
- Fine-tuned RoBERTa on a curated public + synthetic dataset of 80k incident reports. No customer data used in base model. Custom fine-tuning on customer historical data available on Complete (1,000+ labeled examples required).
- Precision / impact
- Precision: 91% (multi-label, top-3 categories). Languages supported in production: EN, ES, FR, PT, DE, IT.
- Known limitations
- Domain-specific jargon (industrial chemical names, healthcare procedures) may classify into wrong category. Custom fine-tuning recommended for sites with specialized terminology.
- Bias evaluation
- Base training data audited for geographic bias (US, EU, LATAM each ≥ 25% representation) and category balance (no category < 1% of training set). Bias evaluation report appended to each release.
- Update cadence
- Quarterly re-training of base model. Custom fine-tunes regenerated when customer adds 500+ new labeled examples.