With scaled datasets manual desensitization of documents is not even possible. The Data X-Ray moves you from a world of manual highlighting to one where you can desensitize data at 100,000s of words per second in a click of a button. The Data X-Ray enables ongoing privacy-preserving data science by deploying your desensitization algorithms across your team and enterprise with ease.
Ingest any normal type of file or database column as your raw dataset.
The out-of-the-box accuracy rate is around 92% and with supervision can quickly approach 99% accuracy on homogeneous datasets.
Redaction exports include statistical reports of what was redacted and why for supervised redaction.
In addition to basic ML algorithms, regular expressions, and dictionaries, you can also use our redaction filters if you want to achieve very high recall on sensitive data types.
Export raw files as well as text versions of the files for further processing downstream.