Dark data is part of every enterprise's information asset universe, but is either under-used or not used at all. At the very least, this presents costs in storage and security that weigh down your systems. At worst, this data can add unnecessary risk to your organization while adding little additional value. The Data X-Ray helps you find this data and remediate it across petabyte-scale data landscapes.
Ingest metadata tag files at rates of over 100,000,000 files/day. With pure metadata extraction, you can move stale data that has been in your storage far too long.
Whether an email, a PDF, a powerpoint, or any one of 1000s of other formats, you can the Data X-Ray algorithms to identify files based on your data governance strategies and taxonomies.
Databases are scanned on a column level at speeds that humans simply cannot match. Once dark data is identified, you can tag those tables and columns for migration or further consideration with your teams.
The Data X-Ray has native integrations with tools like Collibra to trigger workflows based on your specific governance frameworks.
Customize our base ML algorithms with definitions specific to your data governance taxonomies. You can add new ML classes or employ more traditional methods like regular expressions and dictionaries to get the highest accuracy possible.