The Content Discovery Tool Kit (CDTK) is a bespoke software application designed for Hodge Bank to perform deep, comprehensive data searches for Personally Identifiable Information (PII) across structured and unstructured data stores.
Utilising the power of OCR (Optical Character Recognition) and metadata scanning of common file formats the CDTK investigates and uncovers instances of keywords or groups of keywords, as well as regular expressions, such as bank account numbers, passport numbers, email addresses and dates found in unstructured data stores. The CDTK then reports on its findings, enabling a range of workflow actions to be taken on the management and retention of this data to reduce areas of potential risk.