The removal of duplicate records from a dataset in order to avoid repetitious review. In performing deduplication, care must be taken to preserve multiple custodians and context. For example, a meeting schedule may appear innocuous on its own, but highly relevant when attached to an e-mail between competitors who see it as an opportunity to fix prices.