The removal from a document set of e-mails which are subsumed within larger e-mail conversations, in order to avoid repeated review of the same message. Care must be taken to avoid pitfalls relating to unreliable message IDs and subject headers, multiple custodians, and attachments.