The control set is a random sample that attempts to represent the universe of documents the predictive-coding software will analyze. The control set is used to measure accuracy. Some software providers, such as Dagger Analytics, allow parties to use part or all of the training set as the control set, reducing the number of documents requiring review. The random set is typically a few hundred to a few thousand documents, depending on prevalence and the desired margins of error.