Assume a set of data has been collected from the client as potentially responsive to requests for production from an opposing party. After de-duping, de-NISTing, etc., the remainder of the data yields a corpus of 1.1 million documents. A cursory “macro’ review (e.g. using document types, date ranges, or common email spam domains) yields 100,000 documents as clearly non-responsive and these are removed from the corpus.
At this point, there are 1 million documents remaining that are potentially responsive and need to be evaluated in more detail.