United States: Electronic Discovery & Information Governance - Tip of the Month: Making a Molehill Out of a Mountain: Tips for Handling Terabytes of Data


A medium-sized company is a defendant in a putative class action lawsuit. Outside counsel negotiated the scope of the plaintiffs' document requests as much as possible, agreed on a list of custodians, and sent an e-discovery vendor to the client's headquarters to collect email and copy hard drives. The vendor collects nearly a terabyte of data. The defendant's general counsel would like information regarding ways to manage the costs associated with this large amount of data, including considering various data analytics tools in the document review strategy and using skilled, experienced people who understand how to deploy these tools as part of a defensible process.


Key goals for companies that must respond to e-discovery requests are identifying the relevant data and identifying the data that is responsive to the plaintiffs' production requests. These categories overlap, but each allows the producing party to reduce the massive volume of data to a more manageable level. Using early case assessment tools and review workflow techniques, the case team may be able to prioritize the review and production. A benefit of prioritizing the review is that the most relevant documents are typically reviewed early in the process, which allows for early case assessment (ECA) and strategy development.

Choose Your E-Discovery Partner Well

When contemplating the use of advanced analytics for filtering large data volumes, hiring the appropriate e-discovery vendor is an important first step. Counsel should consider working with a qualified e-discovery vendor that:

  • Can perform forensically sound data collections, process data using defensible workflows and prepare supporting documentation;
  • Can make available ECA tools for filtering, searching and developing review strategies;
  • Will host the results in a review application that facilitates further analysis; and
  • Has the experience and the resources to support the case team and meet discovery deadlines.

In cases with a large volume of data, recurring "hosting charges" can become a real burden, especially during a long-running case. Vendors typically charge a per-gigabyte fee for hosting data. The use of various ECA tools can result in additional hosting charges. Counsel may want to explore negotiating alternative fee arrangements for processing and/or hosting at the outset.

Simple Steps To Take Before Responsiveness Review

In addition to prioritizing the review to move relevant documents to the front of the queue, some case teams may concurrently consider removing data that is unlikely to lead to responsive documents. The most common step is to "DeNIST" data during the initial processing in order to remove particular file types, primarily program or system files. Next, a case team may consider targeted searches to identify non-relevant files, often in the form of music, videos and photos. Similarly, "junk" email, such as daily newspaper reports and newsletters, might be culled prior to the application of search terms in order to minimize instances of false positive hits.By excluding these files, the case team might gain greater insight into the data while reducing the volume of data promoted to attorney review.

Another way to cull irrelevant material is through the use of date restrictions. By applying date filters, which often are agreed upon as part of the meet-and-confer dialogue with the requesting party, the case team can concentrate on a date-restricted set of documents for review and analysis. The case team might also consider custodian-specific time limitations. For example, if a custodian only worked in the relevant department for two months, there may be no reason to include email from that person's entire tenure at the company. This initial cut can be performed during processing and excluded from the reviewable data.

Once broad cuts are made, the next step is typically to run search terms against the remaining data. Creating a list of search terms is an iterative process that is often developed through a process of discussions with the client and testing the terms against the database. The search term hit reports may suggest modifications of certain terms in order to identify relevant documents in addition to minimizing the amount of "false hits."

Consider the Use of Data Analytics Tools

New technologies can make the review process more efficient and can get attorneys' eyes on the key documents faster. For instance, "concept clustering" uses software to group emails about certain themes. Email threading can reduce review volume by showing reviewers only the most "complete" e-mail in a long chain and automatically coding its subsidiary parts so they do not need to be individually reviewed.

The case team might consider the use of predictive coding or technology-assisted review (TAR) tools during discovery and trial preparation. Although these tools were initially developed and marketed as a means for reducing the first-level attorney review costs, the focus today is trending toward using these tools to improve evaluation of both documents produced and those received in production. In addition, data analytics tools can be considered for prioritizing the review workflow; streamlining the second-level review, which is typically performed by outside counsel; and quality checking the review in order to prepare the documents for production. Data analytics can also save time, and potentially provide better results, during the preparation of witness files for depositions and trial.

Document, Document, Document

Whatever choices are made for data review, it is important to carefully document them. The use of these tools is relatively new and is still in the process of being fully understood by the legal community. As a result, a degree of skepticism can exist about the use of these tools. Thus, the case team is encouraged to work closely with their e-discovery provider to create supporting documentation that describes the process. This documentation can be used to replicate the process in future litigation and to explain and defend the process in the event of a challenge.

Learn more about Mayer Brown's Electronic Discovery & Records Management practice

Visit us at mayerbrown.com

Mayer Brown is a global legal services provider comprising legal practices that are separate entities (the "Mayer Brown Practices"). The Mayer Brown Practices are: Mayer Brown LLP and Mayer Brown Europe – Brussels LLP, both limited liability partnerships established in Illinois USA; Mayer Brown International LLP, a limited liability partnership incorporated in England and Wales (authorized and regulated by the Solicitors Regulation Authority and registered in England and Wales number OC 303359); Mayer Brown, a SELAS established in France; Mayer Brown JSM, a Hong Kong partnership and its associated entities in Asia; and Tauil & Chequer Advogados, a Brazilian law partnership with which Mayer Brown is associated. "Mayer Brown" and the Mayer Brown logo are the trademarks of the Mayer Brown Practices in their respective jurisdictions.

© Copyright 2015. The Mayer Brown Practices. All rights reserved.

This Mayer Brown article provides information and comments on legal issues and developments of interest. The foregoing is not a comprehensive treatment of the subject matter covered and is not intended to provide legal advice. Readers should seek specific legal advice before taking any action with respect to the matters discussed herein.

To print this article, all you need is to be registered on Mondaq.com.

Click to Login as an existing user or Register so you can print this article.

In association with
Related Video
Up-coming Events Search
Font Size:
Mondaq on Twitter
Register for Access and our Free Biweekly Alert for
This service is completely free. Access 250,000 archived articles from 100+ countries and get a personalised email twice a week covering developments (and yes, our lawyers like to think you’ve read our Disclaimer).
Email Address
Company Name
Confirm Password
Mondaq Topics -- Select your Interests
 Law Performance
 Law Practice
 Media & IT
 Real Estate
 Wealth Mgt
Asia Pacific
European Union
Latin America
Middle East
United States
Worldwide Updates
Check to state you have read and
agree to our Terms and Conditions

Terms & Conditions and Privacy Statement

Mondaq.com (the Website) is owned and managed by Mondaq Ltd and as a user you are granted a non-exclusive, revocable license to access the Website under its terms and conditions of use. Your use of the Website constitutes your agreement to the following terms and conditions of use. Mondaq Ltd may terminate your use of the Website if you are in breach of these terms and conditions or if Mondaq Ltd decides to terminate your license of use for whatever reason.

Use of www.mondaq.com

You may use the Website but are required to register as a user if you wish to read the full text of the content and articles available (the Content). You may not modify, publish, transmit, transfer or sell, reproduce, create derivative works from, distribute, perform, link, display, or in any way exploit any of the Content, in whole or in part, except as expressly permitted in these terms & conditions or with the prior written consent of Mondaq Ltd. You may not use electronic or other means to extract details or information about Mondaq.com’s content, users or contributors in order to offer them any services or products which compete directly or indirectly with Mondaq Ltd’s services and products.


Mondaq Ltd and/or its respective suppliers make no representations about the suitability of the information contained in the documents and related graphics published on this server for any purpose. All such documents and related graphics are provided "as is" without warranty of any kind. Mondaq Ltd and/or its respective suppliers hereby disclaim all warranties and conditions with regard to this information, including all implied warranties and conditions of merchantability, fitness for a particular purpose, title and non-infringement. In no event shall Mondaq Ltd and/or its respective suppliers be liable for any special, indirect or consequential damages or any damages whatsoever resulting from loss of use, data or profits, whether in an action of contract, negligence or other tortious action, arising out of or in connection with the use or performance of information available from this server.

The documents and related graphics published on this server could include technical inaccuracies or typographical errors. Changes are periodically added to the information herein. Mondaq Ltd and/or its respective suppliers may make improvements and/or changes in the product(s) and/or the program(s) described herein at any time.


Mondaq Ltd requires you to register and provide information that personally identifies you, including what sort of information you are interested in, for three primary purposes:

  • To allow you to personalize the Mondaq websites you are visiting.
  • To enable features such as password reminder, newsletter alerts, email a colleague, and linking from Mondaq (and its affiliate sites) to your website.
  • To produce demographic feedback for our information providers who provide information free for your use.

Mondaq (and its affiliate sites) do not sell or provide your details to third parties other than information providers. The reason we provide our information providers with this information is so that they can measure the response their articles are receiving and provide you with information about their products and services.

If you do not want us to provide your name and email address you may opt out by clicking here .

If you do not wish to receive any future announcements of products and services offered by Mondaq by clicking here .

Information Collection and Use

We require site users to register with Mondaq (and its affiliate sites) to view the free information on the site. We also collect information from our users at several different points on the websites: this is so that we can customise the sites according to individual usage, provide 'session-aware' functionality, and ensure that content is acquired and developed appropriately. This gives us an overall picture of our user profiles, which in turn shows to our Editorial Contributors the type of person they are reaching by posting articles on Mondaq (and its affiliate sites) – meaning more free content for registered users.

We are only able to provide the material on the Mondaq (and its affiliate sites) site free to site visitors because we can pass on information about the pages that users are viewing and the personal information users provide to us (e.g. email addresses) to reputable contributing firms such as law firms who author those pages. We do not sell or rent information to anyone else other than the authors of those pages, who may change from time to time. Should you wish us not to disclose your details to any of these parties, please tick the box above or tick the box marked "Opt out of Registration Information Disclosure" on the Your Profile page. We and our author organisations may only contact you via email or other means if you allow us to do so. Users can opt out of contact when they register on the site, or send an email to unsubscribe@mondaq.com with “no disclosure” in the subject heading

Mondaq News Alerts

In order to receive Mondaq News Alerts, users have to complete a separate registration form. This is a personalised service where users choose regions and topics of interest and we send it only to those users who have requested it. Users can stop receiving these Alerts by going to the Mondaq News Alerts page and deselecting all interest areas. In the same way users can amend their personal preferences to add or remove subject areas.


A cookie is a small text file written to a user’s hard drive that contains an identifying user number. The cookies do not contain any personal information about users. We use the cookie so users do not have to log in every time they use the service and the cookie will automatically expire if you do not visit the Mondaq website (or its affiliate sites) for 12 months. We also use the cookie to personalise a user's experience of the site (for example to show information specific to a user's region). As the Mondaq sites are fully personalised and cookies are essential to its core technology the site will function unpredictably with browsers that do not support cookies - or where cookies are disabled (in these circumstances we advise you to attempt to locate the information you require elsewhere on the web). However if you are concerned about the presence of a Mondaq cookie on your machine you can also choose to expire the cookie immediately (remove it) by selecting the 'Log Off' menu option as the last thing you do when you use the site.

Some of our business partners may use cookies on our site (for example, advertisers). However, we have no access to or control over these cookies and we are not aware of any at present that do so.

Log Files

We use IP addresses to analyse trends, administer the site, track movement, and gather broad demographic information for aggregate use. IP addresses are not linked to personally identifiable information.


This web site contains links to other sites. Please be aware that Mondaq (or its affiliate sites) are not responsible for the privacy practices of such other sites. We encourage our users to be aware when they leave our site and to read the privacy statements of these third party sites. This privacy statement applies solely to information collected by this Web site.

Surveys & Contests

From time-to-time our site requests information from users via surveys or contests. Participation in these surveys or contests is completely voluntary and the user therefore has a choice whether or not to disclose any information requested. Information requested may include contact information (such as name and delivery address), and demographic information (such as postcode, age level). Contact information will be used to notify the winners and award prizes. Survey information will be used for purposes of monitoring or improving the functionality of the site.


If a user elects to use our referral service for informing a friend about our site, we ask them for the friend’s name and email address. Mondaq stores this information and may contact the friend to invite them to register with Mondaq, but they will not be contacted more than once. The friend may contact Mondaq to request the removal of this information from our database.


From time to time Mondaq may send you emails promoting Mondaq services including new services. You may opt out of receiving such emails by clicking below.

*** If you do not wish to receive any future announcements of services offered by Mondaq you may opt out by clicking here .


This website takes every reasonable precaution to protect our users’ information. When users submit sensitive information via the website, your information is protected using firewalls and other security technology. If you have any questions about the security at our website, you can send an email to webmaster@mondaq.com.

Correcting/Updating Personal Information

If a user’s personally identifiable information changes (such as postcode), or if a user no longer desires our service, we will endeavour to provide a way to correct, update or remove that user’s personal data provided to us. This can usually be done at the “Your Profile” page or by sending an email to EditorialAdvisor@mondaq.com.

Notification of Changes

If we decide to change our Terms & Conditions or Privacy Policy, we will post those changes on our site so our users are always aware of what information we collect, how we use it, and under what circumstances, if any, we disclose it. If at any point we decide to use personally identifiable information in a manner different from that stated at the time it was collected, we will notify users by way of an email. Users will have a choice as to whether or not we use their information in this different manner. We will use information in accordance with the privacy policy under which the information was collected.

How to contact Mondaq

You can contact us with comments or queries at enquiries@mondaq.com.

If for some reason you believe Mondaq Ltd. has not adhered to these principles, please notify us by e-mail at problems@mondaq.com and we will use commercially reasonable efforts to determine and correct the problem promptly.