The Importance of Process; THE Lesson from Da Silva Moore?
By Sonya Sigler on April 26, 2012
I recently wrote about Technology Assisted Review (“TAR”) and the importance of the combination of humans and technology in Let’s Make a Deal: The Document Review Version. This blog is a continuation of that topic.
In Judge Peck’s recent Da Silva Moore, et al., v. Publicis Groupe, et al. opinion (2/24/12), he emphasized the importance of having a well-thought out workflow and process when using TAR. In that case, the parties had more than three million emails and they agreed to the use of predictive coding. (Predictive coding is a particular type of search technology that generally relies on Latent Semantic Indexing, a statistics-based search methodology.)
The parties are now in a side dispute over the “scope and implementation” of that use of predictive coding technology in the eDiscovery portion of the case. This highly public and contested argument over which protocol and workflow to use when deploying TAR is the centerpiece of the dispute.
Perfection is never a goal of any technology assisted review software: rather, the goal is to find a reasonable and cost effective way to identify and produce relevant documents while identifying and withholding privileged material. The exact workflow and process has until now been the producing party’s purview. Here, the parties agreed to the defendant’s use of predictive coding, but now disagree on how to use it and the number of documents to be review from a statistically significant standpoint.
In Da Silva, the defendant agreed to turn over the seed set of non-privileged documents used to “train” the system. This disclosure would allow the plaintiff to see the decisions and coding calls made on the seed data. Seven iterative rounds of review were scheduled for the training phase. Once complete, the defendant was to sample the documents not marked relevant by the machine. The sample size the defendant’s were using for this phase was 2,399 documents. The plaintiffs questioned and disagreed that this protocol and workflow would work.
The Court reminded the parties that computer-assisted review “works better than most of the alternatives, if not all of the [present] alternatives. So the idea is not make this perfect, it’s not going to be perfect. The idea is to make it significantly better than the alternatives without nearly as much cost.”
The court agreed to use that workflow as described above, but with the caveat that the parties may need to revisit the proposed workflow if things aren’t stabilized or working as first thought. The fundamental disagreement is over whether the seed set of data used to train the predictive coding technology is representative of the entire data set.
All of these things are mere details of a workflow, which is used to decide how to actually locate and confirm a document is relevant (and not privileged) and marked to be turned over in a document production.
Judge Peck went on to say that the Daubert standard is “not applicable to how documents are searched for and found in discovery.” He also said that “it is the process used and the interaction of man and machine that the courts need to examine.” That interaction is the workflow and process behind what was done with the technology by the humans. That workflow is the cautionary tale from the Da Silva Moore decision. That workflow is what we need to be paying attention to when we are putting together an effective eDiscovery plan.
I consider the next to last sentence of this opinion to be the most use: “As with keywords or any other technological solution to ediscovery, counsel must design an appropriate process, including use of available technology, with appropriate quality control testing, to review and produce relevant ESI while adhering to Rule 1 and Rule 26(b)(2)(C) proportionality.” In the end, workflow matters.