top of page

Lexis LAW PreDiscovery - Early Data Analyzer


Lexis LAW comes with a separate application (or module) called Early Data Analyzer. EDA can be used to filter out or cull data, and reduce the overall cost of electronic discovery, based on several criteria:

1. File hashes - for files that do not contain user related data.

2. Duplicates

3. Date ranges.

4. File types.

5. Email sender domain.

6. Language

Note that EDA makes it possible to export email threads without attachments that are filtered out.

EDA has settings for analysis which give you the option to use either MD5 or SHA1 either across custodian sources, and within individual custodians. Text extraction can be skipped in order to improve performance. Language identification analysis can be restricted to only a specified number of KB from the beginning of a file, or restricted to only 'common languages'.

The Indexing features can be modified to ignore or include accents, make text case sensitive, or insert spaces between Asian language characters. Email addresses, dates, and credit card numbers can also be automatically identified.

During the import process LAW will generate an exception log which will indicate which files it could not recognize or process.

EDA has a Global Dashboard which allows it to take advantage of the processing power of multiple networked computers.

In addition to endorsing documents with bates numbers, and indexing text extracted from native files, LAW EDA has two different programs which can render OCR, Abbyy FineReader and ExperVision OpenRTK. As noted in the Tip of the Night for May 16, 2015 different kinds of OCR software will generate different results. Abbyy is 2 to 3 times slower than OpenRTK but it supports Asian language characters, can run multiple instances on a single PC, and can generate PDF/A files. See the Tip of the Night for September 25, 2015. EDA can be configured to detect multiple columns of text on a page while OCRing, and also be modified to deal differently with dot matrix, typewritten, or laser & inkjet printed pages.


Sean O'Shea has more than 20 years of experience in the litigation support field with major law firms in New York and San Francisco.   He is an ACEDS Certified eDiscovery Specialist and a Relativity Certified Administrator.

The views expressed in this blog are those of the owner and do not reflect the views or opinions of the owner’s employer.

If you have a question or comment about this blog, please make a submission using the form to the right. 

Your details were sent successfully!

© 2015 by Sean O'Shea . Proudly created with Wix.com

bottom of page