Follow me on Twitter and see How-To Videos on my YouTube channel.
New tips for paralegals and litigation support profesionals are posted to this site each night. Click on the blog headings for better detail.
The views expressed in this blog are those of the owner and do not reflect the views or opinions of the owner’s employer. All content provided on this blog is for informational purposes only. The owner of this blog makes no representations as to the accuracy or completeness of any information on this site or found by following any link on this site. The owner will not be liable for any errors or omissions in this information nor for the availability of this information. The owner will not be liable for any losses, injuries, or damages from the display or use of this information. This policy is subject to change at any time. The owner is not an attorney, and nothing posted on this site should be construed as legal advice. Litigation Support Tip of the Night does not provide confirmation that any e-discovery technique or conduct is compliant with legal, regulatory, contractual or ethical requirements.
Relativity allows users to use Lucene Search in on long text fields stored in Data Grid. So if your workspace is enabled for Data Grid, in the Search panel you click Add Condition, and then select Index Search. When the separate Index Search window opens you select Lucene Search. Note that Lucene is a free Java based information retrieval search tool that is particularly good at locating similar documents.
You can enter a standard Boolean search, and then click Apply to put it in the conditions list.
One of the most unique features of Lucene search is that it lets you specify the fuzziness level of your search in the search syntax. Numbers between 0 to 2 can be entered after a ~ to specify how many character terms in the results should differ from the search term. kCura's guide notes that 80% of misspellings have an edit distance of 1. So a search for:
. . . will find treasore, but not tresore.
A variation of this is a proximity search in which the syntax specifies the distance between two terms. So a search for
. . . will find "treasure in a box" and "box of treasure", but not "treasure of priceless diamonds in a box"