Follow me on Twitter and see How-To Videos on my YouTube channel.
New tips for paralegals and litigation support profesionals are posted to this site each night. Click on the blog headings for better detail.
The views expressed in this blog are those of the owner and do not reflect the views or opinions of the owner’s employer. All content provided on this blog is for informational purposes only. The owner of this blog makes no representations as to the accuracy or completeness of any information on this site or found by following any link on this site. The owner will not be liable for any errors or omissions in this information nor for the availability of this information. The owner will not be liable for any losses, injuries, or damages from the display or use of this information. This policy is subject to change at any time. The owner is not an attorney, and nothing posted on this site should be construed as legal advice. Litigation Support Tip of the Night does not provide confirmation that any e-discovery technique or conduct is compliant with legal, regulatory, contractual or ethical requirements.
Western European (Windows) and Unicode (UTF-8) File Encoding
June 29, 2016
Relativity allows admins to import load files that have either Western European (Windows) or Unicode (UTF-8) encoding. What is the difference between the two? Western European (Windows) or ANSI (Windows-1252) text is a small extension of the standard ASCII text English character set that includes characters used in other Latin alphabet European languages. This chart shows the full character set:
UTF-8, or Unicode, consists of more than 128,000 characters, accounting for Greek, Chinese, Cyrillic, Japanese and many other non-Latin alphabets. For a fuller discussion, see the Tip of the Night for November 25, 2015. In Relativity if you attempt to import Unicode text into a field that is not Unicode enabled, you'll get scrambled results. You can set a field for Unicode by going to Administration . . . Fields.
If you need to quickly determine which encoding a load file uses, download File Encoding Checker from CodePlex. See this page, https://encodingchecker.codeplex.com/ . In the file mask box, simply enter a string such as *.dat to find all of the load files. Then click 'View'. You'll get a list showing each file's encoding.