Percipient LogoPercipient LogoPercipient LogoPercipient Logo
  • About
  • Services
  • Articles & Resources
  • Contact
✕

Vertical, Horizontal and Global Deduplication Explained

June 2, 2015

“Deduplication” or “Deduping” is the process of comparing computer files in a data-set and removing or segregating duplicates. Two significant benefits of using e-discovery software are deduplication capabilities and identification of “near duplicates.” (Near duplicate documents are those that are closely related, such as contract drafts with textual differences, or a document in different formats). Deduping a document collection reduces the number of documents to review. E-discovery software generally dedupes document collections by analyzing the hash value of the files.

Vertical deduplication occurs when duplicates are removed from documents collected from individual data custodians. This is also sometimes called custodian deduplication.

Horizontal, or global, deduplication occurs when a whole data-set is analyzed and duplicates are removed.

For more on deduplication, please visit the deduplication page in the EDRM glossary.

 

Share
Percipient Team
Percipient Team

Related posts

Your guide to eDiscovery Review Protocol
September 29, 2022

The Complete Guide to Drafting Legal Document Review Protocols


Read more
Image for article on ediscovery search in microsoft 365
July 21, 2022

What Version of Microsoft 365 Do We Need for eDiscovery?


Read more
Artificial Intelligence Defensibility
June 23, 2022

Artificial Intelligence and Legal Defensibility – Distinguishing AI Concepts and Explaining in Plain Language


Read more
Percipient Logo

Learn

Articles & Resources

Technically Legal Podcast

Company

About

Services

Contact

Talk to Us
(c) Percipient, LLC – not a law firm and
not licensed to practice law in any jurisdiction.
Privacy Policy
Website construction by WorkSite, LLC