Search and Review in 280 Languages
In this "World is Flat" era, corporations and law firms work globally and communicate across many languages. Discovery documents likewise come in many languages, requiring language-specific search, analysis and review.
Catalyst was one of the first discovery companies to provide true multi-language search and review. Our discovery platform includes sophisticated language identification and tokenization software to ensure that multi-language documents are treated properly. Since 2007, we have specialized in the challenging CJK languages (Chinese, Japanese and Korean), along with Russian, Arabic and Hebrew.
Many systems today advertise themselves as “Unicode compliant.” We’ve been Unicode compliant since 2007, but that is only part of our story. Multi-language documents often come in proprietary code pages (e.g. Shift JIS, Big 5) as well as Unicode. We handle the major proprietary formats along with Unicode and ASCII.
Our platform also identifies the primary and secondary languages in discovery documents. Critically, we properly tokenize languages such as Chinese, Japanese, Korean and Arabic. If the words are not tokenized, a search engine won’t properly recognize search hits.
Our automated processing was designed for complex multi-language documents. We not only handle Unicode but can also process and index non-Unicode documents. We have helped clients manage terabytes of documents in Chinese, Japanese, Korean, Russian, Azerbaijani and many other languages.
Group Documents by Language for Review
As we index your documents, we can identify primary and secondary languages, allowing review administrators to group and assign documents by language. We can also analyze relative language content for more accurate assignments.
Translate Documents On the Fly
Translate documents from over 30 languages with one click. Our Translation Assistant allows you to translate search terms into multiple languages as well.
Multi-Language Search and Review Services
We provide search consultants fluent in multiple languages to assist with search strategy, culling, key document retrieval and analysis. We can also provide local or virtual review teams fluent in over 100 languages.
European Union Data Privacy Compliance
We have been hosting data coming from the European Union for years. We were one of the first to be registered with the U.S. Department of Commerce as a “Safe Harbor.”
Designed for a Global Regulatory Environment
We regularly support large corporations facing multi-language FCPA investigations and FTC Second Requests. Our grid-based system allows you to quickly process and load large volumes of multi-language files, search and organize them for review, and then assign the work to global review teams. Our system really shines when you need to review millions of multi-language documents on a tight deadline.
What People Are Saying
“Catalyst's foreign-language capabilities are industry-leading and its search technology is best-in-class.”