Building and Using Comparable Corpora for Multilingual Natural Language Processing

AvSerge Sharoff,Reinhard Rapp

Inbunden, Engelska, 2023

453 kr

Beställningsvara. Skickas inom 10-15 vardagar. Fri frakt över 249 kr.

Fler format och utgåvor

Beskrivning

This book provides a comprehensive overview of methods to build comparable corpora and of their applications, including machine translation, cross-lingual transfer, and various kinds of multilingual natural language processing. The authors begin with a brief history on the topic followed by a comparison to parallel resources and an explanation of why comparable corpora have become more widely used. In particular, they provide the basis for the multilingual capabilities of pre-trained models, such as BERT or GPT. The book then focuses on building comparable corpora, aligning their sentences to create a database of suitable translations, and using these sentence translations to produce dictionaries and term banks. Then, it is explained how comparable corpora can be used to build machine translation engines and to develop a wide variety of multilingual applications.

Produktinformation

Utforska kategorier

Mer om författaren

Innehållsförteckning

Hoppa över listan

Mer från samma författare

Alexander Mehler, Serge Sharoff, Marina Santini - Genres on the Web, Inbunden

Genres on the Web

Alexander Mehler, Serge Sharoff, Marina Santini

Inbunden, 2010

1 680 kr

Hoppa över listan

Mer från samma serie

Hoppa över listan

Du kanske också är intresserad av

Alexander Mehler, Serge Sharoff, Marina Santini - Genres on the Web, Häftad

Genres on the Web

Alexander Mehler, Serge Sharoff, Marina Santini

Häftad, 2012

1 680 kr