Translational equivalence in Statistical Machine Translation or meaning as co-occurrence

Lieve Macken; Els Lefever

doi:10.52034/lanstts.v7i.215

Translational equivalence in Statistical Machine Translation or meaning as co-occurrence

Authors

Lieve Macken University College Ghent Faculty of Translation Studies
Els Lefever University College Ghent Faculty of Translation Studies Groot-Brittanniëlaan

DOI:

https://doi.org/10.52034/lanstts.v7i.215

Keywords:

co-occurrence, corpus-based Machine Translation, Ma- chine Translation, meaning, phrase translation table, Statistical Machine Translation, word alignment, word sense disambiguation

Abstract

In this paper, we will describe the current state-of-the-art of Statistical Machine Translation (SMT), and reflect on how SMT handles meaning. Statistical Machine Translation is a corpus-based approach to MT: it de-rives the required knowledge to generate new translations from corpora. General-purpose SMT systems do not use any formal semantic representa-tion. Instead, they directly extract translationally equivalent words or word sequences – expressions with the same meaning – from bilingual parallel corpora. All statistical translation models are based on the idea of word alignment, i.e., the automatic linking of corresponding words in parallel texts. The first generation SMT systems were word-based. From a linguistic point of view, the major problem with word-based systems is that the mean-ing of a word is often ambiguous, and is determined by its context. Current state-of-the-art SMT-systems try to capture the local contextual dependen-cies by using phrases instead of words as units of translation. In order to solve more complex ambiguity problems (where a broader text scope or even domain information is needed), a Word Sense Disambiguation (WSD) module is integrated in the Machine Translation environment.

Downloads

Download data is not yet available.

Downloads

Published

25-10-2021

How to Cite

Macken, L., & Lefever, E. (2021). Translational equivalence in Statistical Machine Translation or meaning as co-occurrence. Linguistica Antverpiensia, New Series – Themes in Translation Studies, 7. https://doi.org/10.52034/lanstts.v7i.215

Download Citation

Issue

Vol. 7 (2008): Looking for Meaning: Methodological Issues in Translations Studies

Section

Articles

License

This work is licensed under a Creative Commons Attribution 4.0 International License.

Authors who publish with this journal agree to the following terms:

Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under the CC BY-NC 4.0 Deed that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal. The material cannot be used for commercial purposes.

Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.

Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See The Effect of Open Access).

Translational equivalence in Statistical Machine Translation or meaning as co-occurrence

Authors

DOI:

Keywords:

Abstract

Downloads

Downloads

Published

How to Cite

Issue

Section

License

Make a Submission

Information

linkedin

Browse