The FEMTI guidelines for contextual MT evaluation: principles and resources

Paula Estrella, Andrei Popescu-Belis, Maghi King

Abstract


A large number of evaluation metrics exist for machine translation (MT) systems, but depending on the intended context of use of such a system, not all metrics are equally relevant. Based on the ISO/IEC 9126 and 14598 standards for software evaluation, the Framework for the Evaluation of Machine Translation in ISLE (FEMTI) provides guidelines for the selection of quality characteristics to be evaluated depending on the expected task, users, and input characteristics of an MT system. This approach to contextual evaluation was implemented as a web-based application which helps its users design evaluation plans. In addition, FEMTI offers experts in evaluation the possibility to enter and share their knowledge using a dedicated web-based tool, tested in several evaluation exercises.

Keywords


MT evaluation; contextual evaluation; quality models for MT; contexts of use for MT; evaluation plans; web-based interfaces; FEMTI

Full Text:

PDF