Description of the Chinese-to-Spanish rule-based machine translation system developed with a hybrid combination of human annotation and statistical techniques

Tots els drets reservats. Aquesta obra està protegida pels drets de propietat intel·lectual i industrial corresponents. Sense perjudici de les exempcions legals existents, queda prohibida la seva reproducció, distribució, comunicació pública o transformació sense l'autorització de la persona titular dels drets

Abstract

Two of the most popular Machine Translation (MT) paradigms are rule based (RBMT) and corpus based, which include the statistical systems (SMT). When scarce parallel corpus is available, RBMT becomes particularly attractive. This is the case of the Chinese--Spanish language pair.

This article presents the first RBMT system for Chinese to Spanish. We describe a hybrid method for constructing this system taking advantage of available resources such as parallel corpora that are used to extract dictionaries and lexical and structural transfer rules.

The final system is freely available online and open source. Although performance lags behind standard SMT systems for an in-domain test set, the results show that the RBMT’s coverage is competitive and it outperforms the SMT system in an out-of-domain test set. This RBMT system is available to the general public, it can be further enhanced, and it opens up the possibility of creating future hybrid MT systems.

Citació

Ruiz, M., Centelles, J. Description of the Chinese-to-Spanish rule-based machine translation system developed with a hybrid combination of human annotation and statistical techniques. "ACM transactions on asian language information processing", 1 Novembre 2015, vol. 15, núm. 1, p. 1-13.

URI

https://hdl.handle.net/2117/104736

DOI

10.1145/2738045

ISSN

1530-0226

Versió de l'editor

http://dl.acm.org/citation.cfm?id=2738045

Col·leccions

Altres - Enviament des de DRAC
VEU - Grup de Tractament de la Parla - Articles de revista
Departament de Teoria del Senyal i Comunicacions - Articles de revista

Pàgina completa de l'ítem

Description of the Chinese-to-Spanish rule-based machine translation system developed with a hybrid combination of human annotation and statistical techniques

Fitxers

Projectes de recerca

Unitats organitzatives

Número de la revista

Títol de la revista

ISSN de la revista

Títol del volum

Autors

Col·laborador

Editor

Tribunal avaluador

Realitzat a/amb

Tipus de document

Data publicació

Editor

Condicions d'accés

Llicència

Assignatures relacionades

Assignatures relacionades

Publicacions relacionades

Datasets relacionats

Datasets relacionats

Projecte CCD

Abstract

Descripció

Persones/entitats

Document relacionat

Versió de

Citació

Ajut

Forma part

URI

DOI

Dipòsit legal

ISBN

ISSN

Versió de l'editor

Altres identificadors

Referències

Col·leccions