[ Skip to the content ]

Institute of Formal and Applied Linguistics

at Faculty of Mathematics and Physics, Charles University, Prague, Czech Republic


[ Back to the navigation ]

Publication


Year 2014
Type in proceedings
Status published
Language English
Author(s) Xue, Nianwen Bojar, Ondřej Hajič, Jan Palmer, Martha Urešová, Zdeňka Zhang, Xiuhong
Title Not an Interlingua, But Close: Comparison of English AMRs to Chinese and Czech
Czech title Ještě ne interlingva, ale už dost blízko: Srovnání anglických AMR s čínskými a českými
Proceedings 2014: Reykjavík, Iceland: LREC 2014: Proceedings of the 9th International Conference on Language Resources and Evaluation (LREC 2014)
Pages range 1765-1772
How published online
URL http://www.lrec-conf.org/proceedings/lrec2014/pdf/384_Paper.pdf
Supported by 2013-2015 GP13-03351P (Srovnání české a anglické valence sloves na základě korpusového materiálu (teorie a praxe)) 2012-2014 LH12093 (Strojový překlad se sémantickou informací) 2010-2015 LM2010013 (LINDAT-CLARIN: Institut pro analýzu, zpracování a distribuci lingvistických dat) 2012-2016 PRVOUK P46 (Informatika)
Czech abstract Reprezentace AMR abstrahují od morfologicko- syntaktických jevů, protože mnohé z nich mohou být důvodem mezijazykových rozdílů. V článku se snažíme zjistit, zda by AMR mohly napomoci počítačovému překladu. Využíváme k tomu porovnání překladu a anotace anglických vět do češtiny a čínštiny.
English abstract Abstract Meaning Representations (AMRs) are rooted, directional and labeled graphs that abstract away from morpho-syntactic idiosyncrasies such as word category (verbs and nouns), word order, and function words (determiners, some prepositions). Because these syntactic idiosyncrasies account for many of the cross-lingual differences, it would be interesting to see if this representation can serve, e.g., as a useful, minimally divergent transfer layer in machine translation. To answer this question, we have translated 100 English sentences that have existing AMRs into Chinese and Czech to create AMRs for them. A cross-linguistic comparison of English to Chinese and Czech AMRs reveals both cases where the AMRs for the language pairs align well structurally and cases of linguistic divergence. We found that the level of compatibility of AMR between English and Chinese is higher than between English and Czech. We believe this kind of comparison is beneficial to further refining the annotation standards for each of the three languages and will lead to more compatible annotation guidelines between the languages.
Specialization linguistics ("jazykověda")
Confidentiality default – not confidential
Open access no
WOS Code 000355611003061
Editor(s)* Nicoletta Calzolari; Khalid Choukri; Thierry Declerck; Hrafn Loftsson; Bente Maegaard; Joseph Mariani
ISBN* 978-2-9517408-8-4
Address* Reykjavík, Iceland
Month* May
Venue* Harpa Conference Centre
Publisher* European Language Resources Association
Creator: Common Account
Created: 4/14/14 9:50 AM
Modifier: Common Account
Modified: 11/9/15 9:45 AM
***

public384_Paper.pdfapplication/pdf
Not an InterlinguapublicCAMERA_READY.pdfapplication/pdf
Content, Design & Functionality: ÚFAL, 2006–2016. Page generated: Tue Oct 23 06:34:19 CEST 2018

[ Back to the navigation ] [ Back to the content ]

100% OpenAIRE compliant