David Mareček - Publications

  1. David Mareček, Rudolf Rosa (2019): From Balustrades to Pierre Vinken: Looking for Syntax in Transformer Self-Attentions. In: The BlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP at ACL 2019, pp. 263-275, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-950737-30-7 (url, local PDF, local PDF, bibtex)
  2. Tomáš Musil, Jonáš Vidra, David Mareček (2019): Derivational Morphological Relations in Word Embeddings. In: The BlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP at ACL 2019, pp. 173-180, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-950737-30-7 (url, bibtex)
  3. Jindřich Libovický, Jindřich Helcl, David Mareček (2018): Input Combination Strategies for Multi-Source Transformer Decoder. In: Proceedings of the Third Conference on Machine Translation, Volume 1: Research Papers, pp. 253-260, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-948087-81-0 (local PDF, local PDF, obd, bibtex)
  4. David Mareček, Rudolf Rosa (2018): Extracting Syntactic Trees from Transformer Encoder Self-Attentions. In: Proceedings of the First Workshop on Analyzing and Interpreting Neural Networks for NLP, pp. 347-349, The Assotiation of Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-948087-71-1 (url, local PDF, local PDF, obd, bibtex)
  5. Rudolf Rosa, David Mareček (2018): CUNI x-ling: Parsing under-resourced languages in CoNLL 2018 UD Shared Task. In: Proceedings of the CoNLL 2018 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies, pp. 187-196, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-948087-82-7 (pdf, local PDF, local PDF, obd, bibtex)
  6. Ondřej Bojar, Tom Kocmi, David Mareček, Roman Sudarikov, Dušan Variš (2017): CUNI Submission in WMT17: Chimera Goes Neural. In: Proceedings of the Second Conference on Machine Translation, Volume 2: Shared Task Papers, pp. 248-256, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-945626-96-8 (obd, bibtex)
  7. David Mareček, Ondřej Bojar, Ondřej Hübsch, Rudolf Rosa, Dušan Variš (2017): CUNI Experiments for WMT17 Metrics Task. In: Proceedings of the Second Conference on Machine Translation, Volume 2: Shared Task Papers, pp. 604-611, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-945626-96-8 (obd, bibtex)
  8. Bedřich Pišl, David Mareček (2017): Communication with Robots using Multilayer Recurrent Networks. In: Proceedings of the First Workshop on Language Grounding for Robotics, pp. 44-48, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-945626-64-7 (pdf, obd, bibtex)
  9. Rudolf Rosa, Daniel Zeman, David Mareček, Zdeněk Žabokrtský (2017): Slavic Forest, Norwegian Wood. In: Proceedings of the Fourth Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial4), pp. 210-219, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-945626-43-2 (pdf, local PDF, local PDF, obd, bibtex)
  10. David Mareček (2016): Delexicalized and Minimally Supervised Parsing on Universal Dependencies. In: Statistical Language and Speech Processing, pp. 30-42, Springer International Publishing, Cham, Switzerland, ISBN 978-3-319-45924-0 (local PDF, obd, bibtex)
  11. David Mareček (2016): Merged bilingual trees based on Universal Dependencies in Machine Translation. In: Proceedings of the First Conference on Machine Translation (WMT). Volume 2: Shared Task Papers, pp. 333-338, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-945626-10-4 (pdf, local PDF, local PDF, obd, bibtex)
  12. David Mareček (2016): Twelve Years of Unsupervised Dependency Parsing. In: Proceedings of the 16th ITAT: Slovenskočeský NLP workshop (SloNLP 2016), pp. 56-62, CreateSpace Independent Publishing Platform, Bratislava, Slovakia, ISBN 978-1537016740 (pdf, local PDF, obd, bibtex)
  13. David Mareček, Zdeněk Žabokrtský (2016): Gibbs Sampling Segmentation of Parallel Dependency Trees for Tree-Based Machine Translation. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 105, pp. 101-110 (pdf, local PDF, obd, bibtex)
  14. Rudolf Rosa, Martin Popel, Ondřej Bojar, David Mareček, Ondřej Dušek (2016): Moses & Treex Hybrid MT Systems Bestiary. In: Proceedings of the 2nd Deep Machine Translation Workshop, pp. 1-10, ÚFAL MFF UK, Praha, Czechia, ISBN 978-80-88132-02-8 (url, local PDF, local PDF, obd, bibtex)
  15. Zhiwei Yu, David Mareček, Zdeněk Žabokrtský, Daniel Zeman (2016): If You Even Don't Have a Bit of Bible: Learning Delexicalized POS Taggers. In: Proceedings of the 10th International Conference on Language Resources and Evaluation (LREC 2016), pp. 96-103, European Language Resources Association, Paris, France, ISBN 978-2-9517408-9-1 (url, local PDF, obd, bibtex)
  16. Daniel Zeman, David Mareček, Zhiwei Yu, Zdeněk Žabokrtský (2016): Planting Trees in the Desert: Delexicalized Tagging and Parsing Combined. In: Proceedings of the 30th Pacific Asia Conference on Language, Information and Computation, pp. 199-207, Kyung Hee University, Seoul, Korea, ISBN 978-89-6817-428-5 (pdf, local PDF, local PDF, obd, bibtex)
  17. David Mareček (2015): Multilingual Unsupervised Dependency Parsing with Unsupervised POS tags. In: MICAI 2015: Advances in Artificial Intelligence and Soft Computing, Part I, pp. 72-82, Springer, Berlin / Heidelberg, ISBN 978-3-319-27059-3 (obd, bibtex)
  18. David Mareček, Zdeněk Žabokrtský (2014): Dealing with Function Words in Unsupervised Dependency Parsing. In: 15th International Conference on Computational Linguistics and Intelligent Text Processing, pp. 250-261, Springer, Berlin / Heidelberg, ISBN 978-3-642-54905-2 (local PDF, obd, bibtex)
  19. Pavel Pecina, Ondřej Dušek, Lorraine Goeuriot, Jan Hajič, Jaroslava Hlaváčová, Gareth J.F. Jones, Liadh Kelly, Johannes Leveling, David Mareček, Michal Novák, Martin Popel, Rudolf Rosa, Aleš Tamchyna, Zdeňka Urešová (2014): Adaptation of machine translation for multilingual information retrieval in medical domain. In: Artificial Intelligence in Medicine, ISSN 0933-3657, vol. 61, no. 3, pp. 165-185 (url, obd, bibtex)
  20. Loganathan Ramasamy, David Mareček, Zdeněk Žabokrtský (2014): Multilingual Dependency Parsing: Using Machine Translated Texts instead of Parallel Corpora. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 102, pp. 93-104 (obd, bibtex)
  21. Rudolf Rosa, Jan Mašek, David Mareček, Martin Popel, Daniel Zeman, Zdeněk Žabokrtský (2014): HamleDT 2.0: Thirty Dependency Treebanks Stanfordized. In: Proceedings of the 9th International Conference on Language Resources and Evaluation (LREC 2014), pp. 2334-2341, European Language Resources Association, Reykjavík, Iceland, ISBN 978-2-9517408-8-4 (pdf, local PDF, local PDF, obd, bibtex)
  22. Daniel Zeman, Ondřej Dušek, David Mareček, Martin Popel, Loganathan Ramasamy, Jan Štěpánek, Zdeněk Žabokrtský, Jan Hajič (2014): HamleDT: Harmonized Multi-Language Dependency Treebank. In: Language Resources and Evaluation, ISSN 1574-020X, vol. 48, no. 4, pp. 601-637 (url, local PDF, obd, bibtex)
  23. Niraj Aswani, Thomas Beckers, Erich Birngruber, Célia Boyer, Andreas Burner, Jakub Bystroň, Khalid Choukri, Sarah Cruchet, Hamish Cunningham, Jan Dědek, Ljiljana Dolamic, René Donner, Ondřej Dušek, Sebastian Dungs, Ivan Eggel, Antonio Foncubierta, Norbert Fuhr, Adam Funk, Alba García Seco de Herrera, Arnaud Gaudinat, Georgi Georgiev, Julien Gobeill, Lorraine Goeuriot, Paz Gomez, Mark A. Greenwood, Manfred Gschwandtner, Allan Hanbury, Jan Hajič, Jaroslava Hlaváčová, Markus Holzer, Gareth J.F. Jones, Blanca Jordán, Matthias Jordan, Klemens Kaderk, Franz Kainberger, Liadh Kelly, Sascha Kriewel, Marlene Kritz, Georg Langs, Nolan Lawson, Johannes Leveling, David Mareček, Dimitrios Markonis, Iván Martínez, Vassil Momtchev, Alexandre Masselot, Hélène Mazo, Henning Müller, Michal Novák, Johann Petrak, João Palotti, Pavel Pecina, Konstantin Pentchev, Deyan Peychev, Natalia Pletneva, Martin Popel, Diana Pottecher, Angus Roberts, Rudolf Rosa, Patrick Ruch, Alexander Sachs, Matthias Samwald, Priscille Schneller, Veronika Stefanov, Aleš Tamchyna, Miguel Angel Tinte, Zdeňka Urešová, Alejandro Vargas, Dina Vishnyakova (2013): Khresmoi Professional: Multilingual Semantic Search for Medical Professionals. In: Proceedings of the ACM SIGIR Workshop on Health Search and Discovery: Helping Users and Advancing Medicine, pp. 31-34, Microsoft Research, Cambridge, UK (url, local PDF, obd, bibtex)
  24. David Mareček, Martin Popel, Loganathan Ramasamy, Jan Štěpánek, Daniel Zeman, Zdeněk Žabokrtský, Jan Hajič (2013): Cross-language Study on Influence of Coordination Style on Dependency Parsing Performance (technical report). In: (pdf, local PDF, bibtex)
  25. David Mareček, Milan Straka (2013): Stop-probability estimates computed on a large corpus improve Unsupervised Dependency Parsing. In: Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, pp. 281-290, Association for Computational Linguistics, Sofija, Bulgaria, ISBN 978-1-937284-50-3 (pdf, local PDF, obd, bibtex)
  26. Martin Popel, David Mareček, Jan Štěpánek, Daniel Zeman, Zdeněk Žabokrtský (2013): Coordination Structures in Dependency Treebanks. In: Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, pp. 517-527, Association for Computational Linguistics, Sofija, Bulgaria, ISBN 978-1-937284-50-3 (pdf, local PDF, local PDF, local PDF, obd, bibtex)
  27. Rudolf Rosa, David Mareček, Aleš Tamchyna (2013): Deepfix: Statistical Post-editing of Statistical Machine Translation Using Deep Syntactic Analysis. In: 51st Annual Meeting of the Association for Computational Linguistics Proceedings of the Student Research Workshop, pp. 172-179, Association for Computational Linguistics, Sofija, Bulgaria, ISBN 978-1-937284-53-4 (url, local PDF, local PDF, local PDF, obd, bibtex)
  28. Ondřej Bojar, Zdeněk Žabokrtský, Ondřej Dušek, Petra Galuščáková, Martin Majliš, David Mareček, Jiří Maršík, Michal Novák, Martin Popel, Aleš Tamchyna (2012): The Joy of Parallelism with CzEng 1.0. In: Proceedings of the 8th International Conference on Language Resources and Evaluation (LREC 2012), pp. 3921-3928, European Language Resources Association, İstanbul, Turkey, ISBN 978-2-9517408-7-7 (url, local PDF, obd, bibtex)
  29. Ondřej Dušek, Zdeněk Žabokrtský, Martin Popel, Martin Majliš, Michal Novák, David Mareček (2012): Formemes in English-Czech Deep Syntactic MT. In: Proceedings of the Seventh Workshop on Statistical Machine Translation, pp. 267-274, Association for Computational Linguistics, Montréal, Canada, ISBN 978-1-937284-20-6 (pdf, local PDF, obd, bibtex)
  30. David Mareček (2012): Unsupervised Dependency Parsing (PhD thesis). In: (local PDF, bibtex)
  31. David Mareček, Zdeněk Žabokrtský (2012): Exploiting Reducibility in Unsupervised Dependency Parsing. In: Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, pp. 297-307, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-937284-43-5 (obd, bibtex)
  32. David Mareček, Zdeněk Žabokrtský (2012): Unsupervised Dependency Parsing using Reducibility and Fertility features. In: The NAACL-HLT Workshop on the Induction of Linguistic Structure, pp. 84-89, The Association for Computational Linguistics, Montréal, Canada, ISBN 978-1-937284-20-6 (obd, bibtex)
  33. Rudolf Rosa, Ondřej Dušek, David Mareček, Martin Popel (2012): Using Parallel Features in Parsing of Machine-Translated Sentences for Correction of Grammatical Errors. In: Proceedings of Sixth Workshop on Syntax, Semantics and Structure in Statistical Translation (SSST-6), ACL, pp. 39-48, Association for Computational Linguistics, Jeju, Korea, ISBN 978-1-937284-38-1 (pdf, local PDF, local PDF, obd, bibtex)
  34. Rudolf Rosa, David Mareček (2012): Dependency Relations Labeller for Czech. In: Text, Speech and Dialogue: 15th International Conference, TSD 2012. Proceedings, Lecture Notes in Computer Science, ISSN 0302-9743, 7499, pp. 256-263, Springer Verlag, Berlin / Heidelberg, ISBN 978-3-642-32789-6 (url, local PDF, local PDF, obd, bibtex)
  35. Rudolf Rosa, David Mareček, Ondřej Dušek (2012): DEPFIX: A System for Automatic Correction of Czech MT Outputs. In: Proceedings of the Seventh Workshop on Statistical Machine Translation, pp. 362-368, Association for Computational Linguistics, Montréal, Canada, ISBN 978-1-937284-20-6 (pdf, local PDF, local HTML, local PDF, obd, bibtex)
  36. Daniel Zeman, David Mareček, Martin Popel, Loganathan Ramasamy, Jan Štěpánek, Zdeněk Žabokrtský, Jan Hajič (2012): HamleDT: To Parse or Not to Parse?. In: Proceedings of the 8th International Conference on Language Resources and Evaluation (LREC 2012), pp. 2735-2741, European Language Resources Association, İstanbul, Turkey, ISBN 978-2-9517408-7-7 (url, local PDF, local PDF, obd, bibtex)
  37. David Mareček (2011): Combining Diverse Word-Alignment Symmetrizations Improves Dependency Tree Projection. In: Lecture Notes in Computer Science, ISSN 0302-9743, 6608, pp. 144-154 (url, obd, bibtex)
  38. David Mareček, Rudolf Rosa, Petra Galuščáková, Ondřej Bojar (2011): Two-step translation with grammatical post-processing. In: Proceedings of the Sixth Workshop on Statistical Machine Translation, pp. 426-432, Association for Computational Linguistics, Edinburgh, UK, ISBN 978-1-937284-12-1 (url, local PDF, local PDF, obd, bibtex)
  39. David Mareček, Zdeněk Žabokrtský (2011): Gibbs Sampling with Treeness constraint in Unsupervised Dependency Parsing. In: Robust Unsupervised and Semisupervised Methods in Natural Language Processing, pp. 1-8, Incoma, Šumen, Bulgaria, ISBN 978-954-452-017-5 (obd, bibtex)
  40. David Mareček, Zdeněk Žabokrtský (2011): Unsupervised Dependency Parsing (technical report). In: (pdf, bibtex)
  41. Martin Popel, David Mareček, Nathan David Green, Zdeněk Žabokrtský (2011): Influence of Parser Choice on Dependency-Based MT. In: Proceedings of the Sixth Workshop on Statistical Machine Translation, pp. 433-439, Association for Computational Linguistics, Edinburgh, UK, ISBN 978-1-937284-12-1 (obd, bibtex)
  42. Ondřej Bojar, Kamil Kos, David Mareček (2010): Tackling Sparse Data Issue in Machine Translation Evaluation. In: Proceedings of the ACL 2010 Conference Short Papers, pp. 86-91, Association for Computational Linguistics, Uppsala, Sweden, ISBN 978-1-932432-69-5 (url, obd, bibtex)
  43. Natalia Klyueva, David Mareček (2010): Towards Parallel Czech-Russian Dependency Treebank. In: Workshop on Annotation and Exploitation of Parallel Corpora, NEALT Proceedings Series, ISSN 1736-6305, 10, pp. 44-52, Northern European Association for Language Technology, Tartu, Estonia (local PDF, local PDF, obd, bibtex)
  44. David Mareček, Martin Popel, Zdeněk Žabokrtský (2010): Maximum Entropy Translation Model in Dependency-Based MT Framework. In: Proceedings of the Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR, pp. 201-201, Association for Computational Linguistics, Uppsala, Sweden, ISBN 978-1-932432-71-8 (pdf, obd, bibtex)
  45. Martin Popel, David Mareček (2010): Perplexity of n-gram and Dependency Language Models. In: Text, Speech and Dialogue. 13th International Conference, TSD 2010, Brno, Czech Republic, September 6-10, 2010. Proceedings, Lecture Notes in Computer Science, ISSN 0302-9743, 6231, pp. 173-180, Springer, Berlin / Heidelberg, ISBN 978-3-642-15759-2 (local PDF, local PDF, obd, bibtex)
  46. Ondřej Bojar, David Mareček, Václav Novák, Martin Popel, Jan Ptáček, Jan Rouš, Zdeněk Žabokrtský (2009): English-Czech MT in 2008. In: Proceedings of the Fourth Workshop on Statistical Machine Translation, pp. 125-129, Association for Computational Linguistics, Athina, Greece (pdf, local PDF, bibtex)
  47. David Mareček (2009): Improving Word Alignment Using Alignment of Deep Structures. In: Proceedings of the 12th International Conference, TSD 2009, pp. 56-63, Springer, Berlin / Heidelberg, ISBN 978-3-642-04207-2 (pdf, bibtex)
  48. David Mareček (2009): Using Tectogrammatical Alignment in Phrase‐Based Machine Translation. In: WDS'09 Proceedings of Contributed Papers, pp. 22-27, Matfyzpress, Charles University, Praha, Czechia, ISBN 978-80-7378-101-9 (pdf, obd, bibtex)
  49. David Mareček, Natalia Klyueva (2009): Converting Russian Treebank SynTagRus into Praguian PDT Style. In: Multilingual resources, technologies and evaluation for Central and Eastern European languages, pp. 30-35, INCOMA Ltd., Shoumen, Bulgaria, ISBN 978-954-452-008-3 (pdf, bibtex)
  50. David Mareček (2008): Automatic Alignment of Tectogrammatical Trees from Czech-English Parallel Corpus (masters thesis). In: (local PDF, bibtex)
  51. David Mareček, Zdeněk Žabokrtský, Václav Novák (2008): Automatic Alignment of Czech and English Deep Syntactic Dependency Trees. In: Proceedings of the Twelfth EAMT Conference, pp. 102-111, HITEC e.V., Hamburg, Germany, ISBN 978-3-00-025770-4 (pdf, local PDF, obd, bibtex)