Rudolf Rosa

office hours
I am usually available in my office at the given times:
Tue 16:30-17:30
Wed 15:30-16:30
Feel free to e-mail me for an appointment on other days/at other times!
Malostranské náměstí 25
118 00 Praha 1
Czech Republic

Main Research Interests

  • Dependency parsing
  • Unsupervised and semi-supervised methods, especially cross-lingual and multilingual
  • Automatic post-editing of Machine translation
  • Machine translation

Visit my Research MicroBlog to see what I am currently up to!



This is my personal project on cross-lingual syntax modelling. It is a realization of my PhD topic (the full official topic is "Odhalování struktury vět přirozeného jazyka pomocí neřízených a částečně řízených metod", the so far unofficial English translation is "Discovering the structure of natural language sentences by unsupervised and semi-supervised methods"), and it is realized under my GAUK grant, Modelling dependency syntax across languages.


Currently, my main project is QTLeap, a project aimed at significantly improving the quality of machine translation using deep language processing approaches (also see TectoMT).


My second-main project is HimL, focusing on semantically sane translation of medical texts from English to Czech, German, Romanian and Polish (also see Depfix).

HamleDT & Universal Dependencies

I am a member of the HamleDT group, which is a project of harmonizing dependency treebanks for various languages. (This is closely related to my dissertation.) I am also a member of the Universal Dependencies project.


Depfix is a system for automatic post-editing of machine translation outputs. It was developed as a part of the Faust project and its development continues as a part of the HimL project.


MSTperl is a reimplementation of the Maximum spanning tree dependency parser (McDonald et al., 2005) in Perl. It is tuned for Czech and has several advanced features that are useful for parsing the machine-translated sentences by Depfix. Recently, I have started using it for delexicalized parser transfer, so it now also has some advanced features for that.

Fairytale Child Chatbot / Chatbot Pohádkové dítě

EN: A simple console chatbot that wants to hear a fairly tale from you!
CS: Jednoduchý konzolový chatbot, který si od vás chce nechat vyprávět pohádku!
Visit/navštivte Pohádkové dítě / Fairytale Child

Curriculum Vitae

You can download my CV either in English or in Czech.


I am the student ambassador of ÚFAL, so feel free to contact me with anything related to studying here.

Jsem studentským ambasadorem ÚFALu -- zeptejte se mě na cokoliv ohledně studia lingvistiky na Matfyzu!

I also help Zdeněk Žabokrtský with teaching the NLP Technology course.

Selected Bibliography


I am one of the main organizers of the Slovakoczech NLP workshop for students and early-stage researchers -- see SloNLP 2015, SloNLP 2016.

My ORCID ID is 0000-0003-4908-6127.

Náměty na zápočťáky, bakalářky, diplomky a podobné věci (v češtině).

My Erdös number is 4 (me - Jaroslava Hlaváčová - Petr Savický - Zsolt Tuza - Paul Erdös)

Můj herní index je 38 (aktuální před Tmou 2016), náš šifrovací tým se jmenuje Divize nulou.