We kindly invite you to a lecture by Václav Cvrček, the director of the Institute of the Czech National Corpus, that will be held this Thursday, November 15th 2015, at 15.00, at the Faculty of Arts of the University of Ljubljana (Modra soba hall, 5th floor). The lecture is partially endorsed by the Centre for Applied Linguistics within Trojina Institute.
Introducing Czech National Corpus, Václav Cvrček
Czech National Corpus (CNC, see www.korpus.cz) is an academic project striving for continuous mapping of Czech language in all possible dimensions which was in 2011 acknowledged as a research infrastructure for empirical language-oriented research in social sciences and humanities. Since its foundation in 1994, the CNC has been systematically collecting, processing and providing access to large language corpora of Czech and other languages for contrastive research. In my talk I would like to introduce the CNC project – its current activities as well as its development outlook – with respect to following topics: data collection (current data coverage and plans for future), data processing (linguistic and structure annotation), tools and applications for corpus-based research developed within the CNC.