This paper presents a specialized corpus tool GramatiKat in the context of Open Science principles, namely data sharing, which offers opportunities for original research and facilitates verifiability of research and building on previous research. The tool is designed primarily for examining grammatical categories from the quantitative point of view. It offers grammatical profiles of particular lemmas (currently 14 thousand Czech nouns) and the proportion of individual grammatical categories within a part of speech, i.e., the standard behavior of a word class. The data in GramatiKat are pre-processed, statistically evaluated, and presented in charts and tables for clarity, and they are available to other linguists, especially from fields of morphology and lexicography. This article is aimed at providing inspiration and support to corpus and non-corpus linguists with utilization and enhanced use of the existing tools and with the creation of new specialized tools available to other users.
Keywords
- specialized corpus tools
- grammatical category
- morphology
- lexicography
- Open Science
Tell me what it feels like”: On the verbal interface of the phenomenal Communicating globalized science: a comparative analysis of domestic and Anglo-Saxon style of academic writing in linguistics Remarks on lexical adaptation of loanwords in the Slovak language (based on the blog nomination family)The Problem of Natiolect in Connection with the Language Interference and the Specifics of Terminology Translation Czech and English terminology of health and impaired health in historical, societal and conceptual framework What do modern languages with Scriptio Continua have in common?Recenzie a Správy Book Review: Le Concept Linguistique D’opérativité Etymológia a Nárečová Lexikografia (Na Materiáli Slovníka Slovenských Nárečí)