This is the central repository for the SynSemClass project, which aggregates the source code from all related repositories created under this initiative.
SynSemClass is an event-type ontology in multiple languages. The project attempts to create specifications and definitions of a hierarchical event-type ontology, populated with words denoting events or states (primarily verbs, verbal nouns, and adjectives, but also any other single- or multiword units denoting events or states). It links its entries or "classes" (and the words that evoke them) to several existing lexical resources that exist and has, to some extent, similar goals; such linking allows for both theoretical and practical comparison and use of the resources.
- Please see the individual repositories for their corresponding publications for citations.
- Or, please select one of the publications from this list on the documentation page most relevant to your work.
- If unsure, the canonical publication for this project is Urešová et al. (2020):
@inproceedings{uresova-etal-2020-synsemclass,
title = "{S}yn{S}em{C}lass Linked Lexicon: Mapping Synonymy between Languages",
author = "Uresova, Zdenka and
Fucikova, Eva and
Hajicova, Eva and
Hajic, Jan",
editor = "Kernerman, Ilan and
Krek, Simon and
McCrae, John P. and
Gracia, Jorge and
Ahmadi, Sina and
Kabashi, Besim",
booktitle = "Proceedings of the 2020 Globalex Workshop on Linked Lexicography",
month = may,
year = "2020",
address = "Marseille, France",
publisher = "European Language Resources Association",
url = "https://aclanthology.org/2020.globalex-1.2",
pages = "10--19",
language = "English",
ISBN = "979-10-95546-46-7",
}
Copyright 2024 Institute of Formal and Applied Linguistics, Faculty of Mathematics and Physics, Charles University, Czech Republic.
This Source Code Form is subject to the terms of the Mozilla Public License, v. 2.0. If a copy of the MPL was not distributed with this file, You can obtain one at http://mozilla.org/MPL/2.0/.
Please see the individual repositories for additional licensing.
The data is released under CC BY-NC-ND 4.0.
You can either clone the related repository/repositories individually, for example, to clone just the SynSemClassSearch:
git clone https://github.com/ufal/SynSemClassSearchOr you can clone the entire project, including all the related repositories as submodules, each in a subdirectory (please note the --resurse_submodules which adds the submodules into subdirectories):
git clone --recurse-submodules https://github.com/ufal/SynSemClass/- SynSemClassML: machine learning for the SynSemClass project. Also accompanies Straková et al. (2023).
- SynSemClassMulti: a tool for manual annotation of SynSemClass lexicons - version with the separate lexicon for every language.
- SynSemClassPipeline: a toolchain for pre-annotation of a new language in a semantic ontology.
- SynSemClassSearch: a Search Tool for the SynSemClass ontology.
- SynSemClassLREC2026: code and data accompanying the paper accepted at LREC 2026: Automatic Suggestions Help Extending Eventive Ontology: A Case Study on SynSemClass.
This work has been supported by the Grant Agency of the Czech Republic under the EXPRO program as project “LUSyD” (project No. GX20-16819X), by the UMR project No. LUAUS23283 supported by the Czech Ministry of Education, Youth and Sports of the Czech Republic (MŠMT ČR), and by the Johannes Amos Comenius Programme (P JAC) project No. CZ.02.01.01/00/22_008/0004605, Natural and anthropogenic georisks.
The work described herein uses resources hosted by the LINDAT/CLARIAH-CZ Research Infrastructure (projects LM2018101 and LM2023062, supported by the Ministry of Education, Youth and Sports of the Czech Republic).
We also acknowledge the use of the corpora created, maintained, and made available by the Czech National Corpus (project No. LM2023044 (2023–2026) funded by the Ministry of Education of the Czech Republic).
- Zdeňka Urešová
uresova@ufal.mff.cuni.czis in charge of the SynSemClass project, - Jan Hajič
hajic@ufal.mff.cuni.czis the PI of LINDAT/CLARIAH-CZ and of the LuSyD project and coordinates the expansion work on the lexicon, - Jana Straková
strakova@ufal.mff.cuni.czis in charge of this repository.