Gárdos, Judit and Egyed-Gergely, Júlia and Horváth, Anna and Micsik, András and Kovács, László and Martin, Dániel and Marx, Attila and Meiszterics, Enikő and Pataki, Balázs and Siket, Melinda and Vajda, Róza (2023) A thematic exploration of textual research resources in CSS data repositories. [Data Collection]
Abstract
The aim of the project was to facilitate the research of collected interview texts at the Research Documentation Centre of the Centre for Social Sciences. Currently it is difficult to find the parts related to specific topics or obtain a thematic mapping of lengthy interviews and collections. To this end, we identified and tested the most promising NLP tools supporting the Hungarian language. Furthermore, a suitable and domain-oriented taxonomy was created for the classification of available texts.
Legal and ethical issues
Minden másodfelhasználás esetében hivatkozza le a TK Kutatási Dokumentációs Központját mint adatdisztribútort, az eredeti kutatás címét és vezetőjét, valamint az adat KDK-beli url-jét vagy DOI-ját! Amennyiben személyes adatokhoz fér hozzá egy KDK-s kutatási forrásban, azok nyilvánosságra hozatala tilos!
Please cite the Centre for Social Sciences, Research Documentation Centre as the distributor of the data, the title of the original research and its research head, and the url or the DOI of the research collection at the RDC! If you encounter any personal data in a source at the RDC, it is prohibited to publish them!
Title in English: | A thematic exploration of textual research resources in CSS data repositories | ||
---|---|---|---|
Keywords in English: | natural language processing (NLP), named entity recognition, extreme classification, exploratory UI, text visualization, thesaurus | ||
Subjects: | H Social Sciences > H Social Sciences (General) H Social Sciences > HM Sociology |
||
Divisions: | Research Documentation Centre (KDK) | ||
Research funder: | MILAB | ||
Depositing User: | Enikő Meiszterics | ||
Date Deposited: | 04 Sep 2023 21:31 | ||
Last Modified: | 11 Sep 2023 09:36 | ||
Related papers or data collections: | Horváth, Anna and Szöllősi, Melinda and Annus, Szabolcs (2023) Subject headings and the word. Machine processing of interview collections at the Centre for Social Sciences. [Data Collection] | ||
URI: | https://openarchive.tk.mta.hu/id/eprint/598 | ||
Actions (login required)
|