PUBLIKATIONSSERVER

How to Label? Combining Experts’ Knowledge for German Text Classification

D. Pelkmann, A. Tharwat, W. Schenck, in: 2020 7th Swiss Conference on Data Science (SDS), IEEE, 2020, pp. 61–62.

Download
Es wurde kein Volltext hochgeladen. Nur Publikationsnachweis!
Konferenzbeitrag | Veröffentlicht | Englisch
Autor*in
Abstract
A supervised machine learning classifier can only be as good as the labeled training data. For this reason, there is a need for explicit human expert knowledge inside the workflow. Existing data collections often consist of classes different to the ones which are necessary for an individual application. Therefore, generating a new data set based on a predefined labeling guideline is mandatory. The aim of this work is to increase the quality of labeled data sets during their creation. We present a workflow for the labeling of unsorted data by a group of experts, including subsequent classifier training and evaluation. Even if combined with standard methods for feature extraction and classification, a performance improvement was achieved with the proposed labeling method. Furthermore, we offer access to our data set (German newspaper articles) including the labeling guideline as contribution to the research community.
Erscheinungsjahr
Titel des Konferenzbandes
2020 7th Swiss Conference on Data Science (SDS)
Seite
61-62
Konferenz
2020 7th Swiss Conference on Data Science (SDS)
Konferenzort
Luzern, Switzerland
Konferenzdatum
2020-06-26 – 2020-06-26
FH-PUB-ID

Zitieren

Pelkmann, David ; Tharwat, Alaa ; Schenck, Wolfram: How to Label? Combining Experts’ Knowledge for German Text Classification. In: 2020 7th Swiss Conference on Data Science (SDS) : IEEE, 2020, S. 61–62
Pelkmann D, Tharwat A, Schenck W. How to Label? Combining Experts’ Knowledge for German Text Classification. In: 2020 7th Swiss Conference on Data Science (SDS). IEEE; 2020:61-62. doi:10.1109/SDS49233.2020.00023
Pelkmann, D., Tharwat, A., & Schenck, W. (2020). How to Label? Combining Experts’ Knowledge for German Text Classification. In 2020 7th Swiss Conference on Data Science (SDS) (pp. 61–62). Luzern, Switzerland: IEEE. https://doi.org/10.1109/SDS49233.2020.00023
@inproceedings{Pelkmann_Tharwat_Schenck_2020, title={How to Label? Combining Experts’ Knowledge for German Text Classification}, DOI={10.1109/SDS49233.2020.00023}, booktitle={2020 7th Swiss Conference on Data Science (SDS)}, publisher={IEEE}, author={Pelkmann, David and Tharwat, Alaa and Schenck, Wolfram}, year={2020}, pages={61–62} }
Pelkmann, David, Alaa Tharwat, and Wolfram Schenck. “How to Label? Combining Experts’ Knowledge for German Text Classification.” In 2020 7th Swiss Conference on Data Science (SDS), 61–62. IEEE, 2020. https://doi.org/10.1109/SDS49233.2020.00023.
D. Pelkmann, A. Tharwat, and W. Schenck, “How to Label? Combining Experts’ Knowledge for German Text Classification,” in 2020 7th Swiss Conference on Data Science (SDS), Luzern, Switzerland, 2020, pp. 61–62.
Pelkmann, David, et al. “How to Label? Combining Experts’ Knowledge for German Text Classification.” 2020 7th Swiss Conference on Data Science (SDS), IEEE, 2020, pp. 61–62, doi:10.1109/SDS49233.2020.00023.

Export

Markierte Publikationen

Open Data LibreCat

Suchen in

Google Scholar