{"doi":"10.1109/SDS49233.2020.00023","publisher":"IEEE","author":[{"id":"217023","first_name":"David","last_name":"Pelkmann","full_name":"Pelkmann, David"},{"full_name":"Tharwat, Alaa","last_name":"Tharwat","first_name":"Alaa"},{"id":"224375","first_name":"Wolfram","orcid":"0000-0003-3300-2048","orcid_put_code_url":"https://api.orcid.org/v2.0/0000-0003-3300-2048/work/94914458","last_name":"Schenck","full_name":"Schenck, Wolfram"}],"date_updated":"2024-06-13T09:58:30Z","page":"61-62","language":[{"iso":"eng"}],"type":"conference","conference":{"name":"2020 7th Swiss Conference on Data Science (SDS)","location":"Luzern, Switzerland","start_date":"2020-06-26","end_date":"2020-06-26"},"publication_identifier":{"eisbn":["978-1-7281-7177-7"]},"abstract":[{"lang":"eng","text":"A supervised machine learning classifier can only be as good as the labeled training data. For this reason, there is a need for explicit human expert knowledge inside the workflow. Existing data collections often consist of classes different to the ones which are necessary for an individual application. Therefore, generating a new data set based on a predefined labeling guideline is mandatory. The aim of this work is to increase the quality of labeled data sets during their creation. We present a workflow for the labeling of unsorted data by a group of experts, including subsequent classifier training and evaluation. Even if combined with standard methods for feature extraction and classification, a performance improvement was achieved with the proposed labeling method. Furthermore, we offer access to our data set (German newspaper articles) including the labeling guideline as contribution to the research community."}],"publication_status":"published","year":"2020","_id":"1206","status":"public","publication":"2020 7th Swiss Conference on Data Science (SDS)","title":"How to Label? Combining Experts’ Knowledge for German Text Classification","date_created":"2021-06-03T19:35:53Z","user_id":"220548","citation":{"bibtex":"@inproceedings{Pelkmann_Tharwat_Schenck_2020, title={How to Label? Combining Experts’ Knowledge for German Text Classification}, DOI={10.1109/SDS49233.2020.00023}, booktitle={2020 7th Swiss Conference on Data Science (SDS)}, publisher={IEEE}, author={Pelkmann, David and Tharwat, Alaa and Schenck, Wolfram}, year={2020}, pages={61–62} }","ama":"Pelkmann D, Tharwat A, Schenck W. How to Label? Combining Experts’ Knowledge for German Text Classification. In: 2020 7th Swiss Conference on Data Science (SDS). IEEE; 2020:61-62. doi:10.1109/SDS49233.2020.00023","mla":"Pelkmann, David, et al. “How to Label? Combining Experts’ Knowledge for German Text Classification.” 2020 7th Swiss Conference on Data Science (SDS), IEEE, 2020, pp. 61–62, doi:10.1109/SDS49233.2020.00023.","ieee":"D. Pelkmann, A. Tharwat, and W. Schenck, “How to Label? Combining Experts’ Knowledge for German Text Classification,” in 2020 7th Swiss Conference on Data Science (SDS), Luzern, Switzerland, 2020, pp. 61–62.","chicago":"Pelkmann, David, Alaa Tharwat, and Wolfram Schenck. “How to Label? Combining Experts’ Knowledge for German Text Classification.” In 2020 7th Swiss Conference on Data Science (SDS), 61–62. IEEE, 2020. https://doi.org/10.1109/SDS49233.2020.00023.","apa":"Pelkmann, D., Tharwat, A., & Schenck, W. (2020). How to Label? Combining Experts’ Knowledge for German Text Classification. In 2020 7th Swiss Conference on Data Science (SDS) (pp. 61–62). Luzern, Switzerland: IEEE. https://doi.org/10.1109/SDS49233.2020.00023","short":"D. Pelkmann, A. Tharwat, W. Schenck, in: 2020 7th Swiss Conference on Data Science (SDS), IEEE, 2020, pp. 61–62.","alphadin":"Pelkmann, David ; Tharwat, Alaa ; Schenck, Wolfram: How to Label? Combining Experts’ Knowledge for German Text Classification. In: 2020 7th Swiss Conference on Data Science (SDS) : IEEE, 2020, S. 61–62"}}