An Emotion Analysis Dataset of Course Comment Texts in Massive Online Learning Course Platforms
Saved in:
| Title: | An Emotion Analysis Dataset of Course Comment Texts in Massive Online Learning Course Platforms |
|---|---|
| Language: | English |
| Authors: | Xiang Feng (ORCID |
| Source: | Interactive Learning Environments. 2024 32(4):1219-1233. |
| Availability: | Routledge. Available from: Taylor & Francis, Ltd. 530 Walnut Street Suite 850, Philadelphia, PA 19106. Tel: 800-354-1420; Tel: 215-625-8900; Fax: 215-207-0050; Web site: http://www.tandf.co.uk/journals |
| Peer Reviewed: | Y |
| Page Count: | 15 |
| Publication Date: | 2024 |
| Document Type: | Journal Articles Reports - Research |
| Descriptors: | MOOCs, Psychological Patterns, Artificial Intelligence, Prediction, Accuracy, Data Analysis, Foreign Countries |
| Geographic Terms: | China |
| DOI: | 10.1080/10494820.2022.2115517 |
| ISSN: | 1049-4820 1744-5191 |
| Abstract: | Datasets are critical for emotion analysis in the machine learning field. This study aims to explore emotion analysis datasets and related benchmarks in online learning, since, currently, there are very few studies that explore the same. We have scientifically labeled the topic and nine-category emotion of 4715 comment texts in online learning platforms using the "three-person voting label method" based on the "sentence-level" and multi-category labeling dimensions with our self-developed system. After testing the consistency of the labeling results using the "Fleiss Kappa" method, we found that the consistency of the dataset was about 0.51, representing a moderate strength of agreement. Based on the dataset, the prediction accuracy of the Long-Short Term Memory (LSTM) method is about 0.68. This dataset provides a benchmark for the multi-category emotion dataset in the Chinese online learning field. It can provide a basis for the subsequent solution of emotion analysis, monitoring, and intervention in the education field. It can also provide a reference for constructing subsequent datasets in the education field. |
| Abstractor: | As Provided |
| Entry Date: | 2024 |
| Accession Number: | EJ1429173 |
| Database: | ERIC |
| Abstract: | Datasets are critical for emotion analysis in the machine learning field. This study aims to explore emotion analysis datasets and related benchmarks in online learning, since, currently, there are very few studies that explore the same. We have scientifically labeled the topic and nine-category emotion of 4715 comment texts in online learning platforms using the "three-person voting label method" based on the "sentence-level" and multi-category labeling dimensions with our self-developed system. After testing the consistency of the labeling results using the "Fleiss Kappa" method, we found that the consistency of the dataset was about 0.51, representing a moderate strength of agreement. Based on the dataset, the prediction accuracy of the Long-Short Term Memory (LSTM) method is about 0.68. This dataset provides a benchmark for the multi-category emotion dataset in the Chinese online learning field. It can provide a basis for the subsequent solution of emotion analysis, monitoring, and intervention in the education field. It can also provide a reference for constructing subsequent datasets in the education field. |
|---|---|
| ISSN: | 1049-4820 1744-5191 |
| DOI: | 10.1080/10494820.2022.2115517 |