When comments aren't what they seem: The social media comment toxicity detector for understanding contextual comments.
Saved in:
| Title: | When comments aren't what they seem: The social media comment toxicity detector for understanding contextual comments. |
|---|---|
| Authors: | Song, Zichen1 (AUTHOR) songzch21@lzu.edu.cn, Fan, Xiaopeng1 (AUTHOR) fanxp2023@lzu.edu.cn, Wang, Yutong1 (AUTHOR) yutongwang2023@lzu.edu.cn, Yan, Feixuan1 (AUTHOR) yanfx2023@lzu.edu.cn, Wu, Zijin1 (AUTHOR) wuzj2023@lzu.edu.cn, Kang, Zhongfeng1 (AUTHOR) kangzf@lzu.edu.cn |
| Source: | Expert Systems with Applications. Apr2026, Vol. 305, pN.PAG-N.PAG. 1p. |
| Subjects: | Context-aware computing, Generative pre-trained transformers, Machine learning, Sentiment analysis, Electronic data processing |
| Abstract: | Social media comments often contain toxic, sarcastic, or ambiguous language, challenging traditional toxicity detection models. However, most existing methods treat comments as isolated units and rely solely on surface-level textual cues, making them ineffective in identifying implicit toxicity such as sarcasm or context-dependent insults. These limitations hinder their performance in real-world scenarios, where toxicity often emerges across multi-turn interactions. This paper proposes a context-aware interactive framework using GPT models to simulate human-like interactions, transforming single comments into multi-level threads. This approach improves toxicity detection by leveraging contextual dialogue. We evaluate our method on 5018 comments from platforms such as Zhihu, Bilibili, Weibo, YouTube, and RedNote. Our experiments show that our method outperforms traditional models, especially on YouTube, where sensitive word frequency is high. Additionally, our approach demonstrates strong multilingual support, successfully detecting toxicity in six languages. This paper also compares the performance of GPT-4O, GPT-4, GPT-3.5 and DeepSeek-V3, highlighting the advantage of multimodal processing in the former for more robust detection. The dataset we have collected is publicly available at https://github.com/kanglzu/When-Comments-Aren-t-What-They-Seem. [ABSTRACT FROM AUTHOR] |
| Copyright of Expert Systems with Applications is the property of Pergamon Press - An Imprint of Elsevier Science and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.) | |
| Database: | Engineering Source |
| FullText | Text: Availability: 0 |
|---|---|
| Header | DbId: egs DbLabel: Engineering Source An: 191526000 AccessLevel: 6 PubType: Academic Journal PubTypeId: academicJournal PreciseRelevancyScore: 0 |
| IllustrationInfo | |
| Items | – Name: Title Label: Title Group: Ti Data: When comments aren't what they seem: The social media comment toxicity detector for understanding contextual comments. – Name: Author Label: Authors Group: Au Data: <searchLink fieldCode="AR" term="%22Song%2C+Zichen%22">Song, Zichen</searchLink><relatesTo>1</relatesTo> (AUTHOR)<i> songzch21@lzu.edu.cn</i><br /><searchLink fieldCode="AR" term="%22Fan%2C+Xiaopeng%22">Fan, Xiaopeng</searchLink><relatesTo>1</relatesTo> (AUTHOR)<i> fanxp2023@lzu.edu.cn</i><br /><searchLink fieldCode="AR" term="%22Wang%2C+Yutong%22">Wang, Yutong</searchLink><relatesTo>1</relatesTo> (AUTHOR)<i> yutongwang2023@lzu.edu.cn</i><br /><searchLink fieldCode="AR" term="%22Yan%2C+Feixuan%22">Yan, Feixuan</searchLink><relatesTo>1</relatesTo> (AUTHOR)<i> yanfx2023@lzu.edu.cn</i><br /><searchLink fieldCode="AR" term="%22Wu%2C+Zijin%22">Wu, Zijin</searchLink><relatesTo>1</relatesTo> (AUTHOR)<i> wuzj2023@lzu.edu.cn</i><br /><searchLink fieldCode="AR" term="%22Kang%2C+Zhongfeng%22">Kang, Zhongfeng</searchLink><relatesTo>1</relatesTo> (AUTHOR)<i> kangzf@lzu.edu.cn</i> – Name: TitleSource Label: Source Group: Src Data: <searchLink fieldCode="JN" term="%22Expert+Systems+with+Applications%22">Expert Systems with Applications</searchLink>. Apr2026, Vol. 305, pN.PAG-N.PAG. 1p. – Name: Subject Label: Subjects Group: Su Data: <searchLink fieldCode="DE" term="%22Context-aware+computing%22">Context-aware computing</searchLink><br /><searchLink fieldCode="DE" term="%22Generative+pre-trained+transformers%22">Generative pre-trained transformers</searchLink><br /><searchLink fieldCode="DE" term="%22Machine+learning%22">Machine learning</searchLink><br /><searchLink fieldCode="DE" term="%22Sentiment+analysis%22">Sentiment analysis</searchLink><br /><searchLink fieldCode="DE" term="%22Electronic+data+processing%22">Electronic data processing</searchLink> – Name: Abstract Label: Abstract Group: Ab Data: Social media comments often contain toxic, sarcastic, or ambiguous language, challenging traditional toxicity detection models. However, most existing methods treat comments as isolated units and rely solely on surface-level textual cues, making them ineffective in identifying implicit toxicity such as sarcasm or context-dependent insults. These limitations hinder their performance in real-world scenarios, where toxicity often emerges across multi-turn interactions. This paper proposes a context-aware interactive framework using GPT models to simulate human-like interactions, transforming single comments into multi-level threads. This approach improves toxicity detection by leveraging contextual dialogue. We evaluate our method on 5018 comments from platforms such as Zhihu, Bilibili, Weibo, YouTube, and RedNote. Our experiments show that our method outperforms traditional models, especially on YouTube, where sensitive word frequency is high. Additionally, our approach demonstrates strong multilingual support, successfully detecting toxicity in six languages. This paper also compares the performance of GPT-4O, GPT-4, GPT-3.5 and DeepSeek-V3, highlighting the advantage of multimodal processing in the former for more robust detection. The dataset we have collected is publicly available at https://github.com/kanglzu/When-Comments-Aren-t-What-They-Seem. [ABSTRACT FROM AUTHOR] – Name: AbstractSuppliedCopyright Label: Group: Ab Data: <i>Copyright of Expert Systems with Applications is the property of Pergamon Press - An Imprint of Elsevier Science and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract.</i> (Copyright applies to all Abstracts.) |
| PLink | https://search.ebscohost.com/login.aspx?direct=true&site=eds-live&db=egs&AN=191526000 |
| RecordInfo | BibRecord: BibEntity: Identifiers: – Type: doi Value: 10.1016/j.eswa.2025.130902 Languages: – Code: eng Text: English PhysicalDescription: Pagination: PageCount: 1 StartPage: N.PAG Subjects: – SubjectFull: Context-aware computing Type: general – SubjectFull: Generative pre-trained transformers Type: general – SubjectFull: Machine learning Type: general – SubjectFull: Sentiment analysis Type: general – SubjectFull: Electronic data processing Type: general Titles: – TitleFull: When comments aren't what they seem: The social media comment toxicity detector for understanding contextual comments. Type: main BibRelationships: HasContributorRelationships: – PersonEntity: Name: NameFull: Song, Zichen – PersonEntity: Name: NameFull: Fan, Xiaopeng – PersonEntity: Name: NameFull: Wang, Yutong – PersonEntity: Name: NameFull: Yan, Feixuan – PersonEntity: Name: NameFull: Wu, Zijin – PersonEntity: Name: NameFull: Kang, Zhongfeng IsPartOfRelationships: – BibEntity: Dates: – D: 05 M: 04 Text: Apr2026 Type: published Y: 2026 Identifiers: – Type: issn-print Value: 09574174 Numbering: – Type: volume Value: 305 Titles: – TitleFull: Expert Systems with Applications Type: main |
| ResultId | 1 |