24 ESWA_Enhancing rumor detection with data augmentation and generative pre-trained transformer

data augmentation
generative pre-trained transformer

GAP

However, the existing methods could not learn the deep concepts of the rumor text to detect the rumor. In addition, imbalanced datasets in the umor domain reduce the effectiveness of these algorithms.

Idea

leveraging the Generative Pre-trained Transformer 2 (GPT-2) model to generate rumor-like texts, thus creating a balanced dataset. (利用GPT2+增强数据)

GPT-2 captures rich semantic information and can produce diverse, high-quality synthetic text samples.

Datasets

PHEME, Twitter15, and Twitter16 datasets.

image.png

Experimental Results

image.png

最后编辑于：2024.11.12 14:52:20

推荐阅读更多精彩内容

NLP自然语言处理资料汇总
NLP民工的乐园 [toc] NLP民工的乐园: 几乎最全的中文NLP资源库 https://github.com...
Viterbi阅读 1,079评论 0赞 2
自然语言大模型介绍
1 简介最近一直被大语言模型刷屏。本文是周末技术分享会的提纲，总结了一些自然语言模型相关的重要技术，以及各个主流...
xieyan0811阅读 1,922评论 0赞 2
Unleashing Novel Data at Scale|凡凡私荐第9期
本期推荐一份与Deep Learning、NLP相关的资料，作者是Melissa Dell教授，曾获得2020年克...
凡有言说阅读 1,180评论 0赞 3
NLP简报（Issue#5）：The Annotated GPT-2、CodeBERT、JAX...
本文首发于微信公众号：NewBeeNLP 欢迎来到 NLP 时事简报！全文较长，建议收藏。如果想让自己有趣的研究...
kaiyuan_nlp阅读 449评论 0赞 0
ChatGPT笔记
简介 2022年11月，OpenAI推出了一款AI聊天机器人程序，其强大的问答能力瞬间引爆全网关注度。组成部分：...
臻甄阅读 1,820评论 0赞 0

1赞2赞

赞赏

手机看全文