Intent aware data augmentation by leveraging generative AI for stress detection in social media texts

Citations

WEB OF SCIENCE

4
Citations

SCOPUS

10

초록

Stress is a major issue in modern society. Researchers focus on identifying stress in individuals, linking language with mental health, and often utilizing social media posts. However, stress classification systems encounter data scarcity issues, necessitating data augmentation. Approaches like Back-Translation (BT), Easy Data Augmentation (EDA), and An Easier Data Augmentation (AEDA) are common. But, recent studies show the potential of generative AI, notably ChatGPT. This article centers on stress identification using the DREADDIT dataset and A Robustly Optimized BERT Pretraining Approach (RoBERTa) transformer, emphasizing the use of generative AI for augmentation. We propose two ChatGPT prompting techniques: same-intent and posts with similar topics and sentiments, while opposite-intent prompts produce posts with contrasting sentiments. Results show a 2% and 3% performance increase for opposing and same sentiments, respectively. This study pioneers intent-based data augmentation for stress detection and explores advanced mental health text classification methods with generative AI. It concludes that data augmentation has limited benefits and highlights the importance of diverse Reddit data and further research in this field.

키워드

Stress detectionMental healthData augmentationText classificationGenerative AISentiment analysisPre-trained language modelsNatural language understandingCOINTEGRATION
제목
Intent aware data augmentation by leveraging generative AI for stress detection in social media texts
저자
Saleem, MinhahKim, Jihie
DOI
10.7717/peerj-cs.2156
발행일
2024-07
유형
Article
저널명
PeerJ Computer Science
10
페이지
1 ~ 22