Intent aware data augmentation by leveraging generative AI for stress detection in social media texts

Saleem, Minhah; Kim, Jihie

doi:10.7717/peerj-cs.2156

상세 보기

Intent aware data augmentation by leveraging generative AI for stress detection in social media texts

Saleem, Minhah;
Kim, Jihie

Citations

WEB OF SCIENCE

4

Citations

SCOPUS

10

초록

Stress is a major issue in modern society. Researchers focus on identifying stress in individuals, linking language with mental health, and often utilizing social media posts. However, stress classification systems encounter data scarcity issues, necessitating data augmentation. Approaches like Back-Translation (BT), Easy Data Augmentation (EDA), and An Easier Data Augmentation (AEDA) are common. But, recent studies show the potential of generative AI, notably ChatGPT. This article centers on stress identification using the DREADDIT dataset and A Robustly Optimized BERT Pretraining Approach (RoBERTa) transformer, emphasizing the use of generative AI for augmentation. We propose two ChatGPT prompting techniques: same-intent and posts with similar topics and sentiments, while opposite-intent prompts produce posts with contrasting sentiments. Results show a 2% and 3% performance increase for opposing and same sentiments, respectively. This study pioneers intent-based data augmentation for stress detection and explores advanced mental health text classification methods with generative AI. It concludes that data augmentation has limited benefits and highlights the importance of diverse Reddit data and further research in this field.

키워드

Stress detection; Mental health; Data augmentation; Text classification; Generative AI; Sentiment analysis; Pre-trained language models; Natural language understanding; COINTEGRATION

제목: Intent aware data augmentation by leveraging generative AI for stress detection in social media texts

저자: Saleem, Minhah; Kim, Jihie

DOI: 10.7717/peerj-cs.2156

발행일: 2024-07

유형: Article

저널명: PeerJ Computer Science

권: 10

페이지: 1 ~ 22