Workshop Proceedings of the 19th International AAAI Conference on Web and Social Media

Workshop: #SMM4H-HeaRD 2025: Joint 10th Social Media Mining for Health and Health Real-World Data Workshop and Shared Tasks

DOI: 10.36190/2025.72

Published: 2025-06-05
HpiVaxVigil at #SMM4H-HeaRD 2025: Detecting Adverse Vaccine Events on Reddit using GPT-4o Chain-of-Thought Reasoning and Fine-Tuned PLMs with Stratified Cross-Validation
Akhyar Ahmed, Esther-Maria Antao

Detecting vaccine adverse events in noisy social media texts is both challenging and crucial to timely pharmacovigilance. In this paper, we detail our participation in Task 6 of the Workshop #SMM4H (Social Media Mining for Health) 2025, specifically focusing on potentially identifying personally experienced vaccine adverse reactions from unstructured social media posts. We present an enhanced hybrid methodology that combines the sophisticated chain-of-thought (CoT) reasoning of GPT-4o and the enhancement of domain-specific knowledge with pre-trained language models (PLM) such as BERTweet-large and DeBERTa-v3-base. By systematically employing stratified cross-validation, robust regularization (mixout and layer-wise learning rate decay), and conditional text enhancement, our model significantly outperforms the previous benchmarks, achieving an F1 score of 0.96 on both validation and test sets, underscoring the effectiveness of integrating human-like reasoning capabilities with traditional classification models.