WhatsUp: An event resolution approach for co-occurring events in social media

Hettiarachchi, Hansi and Adedoyin-Olowe, Mariam and Bhogal, Jagdev and Gaber, Mohamed Medhat (2023) WhatsUp: An event resolution approach for co-occurring events in social media. Information Sciences, 625. pp. 553-577. ISSN 0020-0255

[img]
Preview
Text
1-s2.0-S0020025523000014-main.pdf - Published Version
Available under License Creative Commons Attribution.

Download (2MB)

Abstract

The rapid growth of social media networks has resulted in the generation of a vast data amount, making it impractical to conduct manual analyses to extract newsworthy events. Thus, automated event detection mechanisms are invaluable to the community. However, a clear majority of the available approaches rely only on data statistics without considering linguistics. A few approaches involved linguistics, only to extract textual event details without the corresponding temporal details. Since linguistics define words’ structure and meaning, a severe information loss can happen without considering them. Targeting this limitation, we propose a novel method named WhatsUp to detect temporal and fine-grained textual event details, using linguistics captured by self-learned word embeddings and their hierarchical relationships and statistics captured by frequency-based measures. We evaluate our approach on recent social media data from two diverse domains and compare the performance with several state-of-the-art methods. Evaluations cover temporal and textual event aspects, and results show that WhatsUp notably outperforms state-of-the-art methods. We also analyse the efficiency, revealing that WhatsUp is sufficiently fast for (near) real-time detection. Further, the usage of unsupervised learning techniques, including self-learned embedding, makes our approach expandable to any language, platform and domain and provides capabilities to understand data-specific linguistics.

Item Type: Article
Identification Number: https://doi.org/10.1016/j.ins.2023.01.001
Dates:
DateEvent
1 January 2023Accepted
7 January 2023Published Online
Uncontrolled Keywords: Word embedding, Dendrograms, Clustering, Social media
Subjects: CAH11 - computing > CAH11-01 - computing > CAH11-01-01 - computer science
Divisions: Faculty of Computing, Engineering and the Built Environment > School of Computing and Digital Technology
Depositing User: Gemma Tonks
Date Deposited: 25 Jan 2023 11:11
Last Modified: 25 Jan 2023 11:11
URI: https://www.open-access.bcu.ac.uk/id/eprint/14150

Actions (login required)

View Item View Item

Research

In this section...