TeleScope: A Longitudinal Dataset for Investigating Online Discourse and Information Interaction on Telegram
By: Susmita Gangopadhyay , Danilo Dessi , Dimitar Dimitrov and more
Potential Business Impact:
Lets researchers study how information spreads online.
Telegram is a globally popular instant messaging platform known for its strong emphasis on security, privacy, and unique social networking features. It has recently emerged as the host for various cross-domain analysis and research works, such as social media influence, propaganda studies, and extremism. This paper introduces TeleScope, an extensive dataset suite that, to our knowledge, is the largest of its kind. It comprises metadata for about 500K Telegram channels and downloaded message metadata for about 71K public channels, accounting for around 120M crawled messages. We also release channel connections and user interaction data built using Telegram's message-forwarding feature to study multiple use cases, such as information spread and message forwarding patterns. In addition, we provide data enrichments, such as language detection, active message posting periods for each channel, and Telegram entities extracted from messages, that enable online discourse analysis beyond what is possible with the original data alone. The dataset is designed for diverse applications, independent of specific research objectives, and sufficiently versatile to facilitate the replication of social media studies comparable to those conducted on platforms like X (formerly Twitter)
Similar Papers
CTI Dataset Construction from Telegram
Cryptography and Security
Finds online dangers from chat messages.
The Schwurbelarchiv: a German Language Telegram dataset for the Study of Conspiracy Theories
Social and Information Networks
Helps study how fake news spreads online.
Telegram as a Battlefield: Kremlin-related Communications during the Russia-Ukraine Conflict
Social and Information Networks
Shows how Telegram spread war news and lies.