Our newly-curated dataset, ArCOV-19 has just been released here and associated ArXiv paper is online here.
ArCOV-19 is an Arabic COVID-19 Twitter dataset that covers the period from 27th of January till 31st of March 2020 (and still ongoing). ArCOV-19 is the first publicly-available Arabic Twitter dataset covering COVID-19 pandemic that includes around 748k popular tweets (according to Twitter search criterion) alongside the propagation networks of the most-popular subset of them. The propagation networks include both retweets and conversational threads (i.e., threads of replies). ArCOV-19 is designed to enable research under several domains including natural language processing, data science, and social computing, among others.
Leave a Reply