This page is about ArabicWeb16 collection, the largest Arabic Web collection (150M pages)!

Check our SIGIR 2016 paper that fully describes the collection:

  • Reem Suwaileh, Mucahid Kultlu, Nihal Fathima, Tamer Elsayed, and Matthew Lease. ArabicWeb16: A New Crawl for Today’s Arabic Web. Proceedings of the 39th annual international ACM SIGIR conference on Research and development in information retrieval: SIGIR ’16, Pisa, Italy, July 2016. Download.

If you are interested in getting the collection, please send us an email.