Dmoz-tddli.rar -
Unlike machine-generated lists, DMOZ data was curated by over 90,000 volunteer editors, making the classifications highly accurate for its time.
Highly recommended for researchers looking to train text-classification models or explore the historical structure of the early-to-mid-2000s internet. Community Perspectives DMOZ-TDDLI.rar
About Dataset. This is an url classification dataset from dmoz directory. There are 15 class for classification. Unlike machine-generated lists, DMOZ data was curated by
Since DMOZ officially closed in March 2017, a significant portion of the URLs in this archive may lead to dead links or parked domains. Unlike machine-generated lists