Weblog

New release of the WebDataCommons RDFa, Microdata and Microformat data sets

We are happy to announce a new release of the WebDataCommons RDFa, Microdata, and Microformat data sets. The data sets have been extracted from the November 2013 version of the Common Crawl covering 2.24 billion HTML pages which originate from 12.8 million websites (pay-level-domains). Altogether we discovered structured data within 585 million HTML pages out … Continue reading

LOD2 at Mannheim Linked Open Data Meetup

Co-located with the last LOD2 project plenary in Mannheim, Germany, February 24-25, 2014, the Mannheim Linked Open Data Meetup will take place on Sunday, February 23. The meetup is organized and hosted by the Data and Web Science research group of the University of Mannheim. The meetup is meant to bring together researchers and practitioners … Continue reading