[rdfweb-dev] [ANN] Java RDF crawler
Matt Biddulph
matt at picdiary.com
Mon Apr 21 13:57:33 UTC 2003
Just released: version 0.1 of an RDF crawler (aka scutter) using Java
and Jena that spiders the web (following rdfs:seeAlso) gathering up RDF
data and storing it in any of Jena's backend stores (in-memory, Berkeley
DB, mysql, etc). It does multithreaded downloading, and retains
provenance information which it uses to maintain consistency over
multiple runs.
The code's only had me as a user so far; comments and corrections highly
appreciated.
More information at http://www.hackdiary.com/archives/000030.html
Cheers,
Matt.
More information about the foaf-dev
mailing list