[ddj] data set for semantic search
Daniel Koller
daniel at dakoller.net
Tue Nov 29 08:02:32 UTC 2011
Hi,
You might take the dumps of DBPedia project, which are available here:
http://wiki.dbpedia.org/Downloads37 .
English language has the best coverage, however other languages may provide
an smaller testbed of data.
The LOD cloud may also give you some to relevant datasets if you are
focusing on a specific domain:
http://richard.cyganiak.de/2007/10/lod/lod-datasets_2011-09-19_colored.png
and yes interested in the results ;-)
(Do you work on it during the Open Data Day on Saturday?)
Kind regards,
Daniel
----
Daniel Koller
Jahnstrasse 20 80469 München Germany daniel at dakoller.net - @dakoller
Mobile: +49.163.6191979
Von: Jan Vangrinsven <jan.vangrinsven at gmail.com>
Antworten an: "List about Data Driven Journalism and Open Data in
Journalism." <data-driven-journalism at lists.okfn.org>
Datum: Tue, 29 Nov 2011 08:56:44 +0100
An: <data-driven-journalism at lists.okfn.org>
Betreff: [ddj] data set for semantic search
We're setting up a proof of concept for a "semantic search engine" which
might be helpful for the newsrooms of the future. We're looking for an
interesting data set we could use as bench marking,
In this stadium the data should be in text or XML format, and we are looking
for a bulk of small files (articles).
If anyone is interested in the results, just drop me a note.
--
Jan Vangrinsven
Vaartstraat 77
3000 Leuven
0475/777.530
_______________________________________________ data-driven-journalism
mailing list data-driven-journalism at lists.okfn.org
http://lists.okfn.org/mailman/listinfo/data-driven-journalism
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.okfn.org/pipermail/data-driven-journalism/attachments/20111129/52c62c96/attachment-0001.html>
More information about the data-driven-journalism
mailing list