[datahub-discuss] LODVader v1.0. Released
ciro
cbaron at informatik.uni-leipzig.de
Fri Nov 13 13:03:09 UTC 2015
*
We are delighted to announce the very first release of the LODVader v1.0
available at http://lodvader.aksw.org/driven by requirements developed
in the LIDER project http://lider-project.eu/.
What is LODVader?
LODVader stands for LOD Visualization, Analytics and DiscovEry in
Real-time, and is available as a REST API. LODVader indexes RDF datasets
fetching statistical data for analysis and creating a diagram allowing
users to visualize links among datasets.
How does LODVader works?
LODVader parses your dataset description file that might be in different
formats such as VoID, DCAT and DataID. Then, we stream your RDF data in
order to extract links and statistical data, and compare with different
Bloom filters which contains index from other datasets.
The following features are available in the v1.0.
*
-Visualization: LODVader supports a multi-layer graph visualization
interface which visualizes datasets and their respective relations.
Moreover, in many cases it is important to identify dataset links
which are not connected within the imported dataset cloud. Therefore
we introduce the novel notion of the Dark Cloud. Using the above
features makes it possible to create a new LOD diagram showing
broken links between source and destination datasets. The broken
links discovery also relies on BFs.
*
-Dataset comparison: estimate similarity among different datasets
and perform datasets comparison based on their similarity.
Similarity metrics, such as the Jaccard similarity coefficient, are
being applied.
*
-Analysis via RDF Streaming: LODVader supports the ability to deal
with RDF streams, which enables the support of different kinds of
RDF input sources. Example RDF data sources are RDF dump files,
SPARQL endpoints or other RDF data streams.
*
-Link Extraction: LODVader uses an advanced approach to detect and
extract links between datasets using Bloom filters (BF). The
extraction if performed on-the-fly, when the datasets are being
streamed.
*
-Top-N Analysis: Based on the data which was collected during the
RDF streaming process, statistical analysis of each dataset
regarding the top-N used properties, links, relations and
similarities are performed and made available for further use.
*
-Dataset Search Index: Based on the BF search index is created by
indexing subjects and objects as BF vectors, thus allowing fast
access to this data for comparison and search operations. In
addition, LODVader allows to search and filter datasets or
ontologies by subject, property and objects.
*
-Dataset Statistics: Due to the vast amount of data which is stored
in each dataset, it is important to collect statistical information.
Accurate statistical analysis of each dataset regarding the top-N
used properties, links, relations and similarities is performed and
made available for further analysis.
Moreover, you can check the Wiki, and try our online demo at:
http://lodvader.aksw.org/
Your feedback is more than welcome,
Ciro Baron Neto.
Acknowledgments
*
LODVader is an open-source project maintained by the KILT subgroup
of AKSW at Leipzig University. You can download and deploy the
project from our source code available at GitHub
<https://github.com/AKSW/LODVader>.
*
Special thanks goes to the LODVader team Kay Müller, Martin Brümmer,
Dimitris Kontokostas, Sebastian Hellmann, Diego Esteves.
* This research activity was funded by grants from the FP7 & H2020 EU
projects ALIGNED (GA~644055), and LIDER (GA-610782) and the CAPES
foundation (Ministry of Education of Brazil) for the given scholarship.
* **
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.okfn.org/pipermail/datahub-discuss/attachments/20151113/d0edeac3/attachment-0002.html>
More information about the datahub-discuss
mailing list