[public-lod2] Data Mining on Linked Data (DMoLD'12) workshop with Linked Data Mining Challenge

Martin Kaltenböck m.kaltenboeck at semantic-web.at
Wed Jun 27 19:59:43 UTC 2012

Maybe interesting for you as member of this mailing list - regards - Martin
(apologies for cross posting)

Dear colleagues,

The following workshop (organized by UEP and I2G) and challenge is closely
related to the work we are doing in WP9a and WP10. Some of you are already
in the PC. Please, distribute this information further and consider to take

Vojtech and colleagues

Data Mining on Linked Data (DMoLD’12) workshop with Linked Data Mining

to be held during the 20th International Symposium on Methodologies of
Intelligent Systems, ISMIS 2012, 4-7 December 2012, Macau

Workshop website: http://keg.vse.cz/ismis2012

!! Data for the Challenge track already available from the website !!

The workshop consists of an Open Track and a Challenge Track.

The Open Track expects submission of regular research papers, describing
novel approaches to applying Data Mining techniques on Linked Data sources.
The Data Mining techniques can be of logical, probabilistic,
information-theoretic or any other nature. Similarly, the Linked Data
sources considered can be those from the LOD cloud (http://linkeddata.org/)
or any other that follow the Linked Data paradigm; data mining experiments
on RDF data extracted from unstructured or semi-structured sources using
NLP techniques are also eligible.

Participation in the Challenge Track requires the participants to download
a real-world RDF dataset from the domain of Public Contract Procurement and
accomplish at least one of the four tasks on it using their own or publicly
available data mining tool. To get access to the data, participants have to
register to the Challenge Track at http://keg.vse.cz/ismis2012.  Partial
mapping to external datasets is also available, which will allow for
extraction of further features from the Linked Open Data cloud in order to
augment the core dataset.

The tasks can be characterized as follows (for details see
* Task 1 corresponds to unrestricted discovery of potentially interesting
* Task 2 is similar but the category of interesting hypotheses is partially
* Task 3 concerns prediction of a specific quantitative feature in data
that only becomes known (and is added to the evaluation dataset) after the
result submission.
* Task 4 concerns prediction of a ‘soft’ binary feature manually added to a
sample of the data by a team of domain experts.

Important Dates:
Workshop paper and result data submission: August 10, 2012
Notification of paper acceptance: August 25, 2012
Workshop: December 4, 2012

Program Committee:
Jose Maria Alvarez, University of Oviedo, Spain
Soren Auer, Universitat Leipzig, Germany
Bettina Berendt, KU Leuven, Belgium
Key-Sun Choi, KAIST, Korea
Jana Chvalkovska, zIndex, Czech Republic
Fazel Famili, NRC Institute for Information Technology Ottawa, Canada
Stan Matwin, University of Ottawa, Canada
Pablo Mendez, Free University Berlin, Germany
Martin Necasky, Charles University Prague, Czech Republic
Zbigniew Ras, University of North Carolina, Charlotte, USA
Jiri Skuhrovec, zIndex, Czech Republic
Jerzy Stefanowski, Poznan University of Technology, Poland
Djamel Zighed,  University Lumiere, Lyon, France

Workshop Organisers
Petr Berka, University of Economics, Prague, Czech Republic
Vojtech Svatek, University of Economics, Prague, Czech Republic
Krzysztof Wecel, Poznan University of Economics, Poland

The Workshop and Challenge are supported by the EU LOD2 project (

More information about the lod2 mailing list