[open-linguistics] META-NET Data Liberation Campaign
John Judge
jjudge at computing.dcu.ie
Wed Dec 5 16:58:33 UTC 2012
Thanks for your feedback Nancy. You raise a very valid point about the
licensing restrictions and how a move towards fully open data is a goal
we would like to pursue. In this instance with our data liberation
campaign we are trying to take some small but concrete steps in this
direction.
If we were to lobby the owners of this data to try and ask that they
simply move to a fully open distribution model right away I think that
we wouldn't have much success in getting them to loosen their
restrictions at all. Instead through the approach we're taking of
inviting them to share the data in some situations we're hoping to be
able to "get our foot in the door" and show them that there's nothing to
fear from sharing their data and much to gain.
Through this approach we've already had some successes, which I hope to
report on more fully at a later stage when the details have firmed up.
But suffice to say I think that given the tight restrictive
circumstances under which much of this data is currently being held that
, for now at least, this softly softly approach where we offer the
owners everything from an open distribution channel to free legal help
with regards licensing is helping free up the data and opening the minds
of those holding it to the greater possibilities of loosening their
restrictions. So in my view this is just a first step, and an important
one to show that there are good citizens out there and that that
community wants and needs the data to be freely available through less
restrictive licences.
All the best,
John
On 05/12/2012 16:15, Nancy Ide wrote:
> I would like to raise a concern here that calling for "open for
> research" licensing is potentially damaging to our interests, in the
> sense that it promotes a practice that is counter to what I assume
> (hope) is the overall goal: fully open data, restricted to no one for
> any purpose and thereby /supportive of collaborative development
> across both nonprofit and commercial organizations/. Given the EU's
> promotion of collaboration between research and industry in their
> funding model, it would seem that this would be in the interest of the
> official language bodies in Europe as well.
>
> A look at the impact of the promotion of GNU (copyleft) and
> "share-alike" licenses makes my point: promotion of these licenses as
> the "good citizen's license" has had a subtle but pervasive impact on
> software and data licensing, in that these licenses are at this point
> the /de facto/ licenses of choice. Unfortunately, such licenses are
> often not suitable for commercial use because of the requirement to
> distribute results under the same terms. So what we have is a grass
> roots effort to be open that in fact has had the result of obstructing
> full openness. I fear that the promotion of the even stronger
> research-only restriction will have a similar, and even more damaging,
> effect.
>
> I would recommend promotion of something like the Apache 2.0 license
> (http://www.apache.org/licenses/LICENSE-2.0), even if (as pointed out
> in the note below) it is not likely that such a license would be
> acceptable in this instance. That would send the message that a fully
> open license is what the community feels is the good citizen's choice,
> in that it supports collaborative development among both nonprofit and
> commercial organizations. If there cannot be agreement to adopt this
> type of licensing, so be it, but the message from the community should
> be clear about what we see as the ideal.
>
> Nancy Ide
>
> =======================================================
> Nancy Ide
> Professor of Computer Science
>
> Department of Computer Science
> Vassar College
> Poughkeepsie, New York 12604-0520
> USA
>
> tel: (+1 845) 437 5988
> fax: (+1 845) 437 7498
> email: ide at cs.vassar.edu <mailto:ide at cs.vassar.edu>
> http://www.cs.vassar.edu/~ide <http://www.cs.vassar.edu/%7Eide>
> =======================================================
>
>
>
>
>
> On Dec 3, 2012, at 7:25 PM, Christian Chiarcos
> <christian.chiarcos at web.de <mailto:christian.chiarcos at web.de>> wrote:
>
>> Dear all,
>>
>> as most of us probably know, a number of "reference corpora" for
>> major (and minor) languages of Europe that have been produced in the
>> last decades, but many of them are not fully available to the public
>> (not even under a restrictive license), or available in a snippet
>> view on the web only (and hence unusable for NLP or advanced
>> statistical analyses), -- not to talk about open licenses.
>>
>> To address this issue, META-NET have prepared an open letter to all
>> the official language bodies in Europe and to those holding onto the
>> various corpora calling on them to consider trying to make this
>> important language data available for research purposes. If you feel
>> that there is a huge benefit to liberating these corpora and making
>> them available for research then please contact your local language
>> body and let them know that you are in favour of the META-NET proposal.
>>
>> More on this can be found on our blog, in a recent post by John
>> Judge, META-NET Ireland
>> (http://linguistics.okfn.org/2012/11/19/meta-net-data-liberation-campaign/,
>> from where the last paragraph was quoted). I'd like to thank John for
>> replicating his original post there and hope this initiative receives
>> some support from the OWLG.
>>
>> Certainly, making these resources available under a research license
>> would not be sufficient in the eyes of many on the list, but it would
>> definitely be an important (and more easily achievable) step towards
>> the further liberation of linguistic data.
>>
>> Thank you,
>> Christian
>> --
>> Christian Chiarcos
>> Information Sciences Institute
>> University of Southern California
>> 4676 Admiralty Way #1001
>> Marina del Rey, CA 90292
>> tel: +1-310-448-9391
>> fax: +1-310-448-8599
>> http://purl.org/chiarcos/home
>> chiarcos at isi.edu
>>
>> _______________________________________________
>> open-linguistics mailing list
>> open-linguistics at lists.okfn.org
>> http://lists.okfn.org/mailman/listinfo/open-linguistics
>> Unsubscribe: http://lists.okfn.org/mailman/options/open-linguistics
>
--
John Judge
Centre for Next Generation Localisation
META-NET CIO
Email: jjudge at computing.dcu.ie
Phone: +353 1 700 6729
Mob: +353 87 218 9093
Skype: jjudge2
http://www.cngl.ie
http://www.meta-net.eu
Email Disclaimer
"This email and any files transmitted with it are confidential and are
intended solely for use by the addressee. Any unauthorised
dissemination, distribution or copying of this message and any
attachments is strictly prohibited. If you have received this email in
error please notify the sender and delete the message. Any views or
opinions presented in this email may solely be the views of the author
and cannot be relied upon as being those of Dublin City University.
E-mail communications such as this cannot be guaranteed to be virus
free, timely, secure or error free and Dublin City University do not
accept liability for any such matters or their consequences. Please
consider the environment before printing this Email."
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.okfn.org/pipermail/open-linguistics/attachments/20121205/34fe2b4e/attachment.html>
More information about the open-linguistics
mailing list