[open-linguistics] survey of interest in a potential new catalog of annotation resources
Mark Finlayson
markaf at fiu.edu
Mon Dec 19 18:50:57 UTC 2016
Dear Colleagues,
I am seeking community feedback and expressions of interest regarding a
proposal for a new type of annotation resource catalog. The idea is
described briefly in the appended text, and was proposed, discussed, and
endorsed at an NSF-funded workshop <http://uat2015.cs.fiu.edu/> held
last year (described in detail in this technical report
<https://dspace.mit.edu/handle/1721.1/105270>).
If you have ever done linguistic annotation and have a moment, I would
very much appreciate if you would fill out this very short survey (3
questions):
http://online.cs.fiu.edu/survey/index.php?sid=74491&lang=en
Responses may remain anonymous, if you wish. If you have any questions,
please get in touch. Best regards,
Mark
> It is our observation that, despite several ongoing cataloging
> efforts, researchers still struggle to find, understand, and evaluate
> the annotation resources available to support their particular
> annotation tasks (e.g., software tools, schema, and corpora). We
> propose to develop a new catalog that addresses this problem, which
> will add important new metadata on top of existing catalogs (e.g.,
> those maintained by LDC, ELRA, OLAC, etc.,with whom we will partner)
> as well as adding many additional uncatalogued resources.
>
> The key feature of the catalog is to provide new, sophisticated, and
> formal metadata that allows sophisticated and accurate search for
> linguistic-annotation-related resources. The metadata will also enable
> enhanced evaluation of the relevance of resources for a particular
> annotation task. This metadata is not currently captured anywhere,
> except insofar as it is described informally in academic publications.
> The metadata will also enable automatic inference as to what resources
> are relevant to which linguistic phenomena and annotation tasks. On
> the front-end the catalog will be a website: it will list linguistic
> annotation tools, annotation schema and guidelines, and corpora. On
> the back-end, it will capture formal relationships between these items
> in specially designed metadata schemes and description languages,
> which will enable sophisticated search by linguistic phenomenon,
> software capability, and semantic interoperability.
>
> The catalog will also incorporate a number of useful features:
>
> * *Wizard Interface for Novices*, that will guide novice users (such
> as undergraduates, young graduate students, or researchers new to
> the field) to the tools, schemes, data, and best practices most
> appropriate for their task.
> * *Free Text Reviews* which, like modern online shopping sites, will
> allow researchers to help others evaluate resources.
> * *Archive of Abandoned Resources*, providing a
> download-site-of-last-resort for resources that have lost their homes.
> * *Streamlined Intake and Archiving*, which will allow researchers
> to quickly and easily add their newly-developed resources to the
> catalog to enable other researchers to more easily find and cite
> their resources, as well as to comply with data archiving
> requirements by funding agencies (e.g., the NSF Data Management Plan).
>
> This idea was proposed and endorsed by a group of 23 researchers in
> linguistic annotation who gathered at an NSF-funded workshop
> <http://uat2015.cs.fiu.edu/> held in March 2015 (described in detail
> in this technical report
> <https://dspace.mit.edu/handle/1721.1/105270>). We will be soon
> seeking NSF funding to support the implementation of the idea.
>
_________________
Mark A. Finlayson
Assistant Professor, FIU SCIS
11200 SW 8th Street, ECS Room 362, Miami, FL 33199
+1.305.348.7988 (office); +1.617.515.0708 (mobile);markaf at fiu.edu
.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.okfn.org/pipermail/open-linguistics/attachments/20161219/7e9b99f5/attachment-0002.html>
More information about the open-linguistics
mailing list