[Bibjson-dev] Schema.org

William Waites ww at styx.org
Tue Jun 7 19:21:58 UTC 2011


Some points:

  * There's already an RDF mapping for the schema.org stuff -
    schema.rdfs.org so interoperating isn't very hard and can
    be largely automatic
  * This is a "standard" dictated by a cartel that largely ignores
    previous and current work.
  * schema.org is very close to RDFa, why didn't they just use RDFa?
    Their arguments boil down to "because we felt like it" which is a
    bit anti-social - see the excellent article by Manu Sporny
  * Are you seriously suggesting to embed XML fragments in JSON?

The two middle points may be true but it may also be that because the
cartel is so powerful it gains some sort of critical mass (and maybe
not, this is not the first time Google et al. have proposed
microformats of various types which have not been widely adopted).

The real news is that this is an admission by the big three search
engines that heuristics and natural language techniques are not enough
- this is a significant departure from their previous positions. This
much is a very good thing.

Cheers,
-w


* [2011-06-07 12:11:27 -0700] Jim Pitman <pitman at stat.Berkeley.EDU> écrit:

] This http://schema.org/ looks like an important development for both SchHTML and BibJSON.
] Peter, please can you forward to the SchHTML community?
] 
] >This site provides a collection of schemas, i.e., html tags, that webmasters can use to markup their pages in ways recognized by major search providers. Search engines including Bing, Google and Yahoo! rely on this markup to improve the display of search results, making it easier for people to find the right web pages.
] >Many sites are generated from structured data, which is often stored in databases. When this data is formatted into HTML, it becomes very difficult to recover the original structured data. Many applications, especially search engines, can benefit greatly from direct access to this structured data. On-page markup enables search engines to understand the information on web pages and provide richer search results in order to make it easier for users to find relevant information on the web. Markup can also enable new tools and applications that make use of the structure.
] >A shared markup vocabulary makes easier for webmasters to decide on a markup schema and get the maximum benefit for their efforts. So, in the spirit of sitemaps.org, Bing, Google and Yahoo! have come together to provide a shared collection of schemas that webmasters can use. 
] 
] The search engines should strongly motivate data providers to make their data available according to these schemas.
] 
] I think we should respond to this initiative by
] 
] 1) quickly identifying how objects of interest to us can be generically embedded in crude forms of BibJSON. These objects include especially
] 
] http://schema.org/ScholarlyArticle
] 
] Note the schema is crude, and does not even have a placeholders for usual "journal", "pages" or "howpublished" or identifier attributes like "doi", "issn", ....
] 
] http://schema.org/Person
] 
] Note that there appear to be no attributes like "homepage" or "profilepage" so it is not easy to indicate the person by a URI.
] 
] 2) develop and support tools for harvesting data from a page compliant with schema.org and mapping it to BibJSON for further processing
] 
] 3) provide sample mappings back from BibJSON to html compliant with schema.org , and to richer forms regarded as acceptable SchHTML.
] 
] There will be finer things we want to indicate in BibJSON and ScHTML data, but this should be possible by extensions of schema.org.
] We can always fall back on BIBO, FOAF etc, but it could easily be that schema.org largely replaces all of these.
] 
] I think some hand-in-hand development of BibJSON and ScHTML consistent with the broad modelling of object-types laid out by schema.org
] could be very rewarding.
] 
] 
] --Jim
] ----------------------------------------------
] Jim Pitman
] Director, Bibliographic Knowledge Network Project
] http://www.bibkn.org/
] 
] Professor of Statistics and Mathematics
] University of California
] 367 Evans Hall # 3860
] Berkeley, CA 94720-3860
] 
] ph: 510-642-9970  fax: 510-642-7892
] e-mail: pitman at stat.berkeley.edu
] URL: http://www.stat.berkeley.edu/users/pitman
] 

-- 
William Waites                <mailto:ww at styx.org>
http://river.styx.org/ww/        <sip:ww at styx.org>
F4B3 39BF E775 CF42 0BAB  3DF0 BE40 A6DF B06F FD45




More information about the bibjson-dev mailing list