WG2: Existing StandardsThis is a featured page

STORAGE

-- text encoding standards
  • Unicode (properly a character encoding standard)
  • Text Encoding Initiative (TEI) P5 Standard "These guidelines make recommendations about suitable ways of representing those features of textual resources which need to be identified explicitly in order to facilitate processing by computer programs. In particular, they specify a set of markers (or tags) which may be inserted in the electronic representation of the text, in order to mark the text structure and other features of interest."
  • ...
(domain-specific) terminological standards
---- storage and retrieval standards
  • repository systems (e.g. www.escidoc.org and www.fedora-commons.org might be relevant in relation to long-term archiving
  • DELAMAN is 'as international umbrella body for archives and other initiatives with the goal of documenting and archiving endangered languages and cultures worldwide. Our aim is to stimulate interaction about practical matters that result from the experiences of fieldworkers and archivists, and to act as an information clearinghouse.'
  • The Rosetta Project is another archive actively promoting a set of best practices for storage of language data.

RETRIEVAL

-- reference/identification standards (i.e. metadata)
-- Citation Standards
  • COINS (a simple standard for embedding Dublin Core citation metadata in a web page)

SEARCH
  • Open Search. A very simple standard for sharing search results, usually by expressing such search results in the Atom Syndication Format. Although this is not the best standard (in terms of design, extensibility), it is relatively easy to adopt. Open Search also has proposed geographic extensions to describe how to query a collection based on geographic parameters.

ACCESS/REUSE

-- Cultural Heritage Global Schema and Ontologies
  • CIDOC (an ontology mainly applied by European museums and other heritage organizations that is nicely abstracted and very generalized, but is complex and has some difficulties in application)
  • OCHRE/ArchaeoML (a somewhat more simple global schema / ontology for cultural heritage applications, including archaeology, epigraphy and philology. It is highly abstract so that projects and collections retain native descriptive terminologies but some degree of interoperability and shared services are facilitated.

-- Copyright and Intellectual Property
  • Creative Commons provides a series of standard copyright licenses and associated metadata to explicitly give certain permissions and conditions for use/reuse of copyrighted content. These are useful to define how content can be used. However, these are complicated to apply with scientific data, since US copyright law makes a distinction between "facts" (ideas, concepts, objective data) and "expressions". Since many scientific datasets contain factual measurements and observation, they may not be protected by copyright. To make matters more complicated, the determination of what's a fact and what's an expression is ambiguous and a blurred distinction. This legal ambiguity and complexity makes it harder to use and reuse scientific data. Therefore, Creative Common's scientific arm, "Science Commons", recommends that scientists do not use Creative Commons copyright licenses for scientific data. Instead, Science Commons recommends that scientific application explicitly dedicate data to the public domain using the "CC-Zero" declaration. CC-Zero removes legal ambiguity around data, removes all restrictions for reuse, and in theory, maximizes the scientific value of data.

-- APIs/standards for interfaces with other resources (e.g. corpora, lexica/lexical resources, treebanks?, ...)
  • WordNet "This document presents a standard conversion of Princeton WordNet to RDF/OWL. It describes how it was converted and gives examples of how it may be queried for use in Semantic Web applications."



No user avatar
DeborahAnderson
Latest page update: made by DeborahAnderson , Aug 28 2009, 12:45 AM EDT (about this update About This Update DeborahAnderson Edited by DeborahAnderson


view changes

- complete history)
More Info: links to this page
There are no threads for this page.  Be the first to start a new thread.

Related Content

  (what's this?Related ContentThanks to keyword tags, links to related pages and threads are added to the bottom of your pages. Up to 15 links are shown, determined by matching tags and by how recently the content was updated; keeping the most current at the top. Share your feedback on Wetpaint Central.)