Version User Scope of changes
Jul 19 2009, 10:27 AM EDT (current) paul.trilsbeek 174 words added, 167 words deleted
Jul 19 2009, 1:31 AM EDT paul.trilsbeek 226 words added

Changes

Key:  Additions   Deletions
Availability of services
software
  • storage as a service storage
  • software as servicea service
  • high-performance computing as a service customizable editions
  • distribution of the data (Wittgenstein example) tuning way you lookservices, atagreements thebetween data"centers" to specifictake researchover needs:if one viewcenter isceases notto enoughexist

"Pipelines" dataof mashups:services
  • provenance usermetadata participatesbecomes inessential
  • existing creatingmetadata dataset ->PREMIS storewhich mashupsincludes againprovenance asmetadata
  • intermediate newresults data?may keepbe provenanceworth informationstoring ofas thea sourcesnew whatresource

What types of data are there? one person's analysis becomes anotherCan personsone datageneralize whatover generalizationsall applytypes toe.g. differentregarding types,the e.g.notion publicationof applies"data topublication"? someSome types someof thingsdata wille.g. bein "publications"typology andor somefield not,linguistics butvery stilltime wantconsuming to beproduce sharablewhereas typology:other datatypes couldmay be an additional publication with the typological analysis itself, e.g. coordination,generated loan words lists of NPs used in analysis is a bit different: lessseconds.

Customizable detailededitions annotation,of moredata
  • tuning automatedway acquisitionyou dictionaries,look especiallyat inthe non-finaldata formto specific (Atlasresearch ofneeds: Europeanone Languagesview wasis publishednot volumeenough by
  • Wittgenstein volumeexample
  • how withto data,reference commentssuch ona whatrendition did-> andPID systemsneeds used,to maps;include all onparameter paper) fieldsettings. workerguaranteed withavailability annotatedof corpus"tool"
  • data withmashups: translation:user gettingparticipates andin analyzingcreating data very time
  • store consumingmashups documentagain whatas did:new manydata? judgmentskeeping goprovenance intoinformation howof the annotation was done and need to know this; creditssources scientificagain achievementessential

privacyPrivacy issues
  • some technology exists to anonymize source materials, e.g. by masking., Thisbut often not automatic and sometimes not possible e.g. in sign language.
  • becomes hardervery complex with international access, where different countries may have different rules for guarding privacy. This situation may require different country-specific licences and legal advice.
  • DOBES example: legal specialists advised to keep all data closed. Code of Conduct/ethical rules only workable solution, some data needs to remain closed.