Sign in or 

• Linked in interim report 1
• Edited main page: decisions about relevance and organization
• Distribution of tasks
• SR: interim report 2 (stop by 5:15 to discuss)
• SC: take notes WS3
• Decided to have coordination discussion with working group 3: tools
• MB: Instead, have our conversation via the wiki’s top-level page on “Privacy and Ethics”
• Should we begin WS3 by working on “tools”?
• MB: Add to the Tools Working Group page, or approach with a thread first?
• SR/RW: Less invasive to begin as a thread
• Tools for making tools for checking well-formedness of annotations
• SR: libraries, toolkits
• con: requires software engineering knowledge
• pro: non-standalone, provides anyone to roll your own
• e.g. NLTK toolkit
• Automation tools
• RW: translation tools, for converting non-identical standards for communication between users
• importing, exporting formats
• simple, dedicated purposes
• e.g. Scott Farrar’s Praat -> Unicode transcription tool
• ELAN
• Pros and cons
• MB: [+] can now call Praat, toggle video/audio
• RW: [+] allows you to verify tags, [-] but not align them with someone doing audio
• SC: [+] is open source, allows developers to extend
• RW: This should be part of the Annotation’s Desiderata
• Interoperability: - open standards should have open sources
• all together: This is the kind of fresh viewing we want to avoid conveying.
• what is on there now, that is severely not worded well:•
• Why (#2) is necessary:
• First, the kinds of resistance from native communities, such as the Deaf community, to something that is immediately perceived as a written system. The convention of tags aims to capture the spatial signal that is visual
• Consensus:• Second, prosodic signals, for example, exist only to add information to the primary data. So they require an implementation for tagging the primary signal
• 2 components provide information
• 1st is the input (the raw, primary data)
• So if we’re all in agreement about what annotation is, what is problematic with the wording above such that a first-pass reader would make such off interpretations?• 2nd is the information that adds the analysis, the tags, the transcription (the annotation)
• SR: Is it that it looks like it’s saying transcription and tagging are one and the same, rather than significant aspects to annotation?
• GC: It’s that it looks like it undersells annotation. Many linguists already feel like they know what certain terms mean...
• MB: Let’s not use “transcription” without prefacing what kind of transcription we mean... Let’s not use transcription at all. Let’s go back to the three-way annotation that Chuck brought up yesterday.
• you can think of dissatisfaction with annotation as having multiple ways of saying it. Given the example “You had’ve seen it” instead of “You hadn’t have seen it”
• 1 - leaving it out altogether
• 2 - annotating it with an analysis that is wrong given what you’re presumed to be looking at
• 3 - locate the annotator, made a query about it. looked at standards, and had a way of extracting it.
• MB: I think annotation is good, if there’s a built-in way of making queries, modifications, backtracking, in order to recover the primary data using the conventions around the annotation
• This way, the goal is still to stay true to the primary data, not to stay true to the idea of the “standard”
• And I think that having annotative tags helps with this
• CF: Take the example “whose” vs. “who’s”, whose orthographic convention should predict “who’s” as the correct
• This suggests that the orthography (if we consider it an annotation) is not true to its own conventions
• MB: Let’s not try to make a distinction, then, but clean up our writing
• P1: ...“added conventionalized representation”... “to primary linguistic data”... what is linguistic data? Then... what is “linguistic data”?
• ex1: orthography
• ex2: segmental transcription
• ex3: tagging of named entities
• ex4: parsing
• RW: present these examples just as bullets for now?
• MB: as prose, pack it in
• SC: as prose, with clear examples of one source of primary data annotated with each example?
• RW: Bow et al.’s 2002 paper with many different examples of interlinear glosses
• SC: Does this flow of ideas mean we want to collapse “What is annotation?” with “What is annotation for?”
• P2 ...examples show need for reference of set of analyses and community of analysts who would provide the annotation conventions
• GC: annotation may provide value
• 1- that saves expended work, so that I can just proceed with the analysis I was going to make anyway
• 2 - that otherwise wouldn’t be available at all, because I have no access to the language otherwise (e.g. morphosyntactic analysis of some remote language).
• MB: I think this goes back to the purpose of the distinction I was trying to make before. It goes back to the meaning of what it means to make a “transcription”
• a cogent example, when you supply, as best you can, an alphabetic transcription of a two-year-old’s speech. You have to make decisions about the steps of transcription you make
• For our purposes, we need to get out of what the annotator is doing--2 kinds of information that are in conflict with each other
• 1 - Would this lingual obstruent be conceived as a stop for the adult phonemic target? (by the native speaker transcriber)
• 2 - What does the phonetician hear as closest to the adult phonetic space (by the native speaker phonetician)
• SR: I think MB’s point is, “annotation is not what you think it is”, and I think GC’s point is “annotation adds value that wouldn’t be there otherwise”
• SR/SC: Annotation (standards) can really help with bringing about the culture change for valuing crosslinguistic/crosslingual domains of interest using understood platforms of convention. SR: different domains can talk to each other. SC: different languages within the same domains, even, which doesn’t yet obtain.
• MB: So this leads to an expanded section on “What is annotation standards?”
• SR: how about 2 sections:
• (1) What is annotation and what is it good for?
• (2) What are annotation standards and what are they good for?
• Invite others to add to bibliography
• Recruits others into community
• addresses some of M.Liberman’s issues
• Clean up front end
• Walk through new wiki sections
|
ashraGurnch |
Latest page update: made by ashraGurnch
, Jul 19 2009, 3:12 AM EDT
(about this update
About This Update
2 words added 1 word deleted view changes - complete history) |
|
Keyword tags:
None
More Info: links to this page
|