Sign in or 

| Case study subdiscipline: | Sociolinguistics |
| Project title: | DIALECT EVOLUTION AND ONGOING VARIABLE LINGUISTIC INPUT: ENGLISH IN THE PACIFIC NORTHWEST 200 YEARS AFTER LEWIS AND CLARK National Science Foundation Award: BCS-0643374 (Alicia Beckford Wassink, Principle Investigator, University of Washington, Department of Linguistics) |
| Software used: | Microsoft Sharepoint Server 2007 (for version control and remote collaboration) |
| Goals of this case study: | Demonstrate the use of versioning software in an online research collaboration area (ORCA) to register the changes made to spoken language recordings and associated data to enable tracking of modifications, and make transparent the nature of and motivations for the changes |
• Elicitation of data involves the utilization of a hybrid methodology, combining phonetic analysis with a standard multi-part variationist sociolinguistic interview schedule allowing collection of data in different spoken registers (unscripted conversation, one-on-one interview, reading passage, word lists, semantic differentials, syntactic diagnostic prompts).
• While we cannot make all elicitation instruments available in this wiki (to avoid exposing materials to potential respondents), similar elicitation materials are publicly available in the Elicitation Materials Clearinghouse, Sociolinguistics Laboratory, University of Washington.
• Two-part sample includes data recorded in the field for a judgement sample and in the laboratory using telephony devices to acquire data for a complementary random sample.
• Original recordings (recorded at a 44.1kHz sampling rate in uncompressed form to compact flash media, using M-Audio MicroTrack digital flash recording devices) are stored in four locations (as required by IRB protocols): 1) on a file server subjected to regular, incremental backups, 2) in ISO-9660 formatted compact disks, locked in a cabinet accessible only to the principal investigator, 2) in redacted form on compact disks in a CD archive, 3) in redacted form on a file server in an online research collaboration area (Microsoft SharePoint ORCA).
• Redacted formats have been edited in Praat software for the removal of potentially identifying subject information, i.e., the acoustic signal has been attenuated to zero, while leaving the time dimension intact. This allows all versions of the soundfiles to retain original timings, enabling location of temporal events of interest across versions of the recordings and transcriptions (which have been time-stamped based upon the non-redacted versions of the signal).
• Version control is provided via an online research collaboration workspace created using Microsoft SharePoint 2007. SharePoint runs on any platform (Researchers in our team are currently using MAC, Windows and Linux operating systems). Versioning is particularly useful in the document libraries where soundfiles, transcriptions, and praat text tiers are stored (Fig 1).
Fig 1. Screenshot showing organization of ORCA main page.
• Version control requires (in this case, although other versioning software varies) that each user check out a soundfile or transcript from the document library. SharePoint allows only one user to check out a file at a time, but other software (such as CSV, Subversion, etc) does not have this limitation.
• The file is modified by the user.
• At the end of a work session, the user uploads the modified version of the file to the document library. The software prompts the user to provide comments regarding what changes were made to the document, and automatically timestamps the new file with the upload time and version (Figs 2a,b). Figure 2a shows the SharePoint pulldown for a file called SR2CF2A_non-conversational. This is an orthographic transcription file that exists in several versions because it has been subjected to a process of anonymization. We desire to view the version history for this file. Figure 2b shows the version history for this file. The current iteration is version 6, which has been wiped of information that potentially identifies a study participant. Clicking on any version (from 1-6) will result in a prompt by the system to view or restore that iteration.
Fig 2a. version history pulldown from main document library
Fig 2b. Screenshot of version history comment page
• Crucially, all prior versions are available to the user. This allows full control and comparison of different versions of the documents stored in the library without overwriting data.
• A discussion area within the ORCA allows discussion of substantive changes to collection, analysis, and other protocols so that important decisions may be registered as part of the project history.
Fig 3. Screenshot showing topic list from the general discussion site
• Akustyk software is used for associating project, speaker and token level metadata with events in the sound file.
• A project handbook registers methodology and decisions made.
- The metadata associated with all recordings is here
• Sharepoint allows for restriction of access depending on permissions criteria for each member of the research team. It is possible, in principle, to share redacted versions of the recordings with all members of the team with data analysis functions, and restrict access to the non-redacted versions to the PI. Permissions criteria are set by principal investigator.
• The public face of the project includes: 1) the project website, 2) exemplifying soundfiles that may be played out or downloaded from maps on the project website, 3) individuals and organizations may download datafiles for particular speakers from the project website, for the set of speakers who have consented that their materials be made available in this way (see Human Subjects consent form sample).
|
AliciaBW |
Latest page update: made by AliciaBW
, Jul 22 2009, 12:21 PM EDT
(about this update
About This Update
1 word added 1 word deleted view changes - complete history) |
|
Keyword tags:
None
More Info: links to this page
|