Aligning text to audio and video using ELAN 1

Andrea Berez
Christopher Cox

Course days, time, and location:
6/22, 6/23, 6/24, & 6/28
1:30 - 3:15
Knight Library, Room 267b

Course Information:
This is a two-part workshop on the use of ELAN software, which enables the creation of archival-quality time-aligned transcriptions of audio and video. The Level 1 workshop is an introduction to the basic functions of ELAN, including conceptualizing transcripts and step-by-step instructions for building annotation files of different levels of complexity. The Level 2 workshop will show students how to integrate ELAN into typical language documentation workflows, including transforming time-aligned texts into friendly presentation formats that are suitable for language learners and speakers, such as subtitled DVDs and web-based formats using tools like CuPED.

Instructor:  Please use the following link to access all wav, eaf, and mp4 files:
If you have any questions, please call Marnie on her cell phone.  Thank you!

Course Documents:
ELAN 1 Coursepack

Exercise 1 - Language Opinion
Language Opinion (wav)  

Exercise 2 - Television
Television Text
Television (wav)

Exercise 3 - Tortilla Making
Tortilla Making Text
Tortilla Making (wav)

Exercise 4 - Pear Story
Pear Story Text
Pear Story (wav)

Exercise 5 - Morning
Morning Text
Morning (wav)
Morning Camera One  
Morning Camera Two  
Morning (eaf)

Exercise 6 - Roadtrip
Roadtrip (wav)

Instructor(s) Bio:
Andrea Berez is a doctoral candidate in the linguistics department at the University of California, Santa Barbara. She is a descriptive and documentary linguist who works primarily with speakers of Ahtna and Dena’ina, two endangered Athabascan languages of southcentral Alaska. Her linguistic interests include intonation, spatial cognition and discourse-functional approaches to grammar. She is also interested in the development of the technological infrastructure to support language documentation and archiving.

Christopher Cox is a doctoral student in the Department of Linguistics at the University of Alberta.  His research centres on language documentation and description and corpus and computational linguistics, concentrating upon the collaborative development of permanent collections of language resources for both community and academic use.  He has been involved in language documentation efforts with speakers of Tsuut'ina, an endangered language of southern Alberta, and of Plautdietsch, the traditional language of the Dutch-Russian Mennonites, and currently serves as the moderator of the "Plautdietsch-L" community listserv.

                                              Updated July 20, 2010 at 3:49 am