Causality Across Languages (CAL): Datooga
|Affiliation||University of Bristol|
|Funding Body||National Science Foundation|
|Collection Status||Collection online|
|Landing Page Handle||http://hdl.handle.net/2196/0584c102-ee8a-4e47-b235-a2c752aaa95a|
Summary of the collection
This collection contains audio and video recordings of twelve Datooga speakers conducting the Causality Across Languages discourse task. The task involved showing participants a series of short video clips to elicit event descriptions (e.g., a woman breaking an egg). Each session in the collection corresponds with one participant. Sessions contain a compressed video recording of the participant conducting the task, an extracted WAV file, and an ELAN file with a transcription of the recording.
This collection represents speakers of the Datooga language who belong to the Datooga ethnic group of Tanzania. Within Datooga there are different ethnic subgroups which speak different dialects of the language. Sessions 4, 11, and 12 involve participants who identify as Gisamjanga-Datooga. Sessions 7 and 10 involve participants who identify as Rootigeenga-Datooga. Session 9 involves a Bureadiga-Datooga participant. All other sessions involve Barabaiga-Datooga participants. There are therefore some dialectal differences in the collection. All speakers come from two adjacent districts of Tanzania: Hanang’ and Mbulu.
Datooga is a Southern Nilotic language spoken in northern and central Tanzania. Datooga are traditionally semi-nomadic pastoralists and are thus fairly widely dispersed across the country, especially so in recent decades. The speakers represented in this collection all live in Mbulu District or Hanang’ District of Manyara Region, Tanzania. Working languages of the collection include Swahili and English.
The transcriptions use the Datooga orthography developed by the Datooga Bible Translation Project, with the only difference being that
is used rather than for the voiceless uvular stop. Datooga is a tonal language but the transcriptions do not (yet) contain tone marking.
The collection contains twelve sessions, each session representing one participants’ response to the discourse task. Sessions contain a compressed MP4 video recording, an extracted WAV file, and an ELAN file with a transcription of the recording.
The data for this collection was collected by Alice Mitchell in April and May 2016. All recordings were made at the IM Hotel in Haydom, Manyara Region, Tanzania.
Acknowledgement and citation
Users of any part of the collection should acknowledge Alice Mitchell as the data collector and researcher. Users should also acknowledge Juergen Bohnemeyer as the Primary Investigator of the Causality Across Languages project and the National Science Foundation for funding the research. Individual speakers do not need to be acknowledged when extracting linguistic examples from their speech.
To refer to any data from the collection, please cite as follows:
Mitchell, Alice. 2017. Causality Across Languages (CAL): Datooga. Endangered Languages Archive. Handle: http://hdl.handle.net/2196/00-0000-0000-000F-F304-0. Accessed on [insert date here].