Link Search Menu Expand Document

Transcribing tasks from the Talk Together Study (TTS)

Last updated: May 21th 2021

This page is our protocol how to approach transcribing data from the various Talk Together Study activities.

Please refer to the previous pages for specific guidance on transcription conventions, language tags, the tiers needed, or the ELAN template.

The playlist of our video tutorials geared towards transcribing data from the Talk Together Study can be found here. However, please cross-reference the wiki for the most up-to-date protocol. Any changes to the protocol since the creation of the videos have been noted in the video description.

Additionally, watch our video regarding things to keep in mind during the transcription process.

Do not hesitate to contact a full-time staff member should you have questions.

Contents


Transcribing Voicemail Game Picture Cards Recordings

For a video demonstration of transcribing a picture card recording, please watch this video from our playlist.

As part of the Voicemail Game in this study, participants will record an audio recording of their responses to three different Voicemail Game activities in English and also in their other preferred language (Mandarin, Malay, or Tamil). The first activity of the Voicemail Game is the Picture Cards activity. We may also refer to this as the Vowel Triangle activity.

The picture cards include a picture and an associated “target” word. Participants will read the individual words one-by-one in English and their other preferred language language. The recording should only contain one adult speaker.

Visit this page to see all the picture cards sorted by language.

  1. Import the audio file (.wav) and the template file (.etf) into ELAN. Refer to the “Transcription Template” section of the wiki on instructions on how to handle the template subsequent to importing it.

  2. Go to “File” and click “Save” or “Save As” like a normal document. The file should be named according to the file naming conventions outlined on the wiki. You can also set an automatic backup of the file by going to “File” > “Automatic Backup”. Choose a time interval between 1 to 30 minutes that suits your preference. Note: the backup file will be saved with the extension *.eaf.001.

  3. In order to see the hierarchical relationship between the dependent tiers, right click on the tier sidebar > Sort Tiers > Sort by Hierarchy.

  4. Listen to the audio file in full before you begin transcribing. Mentally note what languages are being spoken. Note any muffled parts or parts where all speech become indecipherable. Information on how to deal with these parts can be found under Frequently Asked Questions (FAQ).

  5. Edit the tier attributes to replace placeholder participant codes with accurate participant ID codes for all the participants present in the recording. This should be done on all the present participant’s primary and secondary tiers.(e.g. if the participant ID is P1234TT and the mother is the only speaker in this recording, the Mother code MPXXXXTT should be edited to MP1234TT on all the Mother primary and secondary tiers). DO NOT DELETE UNUSED TIERS. Right click on the tier panel and hide them if they are getting in your way

  6. Adjust the vertical zoom (right click on the waveform > Vertical Zoom > adjust the %) and horizontal zoom (adjust the slider at the bottom right of the ELAN screen) to help you see the waveform clearly.

  7. Put on your headphones and begin transcribing the audio file.
    • Isolate and annotate the individual target words on the Utterance, Chunk, Language, and relevant Target_Language (i.e., Target_EL, Target_ML, etc.) tiers. Additionally, translate any non-English words on the Translation tier.
    • The Target_Language tiers (English Target_EL, Mandarin Target_CL, Malay Target_ML) have assigned controlled vocabularies that contain all the picture cards words. The Tamil Target_TL tier requires manual input (i.e., typing) as the ELAN software does not allow for Tamil script in controlled vocabularies (yet).
  8. Ensure you mark out the start and end of the Picture Cards recording on the ActivityMarkers tier. Do this even if the recording only contains one word, or if the recording for the Picture Cards activity is contained within a larger recording that also includes the other Voicemail Game activities (Picture Prompts/Green Grass Park).

    • :marker:start:ads_voweltri_[language] should be used to mark the start of the picture cards activity (e.g., :marker:start:ads_voweltri_Tamil would be used to demarcate the start of the Tamil picture cards recording).
    • :marker:end:ads_voweltri_[language] should be used to mark the end of the picture cards activity (e.g., :marker:start:ads_voweltri_Tamil would be used to demarcate the end of the Tamil picture cards recording).
    • Watch our video here or the read the activity markers section in our tier guide in this wiki should you need more guidance on how the ActivityMarkers tier works.

Back to table of contents


Transcribing Voicemail Game Talk Prompts Recordings

For a video demonstration of transcribing a talk prompts recording, please watch this video from our playlist.

As part of the Voicemail Game in this study, participants will record an audio recording of their responses to three different Voicemail Game activities in English and also in their other preferred language (Mandarin, Malay, or Tamil. The second activity of the Voicemail Game is the Talk Prompts activity.

Participants are given a set of eight (8) topics in English and in their other preferred language. They select one topic in each language, and record their response to the chosen topic prompt in the respective languages. The recording should only contain one adult speaker.

Visit this page to see all the topic prompts sorted by language.

  1. Import the audio file (.wav) and the template file (.etf) into ELAN. Refer to the “Transcription Template” section of the wiki on instructions on how to handle the template subsequent to importing it.

  2. Go to “File” and click “Save” or “Save As” like a normal document. The file should be named according to the file naming conventions outlined on the wiki. You can also set an automatic backup of the file by going to “File” > “Automatic Backup”. Choose a time interval between 1 to 30 minutes that suits your preference. Note: the backup file will be saved with the extension *.eaf.001.

  3. In order to see the hierarchical relationship between the dependent tiers, right click on the tier sidebar > Sort Tiers > Sort by Hierarchy.

  4. Listen to the audio file in full before you begin transcribing. Mentally note what languages are being spoken. Note any muffled parts or parts where all speech become indecipherable. Information on how to deal with these parts can be found under Frequently Asked Questions (FAQ).

  5. Edit the tier attributes to replace placeholder participant codes with accurate participant ID codes for all the participants present in the recording. This should be done on all the present participant’s primary and secondary tiers.(e.g. if the participant ID is P1234TT and the mother is the only speaker in this recording, the Mother code MPXXXXTT should be edited to MP1234TT on all the Mother primary and secondary tiers). DO NOT DELETE UNUSED TIERS. Right click on the tier panel and hide them if they are getting in your way

  6. Adjust the vertical zoom (right click on the waveform > Vertical Zoom > adjust the %) and horizontal zoom (adjust the slider at the bottom right of the ELAN screen) to help you see the waveform clearly.

  7. Put on your headphones and begin transcribing the audio file. Utilise the Utterance, Chunk, and Language tier. For recordings of responses to prompts in the preferred languages (Mandarin, Malay, Tamil), utilise the Translation tier as well.

  8. Ensure you mark out the start and end of the Talk Prompts recording on the ActivityMarkers tier. Do this even if the recording for the Talk Prompts activity is contained within a larger recording that also includes the other activities (Picture Cards/Green Grass Park).

    • :marker:start:ads_prompts_[language] should be used to mark the start of the talk prompts activity (e.g., :marker:start:ads_prompts_Eng would be used to demarcate the start of the English talk prompts recording).
    • :marker:end:ads_prompts_[language] should be used to mark the end of the talk prompts activity (e.g., :marker:start:ads_voweltri_Eng would be used to demarcate the end of the Eng talk prompts recording).
    • Watch our video here or the read the activity markers section in our tier guide in this wiki should you need more guidance on how the ActivityMarkers tier works.

Back to table of contents


Transcribing Voicemail Game Green Grass Park Recordings

For a video demonstration of transcribing a green grass park recording from the voicemail game, please watch this video from our playlist.

As part of the Voicemail Game in this study, participants will record an audio recording of their responses to three different Voicemail Game activities in English and also in their other preferred language (Mandarin, Malay, or Tamil. The third and last activity of the Voicemail Game is the Green Grass Park activity.

Participants are given a picture prompt called “Green Grass Park”. There are three version of the picture; a Mandarin version, a Malay version, and a Tamil Version. All three versions also have English on them. The Mandarin/Malay/Tamil words on the image are “target” words. Participants select one version and record themselves describing what they see in the picture. They are given the option to use whichever presented language they are most comfortable with, or to mix them. The recording should only contain one adult speaker.

Visit this page to see all the Green Grass Park pictures sorted by language (click the images on the page to enlarge them).

  1. Have the Green Grass Park image open. This is so you can reference it to identify the participant’s use of any Mandarin/Malay/Tamil target words in their recording.

  2. Import the audio file (.wav) and the template file (.etf) into ELAN. Refer to the “Transcription Template” section of the wiki on instructions on how to handle the template subsequent to importing it.

  3. In order to see the hierarchical relationship between the dependent tiers, right click on the tier sidebar > Sort Tiers > Sort by Hierarchy.

  4. Go to “File” and click “Save” or “Save As” like a normal document. The file should be named according to the file naming conventions outlined on the wiki. You can also set an automatic backup of the file by going to “File” > “Automatic Backup”. Choose a time interval between 1 to 30 minutes that suits your preference. Note: the backup file will be saved with the extension *.eaf.001.

  5. Listen to the audio file in full before you begin transcribing. Mentally note what languages are being spoken. Note any muffled parts or parts where all speech become indecipherable. Information on how to deal with these parts can be found under Frequently Asked Questions (FAQ).

  6. Edit the tier attributes to replace placeholder participant codes with accurate participant ID codes for all the participants present in the recording. This should be done on all the present participant’s primary and secondary tiers.(e.g. if the participant ID is P1234TT and the mother is the only speaker in this recording, the Mother code MPXXXXTT should be edited to MP1234TT on all the Mother primary and secondary tiers). DO NOT DELETE UNUSED TIERS. Right click on the tier panel and hide them if they are getting in your way

  7. Adjust the vertical zoom (right click on the waveform > Vertical Zoom > adjust the %) and horizontal zoom (adjust the slider at the bottom right of the ELAN screen) to help you see the waveform clearly.

  8. Put on your headphones and begin transcribing the audio file.
    • Transcribe utterances on the Utterance, Chunk, and Language tiers. Also use the Translation tier should you need to translate non-English utterances into English.
    • Isolate and transcribe any instances of individual target words on the relevant Target_Language (i.e., Target_CL, Target_ML, or Target_TL) tiers.
    • The Target_Language tiers (Mandarin Target_CL, Malay Target_ML) have assigned controlled vocabularies that contain all the Green Grass Park Mandarin/Malay Green Grass Park target words. The Tamil Target_TL tier requires manual input (i.e., typing) as the ELAN software does not allow for Tamil script in controlled vocabularies (yet).
  9. Ensure you mark out the start and end of the Green Grass Park recording on the ActivityMarkers tier. Do this even if the recording for the Green Grass Park activity is contained within a larger recording that also includes the other activities (Picture Cards/Talk Prompts).

    • :marker:start:ads_ggp should be used to mark the start of the Voicemail Game Green Grass Park activity.
    • :marker:end:ads_ggp should be used to mark the end of the Voicemail Game Green Grass Park activity
    • Watch our video here or the read the activity markers section in our tier guide in this wiki should you need more guidance on how the ActivityMarkers tier works.

Back to table of contents


Transcribing Video Call Storytime Phase 1 Recordings

The Video Call Storytime (VCST) activity in the Talk Together Study is a recording of an interaction between a parent and child over a video call (Zoom). The Phase/Timepoint 1 VCST recording includes:

  • The parent reading “The Little Orangutan: What a Scary Storm” book to their child. The researcher shares this book on screen.

The VCST will usually include one Parent (Father or Mother), one Baby, and one Researcher. There may be instances where another person’s voice may appear in the recording (e.g., a sibling of the baby, another family member, etc.)

  1. Watch the VIDEO recording file in full before you begin transcribing.
    • Familiarize yourself with the participants’ voices and any contextual information.
    • Note how many participants there are. Identify the participant by their relationship to the baby.
    • Note what languages are being spoken.
    • Note down the timepoints when the story reading segment starts and ends for ActivityMarker purposes
      • the start of the story segment is the very start of the recording, including researcher instruction/speech
      • the end of the story segment is usually when the parent/child says ‘the end’ or finishes recapping the story or finishes talking about the orangutan holding pencils on the last page of the book. The ending marker should be placed right before the researcher comes back into the recording and says something along the lines of “Thank you! You and your child looked like you were having so much fun….” If you’re unsure, ask a full-time staff member.
    • Note any muffled parts or parts where all speech become indecipherable. Information on how to deal with these parts can be found under Frequently Asked Questions (FAQ).
  2. Import the associated AUDIO (.wav) file and the template file (.etf) into ELAN. Refer to the “Transcription Template” section of the wiki on instructions on how to handle the template subsequent to importing it.

  3. a) Go to “File” and click “Save” or “Save As” like a normal document. The file should be named according to the file naming conventions outlined on the wiki. You can also set an automatic backup of the file by going to “File” > “Automatic Backup”. Choose a time interval between 1 to 30 minutes that suits your preference. Note: the backup file will be saved with the extension *.eaf.001.

    b) Update the transcription log - namely AudioFileName (Column B), TranscribedFileName (Column C), Project (Column D, in this case would be TTS), Audio Length (Column E), Language(s) spoken in the recording (Column G), Transcriber 1 (Column H, which would be your name), Date issued (Column I). It will also help to write down anything of note (e.g., audio dropping at any point, the length of an activity) into the Comments by Transcriber (Column M).

  4. Return to ELAN. In order to see the hierarchical relationship between the dependent tiers in ELAN, right click on the tier sidebar > Sort Tiers > Sort by Hierarchy.

  5. Edit the tier attributes to replace placeholder participant codes with accurate participant ID codes for all the participants present in the recording. This should be done on all the present participant’s primary and secondary tiers.(e.g. the participant ID is P1234TT and the mother, baby, and researcher (R099) are present in this recording. The Mother code MPXXXXTT should be edited to MP1234TT on all the Mother primary and secondary tiers. The Baby code PXXXXTT should be edited to P1234TT on all the Baby primary and secondary tiers. The Researcher code R00XPxxxxTT should be edited to R099P1234TT on all the Researcher primary and secondary tiers). DO NOT DELETE UNUSED TIERS. Right click on the tier panel and hide them if they are getting in your way

  6. Adjust the vertical zoom (right click on the waveform > Vertical Zoom > adjust the %) and horizontal zoom (adjust the slider at the bottom right of the ELAN screen) to help you see the waveform clearly.

  7. Put on your headphones and begin transcribing the story reading segment of the audio file.
    • Transcribe utterances on the Utterance, Chunk, and Language tiers. Use the Translation tier should you need to translate any non-English utterances into English.
    • Isolate instances (e.g., mention of the baby’s name) that need redacting on the Sensitive_Masking tier. Live demo on how to use the Sensitive_Masking tier here.
    • What counts as an utterance? Read “A Problem Named Utterance” by Dr. Shamala Sundaray, which can be found here
  8. Ensure you mark out the start and end of the story reading segment in the recording on the ActivityMarkers tier. Follow the guidelines in step 1 to get an understanding of where each activity starts and ends. Do consult a full-time staff member should you have any questions.

    • :marker:start:cds_book should be used to mark the start of the Story Reading segment.
    • :marker:end:cds_book should be used to mark the end of the Story Reading Segment.
    • Watch our video here or the read the activity markers section in our tier guide in this wiki should you need more guidance on how the ActivityMarkers tier works.
  9. Update the transcription log to note your transcription progress on a given file.

    a) If you’re still working on the file, note “ongoing” in Column J (actual trans. hrs) and add the date and amount of audio transcribed in the 1st or 2nd listen column (Column K and L, respectively).

    b) If you have finished transcribing a file alert a full-time staff member, update Column J (actual trans. hrs) with the number of hours it took you to complete the transcription, and write the dates of completion of each listen in Column K and L.

    For more information about each column in the Transcription Log, please refer to this page in the wiki.

Back to table of contents


Transcribing Video Call Storytime Phase 2 Recordings

The Video Call Storytime (VCST) activity in the Talk Together Study is a recording of an interaction between a parent and child over a video call (Zoom). The Phase/Timepoint 2 VCST recordings includes:

  • The parent describing the “Green Grass Park” picture prompt to their child. The parent has a maximum of 5 minutes but can choose to end this activity earlier. The Green Grass Park pictures for the VCST are different from the Voicemail game version, and the transcriber does not need to isolate and transcribe any target words.
  • The parent reading “The Little Orangutan: What a Scary Storm” book to their child

The VCST will usually include one Parent (Father or Mother), one Baby, and one Researcher. There may be instances where another person’s voice may appear in the recording (e.g., a sibling of the baby, another family member, etc.)

  1. Watch the VIDEO recording file in full before you begin transcribing.
    • Familiarize yourself with the participants’ voices and any contextual information.
    • Note how many participants there are. Identify the participant by their relationship to the baby.
    • Note what languages are being spoken.
    • Note down the timepoints when the Green Grass Park (GGP) segment starts and ends for ActivityMarker purposes
      • The start of the GGP segment is the very start of the recording, including researcher instruction/speech
      • The end of the GGP segment is usually when the researcher says something along the lines of “Time’s up! That was great” (meaning the parent reacher the 5 minutes), or if the parent says something along the lines of “Next. I’m ready to move on to the next activity”.
    • Note down the timepoints when the story reading segment starts and ends for ActivityMarker purposes
      • The start of the story segment is when the researcher says something along the lines of “I will now switch off my camera. Just remember, once you are ready to go to the next page, you can tell me ‘Next!’ Are you ready? Okay let’s begin”. Ensure you include this researcher instruction/speech within the starting marker.
      • The end of the story segment is usually when the parent/child says ‘the end’ or finish recapping the story or finish talking about the orangutan holding pencils on the last page of the book. The ending marker should be placed right before the researcher comes back into the recording and says something along the lines of “Thank you! You and your child looked like you were having so much fun….” If you’re unsure, ask a full-time staff member.
    • Note any muffled parts or parts where all speech become indecipherable. Information on how to deal with these parts can be found under Frequently Asked Questions (FAQ).
  2. Import the associated AUDIO (.wav) file and the template file (.etf) into ELAN. Refer to the “Transcription Template” section of the wiki on instructions on how to handle the template subsequent to importing it.

  3. a) Go to “File” and click “Save” or “Save As” like a normal document. The file should be named according to the file naming conventions outlined on the wiki. You can also set an automatic backup of the file by going to “File” > “Automatic Backup”. Choose a time interval between 1 to 30 minutes that suits your preference. Note: the backup file will be saved with the extension *.eaf.001.

    b) Update the transcription log - namely AudioFileName (Column B), TranscribedFileName (Column C), Project (Column D, in this case would be TTS), Audio Length (Column E), Language(s) spoken in the recording (Column G), Transcriber 1 (Column H, which would be your name), Date issued (Column I). It will also help to write down anything of note (e.g., audio dropping at any point, the length of an activity) into the Comments by Transcriber (Column M).

  4. Return to ELAN. In order to see the hierarchical relationship between the dependent tiers, right click on the tier sidebar > Sort Tiers > Sort by Hierarchy.

  5. Edit the tier attributes to replace placeholder participant codes with accurate participant ID codes for all the participants present in the recording. This should be done on all the present participant’s primary and secondary tiers.(e.g. the participant ID is P1234TT and the mother, baby, and researcher (R099) are present in this recording. The Mother code MPXXXXTT should be edited to MP1234TT on all the Mother primary and secondary tiers. The Baby code PXXXXTT should be edited to P1234TT on all the Baby primary and secondary tiers. The Researcher code R00XPxxxxTT should be edited to R099P1234TT on all the Researcher primary and secondary tiers). DO NOT DELETE UNUSED TIERS. Right click on the tier panel and hide them if they are getting in your way

  6. Adjust the vertical zoom (right click on the waveform > Vertical Zoom > adjust the %) and horizontal zoom (adjust the slider at the bottom right of the ELAN screen) to help you see the waveform clearly.

  7. Put on your headphones and begin transcribing the story reading segment and GGP segment of the audio file.
    • Transcribe utterances on the Utterance, Chunk, and Language tiers. Use the Translation tier if you need to translate any non-English utterances into English.
    • Isolate instances (e.g., mention of the baby’s name) that need redacting on the Sensitive_Masking tier. Live demo on how to use the Sensitive_Masking tier here.
    • What counts as an utterance? Read “A Problem Named Utterance” by Dr. Shamala Sundaray, which can be found here
  8. Ensure you mark out the start and end of the GGP segment and story reading segment in the recording on the ActivityMarkers tier. Follow the guidelines in step 1 to get an understanding of where each activity starts and ends. Do consult a full-time staff member should you have any questions.

    • :marker:start:cds_ggp should be used to mark the start of the VCST Green Grass Park segment.
    • :marker:end:cds_ggp should be used to mark the end of the VCST Green Grass Park segment.
    • :marker:start:cds_book should be used to mark the start of the Story Reading segment.
    • :marker:end:cds_book should be used to mark the end of the Story Reading Segment.
    • Watch our video here or the read the activity markers section in our tier guide in this wiki should you need more guidance on how the ActivityMarkers tier works.
  9. Update the transcription log to note your transcription progress on a given file.

    a) If you’re still working on the file, note “ongoing” in Column J (actual trans. hrs) and add the date and amount of audio transcribed in the 1st or 2nd listen column (Column K and L, respectively).

    b) If you have finished transcribing a file alert a full-time staff member, update Column J (actual trans. hrs) with the number of hours it took you to complete the transcription, and write the dates of completion of each listen in Column K and L.

    For more information about each column in the Transcription Log, please refer to this page in the wiki.

Back to table of contents


Transcribing Video Call Storytime Phase 3 Recordings

The Video Call Storytime (VCST) activity in the Talk Together Study is a recording of an interaction between a parent and child over a video call (Zoom). The Phase/Timepoint 3 VCST recordings includes:

  • The parent reading the picture cards/vowel triangle cards to their child in English and their preferred language (Mandarin, Malay, or Tamil). As per protocol change on 15th November 2021, the Picture Cards activity will now be fully transcribed. See point 7(a) below for more details.
  • The parent reading “The Little Orangutan: What a Scary Storm” book to their child

The VCST will usually include one Parent (Father or Mother), one Baby, and one Researcher. There may be instances where another person’s voice may appear in the recording (e.g., a sibling of the baby, another family member, etc.)

  1. Watch the VIDEO recording file in full before you begin transcribing.
    • Familiarize yourself with the participants’ voices and any contextual information.
    • Note how many participants there are. Identify the participant by their relationship to the baby.
    • Note what languages are being spoken.
    • Note down the timepoints when the Picture Cards (PC) segment starts and ends for ActivityMarker purposes
      • The start of the PC segment is the very start of the recording, including researcher instruction/speech
      • The end of the PC segment is usually when the researcher says something along the lines of “That was great! Thank you so much” (indicating that the parents has read through all the necessary cards), or if the parent says something along the lines of “Next. I’m ready to move on to the next activity”.
    • Note down the timepoints when the story reading segment starts and ends for ActivityMarker purposes
      • The start of the story segment is when the researcher says something along the lines of “I will now switch off my camera. Just remember, once you are ready to go to the next page, you can tell me ‘Next!’ Are you ready? Okay let’s begin”. Ensure you include this researcher instruction/speech within the starting marker.
      • The end of the story segment is usually when the parent/child says ‘the end’ or finish recapping the story or finish talking about the orangutan holding pencils on the last page of the book. The ending marker should be placed right before the researcher comes back into the recording and says something along the lines of “Thank you! You and your child looked like you were having so much fun….” If you’re unsure, ask a full-time staff member.
    • Note any muffled parts or parts where all speech become indecipherable. Information on how to deal with these parts can be found under Frequently Asked Questions (FAQ).
  2. Import the associated AUDIO (.wav) file and the template file (.etf) into ELAN. Refer to the “Transcription Template” section of the wiki on instructions on how to handle the template subsequent to importing it.

  3. a) Go to “File” and click “Save” or “Save As” like a normal document. The file should be named according to the file naming conventions outlined on the wiki. You can also set an automatic backup of the file by going to “File” > “Automatic Backup”. Choose a time interval between 1 to 30 minutes that suits your preference. Note: the backup file will be saved with the extension *.eaf.001.

    b) Update the transcription log - namely AudioFileName (Column B), TranscribedFileName (Column C), Project (Column D, in this case would be TTS), Audio Length (Column E), Language(s) spoken in the recording (Column G), Transcriber 1 (Column H, which would be your name), Date issued (Column I). It will also help to write down anything of note (e.g., audio dropping at any point, the length of an activity) into the Comments by Transcriber (Column M).

  4. In order to see the hierarchical relationship between the dependent tiers, right click on the tier sidebar > Sort Tiers > Sort by Hierarchy.

  5. Edit the tier attributes to replace placeholder participant codes with accurate participant ID codes for all the participants present in the recording. This should be done on all the present participant’s primary and secondary tiers.(e.g. the participant ID is P1234TT and the mother, baby, and researcher (R099) are present in this recording. The Mother code MPXXXXTT should be edited to MP1234TT on all the Mother primary and secondary tiers. The Baby code PXXXXTT should be edited to P1234TT on all the Baby primary and secondary tiers. The Researcher code R00XPxxxxTT should be edited to R099P1234TT on all the Researcher primary and secondary tiers). DO NOT DELETE UNUSED TIERS. Right click on the tier panel and hide them if they are getting in your way

  6. Adjust the vertical zoom (right click on the waveform > Vertical Zoom > adjust the %) and horizontal zoom (adjust the slider at the bottom right of the ELAN screen) to help you see the waveform clearly.

  7. Put on your headphones and begin transcribing the PC segment (a) and story reading segment (b) of the audio file.

    7a. Transcribing the PC segment t3_picturecards_eng_20211126.png

    t3_picturecards_mandarin_20211126.png

    • For the picture cards segment during the timepoint 3 video call story time, transcribers need to also can isolate the target words spoken by the parent or child on the participants’ respective Target word tiers (e.g., Target_EL for target English words). Please see the two images above as examples.
    • Target word tiers for the baby do not have a controlled vocabulary assigned to them. Transcribers can freely transcribe any instance of the baby’s attempt at vocalising a target word (e.g., if a baby says ‘pa’ instead of ‘pea’).
    • The picture card activity is looking to elicit the following sounds:
      • xi (for Mandarin Picture Cards)
      • si (for Mandarin Picture Cards)
      • pi (for English, Mandarin, Tamil, Malay Picture Cards)
      • pa (for English, Mandarin, Tamil, Malay Picture Cards)
      • pu (for English, Mandarin, Tamil, Malay Picture Cards)
    • For incomplete target words, it is necessary to isolate them in the target word tiers if the target sounds are elicited. See the images below as examples.

    Example: in ‘西瓜’ if parent says ‘西’, we should tag it in the Target_CL tier. However, if only ‘瓜’ is said, it is not necessary to label it in the Target_CL tier. This would apply to English, Malay and Tamil as well, whereby in Malay if ‘pa’ was said, it should be indicated in the Target_ML tier but there is no need to do so for ‘dang’.

    t3_targetwords_xigua.png

    t3_targetwords_padang.png

    • At times, parents might say the plural versions of the target words (e.g. peas, partners). In this case, it is still necessary to isolate them in the target word tiers as the target sounds were elicited.

    t3_targetwords_plural_partners.png

    t3_targetwords_plural_peas.png

    • If target words are spelt out letter by letter i.e. parent says each letter out loud, they do not need to be tagged in the target word tiers. When a word is being said out letter by letter, it should be annotated like this: p... e... a... . Do refer to the example as shown below.

    t3_targetwords_spellingofpea.PNG

    • Isolate instances (e.g., mention of the baby’s name) that need redacting on the Sensitive_Masking tier. Live demo on how to use the Sensitive_Masking tier here.
    • Use the Translation tier if you need to translate any non-English utterances into English.

    7b. Transcribing the story reading segment

    • Transcribe utterances on the Utterance, Chunk, and Language tiers. Use the Translation tier if you need to translate any non-English utterances into English.
    • Isolate instances (e.g., mention of the baby’s name) that need redacting on the Sensitive_Masking tier. Live demo on how to use the Sensitive_Masking tier here.
    • What counts as an utterance? Read “A Problem Named Utterance” by Dr. Shamala Sundaray, which can be found here
  8. Ensure you mark out the start and end of the PC segment and story reading segmentin the recording on the ActivityMarkers tier. Follow the guidelines in step 1 to get an understanding of where each activity starts and ends. Do consult a full-time staff member should you have any questions.

    • :marker:start:cds_voweltri_Eng should be used to mark the start of the VCST Picture Cards/Vowel Triangles English segment.
    • :marker:end:cds_voweltri_Eng should be used to mark the end of the VCST Picture Cards/Vowel Triangles English segment.
    • :marker:start:cds_voweltri_[language name] should be used to mark the start of the VCST Picture Cards/Vowel Triangles preferred language segment (i.e., Mandarin, Malay, or Tamil).
    • :marker:end:cds_voweltri_[language name] should be used to mark the end of the VCST Picture Cards/Vowel Triangles preferred language segment (i.e., Mandarin, Malay, or Tamil).
    • :marker:start:cds_book should be used to mark the start of the Story Reading segment.
    • :marker:end:cds_book should be used to mark the end of the Story Reading Segment.
    • Watch our video here or the read the activity markers section in our tier guide in this wiki should you need more guidance on how the ActivityMarkers tier works.
  9. Update the transcription log to note your transcription progress on a given file.

    a) If you’re still working on the file, note “ongoing” in Column J (actual trans. hrs) and add the date and amount of audio transcribed in the 1st or 2nd listen column (Column K and L, respectively).

    b) If you have finished transcribing a file alert a full-time staff member, update Column J (actual trans. hrs) with the number of hours it took you to complete the transcription, and write the dates of completion of each listen in Column K and L.

    For more information about each column in the Transcription Log, please refer to this page in the wiki.

Back to table of contents