Video recordings represent an abstraction from an activity or event in reality. A video cannot capture all aspects of the event recorded. A thousand analytic decisions influence how useful the video is, including when the recording is started and stopped, where the camera is placed, where the camera is pointed, how the image is framed by zooming, where microphones are placed, how many cameras are used, and how intrusive the recording operation is allowed to be. The video cannot capture everything. Still, video can be a very useful tool for research.
Transcripts represent a further degree of abstraction. A transcript cannot capture all aspects of data in a video. What is captured in the transcript? Speech is common, but to what level of detail? Are sounds that are not words included? “Um”s and “uh”s and partial words in imperfect spoken language, and to what level of detail? How is intonation represented, if at all? Movement? Posture? Body language? Facial expression? Does a participant’s sarcasm, grief, or excitement come through clearly? How does one reflect the surroundings, the environment where recording occurs? How do you transcribe music, or dance? Do you even have time to transcribe, given that the customer wants a final analysis by Tuesday? Another thousand analytic decisions must be made.
The following describes how to transcribe in Transana. What you transcribe and how you transcribe are up to you.
Types of Transcripts
- with Deepgram Server
- with Speechmatics Server
Transana's Built-In Transcription Shortcut Keys
Advanced Time Codes
Time Codes with Multiple Simultaneous Transcripts
Define New Transcription Shortcuts
Loading Existing Media Files and Controlling Media Playback
Add a Still Image to a Transcript