Automatic Audio Transcription (AAT)

Automatic Audio Transcription (AAT) is contolled by a single check box on the Data Conversion Tool. Under the covers the ResCarta Toolkit uses the open source SPHINX software from Carnegie Mellon University of Pittsburgh. This software converts digital audio to text much like printed text is converted to characters and words by Optical Character Recognition software (OCR). The ResCarta Toolkit currently supports English as the language output and uses the Wall Street Journal dictionary to provide more recognition of family names.

Recent software developments by OpenAI have produced WHISPER tools for creating SRT files and TXT files from audio and video formats. Whisper supports many languages and is more accurate than our internal SPHINX. We here at the foundation have adapted the DATA CONVERSION TOOL to accept SRT files from WHISPER tools. To do so just place the SRT and the WAV file or SRT and MP4 file in a directory and UNCHECK the AAT check box. The DCT will read the SRT into the resulting ResCarta Object and make it searchable.

We also advise the use of another tool called SRTeditor from Niskee This tool includes WHISPER and is an excellant SRT tool.

The Audio Transcription Editor (ATE) allows the automatically produced transcription to be corrected for final inclusion in your digital archive. Each word is stored within the Broadcast Wave formatted file along with its timecode so that word search in your audio files can be accomplished.