Automatic Audio Transcription (AAT) is contolled by a single check box on the Data Conversion Tool. Under the covers the ResCarta Toolkit uses the open source SPHINX software from Carnegie Mellon University of Pittsburgh. This software converts digital audio to text much like printed text is converted to characters and words by Optical Character Recognition software (OCR). The ResCarta Toolkit currently supports English as the langauge output and uses the Wall Street Journal dictionary to provide more recognition of family names.
The Audio Transcription Editor (ATE) allows the automatically produced transcription to be corrected for final inclusion in your digital archive. Each word is stored within the Broadcast Wave formatted file along with its timecode so that word search in your audio files can be accomplished.