ResCarta® Toolkit: ResCarta Data Conversion Tool
The ResCarta Data Conversion Tool converts your TIFF, JPEG, PDF (image only), PDF (image and text), MP4 video and Wave Audio files into ResCarta archive data format.
The ResCarta archive data format for documents and photographs is comprised of TIFF 6.0 image files and Library of Congress METS-formatted XML files for object- and collection-level metadata, as well as a Library of Congress Standard MODS XML file, which is placed into the header of each TIFF file. When you use the ResCarta Data Conversion Tool to convert image and text or Normal PDF files, the tool will also write the word data into a public tag along with word location and font information.The Data Conversion Tool uses the included pdfbox to create transcriptions of image files containing printed text.
ResCarta archive data format for Audio Objects is comprised of a Broadcast Wave file (With embedded Library of Congress Standard MODS XML file in the aXML chunk of BWF file and included BEXT metadata, and Transcription information, if any, stored in marker chunks), an OGG presentation audio file and a Library of Congress METS-formatted XML files for object- and collection-level metadata. The Data Conversion Tool uses the included CMU SPHINX project to create transcriptions of audio files containing spoken words. It can also take text input from SRT files for audio and video created by Subtitle Edit or Whisper.
ResCarta archive data format for Video Objects is comprised of an MPEG-4 video in an MP4 container with object level metadata expressed in METS. The object may also contain SRT (SubRip Text) subtitle files in various languages.
Any new, digitized object that you create for inclusion in your ResCarta database needs to be converted into the ResCarta archive data format in order to be recognized.
Watch the ten minute website on YouTube.