Virage Launches AudioLogger 1.0 at Demo 99 Conference
Industry's first real-time audio cataloger makes video searchable by spoken words, speaker names and audio types INDIAN WELLS, Calif.--Feb. 8, 1999--Virage, Inc. today debuted and announced the immediate availability of the 1.0 version of its AudioLogger software, the first application that automatically transforms the audio content of video into searchable text in real time. By intelligently "listening" to the audio track of a video, the AudioLogger identifies spoken words, speaker names and audio types, generating a powerful index that allows users to search for and find specific segments of video. The announcement was made at the Demo 99 Conference, the industry's premiere showcase of leading edge products and technologies.
"At CNN, the vast majority of video that we deal with is raw footage that contains no embedded textual material, such as closed captioning, from which to construct an index of the content," said Kevin Ivey, Vice President of Research & Development at Cable News Network, Inc. "We are very excited about the introduction of the AudioLogger because of its potential to allow broadcasters to extract and index, in real time, the wealth of information contained in the audio tracks of video. The ability to search incoming news feeds by speaker or by spoken content would enable us to accelerate the rate at which we bring stories to air, one of CNN's major objectives. We see this as an important developing component to our ongoing content management efforts.""Making video searchable is the perfect application for today's audio and speech technologies," said Paul Lego, chief executive officer of Virage. "Virage extends the benefits of searchable video beyond the broadcast market by automatically turning the audio portion of any video into searchable text. The combination of the AudioLogger and VideoLogger create a unique solution that transforms video into a powerful decision support and communication tool for the corporate and Internet markets.
Developed around a powerful and extensible framework, AudioLogger is a PC-based application that combines three unique audio processing engines to automatically generate keyword, speaker identification and audio classification indices from a raw audio signal. Whether the signal originates from a satellite, a VTR, a live feed or directly from a microphone, AudioLogger transforms the audio portion into a rich index of textual content in real time, thereby eliminating the dependency on tedious manual annotation or costly closed captioning.
Speech Keyword Engine AudioLogger generates a keyword index for any speech sample using IBM's ViaVoice technology for Broadcast Speech Transcription. This speech recognition engine handles continuous speech in real time and is speaker independent, eliminating the need for it to be pre-trained for individual speakers. The engine also incorporates special filtering to eliminate background noise and other signal contamination.
Speaker Identification Engine The AudioLogger is the first commercial product to incorporate IBM's real-time ViaVoice technology for Broadcast Speaker Identification. AudioLogger identifies voices from a user-defined library of up to 300 speakers per session, regardless of the words or even the language spoken. By simply providing a short speech sample, users can easily add new speakers to the library. Multiple libraries can be created to support different content types and sources.
Audio Classification Engine AudioLogger also generates an audio classification index that allows users to locate specific audio cues. For example, a segment might be classified as speech, music, ambient noise or silence. AudioLogger combines these various audio indices with the video indices created by Virage's VideoLogger. The result is an extremely accurate index of the video that can be exported to a variety of data stores, such as Web servers, database management systems or media asset management systems, allowing searchable video to be incorporated into a broad range of applications used throughout the enterprise. With the Virage VideoLogger and AudioLogger applications users no longer have to spend hours creating transcripts or adding annotations manually to make video searchable--it all happens automatically, effortlessly and in real time.
The Virage AudioLogger 1.0 is currently shipping with a unit price of $15,000 (U.S.). For additional product or purchase information, contact Virage Sales at (650) 573-3236 or info@virage.com.
About Virage, Inc.
Virage is the pioneer and recognized market leader in video and image search products. The Virage VideoLogger and AudioLogger software set the standard for real-time indexing and distribution of video across the Internet or corporate intranets and Virage has been named the market winner by industry analyst group Frost & Sullivan. Virage customers include ABC News, AltaVista, BBC, CBS News, CNN, CNN Interactive, Compaq, Federal Bureau of Investigations, General Motors, Harvard Business School, Lockheed Martin, Lucent Technologies, NASA, NBC News, Reuters and several classified U.S. government agencies. These companies rely on the Virage VideoLogger as the critical foundation technology for effectively deploying video within their operations. Headquartered in San Mateo, California, Virage was incorporated in 1994 to provide organizations with advanced methods for accessing and leveraging media assets. For more information, see the Virage Web site or call (650) 573-3210.
Summary: ...clinics, as well as community living centers and Vet Centers. The VHA offers hospital-based surgical and critical care services, as well as specialty services such as audiology and speech pathology, dermatology, dental, oncology, prosthetics, and vision care. And the quality of that care has been called...
Summary: ...and audio natural language to text and time synchronize with a streaming preview of the content. Video assets can be quickly and easily found with pinpoint accuracy to the exact location within a video where a word or phrase is spoken. Virage MediaBin also provides a new and intuitive user interface,...
Summary: ...The trial clearly indicated that Autonomy was head and shoulders above the competition, in terms of the product set up and administration, ease of use and stability." (Kevin Phillips, Head of Information Systems). Autonomy at BAE SYSTEMS aggregates content from many sources in many different formats,...
Summary: ...based on their role, interests and access rights. “Now, more than ever, it’s important to efficiently and accurately prioritize and route the thousands of organizational messages, e-mails, and news feeds to the immediate attention of those responsible for planning or action,” says Masters. “The...
Summary: ...would be able to log onto from anywhere in the world and that could provide immediate access to the latest pictures and news articles. With the help of the portal, journalists and business clients should be able to trace critical information swiftly and effortlessly, saving valuable time on research....
Summary: ...only find a specific feature programme, but also a specific place within the programme that mentions the topic they are searching for. As IDOL’s search functionality is based on the underlying meaning of words and concepts spoken on the radio, search results returned to online users are always of maximum...
Summary: ...Once the data has been accepted, it is exported to disk and transferred to the mainframe database. The images are stored using IMR Alchemy Gold. “The ability to process the images through the system and store them permanently in a searchable form has been a significant benefit,” said Garguilo. “We...
Summary: ...world in accounting. We have re-engineered how people work on a daily basis for the better,” says Mitkowski. Another key beneft was Interwoven’s ability to customize at the user level. Users can make selections while creating fle parts to indicate the type of document media they are using, or add...
Summary: ...achieved a blended Netcentricity Index of 22% and margin contribution of £57M (approx. $115M USD). “We all pay close attention to these numbers,” says Farnsworth. “They’re a vivid indicator of the success of our implementation.” BT is now working toward a unified experience across all channels:...
Summary: ...County of San Diego - Case Study. Autonomy's TeleForm AutoMerge Publisher feature was used to support the need to submit supporting paper-based attachments with an online application. As soon as an employment application is complete, it is exported seamlessly to the County’s Documentum document management...
Summary: ...KeySpan - Case Study. Before Autonomy was in place, this was very time-consuming and tedious to achieve.” By enabling KeySpan to tag content as regulated or unregulated, and based on customer data (client status, location, information need), Autonomy has made if possible for KeySpan to dynamically feed...
Summary: ...Top 10 Financial Institution Case Study case study. Desktop Legal Hold: Silent installer, performs local internal index, applies policies and rules as dictated by Autonomy Legal Hold. ALH provides a pre-culled collection set or a forensic image of the system and manages bandwidth throttling. IDOL Echo:...
This is a small selection of the Autonomy case studies available, please visit our publications site at http://publications.autonomy.com/ for more information.
Summary: ...Schedule, monitor and manage live, incoming content feeds Real-time processing with encoding control and synchronization Advanced audio analysis including speech-to-text, speaker identification, story recognition, name extraction and language translation Advanced video analysis including key framing,...
Summary: ...such as PowerPoint presentations, Word documents and web pages. To enable analysis of audio files or video audio tracks, IUS’s Rich Media Module is able to understand the meaning of live or recorded communication. Sophisticated audio recognition and analysis technology process spoken interactions based...
Summary: ...3 Audio recognition Autonomy can convert audio to text using an extensive vocabulary and out-of-the-box speaker independent technology. The solution detects changes among speakers and identifies copy by speaker.
...
Summary: ...of what customers want and need • Recogniz einteractions with customers, regardless of the language spoken or written • Identify hidden patterns and emerging trends in customer behavior • Profile, segment, and deliver relevant content across multiple channels Sophisticated Reporting Autonomy Optimost...
Summary: ...EPS, TGA, TIFF, JPEG, GIF, PNG, BMP, and flash video, and downloads images in any format, size, resolution, and color space. Advanced file format support Supports over 1,000 file types, including 2D CAD, 3D CAD, and 3D Graphic file formats, as well as Camera RAW and DNG digital camera files. Easy import/...
Summary: ...Autonomy KeyView IDOL Viewing SDK Product Brief. By converting any supported word processing or spreadsheet format to RTF, the performance of applications that process source document content can be dramatically improved. Rapidly Integrate Viewing into Your Applications The KeyView IDOL Viewing SDK contains...
Summary: ...is under license. [AUT TB] 27.06.07 Defense Taxonomy The Autonomy Defense Taxonomy is based on the Defense Technical Information Centre (DTIC) thesaurus published by the U.S. Department of Defense (DoD). This provides a basic multidisciplinary vocabulary that includes close to 12,000 topics, such as “communications,...
Summary: ...where applicable). Specifically, Explore provides 200 million-plus daily social media mentions, 210 international broadcast feeds (all media markets), 800-plus searchable industry categories, 100-plus countries, and 50-plus languages to give you unparalleled access to social media. Automatic Alerting...
Summary: ...ways: • Faster and much less expensive than manual transcription leveraging an array of speech analytics and automation to save time and money • Battle tested and sophisticated speech technology overcomes issues around audio quality, accents / dialects, people talking over each other, etc. • Advanced...
Summary: ...Faster and much less expensive than manual transcription leveraging an array of speech analytics and automation to save time and money • Battle tested and sophisticated speech technology overcomes issues around audio quality, accents / dialects, people talking over each other, etc.
...
Summary: ...against pan-enterprise fileplans, while supporting global certifications such as DoD 5015.2 and VERS. Autonomy KeyView Meridio 5.1 sees integration with Autonomy’s file recognition and viewing solution: KeyView. KeyView provides support for WYSIWYG viewing and printing of all popular word processing,...
This is a small selection of the Autonomy Product Briefs available, please visit our publications site at http://publications.autonomy.com/ for more information.
Summary: ...Solutions for SharePoint 7 Speech Analytics Autonomy’s speaker-independent technology, based on superior speech processing algorithms, enables live or recorded speech to be manipulated, edited, searched, and hyperlinked as freely as text. It also develops a wide range of speech processing technologies,...
Summary: ...or recorded. This process is affected by a variety of factors, such as the speaker’s language, dialect or accent, as well as background noise or interference. Due to the variables in speech and language, legacy approaches like phoneme matching and word-spotting alone are not enough to determine what...
Summary: ...depends on high transcription accuracy, which is difficult given ambient background noise, and requires manipulation to account for multiple expressions of the same concept, such as the words “supervisor” and “manager”. In addition, these technologies lack the ability to correlate speech to other...
Summary: ...vocabulary speech recognition system without the overhead of a vast search space when considering sample audio. Figure 2: Time-first hypothosis extension 2.5.2 Inter-speaker independence (Variation between speakers) Recognition of speakers requires no initial training on their part. The system complements...
Summary: ...specific segments of the call to ‘mask’ out. A time event is the amount of time that has elapsed or is remaining in the call. A linguistic event is based on words or concepts that are spoken during the call. Activity events are based on desktop activity such as screens and fields used while on the...
Summary: ...Email Archiving - Analyzing the Return on Investment. • Lost end-user productivity – Statistics indicate that the average end-user spends more than 30 minutes a day managing his / her mailbox.2 Executive Summary 4 White Paper Storage Costs 50–80% Savings Mail Server Costs 20–40% Savings Backup...
Summary: ...digital communications systems. Claude Shannon stated that information could be treated as a quantifiable value in communications. Natural languages contain a high degree of redundancy or nonessential content. For example, a conversation in a noisy room can be understood even when some of the words cannot...
Summary: ...of a matter, formerly written on the outside of the expandable file, now becomes metadata, making all information easily searchable according to client name, matter, practice, type of matter, date opened, date closed, size of deal or litigation, matter outcome, jurisdiction, court, industry and other...
Summary: ...Alleged Ponzi Scheme. http://online.wsj.com/article/SB122928886040304911.html?mod=articleoutset-box 13 Ibid at note 11 14 Feeder funds, also known as funds of funds, “feed” the investments to other hedge funds, providing access to closed funds, simplified management, and presumably due diligence to...
Summary: ...boost the service experience without letting costs spiral out of control. One way has been to use speech analytics to leverage the enormous amount of data already collected in the form of call recordings. THE SPEECH ANALYTICS ANSWER The contact center generates, through the course of its normal operations,...
Summary: ...do not assert that any sampling was done of the text searchable ESI files that were determined not to contain privileged information on the basis of the keyword search to see if the search results were reliable.” 11 DESI II Background Paper Feb.
...
Summary: ...user input, this is the fi rst system seamlessly to connect the spoken word with a fi nal formatted document, and is able to deliver on the promise of increased effi ciency dramatically improving document turnaround times. Capturing and recognising speech is part of the challenge but the end result needs...
This is a small selection of the Autonomy White Papers available, please visit our publications site at http://publications.autonomy.com/ for more information.