Virage Launches AudioLogger 1.0 at Demo 99 Conference
Industry's first real-time audio cataloger makes video searchable by spoken words, speaker names and audio types INDIAN WELLS, Calif.--Feb. 8, 1999--Virage, Inc. today debuted and announced the immediate availability of the 1.0 version of its AudioLogger software, the first application that automatically transforms the audio content of video into searchable text in real time. By intelligently "listening" to the audio track of a video, the AudioLogger identifies spoken words, speaker names and audio types, generating a powerful index that allows users to search for and find specific segments of video. The announcement was made at the Demo 99 Conference, the industry's premiere showcase of leading edge products and technologies.
"At CNN, the vast majority of video that we deal with is raw footage that contains no embedded textual material, such as closed captioning, from which to construct an index of the content," said Kevin Ivey, Vice President of Research & Development at Cable News Network, Inc. "We are very excited about the introduction of the AudioLogger because of its potential to allow broadcasters to extract and index, in real time, the wealth of information contained in the audio tracks of video. The ability to search incoming news feeds by speaker or by spoken content would enable us to accelerate the rate at which we bring stories to air, one of CNN's major objectives. We see this as an important developing component to our ongoing content management efforts.""Making video searchable is the perfect application for today's audio and speech technologies," said Paul Lego, chief executive officer of Virage. "Virage extends the benefits of searchable video beyond the broadcast market by automatically turning the audio portion of any video into searchable text. The combination of the AudioLogger and VideoLogger create a unique solution that transforms video into a powerful decision support and communication tool for the corporate and Internet markets.
Developed around a powerful and extensible framework, AudioLogger is a PC-based application that combines three unique audio processing engines to automatically generate keyword, speaker identification and audio classification indices from a raw audio signal. Whether the signal originates from a satellite, a VTR, a live feed or directly from a microphone, AudioLogger transforms the audio portion into a rich index of textual content in real time, thereby eliminating the dependency on tedious manual annotation or costly closed captioning.
Speech Keyword Engine AudioLogger generates a keyword index for any speech sample using IBM's ViaVoice technology for Broadcast Speech Transcription. This speech recognition engine handles continuous speech in real time and is speaker independent, eliminating the need for it to be pre-trained for individual speakers. The engine also incorporates special filtering to eliminate background noise and other signal contamination.
Speaker Identification Engine The AudioLogger is the first commercial product to incorporate IBM's real-time ViaVoice technology for Broadcast Speaker Identification. AudioLogger identifies voices from a user-defined library of up to 300 speakers per session, regardless of the words or even the language spoken. By simply providing a short speech sample, users can easily add new speakers to the library. Multiple libraries can be created to support different content types and sources.
Audio Classification Engine AudioLogger also generates an audio classification index that allows users to locate specific audio cues. For example, a segment might be classified as speech, music, ambient noise or silence. AudioLogger combines these various audio indices with the video indices created by Virage's VideoLogger. The result is an extremely accurate index of the video that can be exported to a variety of data stores, such as Web servers, database management systems or media asset management systems, allowing searchable video to be incorporated into a broad range of applications used throughout the enterprise. With the Virage VideoLogger and AudioLogger applications users no longer have to spend hours creating transcripts or adding annotations manually to make video searchable--it all happens automatically, effortlessly and in real time.
The Virage AudioLogger 1.0 is currently shipping with a unit price of $15,000 (U.S.). For additional product or purchase information, contact Virage Sales at (650) 573-3236 or info@virage.com.
About Virage, Inc.
Virage is the pioneer and recognized market leader in video and image search products. The Virage VideoLogger and AudioLogger software set the standard for real-time indexing and distribution of video across the Internet or corporate intranets and Virage has been named the market winner by industry analyst group Frost & Sullivan. Virage customers include ABC News, AltaVista, BBC, CBS News, CNN, CNN Interactive, Compaq, Federal Bureau of Investigations, General Motors, Harvard Business School, Lockheed Martin, Lucent Technologies, NASA, NBC News, Reuters and several classified U.S. government agencies. These companies rely on the Virage VideoLogger as the critical foundation technology for effectively deploying video within their operations. Headquartered in San Mateo, California, Virage was incorporated in 1994 to provide organizations with advanced methods for accessing and leveraging media assets. For more information, see the Virage Web site or call (650) 573-3210.
Summary: ...clinics, as well as community living centers and Vet Centers. The VHA offers hospital-based surgical and critical care services, as well as specialty services such as audiology and speech pathology, dermatology, dental, oncology, prosthetics, and vision care. And the quality of that care has been called...
Summary: ...website. In another use of Aurasma, when a conference attendee aimed their mobile device at an ad in the conference guide book, an introductory video launched showing university president Richard H. Hart, MD, Dr PH speaking. Users also had the option to click-through to a website that was specifically...
Summary: ...The trial clearly indicated that Autonomy was head and shoulders above the competition, in terms of the product set up and administration, ease of use and stability." (Kevin Phillips, Head of Information Systems). Autonomy at BAE SYSTEMS aggregates content from many sources in many different formats,...
Summary: ...devices. A half-day training session was held for all users. Benefts Helping address the ‘needle in a haystack’ problem The trial was quickly deemed a success, and extended into November. The most obvious impact was, even using selective data ingestion (London area-only tweets and feeds), that the...
Summary: ...brand IT matters •Enabled university students to access Aurasmapowered augmented reality app via mobile devices •Attracted young customers with an integrated campaign that incorporated video, animations, 3D objects, and social media feeds Business matters •More than 56,000 free vouchers for burgers...
Summary: ...would be able to log onto from anywhere in the world and that could provide immediate access to the latest pictures and news articles. With the help of the portal, journalists and business clients should be able to trace critical information swiftly and effortlessly, saving valuable time on research....
Summary: ...by Borough Commanders and Association of Chief Police Officers (ACPO) users via user definable dashboards In addition, MPS is building a series of ‘real time tension indicators’ to help Commanders take stock of a situation. It is considering linking the dashboard via video link to Special Operations...
Summary: ...convert video and audio natural language to text and time synchronize with a streaming preview of the content. Video assets can be quickly and easily found with pinpoint accuracy to the exact location within a video where a word or phrase is spoken. Virage MediaBin also provides a new and intuitive user...
Summary: ...only find a specific feature programme, but also a specific place within the programme that mentions the topic they are searching for. As IDOL’s search functionality is based on the underlying meaning of words and concepts spoken on the radio, search results returned to online users are always of maximum...
Summary: ...achieved a blended Netcentricity Index of 22% and margin contribution of £57M (approx. $115M USD). “We all pay close attention to these numbers,” says Farnsworth. “They’re a vivid indicator of the success of our implementation.” BT is now working toward a unified experience across all channels:...
Summary: ...KeySpan - Case Study. Before Autonomy was in place, this was very time-consuming and tedious to achieve.” By enabling KeySpan to tag content as regulated or unregulated, and based on customer data (client status, location, information need), Autonomy has made if possible for KeySpan to dynamically feed...
Summary: ...Rogers Communications - Case Study. PROMOTE POWER PROTECT Rogers Communications—Turning Up the Volume on Customer Self-Service Dubbed the “CNN” of Canada, Rogers Communications has grown into one of the largest media conglomerates in the world. In addition to its television and radio holdings, the...
This is a small selection of the Autonomy case studies available, please visit our publications site at http://publications.autonomy.com/ for more information.
Summary: ...can convert audio to text using an extensive vocabulary and out-of-the-box speaker independent technology. The solution detects changes among speakers and identifies copy by speaker. Once captured, key words and phrases can be detected and flagged for further attention. 3 Automatic vehicle plate recognition...
Summary: ...such as PowerPoint presentations, Word documents and web pages. To enable analysis of audio files or video audio tracks, IUS’s Rich Media Module is able to understand the meaning of live or recorded communication. Sophisticated audio recognition and analysis technology process spoken interactions based...
Summary: ...HP Autonomy Broadcast Monitoring. Features Index, analyze, and export • Schedule, monitor and manage live, incoming content feeds • Real-time processing with encoding control and synchronization • Advanced audio analysis including speech-totext, speaker identification, entity extraction and language...
Summary: ...TeamSite will automatically transform and optimize the asset depending on the context. For example, if one original high resolution Photoshop file is stored, the image is automatically transformed to fit a web page, a PowerPoint presentation, or any other application. You can create innovative rich media...
Summary: ...Photoshop PSD, vector EPS, TGA, TIFF, JPEG, GIF, PNG, BMP, and flash video, and downloads images in any format, size, resolution, and color space. Advanced file fromat support Supports over 1,000 file types, including 2D CAD, 3D CAD, and 3D Graphic file formats, as well as Camera RAW and DNG digital camera...
Summary: ...Autonomy Explore - Product brief. speech analytics Voice of the customer Customer experience analytics Customer interaction survey Real-time topic indexing Fraud and risk mitigation Social media monitoring 2 Automatic alerting and tagging Once an emerging trend has been discovered, Autonomy Explore offers...
Summary: ...of what customers want and need • Recogniz einteractions with customers, regardless of the language spoken or written • Identify hidden patterns and emerging trends in customer behavior • Profile, segment, and deliver relevant content across multiple channels Sophisticated Reporting Autonomy Optimost...
Summary: ...Autonomy KeyView IDOL Viewing SDK Product Brief. By converting any supported word processing or spreadsheet format to RTF, the performance of applications that process source document content can be dramatically improved. Rapidly Integrate Viewing into Your Applications The KeyView IDOL Viewing SDK contains...
Summary: ...is under license. [AUT TB] 27.06.07 Defense Taxonomy The Autonomy Defense Taxonomy is based on the Defense Technical Information Centre (DTIC) thesaurus published by the U.S. Department of Defense (DoD). This provides a basic multidisciplinary vocabulary that includes close to 12,000 topics, such as “communications,...
Summary: ...ways: • Faster and much less expensive than manual transcription leveraging an array of speech analytics and automation to save time and money • Battle tested and sophisticated speech technology overcomes issues around audio quality, accents / dialects, people talking over each other, etc. • Advanced...
Summary: ...Faster and much less expensive than manual transcription leveraging an array of speech analytics and automation to save time and money • Battle tested and sophisticated speech technology overcomes issues around audio quality, accents / dialects, people talking over each other, etc.
...
This is a small selection of the Autonomy Product Briefs available, please visit our publications site at http://publications.autonomy.com/ for more information.
Summary: ...Solutions for SharePoint 7 Speech Analytics Autonomy’s speaker-independent technology, based on superior speech processing algorithms, enables live or recorded speech to be manipulated, edited, searched, and hyperlinked as freely as text. It also develops a wide range of speech processing technologies,...
Summary: ...Building a GARP®-compliant solution. This process is afected by a variety of factors, such as the speaker’s language, dialect or accent, as well as background noise or interference. Due to the variables in speech and language, legacy approaches like phoneme matching and word-spotting alone are not...
Summary: ...specific segments of the call to ‘mask’ out. A time event is the amount of time that has elapsed or is remaining in the call. A linguistic event is based on words or concepts that are spoken during the call. Activity events are based on desktop activity such as screens and fields used while on the...
Summary: ...computers can send back every instance of a particular word or combination of words. Because these methods do not understand the meaning of the word “DOG,” you get results that contain the word, but you will still have to sift through the results to find what you want. More recent methods of using...
Summary: ...Email Archiving - Analyzing the Return on Investment. • Lost end-user productivity – Statistics indicate that the average end-user spends more than 30 minutes a day managing his / her mailbox.2 Executive Summary 4 White Paper Storage Costs 50–80% Savings Mail Server Costs 20–40% Savings Backup...
Summary: ...caching can be leveraged to improve overall content delivery performance. The CEM data layer can store any format of content, from raw formats such as XML, to fully formed web pages, digital assets, and video files. Access to the data layer can be achieved via RESTful APIs that extract raw data, or with...
Summary: ...of a matter, formerly written on the outside of the expandable file, now becomes metadata, making all information easily searchable according to client name, matter, practice, type of matter, date opened, date closed, size of deal or litigation, matter outcome, jurisdiction, court, industry and other...
Summary: ...languages contain a high degree of redundancy, or nonessential content. For example, a conversation in a noisy room can be understood even when some of the words cannot be heard, and the essence of a news article can be grasped simply by skimming over the text. Information Theory provides a framework...
Summary: ...digital communications systems. Claude Shannon stated that information could be treated as a quantifiable value in communications. Natural languages contain a high degree of redundancy or nonessential content. For example, a conversation in a noisy room can be understood even when some of the words cannot...
Summary: ...Alleged Ponzi Scheme. http://online.wsj.com/article/SB122928886040304911.html?mod=articleoutset-box 13 Ibid at note 11 14 Feeder funds, also known as funds of funds, “feed” the investments to other hedge funds, providing access to closed funds, simplified management, and presumably due diligence to...
Summary: ...boost the service experience without letting costs spiral out of control. One way has been to use speech analytics to leverage the enormous amount of data already collected in the form of call recordings. THE SPEECH ANALYTICS ANSWER The contact center generates, through the course of its normal operations,...
Summary: ...do not assert that any sampling was done of the text searchable ESI files that were determined not to contain privileged information on the basis of the keyword search to see if the search results were reliable.” 11 DESI II Background Paper Feb.
...
This is a small selection of the Autonomy White Papers available, please visit our publications site at http://publications.autonomy.com/ for more information.