IDOL natively ingests XML files and fully supports the searching, processing, and analyzing of semi-structured content. Standard Boolean operators can be used to help establish relevancy, such as WHEN (structural match), WHENn (nested structural match), and vWHEN (structural weighted search), and as in structured data queries, many other search operators are also supported.
IDOL allows organizations to eliminate the inefficiencies of the manual issues associated with creating XML tags by understanding the content and purpose of either the tag itself, related information, or both. Its key benefits include:
Removing the need to manually insert XML tags
Allowing interoperability between applications that use different XML tagging rules
Allowing applications to use idea distancing (vital relationship between seemingly separately tagged subjects) to increase findability of information
Automating processes that were previously performed manually
Natively indexing XML directly into the engine
Accessibility by XQuery as a query language
Obtaining all output from the engine in XML format
Adding Intelligence to XML
The use of XML is already widespread, but its deployment has significant limitations. Not only are tags often chosen manually in a costly and time-consuming process, but XML also has no built-in understanding of concepts that are similar to one another. In XML, for example, the tag <aircraft> and the tag <plane> are wholly unrelated items. Typically, this presents considerable problems because information from different sources that has been structured using different tagging rules cannot be reconciled, even if there are important conceptual similarities. This lack of conceptual understanding is a considerable handicap to the success of XML as the standard provider for information exchange.
IDOL addresses both issues directly. Its conceptual understanding enables it to automatically insert XML tags and links into documents based on the concepts contained in the information. This eliminates all manual cost. Secondly, IDOL enables XML applications to understand conceptual information independent of variations in tagging schemas or the variety of applications in use. This means, for example, that legacy data from disparate sources, tagged using different schemas, can be automatically reconciled and operated upon.
Seamless XML Interoperability
IDOL provides an infrastructure for complete and automatic interoperability between applications using different XML tagging rules. The IDOL infrastructure is based on a conceptual understanding of XML documents, rather than on the tags themselves.
The use and nature of XML varies hugely between implementations, and IDOL natively handles the full range of schemas. For example, many clients use a huge number of different tags within the schema, a situation that often causes issues for XML-handling software. Autonomy's enterprise-scaling means that such data causes no problems, with the servers switching into more appropriate modes of storage without any prompting.
The use of particular tags within a single schema also varies hugely; some contain full text, some contain product codes or other metadata, and some contain internal information. IDOL is able to treat each of these types separately and automatically so that its statistical processing of the information adapts to the exact data provided. In this way, fields are assigned properties that allow them to be interpreted as fields to perform tokenization on, fields to process numerically - whether they contain single or multiple values, fields whose value is to be stored for optimized retrieval or matching, or even fields that are to be hidden or ignored.
Furthermore, the language-independent nature of all of Autonomy's algorithms means that widely differing XML systems can be integrated, regardless of the language, script or encoding used in the data.
Summary: ...the university’s visibility and accessibility to applicants outside the U.S. Despite these gains, faculty and staff in the reviewing units sometimes had to wait eight weeks to receive the paper file containing an application and supporting materials. This delay impacted ASU’s ability to be competitive...
Summary: ...the relevant information, the lack of metadata and incorrectly weighted search parameters meant that searches were often unrepeatable and information was lost again within the system. The South Yorkshire Police needed a solution that could harness the wealth of information contained in HOLMES 2. It would...
Summary: ...value of the performance metric. To maximize database insert speed and compression, all the data items within each sample are numerically encoded (and interpreted within reports by joining with a metadata table containing device and metric descriptions, etc.). The volume of data collected and the throughput...
Summary: ...TeleForm software, was created to make these matches.” “Before any study starts, a ‘protocol’ or formal document defining the experimental plan is completed. At OSUCCC we administer over 200 protocols, each 40 to 50 pages in length.” The Solution Physicians often find the volume of information...
Summary: ...TeleForm software, was created to make these matches.” “Before any study starts, a ‘protocol’ or formal document defining the experimental plan is completed. At OSUCCC we administer over 200 protocols, each 40 to 50 pages in length.” The Solution Physicians often find the volume of information...
Summary: ...Toshiba - Case Study. PROMOTE POWER PROTECT Toshiba Realizes Huge Cost Savings Through Autonomy Toshiba America Business Solutions Inc. (TABS) is an independent division of Toshiba Corporation, the 5th largest electronics/electrical equipment company in the world. For more than twenty years, Toshiba has...
Summary: ...providers who can serve them across markets with tailored products. Second, the Internet has raised the bar on customers’ expectations of the timeliness and accessibility of information. For many years, Aon allowed its acquired companies to maintain separate brands with loose ties to the corporate brand—often...
Summary: ...Cardiff Case Study; American Express. [CDF AMX CS] www.cardiff.com way Linmar and (Cardiff) strove to address issues and improve capabilities.” For Mark King, senior manager, Customer Process Listening, the long-term success of the solution came from the constant commitment to improve the product. “TeleForm®...
Summary: ...oil. Statoil is one of the world’s largest crude oil traders. Hutchinson says: “This means that Statoil can plan ahead and be ready to use new products when they become available and benefit from the adaptable nature of Meridio and Microsoft to ever-changing customer needs. For example, Statoil has...
Summary: ...external). The IT department has quotas on mailbox size (1.4GB). These are relatively large due to the nature of academia and research, as both communities have the requirement to share large files (e.g, academic and research documents). When email is used as the primary means of sharing files, the result...
Summary: ...external). The IT department has quotas on mailbox size (1.4GB). These are relatively large due to the nature of academia and research, as both communities have the requirement to share large files (e.g, academic and research documents). When email is used as the primary means of sharing files, the result...
Summary: ...example, after learning that many of the callers were ill, elderly, or just frustrated by the reordering process, the NCAL Pharmacy Call Center agents often had to take extra steps to provide members with the services they need. The NCAL Pharmacy created courses that addressed specific issues such as:...
This is a small selection of the Autonomy case studies available, please visit our publications site at http://publications.autonomy.com/ for more information.
Summary: ...information, or both. IDOL can automatically insert XML tags and links into documents based on the concepts contained in the information. IDOL’s meaning-based technology also provides an infrastructure for complete and automatic interoperability between applications using different XML tagging rules....
Summary: ...related information, or both. IDOL automatically inserts XML tags and links into documents based on concepts contained in the information, and uses meaning-based technology to provide an infrastructure for complete and automatic interoperability between applications using different XML tagging rules....
Summary: ...tools to define and model complex logic and actions within template tags. Tags include "Condition", "Loop", "Variable", "Expression", "Barcode" and others. This means, for instance, that users can insert 2D barcodes into a template or define and evaluate Microsoft Word formulas simply and efficiently....
Summary: ...with EDL Control Automated clipping and segmentation with AutoClip™ Identification and SmartClips™ Real-time information access using Boolean, natural language and other retrieval methods Fast, scalable and language independent retrieval and data processing with IDOL Server Dashboard for personalized...
Summary: ...manual intervention. Keyword and Boolean Searches: Returns only those documents that contain the terms queried. This method is heavily reliant on user skill and adeptness with Boolean operators. It ignores the context in which the keywords were found. While weighting keywords only mitigates this issue,...
Summary: ...results of the natural language retrieval, users can quickly refine their search to precisely focus on the context they require. • Cross-Language Search Autonomy delivers a language independent software infrastructure that enables content to be conceptually retrieved in any language delivering both...
Summary: ...utmost accuracy. Autonomy’s accuracy is rooted in highly sophisticated pattern-matching process that is based on concepts to categorize documents and automatically insert tag data sets, route content or alert users to highly relevant information pertinent to the user’s profile.
...
Summary: ...concepts, ensuring all documents are classified in context and with utmost accuracy. Autonomy’s accuracy is rooted in highly sophisticated pattern-matching process that is based on concepts to categorize documents and automatically insert tag data sets, route content or alert users to highly relevant...
Summary: ...source or date, how often the connector downloads information from the moreover site, how much information it downloads, which words the information must contain or may not contain etc. Please note, the Moreover Fetch can only operate correctly if there is an agreement present with moreover.com to access...
Summary: ...and automate the entire workflow process, for complete lifecycle management of trade documentation. The workflow engine is designed around the Workflow Management Consortium (WfMC) specifcations. It is an XPDL- (WfMC XML Process Defnition Language)compliant engine, designed using EJB 2.0 and JMS. The...
Summary: ...email and attachment viewing • TIFF on-demand • Conceptual, phoneme, keyword and Boolean search • Sophisticated duplicate and near-dupe filtering options • Full support for EDRM XML load file and all legacy load file formats • Full forensic metadata extraction such as: unhiding columns, extracting...
This is a small selection of the Autonomy Product Briefs available, please visit our publications site at http://publications.autonomy.com/ for more information.
Summary: ...itself, related information, or both. Its key benefits include: • Removing the need to manually insert XML tags • Allowing interoperability between applications that use different XML tagging rules • Allowing applications to use idea distancing (vital relationship between seemingly separately tagged...
Summary: ...insert XML tags and links into documents, based on the concepts contained in the information. This eliminates all manual cost. Secondly, IDOL server enables XML applications to understand conceptual information, independent of variations in tagging schemas or the variety of applications in use. This means,...
Summary: ...of rich media, widespread adoption of VOIP, growing use of IPTV and increased scrutiny of white collar crimes. This widespread adoption of rich media has necessitated the “findability” of such content, especially as it has seen increasing importance in eDiscovery cases. “Search of video files is...
Summary: ...does not fit neatly into a structured database. It includes text in the form of emails, documents, IMs, social media, SMS messages, audio in the form of speech and sounds, video, XML, and images. • Ideas do not match, they have a distance.
...
Summary: ...powerful retrieval features, including natural language, conceptual search, refine by example, crosslanguage search and query by example. Autonomy also supports legacy retrieval mechanisms, such as keyword, Boolean, Proximity, Exact Phrase, Soundex and many others etc. ? Active matching Proactively link...
Summary: ...standard called MARC (Machine Readable Catalog). Because of the multiple sources the data was derived from and the diverse nature of the data that is cataloged, the structure of the data is quite irregular. We converted the Barton data from RDF/XML syntax to triples using the Redland parser [3] and then...
Summary: ...all relevant data. Queries can be constructed using and extensive feature list of operators including parametric, Boolean, fielded Boolean, free form queries, geo-term, and conceptual search. Archiving Autonomy Scrittura now encompasses the ability to natively archive to compliant storage all aspects...
Summary: ...of the conceptual matching is done at index time, as opposed to query time; the documents are analyzed while the data is being processed to form a statistical “pool” from which queries can draw key conceptual information, as well as an overlying Bayesian network in which apparently unrelated pieces...
Summary: ...formatted in XML with all data encoded in the variable-byte industry standard, UTF-8. Use of UTF-8 enables Autonomy to encode any human language internally, but conversion is often needed between legacy encoding schemes such as the ASCII and UCS2 data found in existing enterprise repositories. Autonomy...
Summary: ...Autonomy iManage Whitepaper - Managing Electronic Documents: Drafting Guidelines that Protect Both Law Firms and their Clients. Additionally, lawyers have their own separate and independent duties to preserve and, in appropriate circumstances, produce documents from their own files. A lawyer has an independent...
Summary: ...technology, video and audio files are still notoriously large in size, often exceeding the two gigabyte restriction. Moreover, as is often the case with SharePoint deployments, users will often upload variations of the same file, needlessly multiplying storage needs. Therefore it is essential to integrate...
Summary: ...strong in rich media, from its eTalk and Virage applications, and in search, pattern matching, workflow (Cardiff) compliance, and email archiving. At this point, it is clear that Autonomy should no longer be considered purely a search vendor. It builds search-based applications to answer market demands...
This is a small selection of the Autonomy White Papers available, please visit our publications site at http://publications.autonomy.com/ for more information.
There do not seem to be any press releases related to this page in 2012 at the moment, please visit the news section on www.autonomy.com for the latest news.