IDOL natively ingests XML files and fully supports the searching, processing, and analyzing of semi-structured content. Standard Boolean operators can be used to help establish relevancy, such as WHEN (structural match), WHENn (nested structural match), and vWHEN (structural weighted search), and as in structured data queries, many other search operators are also supported.
IDOL allows organizations to eliminate the inefficiencies of the manual issues associated with creating XML tags by understanding the content and purpose of either the tag itself, related information, or both. Its key benefits include:
Removing the need to manually insert XML tags
Allowing interoperability between applications that use different XML tagging rules
Allowing applications to use idea distancing (vital relationship between seemingly separately tagged subjects) to increase findability of information
Automating processes that were previously performed manually
Natively indexing XML directly into the engine
Accessibility by XQuery as a query language
Obtaining all output from the engine in XML format
Adding Intelligence to XML
The use of XML is already widespread, but its deployment has significant limitations. Not only are tags often chosen manually in a costly and time-consuming process, but XML also has no built-in understanding of concepts that are similar to one another. In XML, for example, the tag <aircraft> and the tag <plane> are wholly unrelated items. Typically, this presents considerable problems because information from different sources that has been structured using different tagging rules cannot be reconciled, even if there are important conceptual similarities. This lack of conceptual understanding is a considerable handicap to the success of XML as the standard provider for information exchange.
IDOL addresses both issues directly. Its conceptual understanding enables it to automatically insert XML tags and links into documents based on the concepts contained in the information. This eliminates all manual cost. Secondly, IDOL enables XML applications to understand conceptual information independent of variations in tagging schemas or the variety of applications in use. This means, for example, that legacy data from disparate sources, tagged using different schemas, can be automatically reconciled and operated upon.
Seamless XML Interoperability
IDOL provides an infrastructure for complete and automatic interoperability between applications using different XML tagging rules. The IDOL infrastructure is based on a conceptual understanding of XML documents, rather than on the tags themselves.
The use and nature of XML varies hugely between implementations, and IDOL natively handles the full range of schemas. For example, many clients use a huge number of different tags within the schema, a situation that often causes issues for XML-handling software. Autonomy's enterprise-scaling means that such data causes no problems, with the servers switching into more appropriate modes of storage without any prompting.
The use of particular tags within a single schema also varies hugely; some contain full text, some contain product codes or other metadata, and some contain internal information. IDOL is able to treat each of these types separately and automatically so that its statistical processing of the information adapts to the exact data provided. In this way, fields are assigned properties that allow them to be interpreted as fields to perform tokenization on, fields to process numerically - whether they contain single or multiple values, fields whose value is to be stored for optimized retrieval or matching, or even fields that are to be hidden or ignored.
Furthermore, the language-independent nature of all of Autonomy's algorithms means that widely differing XML systems can be integrated, regardless of the language, script or encoding used in the data.
Summary: ...certain types of cancer do not respond to conventional therapies. In these instances doctors often recommend that patients participate in a clinical trial to evaluate promising new treatments. About 200 OSUCCC patients take part in these trials each year. “The problem is determining which study is appropriate...
Summary: ...the university’s visibility and accessibility to applicants outside the U.S. Despite these gains, faculty and staff in the reviewing units sometimes had to wait eight weeks to receive the paper file containing an application and supporting materials. This delay impacted ASU’s ability to be competitive...
Summary: ...external). The IT department has quotas on mailbox size (1.4GB). These are relatively large due to the nature of academia and research, as both communities have the requirement to share large files (e.g, academic and research documents). When email is used as the primary means of sharing files, the result...
Summary: ...Croatian Justice System Case Study. The Customer The government in the Republic of Croatia is organized on the principle of separation of powers into legislative, executive and judicial branches. Judicial power is exercised by the courts. The judiciary is autonomous and independent. The courts administer...
Summary: ...BAE Systems Customer Case Study. And it automatically alerts BAE SYSTEMS employees to documents in the system that relate to what they're doing, or to other employees in the company whose interests and expertise match their own. BAE SYSTEM’S CEN Clustering. This intuitive java based user interface allows...
Summary: ...and accessibility of its information resources, the Department can now focus on the quality of its content to make sure that the full potential of its website is realized. Objective Manage a vast amount of information and ensure its fast and accurate accessibility to users throughout Queensland. Solution...
Summary: ...fi nancial, insurance, new technology and real estate sectors of Canada, the U.S. and beyond. Like many law fi rms, BD&P use email communications for effective and effi cient means of business correspondences. Email had become so widely used within the organization that it became a part of the practice....
Summary: ...oil. Statoil is one of the world’s largest crude oil traders. Hutchinson says: “This means that Statoil can plan ahead and be ready to use new products when they become available and benefit from the adaptable nature of Meridio and Microsoft to ever-changing customer needs. For example, Statoil has...
Summary: ...Cardiff Case Study; American Express. [CDF AMX CS] www.cardiff.com way Linmar and (Cardiff) strove to address issues and improve capabilities.” For Mark King, senior manager, Customer Process Listening, the long-term success of the solution came from the constant commitment to improve the product. “TeleForm®...
Summary: ...the latest regulations and best practice advice but will also be able to cross-reference this with the internal policies and uncover, often hidden, stores of unstructured information contained in documents around the company network. By using the inherent intelligence of Aungate (powered by the Autonomy...
Summary: ...regulations and best practice advice but will also be able to cross-reference this with the internal policies and uncover, often hidden, stores of unstructured information contained in documents around the company network. By using the inherent intelligence of Aungate (powered by the Autonomy IDOL software)...
Summary: ...pieces of information contained within HOLMES 2. During major incidents, such as unsolved murders, IDOL is used to automatically compare all data to identify hidden connections that otherwise may have gone unnoticed , enabling new lines of enquiry to be opened. The technology complements officers’ existing...
This is a small selection of the Autonomy case studies available, please visit our publications site at http://publications.autonomy.com/ for more information.
Summary: ...information, or both. IDOL can automatically insert XML tags and links into documents based on the concepts contained in the information. IDOL’s meaning-based technology also provides an infrastructure for complete and automatic interoperability between applications using different XML tagging rules....
Summary: ...Boolean, natural language and other retrieval methods Dashboard for personalized views Review, assemble and edit content Playlists for ordering and sequencing Create, save and reuse personalized projects for easy organization Collaborate by sharing or e-mailing content Data export options for XML, ALE,...
Summary: ...with EDL Control Automated clipping and segmentation with AutoClip™ Identification and SmartClips™ Real-time information access using Boolean, natural language and other retrieval methods Fast, scalable and language independent retrieval and data processing with IDOL Server Dashboard for personalized...
Summary: ...through advanced natural language processing techniques, treating words as abstract symbols of meaning and deriving its understanding through the context of their occurrence rather than a rigid definition of the language and grammar. This means that IDOL has no problem understanding slang, industry specific...
Summary: ...for supporting a given load.) Multiple Output Formats Multiple output formats for web, wireless, e-mail, and syndication are supported, and can be generated automatically through a sophisticated scanning algorithm. Supported outputs include Default HTML, ASPX, JSP, ATOM 1.0, RSS 2.0, HTML 4.01 Strict,...
Summary: ...builds a time synchronized index providing immediate, specific retrieval of content Media Analysis Plug-Ins – Allow content owners to enhance indexing capabilities Database Plug-Ins – Enable communication between VideoLogger and any digital asset management systems based on XML or SQL standards ControlCenter-...
Summary: ...Autonomy IDC automates high-volume scanned document classification by leveraging the power of IDOL to understand the meaning of the documents based on the concepts they contain. Moving beyond traditional document classification approaches, TeleForm IDC represents the next step in enabling the real-time...
Summary: ...of topics with visual navigation and cluster drill-down - Early understanding of hidden and language data • Native support for over 100 Languages & 1000 file types processed • Direct Discovery and Manage In-Place process • Full support for EDRM XML load file and all legacy load file formats • Petabyte...
Summary: ...document widely accessible and usable by delivering Web-ready HTML and valid XML to end-users and applications. Convert Multiple Documents Simultaneously KeyView IDOL Export can be configured to convert files to XML and HTML in the same process as the calling application (in-process) or as a separate...
Summary: ...Qfiniti Assist. Autonomy Systems Ltd. Product Brief Q finiti Assist Responding to customer inquiries quickly and efficiently means providing agents with immediate access to a wide range of information. Yet often times, the information needed to answer the question is not easily accessible. It’s hidden...
Summary: ...either explicitly with a natural language description or Boolean expression. Most powerfully, an agent can be trained or re-trained by example, simply by being shown a document, video, or verbal conversation that matches a user’s interests. The Agent will then learn the concepts within the example and...
Summary: ...or patterns of usage. The technology often closes the door on business opportunities when “no match” is found, and the system is unable to make an educated guess based on two data points that at face value may seem unrelated, but actually represent an important connection and business opportunity....
This is a small selection of the Autonomy Product Briefs available, please visit our publications site at http://publications.autonomy.com/ for more information.
Summary: ...to truly realize the potential of XML. Autonomy has developed the only fully featured, commercially scalable infrastructure solution that is capable of working with native XML automatically. IDOL allows organizations to eliminate the inefficiencies introduced by many of the manual issues associated with...
Summary: ...source or date, how often the connector downloads information from the moreover site, how much information it downloads, which words the information must contain or may not contain etc. Please note, the Moreover Fetch can only operate correctly if there is an agreement present with moreover.com to access...
Summary: ...results of the natural language retrieval, users can quickly refine their search to precisely focus on the context they require. • Cross-Language Search Autonomy delivers a language independent software infrastructure that enables content to be conceptually retrieved in any language delivering both...
Summary: ...most important concepts within the text, and automates the processing of this content regardless of its format, location, language or the application with which it has been created. Using Autonomy connectors, Autonomy’s award-winning Intelligent Data Operating Layer (IDOL) integrates unstructured, semi-structured...
Summary: ...efficiencies never experienced before. Autonomy is capable of aggregating any form of structured, semi-structured and unstructured data. This "data agnostic" capability is facilitated through a variety of Autonomy connectors for a considerable number of proprietary data repositories and file formats....
Summary: ...ImportSlave, OmniSlave, BinSlave & PDFSlave • Combine data from any number of tables into a single document • Support for multiple jobs performing different actions • Schedule jobs independently of each other • Extract data as any text based format including HTML & XML • Extract binary document...
Summary: ...Expertise The Expertise Locator Portlet allows users to find people who have been dealing with a specific subject by entering a brief natural language description of the subject. It returns all agents and profiles that match this description together with the names of the users who own the agents or profiles....
Summary: ...experienced before. Autonomy is capable of aggregating any form of structured, semi-structured and unstructured data. This "data agnostic" capability is facilitated through a variety of Autonomy connectors for a considerable number of proprietary data repositories and file formats. Autonomy supports many...
Summary: ...efficiencies never experienced before. Autonomy is capable of aggregating any form of structured, semi-structured and unstructured data. This "data agnostic" capability is facilitated through a variety of Autonomy Connectors (also referred to as Fetches) for a considerable number of proprietary data repositories...
Summary: ...automated efficiencies never experienced before. Autonomy is capable of aggregating any form of structured, semi-structured and unstructured data. This “data agnostic” capability is facilitated through a variety of Autonomy connectors for a considerable number of proprietary data repositories and...
Summary: ...therefore providing automated efficiencies never previously experienced. Autonomy is capable of aggregating any form of structured, semi-structured and unstructured data. This data agnostic capability is facilitated through a variety of Autonomy connectors for a considerable number of proprietary data...
This is a small selection of the Autonomy Technical Briefs available, please visit our publications site at http://publications.autonomy.com/ for more information.
Summary: ...subjects) to increase findability of information • Automating processes that were previously performed manually • Natively indexing XML directly into the engine • Accessibility by XQuery as a query language • Obtaining all output from the engine in XML format 12 Unified Information Access By storing...
Summary: ...insert XML tags and links into documents, based on the concepts contained in the information. This eliminates all manual cost. Secondly, IDOL server enables XML applications to understand conceptual information, independent of variations in tagging schemas or the variety of applications in use. This means,...
Summary: ...insert XML tags Allowing interoperability between applications that use different XML tagging schemes Indexing native XML directly into the engine Obtaining all output from the engine in XML format • • • • 9 By making full use of XML, Autonomy is able to support a massive range of delivery methods...
Summary: ...on a language-independent, patternmatching model that uses predictable statistical word patterns and probabilistic modelling to understand content. This means that Autonomy can process information in any language, making it ideal for international, globally dispersed organizations. Furthermore, Autonomy...
Summary: ...powerful retrieval features, including natural language, conceptual search, refine by example, crosslanguage search and query by example. Autonomy also supports legacy retrieval mechanisms, such as keyword, Boolean, Proximity, Exact Phrase, Soundex and many others etc. ? Active matching Proactively link...
Summary: ...network in which apparently unrelated pieces of information are automatically linked via dynamic probabilities. The second reason is that the documentmatching algorithm itself within IDOL uses widespread “short-circuiting” and iterative calculation to ensure that it only performs exactly as much calculation...
Summary: ...network in which apparently unrelated pieces of information are automatically linked via dynamic probabilities. The second reason is that the documentmatching algorithm itself within IDOL uses widespread “short-circuiting” and iterative calculation to ensure that it only performs exactly as much calculation...
Summary: ...Autonomy White Paper: Enterprise Search - Addressing Security and Entitlement Issues. One-box means one-target for corporate espionage. In the face of this problem, without document-level security, businesses have no choice but to avoid indexing sensitive content, making quick and reliable access for...
Summary: ...and knowledge management processes throughout the enterprise. Autonomy ControlPoint: Information Governance & eDiscovery Solutions for SharePoint 16 The information contained in this document represents the current opinion as of the date of publication of Autonomy Systems Ltd. regarding the issues discussed....
Summary: ...search engine will only return documents that exactly match the query, and the documents will be returned in no particular order…If AND is used, then the engine will retrieve only documents which contain every term so joined. Such queries generally return too little. If OR is used, then the search engine...
Summary: ...in contrast, effectively inverts the problem by using pattern matching to compare incoming content directly with agents. This approach delivers optimal alerting performance and is inherently scalable. 11 Platform Agnostic The ability to roll out modules on any desired platform means Autonomy customers...
Summary: ...although their relative priorities have shifted. Mailbox management, once the primary business problem that drove companies to seek archiving solutions, remains an issue for many organizations, and continues to prompt the deployment of archiving solutions. But mailbox management issues are rarely considered...
This is a small selection of the Autonomy White Papers available, please visit our publications site at http://publications.autonomy.com/ for more information.