IDOL natively ingests XML files and fully supports the searching, processing, and analyzing of semi-structured content. Standard Boolean operators can be used to help establish relevancy, such as WHEN (structural match), WHENn (nested structural match), and vWHEN (structural weighted search), and as in structured data queries, many other search operators are also supported.
IDOL allows organizations to eliminate the inefficiencies of the manual issues associated with creating XML tags by understanding the content and purpose of either the tag itself, related information, or both. Its key benefits include:
Removing the need to manually insert XML tags
Allowing interoperability between applications that use different XML tagging rules
Allowing applications to use idea distancing (vital relationship between seemingly separately tagged subjects) to increase findability of information
Automating processes that were previously performed manually
Natively indexing XML directly into the engine
Accessibility by XQuery as a query language
Obtaining all output from the engine in XML format
Adding Intelligence to XML
The use of XML is already widespread, but its deployment has significant limitations. Not only are tags often chosen manually in a costly and time-consuming process, but XML also has no built-in understanding of concepts that are similar to one another. In XML, for example, the tag <aircraft> and the tag <plane> are wholly unrelated items. Typically, this presents considerable problems because information from different sources that has been structured using different tagging rules cannot be reconciled, even if there are important conceptual similarities. This lack of conceptual understanding is a considerable handicap to the success of XML as the standard provider for information exchange.
IDOL addresses both issues directly. Its conceptual understanding enables it to automatically insert XML tags and links into documents based on the concepts contained in the information. This eliminates all manual cost. Secondly, IDOL enables XML applications to understand conceptual information independent of variations in tagging schemas or the variety of applications in use. This means, for example, that legacy data from disparate sources, tagged using different schemas, can be automatically reconciled and operated upon.
Seamless XML Interoperability
IDOL provides an infrastructure for complete and automatic interoperability between applications using different XML tagging rules. The IDOL infrastructure is based on a conceptual understanding of XML documents, rather than on the tags themselves.
The use and nature of XML varies hugely between implementations, and IDOL natively handles the full range of schemas. For example, many clients use a huge number of different tags within the schema, a situation that often causes issues for XML-handling software. Autonomy's enterprise-scaling means that such data causes no problems, with the servers switching into more appropriate modes of storage without any prompting.
The use of particular tags within a single schema also varies hugely; some contain full text, some contain product codes or other metadata, and some contain internal information. IDOL is able to treat each of these types separately and automatically so that its statistical processing of the information adapts to the exact data provided. In this way, fields are assigned properties that allow them to be interpreted as fields to perform tokenization on, fields to process numerically - whether they contain single or multiple values, fields whose value is to be stored for optimized retrieval or matching, or even fields that are to be hidden or ignored.
Furthermore, the language-independent nature of all of Autonomy's algorithms means that widely differing XML systems can be integrated, regardless of the language, script or encoding used in the data.
Summary: ...certain types of cancer do not respond to conventional therapies. In these instances doctors often recommend that patients participate in a clinical trial to evaluate promising new treatments. About 200 OSUCCC patients take part in these trials each year. “The problem is determining which study is appropriate...
Summary: ...the university’s visibility and accessibility to applicants outside the U.S. Despite these gains, faculty and staff in the reviewing units sometimes had to wait eight weeks to receive the paper file containing an application and supporting materials. This delay impacted ASU’s ability to be competitive...
Summary: ...external). The IT department has quotas on mailbox size (1.4GB). These are relatively large due to the nature of academia and research, as both communities have the requirement to share large files (e.g, academic and research documents). When email is used as the primary means of sharing files, the result...
Summary: ...BAE Systems Customer Case Study. And it automatically alerts BAE SYSTEMS employees to documents in the system that relate to what they're doing, or to other employees in the company whose interests and expertise match their own. BAE SYSTEM’S CEN Clustering. This intuitive java based user interface allows...
Summary: ...and accessibility of its information resources, the Department can now focus on the quality of its content to make sure that the full potential of its website is realized. Objective Manage a vast amount of information and ensure its fast and accurate accessibility to users throughout Queensland. Solution...
Summary: ...fi nancial, insurance, new technology and real estate sectors of Canada, the U.S. and beyond. Like many law fi rms, BD&P use email communications for effective and effi cient means of business correspondences. Email had become so widely used within the organization that it became a part of the practice....
Summary: ...oil. Statoil is one of the world’s largest crude oil traders. Hutchinson says: “This means that Statoil can plan ahead and be ready to use new products when they become available and benefit from the adaptable nature of Meridio and Microsoft to ever-changing customer needs. For example, Statoil has...
Summary: ...the latest regulations and best practice advice but will also be able to cross-reference this with the internal policies and uncover, often hidden, stores of unstructured information contained in documents around the company network. By using the inherent intelligence of Aungate (powered by the Autonomy...
Summary: ...regulations and best practice advice but will also be able to cross-reference this with the internal policies and uncover, often hidden, stores of unstructured information contained in documents around the company network. By using the inherent intelligence of Aungate (powered by the Autonomy IDOL software)...
Summary: ...Croatian Justice System Case Study. The Customer The government in the Republic of Croatia is organized on the principle of separation of powers into legislative, executive and judicial branches. Judicial power is exercised by the courts. The judiciary is autonomous and independent. The courts administer...
Summary: ...pieces of information contained within HOLMES 2. During major incidents, such as unsolved murders, IDOL is used to automatically compare all data to identify hidden connections that otherwise may have gone unnoticed , enabling new lines of enquiry to be opened. The technology complements officers’ existing...
Summary: ...The Need Developing the medicines of tomorrow is time-consuming, complex and costly. Pharmaceutical professionals spend years tearing apart molecules, running experiments and refining and building on their discoveries in a bid to find new and better ways to treat disease. The process is made even more...
This is a small selection of the Autonomy case studies available, please visit our publications site at http://publications.autonomy.com/ for more information.
Summary: ...information, or both. IDOL can automatically insert XML tags and links into documents based on the concepts contained in the information. IDOL’s meaning-based technology also provides an infrastructure for complete and automatic interoperability between applications using different XML tagging rules....
Summary: ...authentication • web services access and SOAP RPC/XML messaging • access to JDBC databases • HTTP requests • tags for the insertion of targeted content into existing websites High Performance Lightweight and robust, LiveSite has a negligible impact on response times while scaling with the application...
Summary: ...Boolean, natural language and other retrieval methods Dashboard for personalized views Review, assemble and edit content Playlists for ordering and sequencing Create, save and reuse personalized projects for easy organization Collaborate by sharing or e-mailing content Data export options for XML, ALE,...
Summary: ...with EDL Control Automated clipping and segmentation with AutoClip™ Identification and SmartClips™ Real-time information access using Boolean, natural language and other retrieval methods Fast, scalable and language independent retrieval and data processing with IDOL Server Dashboard for personalized...
Summary: ...results. • Related Concept Generation and Idea Distancin– g automatically categorizes concepts in relationship to one another by identifying vital relationships between seemingly separate subjects. • Sentiment and Vibe Analysis– determines the degree to which a sentiment is positive, negative...
Summary: ...found. Additional complications arise when subjects incorporate multiple themes. Interoperability of Tagging If two organizations are going to interoperate and apply the same meaning to the same tags, they have to explicitly agree upon their classification schemes in advance. Scale As the number of tags...
Summary: ...utmost accuracy. Autonomy’s accuracy is rooted in highly sophisticated pattern-matching process that is based on concepts to categorize documents and automatically insert tag data sets, route content or alert users to highly relevant information pertinent to the user’s profile.
...
Summary: ...builds a time synchronized index providing immediate, specific retrieval of content Media Analysis Plug-Ins – Allow content owners to enhance indexing capabilities Database Plug-Ins – Enable communication between VideoLogger and any digital asset management systems based on XML or SQL standards ControlCenter-...
Summary: ...through advanced natural language processing techniques, treating words as abstract symbols of meaning and deriving its understanding through the context of their occurrence rather than a rigid definition of the language and grammar. This means that IDOL has no problem understanding slang, industry specific...
Summary: ...of topics with visual navigation and cluster drill-down - Early understanding of hidden and language data • Native support for over 100 Languages & 1000 file types processed • Direct Discovery and Manage In-Place process • Full support for EDRM XML load file and all legacy load file formats • Petabyte...
Summary: ...analytics that is.Powered by the Mighty IDOL.” —Barb Mosher, CMS Wire Discover - Analyze - Act color, and the areas that contain heightened emotion are automatically marked during conversation. The combination of speaker separation, cross-talk identification, and emotion detection allows organizations...
Summary: ...document widely accessible and usable by delivering Web-ready HTML and valid XML to end-users and applications. Convert Multiple Documents Simultaneously KeyView IDOL Export can be configured to convert files to XML and HTML in the same process as the calling application (in-process) or as a separate...
This is a small selection of the Autonomy Product Briefs available, please visit our publications site at http://publications.autonomy.com/ for more information.
Summary: ...of working with native XML automatically. IDOL allows organizations to eliminate the inefficiencies introduced by many of the manual issues associated with creating XML tags by understanding the content and purpose of either the tag itself, related information, or both. IDOL provides the critical layer...
Summary: ...source or date, how often the connector downloads information from the moreover site, how much information it downloads, which words the information must contain or may not contain etc. Please note, the Moreover Fetch can only operate correctly if there is an agreement present with moreover.com to access...
Summary: ...results of the natural language retrieval, users can quickly refine their search to precisely focus on the context they require. • Cross-Language Search Autonomy delivers a language independent software infrastructure that enables content to be conceptually retrieved in any language delivering both...
Summary: ...most important concepts within the text, and automates the processing of this content regardless of its format, location, language or the application with which it has been created. Using Autonomy connectors, Autonomy’s award-winning Intelligent Data Operating Layer (IDOL) integrates unstructured, semi-structured...
Summary: ...efficiencies never experienced before. Autonomy is capable of aggregating any form of structured, semi-structured and unstructured data. This "data agnostic" capability is facilitated through a variety of Autonomy connectors for a considerable number of proprietary data repositories and file formats....
Summary: ...ImportSlave, OmniSlave, BinSlave & PDFSlave • Combine data from any number of tables into a single document • Support for multiple jobs performing different actions • Schedule jobs independently of each other • Extract data as any text based format including HTML & XML • Extract binary document...
Summary: ...Expertise The Expertise Locator Portlet allows users to find people who have been dealing with a specific subject by entering a brief natural language description of the subject. It returns all agents and profiles that match this description together with the names of the users who own the agents or profiles....
Summary: ...can be used interchangeably which means that it does not matter which encoding a language is given in. This makes it, for example, possible to query in one recognized encoding for a language and receive results that are in other encodings. Transliteration schemes Transliteration is the ability to represent...
Summary: ...experienced before. Autonomy is capable of aggregating any form of structured, semi-structured and unstructured data. This "data agnostic" capability is facilitated through a variety of Autonomy connectors for a considerable number of proprietary data repositories and file formats. Autonomy supports many...
Summary: ...is capable of aggregating any form of structured, semi-structured and unstructured data. This "data agnostic" capability is facilitated through a variety of Autonomy connectors for a considerable number of proprietary data repositories and file formats. Autonomy supports many other document management...
Summary: ...efficiencies never experienced before. Autonomy is capable of aggregating any form of structured, semi-structured and unstructured data. This "data agnostic" capability is facilitated through a variety of Autonomy Connectors (also referred to as Fetches) for a considerable number of proprietary data repositories...
This is a small selection of the Autonomy Technical Briefs available, please visit our publications site at http://publications.autonomy.com/ for more information.
Summary: ...itself, related information, or both. Its key benefits include: • Removing the need to manually insert XML tags • Allowing interoperability between applications that use different XML tagging rules • Allowing applications to use idea distancing (vital relationship between seemingly separately tagged...
Summary: ...insert XML tags and links into documents, based on the concepts contained in the information. This eliminates all manual cost. Secondly, IDOL server enables XML applications to understand conceptual information, independent of variations in tagging schemas or the variety of applications in use. This means,...
Summary: ...insert XML tags Allowing interoperability between applications that use different XML tagging schemes Indexing native XML directly into the engine Obtaining all output from the engine in XML format • • • • 9 By making full use of XML, Autonomy is able to support a massive range of delivery methods...
Summary: ...on a language-independent, patternmatching model that uses predictable statistical word patterns and probabilistic modelling to understand content. This means that Autonomy can process information in any language, making it ideal for international, globally dispersed organizations. Furthermore, Autonomy...
Summary: ...network in which apparently unrelated pieces of information are automatically linked via dynamic probabilities. The second reason is that the documentmatching algorithm itself within IDOL uses widespread “short-circuiting” and iterative calculation to ensure that it only performs exactly as much calculation...
Summary: ...network in which apparently unrelated pieces of information are automatically linked via dynamic probabilities. The second reason is that the documentmatching algorithm itself within IDOL uses widespread “short-circuiting” and iterative calculation to ensure that it only performs exactly as much calculation...
Summary: ...formatted in XML with all data encoded in the variable-byte industry standard, UTF-8. Use of UTF-8 enables Autonomy to encode any human language internally, but conversion is often needed between legacy encoding schemes such as the ASCII and UCS2 data found in existing enterprise repositories. Autonomy...
Summary: ...Strategies for Simplifying .NET Application Deployment. ■ XML schema definition (XSD) files define and validate XML content and the structure of XML data. If an application needs to access the schema, the XSD files must be deployed. For example, if an application accesses a Web service that returns...
Summary: ...powerful retrieval features, including natural language, conceptual search, refine by example, crosslanguage search and query by example. Autonomy also supports legacy retrieval mechanisms, such as keyword, Boolean, Proximity, Exact Phrase, Soundex and many others etc. ? Active matching Proactively link...
Summary: ...in contrast, effectively inverts the problem by using pattern matching to compare incoming content directly with agents. This approach delivers optimal alerting performance and is inherently scalable. 11 Platform Agnostic The ability to roll out modules on any desired platform means Autonomy customers...
Summary: ...a natural modeling of processes without resor-t ing to creation of separate workflows for each sub-workflow and linking them artifcially. Figure 1 shows how the tasks in an order processing workflow can be hierarchically decomposed. The arrows depict parent-child containment relationships; predecessor...
Summary: ...for many generations, is riddled with obvious drawbacks, especially when team members are geographically dispersed. Maintaining a separate set of paper files in each firm location requires extensive duplication of effort and introduces the possibility of version control problems and other errors. With...
This is a small selection of the Autonomy White Papers available, please visit our publications site at http://publications.autonomy.com/ for more information.