On top of the PoolParty service-interface a variety of Semantic WebSemantic Web is a group of methods and technologies to allow machines to understand the meaning - or "semantics" - of information on the World Wide Web. The term was coined by World Wide Web Consortium (W3C) director Tim Berners-Lee. According to the original vision, the availability of ... applications can be realized, for example:
- Corporate Thesaurus & Controlled Vocabularies Any form of controlled vocabularyControlled vocabularies provide a way to organize knowledge for subsequent retrieval. They are used in subject indexing schemes, subject headings, thesauri and taxonomies. Controlled vocabulary schemes mandate the use of predefined, authorised terms that have been preselected by the designer of ..., be it a taxonomyTaxonomy is the practice and science of classification. The word finds its roots in the Greek τάξις, taxis (meaning 'order' or 'arrangement') and νόμος, nomos (meaning 'law' or 'science'). Taxonomy uses taxonomic units, known as taxa. In addition, the word is also used as a count noun: ..., thesaurusA thesaurus is a book that lists words grouped together according to similarity of meaning, in contrast to a dictionary, which contains definitions and pronunciations. The largest thesaurus in the world is the Historical Thesaurus of the Oxford English Dictionary, which contains more than ... or ontology, an ontology is a formal representation of knowledge as a set of concepts within a domain, and the relationships between those concepts. It is used to reason about the entities within that domain, and may be used to describe the domain. In theory, an ontology is a "formal, explicit ... greatly helps you in managing, organizing and finding your content. We find that thesauriA thesaurus is a book that lists words grouped together according to similarity of meaning, in contrast to a dictionary, which contains definitions and pronunciations. The largest thesaurus in the world is the Historical Thesaurus of the Oxford English Dictionary, which contains more than ... have the best cost/benefit ratio of all controlled vocabularies, as they are easy to create and to understand, while providing all the semantics that are needed for a wide range of applications.
Terms in a thesaurus are not mere keywords, but controlled concepts that have serveral so called labels attached. The labels are all the words (such as synonyms, abbreviations, misspellings or translations in other languages) that can serve as a name for a concept. E.g. a concept might have the labels United Kingdom, UK, Great Britain, and Großbritannien. All concepts are connected in a hierarchy through statements specifying “broader” and “narrower” relationships, e.g. European Union is a broader concept of United Kingdom which in turn is a broader concept of England. There is also the possibility to connect terms in a non-hierarchic way, by stating two concepts are in some way “related” to each other, e.g. concepts Shakespeare and United Kingdom are related.
Thesauri are of great value when it comes to integrating data within your organization as well as with external sources. All the use cases below can be realized with a thesaurus and the services provided by PoolParty. - Tag Recommender Systems TaggingAn annotation is notes that you make to yourself while you are reading information in a book, document, online record, video, software code or other information, "in the margin", or perhaps just underlined or highlighted passages. Annotated bibliographies, give descriptions about how each source ... can be very helpful in increasing the findability of information, but we find that in convential tagging systems many users rarely apply tags and that the inevitable inconsistencies in tagging behavior considerably hold back its usefulness.
Having a thesaurus backed tagging system allows for consistent use of tags, as tags are no longer mere words, but concepts that include synonyms, abbreviations, misspellings, etc. Your content can be automatically analysed by PoolParty’s Natural Language Processing modules and related concepts can be suggested as tags. The user can choose these controlled tags, or start entering their own tags, which prompt the system to suggest concepts from the thesaurus that match the entered string (see use case AutocompleteAutocomplete is a feature provided by many web browsers, e-mail programs, search engine interfaces, source code editors, database query tools, word processors, and command line interpreters. Autocomplete involves the program predicting a word or phrase that the user wants to type in without the ...). In this way PoolParty lowers the barriers to tagging and improves social tagging, a bookmark is a locally stored Uniform Resource Identifier (URI). All modern web browsers include bookmark features. Bookmarks are called favorites or Internet shortcuts in Internet Explorer, and by virtue of that browser's large market share, these terms have been synonymous with bookmark ... & social bookmarking, a bookmark is a locally stored Uniform Resource Identifier (URI). All modern web browsers include bookmark features. Bookmarks are called favorites or Internet shortcuts in Internet Explorer, and by virtue of that browser's large market share, these terms have been synonymous with bookmark ... in your systems. - Autocomplete
An autocomplete functionality comes in handy for supporting the user in both searching and tagging. Once a user starts entering a keyword the system compares the entered string to the terms from the thesaurus with all their synonyms and suggests possible completions. Those of course are concepts, not keywords: When someone enters e.g. “jav” in a search or tagging widget, the system can suggest “Java (programming language)” and “Java (island)” to the user, thereby providing concept disambiguation and tagging consistency. - Semantic Search Engines Being able to search for concepts instead of keywords not only increases precision and recallPrecision and recall are two widely used statistical classifications. Precision can be seen as a measure of exactness or fidelity, whereas recall is a measure of completeness. In an information retrieval scenario, precision is defined as the number of relevant documents retrieved by a search ... but also allows for new ways of user friendly search mechanisms. No matter whether users searches for Great Britain, UK or Großbritannien, they will get the same results with the documents containing any of these words. The system can offer a moderated searchSemantic search seeks to improve search accuracy by understanding searcher intent and the contextual meaning of terms as they appear in the searchable dataspace, whether on the Web or within a closed system, to generate more relevant results. Author Seth Grimes lists "11 approaches that join ..., where terms related to the search terms are suggested to the user, so she can include or exclude e.g. documents refering to London or Shakespeare. An example application of a semantic searchSemantic search seeks to improve search accuracy by understanding searcher intent and the contextual meaning of terms as they appear in the searchable dataspace, whether on the Web or within a closed system, to generate more relevant results. Author Seth Grimes lists "11 approaches that join ... engine can be seen at reegle.info.
PoolParty is making innovative use of Linked DataLinked Data is a sub-topic of the Semantic Web. The term Linked Data is used to describe a method of exposing, sharing, and connecting data via dereferenceable URIs on the Web.. When creating a thesaurus with PoolParty you have the possiblity to link concepts to their counterparts on external systems. This means a thesaurus manager can retrieve a globally recognized identifier for e.g. the concept London and get additional information like abstracts, category information, related concepts, longitude, latitude and pictures of London. In this way one can enrich thesaurus concepts and consequently the semantic indexThis is referring to Index in the context of Information Technology. For other meanings, see Index (disambiguation). In computer science, an index can be: an integer which identifies an array element a data structure that enables sublinear-time lookup in which document tags and other meta dataMetadata is loosely defined as data about data. Metadata is a concept that applies mainly to electronically archived or presented data and is used to describe the a) definition, b) structure and c) administration of data files with all contents in context to ease the use of the captured and ... is stored, thereby improving search results and content similarity calculations. - Faceted Browsing and Faceted Search
Faceted BrowsingFaceted search, also called faceted navigation or faceted browsing, is a technique for accessing a collection of information represented using a faceted classification, allowing users to explore by filtering available information. A faceted classification system allows the assignment of multiple ... and Faceted SearchFaceted search, also called faceted navigation or faceted browsing, is a technique for accessing a collection of information represented using a faceted classification, allowing users to explore by filtering available information. A faceted classification system allows the assignment of multiple ... are a new approach to help users explore available content in a structured manner. Whereas “advanced modes” of search engines are used only by a few power users, a user interface with faceted search capabilities can stimulate more users to filter and refine search results. It can be used to browse content according to the categories defined in a thesaurus or to further filter search results to include only content related to certain facets. An example application of faceted search can be seen at the PoolParty DemoZone - Similar Documents Recommendations
In the same way as with Semantic Search a concept’s labels and relationships can be used to improve similarity recommendations. When a user looks at a document the system can use its semantic fingerprint derived from the concepts it is about and the information about their related concepts to suggest similar content. From this follows that a document mentioning London will have some similarity to a document mentioning the United Kingom if the relationship between these concepts is modelled in a thesaurus. PoolParty’s use of Linked Data (see Semantic Search) is paramount in improving the semantic fingerprint of documents. - Semantic Wikis and CMS Your wiki can greatly benefit from thesaurus backed autocomplete, tagging and search services provided by PoolParty. Additionally semantic wikis can provide the users with forms for entering data and the fields of the forms can be can be automatically marked up with predefined semantics from a thesaurus or ontology. PoolParty makes use of semantic wikiA semantic wiki is a wiki that has an underlying model of the knowledge described in its pages. Regular, or syntactic, wikis have structured text and untyped hyperlinks. Semantic wikis, on the other hand, provide the ability to capture or identify information about the data within pages, and the ... interfaces to provide an end-user interface to maintain thesauri in collaborative working environments. See an example: Open Data Thesaurus/Wiki .
- Personal Information Systems Using a personal thesaurus greatly enhances the findability of your information. Modelling a thesaurus according to your ideas and needs also helps you to acquire deeper understanding of a knowledge domain you are interested in. PoolParty can be a great tool for e.g. information workers, teachers and students for gaining and organizing knowledge.