Skip to content

VeSNet resources

VeSNet is built based on following LOD resources. Resources differ in type, size and domain. VeSNet is designed as a modular resource. All integrated components — individual thesauri, WordNets, and the mappings between them — can be downloaded and reused separately. This modular structure allows users to:

  • work with a single thesaurus or WordNet without the overhead of the entire network,
  • experiment with selected subsets of mappings (e.g., only exactMatch relations),
  • integrate VeSNet components into their own pipelines and datasets,
  • extend or update the resource incrementally with new thesauri and mappings.

Links to the individual resources and mappings, provided in a unified format, are available below.

AGROVOC

Since the early 1980's, the Food and Agriculture Organization of the United Nations (FAO) has coordinated AGROVOC, a valuable tool for data to be classified homogeneously, facilitating interoperability and reuse. AGROVOC is a multilingual and controlled vocabulary designed to cover concepts and terminology under FAO's areas of interest. It is the largest Linked Open Data set about agriculture available for public use and its highest impact is through facilitating the access and visibility of data across domains and languages. AGROVOC is the largest thesaurus published as linked open data about food and agriculture available for public use.

VeSNet makes use of current core version of Agrovoc for Arabic, Chinese, English, French, Russian and Spanish

VeSNet-AGROVOC

CPA 2.1

The Statistical Classification of Products by Activity, abbreviated as CPA, is the classification of products (goods as well as services) at the level of the European Union (EU). Product classifications are designed to categorise products that have common characteristics. They provide the basis for collecting and calculating statistics on the production, distributive trade, consumption, international trade and transport of such products. CPA is the European Union counterpart of the world standard CPC (Central Product Classification) developed and maintained by the United Nations. The classification has some 5,500 categories.

CPA is a thesaurus published by the Publications Office of the European Union. CPA 2.1 is a part of a legal act and, therefore, freely reusable. The vocabulary is also available in the Official Journal (https://eur-lex.europa.eu/legal-content/EN/TXT/?uri=uriserv:OJ.L_.2014.336.01.0001.01.ENG).

VeSNet-CPA

DBpedia

DBpedia aims at extracting structured content from the information created in Wikipedia and other Wikimedia projects. DBpedia is a crowd-sourced community effort to extract structured content from the information created in various Wikimedia projects. This structured information resembles an open knowledge graph (OKG) which is available for everyone on the Web. A knowledge graph is a special kind of database which stores knowledge in a machine-readable form and provides a means for information to be collected, organised, shared, searched and utilised. Google uses a similar approach to create those knowledge cards during search. We hope that this work will make it easier for the huge amount of information in Wikimedia projects to be used in some new interesting ways.

DDC [new]

DDC or Decimalised Database of Concepts is an open collection of topics inspired by Dewey Decimal Classification, suitable for use in linked data. No guarantees, however, are made about the closeness or the resemblance of the two systems as a whole. SKOS mapping links are provided from this database to the Dewey system, to Library of Congree Classification codes and to DBpedia resources where possible.

VeSNet-DDC

ELSST [new]

The European Language Social Science Thesaurus (ELSST) is a broad-based, multilingual thesaurus for the social sciences. It is owned and published by the Consortium of European Social Science Data Archives (CESSDA) and its national Service Providers. The thesaurus consists of over 3,000 concepts and covers the core social science disciplines: politics, sociology, economics, education, law, crime, demography, health, employment, information and communication technology and, increasingly, environmental science.

VeSNet-ELSST

EUNIS Habitats [new]

EUNIS, that is European Nature Information System, provides access to the publicly available data in the EUNIS database for species, habitat types and protected sites across Europe. It is part of the European Biodiversity data centre (BDC), and is maintained by the European Environment Agency (EEA). The Habitats classification covers all types of habitat types from natural to artificial, from terrestrial to freshwater and marine. EUNIS Habitats was created by Søren Roug.

VeSNet-EUNIS Habitats

EUNIS Species [new]

EUNIS provides access to the publicly available data in the EUNIS database, among others for biological species across Europe. It is part of the European Biodiversity data centre (BDC), and is maintained by the European Environment Agency (EEA). EUNIS Species was created by Søren Roug.

VeSNet-EUNIS Species

EuroVoc

EuroVoc is a multilingual, multidisciplinary thesaurus covering the activities of the EU. It contains terms in 24 EU languages, plus in three languages of countries which are candidate for EU accession (Albanian, Macedonian and Serbian). EuroVoc is managed by the Publications Office of the European Union, which moved forward to ontology-based thesaurus management and semantic web technologies conformant to W3C recommendations as well as latest trends in thesaurus standards. EuroVoc users include the European Union institutions, the Publications Office of the EU, national and regional parliaments in Europe, plus national governments and private users around the world.

VeSNet-EUROVOC

FrameNet [new]

The FrameNet project is building a lexical database of English that is both human- and machine-readable, based on annotating examples of how words are used in actual texts. It is a dictionary of more than 13,000 word senses, most of them with annotated examples that show the meaning and usage. For the researcher in Natural Language Processing, the more than 200,000 manually annotated sentences linked to more than 1,200 semantic frames provide a unique training dataset for semantic role labeling, used in applications such as information extraction, machine translation, event recognition, sentiment analysis, etc. From the point of view of lexicography it serves as a valence dictionary, with uniquely detailed evidence for the combinatorial properties of a core set of the English vocabulary.

  • official site,
  • download questionnaire,
  • licence: CC BY-3.0
  • citation: Fillmore, C. J., & Baker, C. (2010). A frames approach to semantic analysis. In: The Oxford handbook of linguistic analysis. Oxford Handbooks in Linguistic.
VeSNet-FrameNet

GEMET

GEMET (GEneral Multilingual Environmental Thesaurus) is a source of common and relevant terminology used under the ever-growing environmental agenda. GEMET is developed by EEA and Eionet - the institutional environmental network of almost 40 European countries.

VeSNet-GEMET

GeoWordNet

GeoWordNet is a semantic resource built from the full integration of WordNet, GeoNames and the Italian part of MultiWordNet. GeoWordNet Public Dataset contains 3,698,238 entities, 3,698,237 part-of relations between entities, 334 concepts, 182 relations between concepts, 3,698,238 relations between instances and concepts, and 13,562 (English and Italian) alternative entity names.

  • download, v. 2011,
  • download, v. 2011 mirror,
  • licence: CC BY 3.0
  • citation: Giunchiglia, F., Maltese, V., Farazi, F., & Dutta, B. (2010, May). GeoWordNet: a resource for geo-spatial applications. In: Extended Semantic Web Conference (pp. 121-136). Springer, Berlin, Heidelberg.
VeSNet-GeoWordNet

HASSET [new]

The Humanities and Social Science Electronic Thesaurus (HASSET) is the leading British English thesaurus for the social sciences. Developed by the UK Data Service, HASSET is used in-house for indexing and retrieval of UK Data Service data, and by many third party organisations. It consists of over 4,000 concepts and covers the core social science disciplines: politics, sociology, economics, education, law, crime, demography, health, employment, information and communication technology, history and, increasingly, environmental science. It was originally developed from the UNESCO thesaurus, becoming a separate product in 1997.

VeSNet-HASSET

KABA [new]

KABA subject headings language (Catalogs of Automatic Academic Libraries) - an information and search language presented in the form of a model file, used in Polish libraries. KABA is a language that includes a set of subject headings and the rules for their creation, including the construction of categories and subcategories (then the rules of subordination and superiority are defined). This language of library catalogs is consistent with that of the Library of Congress and other leading libraries in the world.

VeSNet-KABA

LCSH [new]

Library of Congress Subject Headings (LCSH) has been actively maintained since 1898 to catalog materials held at the Library of Congress. By virtue of cooperative cataloging other libraries around the United States also use LCSH to provide subject access to their collections. In addition LCSH is used internationally, often in translation.

VeSNet-LCSH

MeSH®

The Medical Subject Headings (MeSH®) thesaurus is a controlled and hierarchically-organized vocabulary produced by the National Library of Medicine. It is used for indexing, cataloging, and searching of biomedical and health-related information. MeSH® includes the subject headings appearing in MEDLINE/PubMed, the NLM Catalog, and other NLM databases. - official browser, - official site, - [download][mesh-download], - licence: MeSH® licence, attribution requirement

VeSNet-MeSH

NAL

National Agricultural Library Thesaurus Concept Space is a state-of-the-art multischeme concept space with added structural features for enhanced scalability and machine readability (with English and Spanish labels). "NALT Core", a trim NALT subscheme with just 13,791 frequently used agricultural concepts including 4,396 agriculturally important organisms (taxa), and structural updates (see below) for a lean and efficient machine readable agricultural knowledge base. "NALT Full" (all NALT concepts) contains NALT Core plus over 48,000 additional agricultural related organisms (taxa) and several thousand less frequently used concepts for a total of 76,933 concepts. NALT Full is a more granular knowledge base.

Polish WordNet

plWordNet is a lexico-semantic database of the Polish language. It includes sets of synonymous lexical units (synsets) followed by short definitions. plWordNet serves as a thesaurus-dictionary where concepts (synsets) and individual word meanings (lexical units) are defined by their location in the network of mutual relations, reflecting the lexico-semantic system of the Polish language. plWordNet is also used as one of the basic resources for the construction of natural language processing tools for Polish.

VeSNet-PEWN

Princeton WordNet®

WordNet® is a lexical database of semantic relations between words and their senses. WordNet® links words and word senses with semantic relations including synonymy, hyponymy, and meronymy. Synonyms are grouped into synsets with short definitions (glosses) and usage examples. WordNet® can thus be seen as a combination and extension of a dictionary and thesaurus. While it is accessible to human users via a web browser, its primary use is in automatic text analysis and artificial intelligence applications. WordNet® was a first such electronic lexical data base, a first wordnet in a long series of off-spring wordnets created for many languages. WordNet® database and software tools have been released under a BSD style license and are freely available for download from the WordNet® website.

Rameau [new]

Rameau is the subject headings thesaurus of the National Library of France. The thesaurus is linked to LCSH and many other thesauri. Published on an open licence with attribution demand (analogical to CC BY).

VeSNet-Rameau

Roget's Thesaurus [new]

Roget's Thesaurus is a widely used English-language thesaurus, created in 1805 by Peter Mark Roget (1779–1869), British physician, natural theologian and lexicographer. The original edition had 15,000 words and each successive edition has been larger, with the most recent edition (the eighth) containing 443,000 words. The book is updated regularly and each edition is heralded as a gauge to contemporary terms; but each edition keeps true to the original classifications established by Roget. We built our version on the 1911 edition.

VeSNet-Roget's

Schema.org [new]

Schema.org is a collaborative, community activity with a mission to create, maintain, and promote schemas for structured data on the Internet, on web pages, in email messages, and beyond. Schema.org vocabulary can be used with many different encodings, including RDFa, Microdata and JSON-LD. These vocabularies cover entities, relationships between entities and actions, and can easily be extended through a well-documented extension model. Over 10 million sites use Schema.org to markup their web pages and email messages. Many applications from Google, Microsoft, Pinterest, Yandex and others already use these vocabularies to power rich, extensible experiences. Founded by Google, Microsoft, Yahoo and Yandex, Schema.org vocabularies are developed by an open community process, using the public-schemaorg@w3.org mailing list and through GitHub.

VeSNet-Schema.org

STW [new]

The STW Thesaurus for Economics is the world's most comprehensive bilingual thesaurus for representing and searching for economics-related content. With its almost 6,000 subject headings in English and German and more than 20,000 synonyms it covers all economics-related subject areas and, on a broader level, the most important related subject fields. The STW is published and continuously further developed by the ZBW according to the latest changes in the economic terminology.

VeSNet-STW

SUMO [new]

The Suggested Upper Merged Ontology (SUMO) and its domain ontologies form a large formal public ontology. They are being used for research and applications in search, linguistics and reasoning. SUMO is the only formal ontology that has been mapped to all of the WordNet lexicon. SUMO is written in the SUO-KIF language. SUMO is free and owned by the IEEE. The ontologies that extend SUMO are available under GNU General Public License. Adam Pease is the Technical Editor of SUMO.

Adimen SUMO [new]

Adimen-SUMO is an off-the-shelf first-order ontology that has been obtained by reengineering out of the 88% of SUMO (Suggested Upper Merged Ontology). Adimen-SUMO can be used appropriately by FO theorem provers (like E-Prover or Vampire) for formal reasoning. The contribution of Adimen-SUMO to the area of ontological formal reasoning is threefold. Firstly, we translated SUMO from its original format into the standard first order language. Secondly, we used first-order theorem provers as inference engines for debugging the ontology. Thus, we detected and repaired several significant problems with the axiomatization of the SUMO ontology. Problems we encountered include incorrectly defined axioms, redundancies, non-desirable properties, and axioms that do not produce expected logical consequences. Thirdly, as a result of the process of adapting the SUMO ontology, we discovered a basic design problem of the ontology which impedes its appropriate use with first order theorem provers. Consequently, we also propose a new transformation to overcome this limitation. As a result of this process, we obtain a validated first-order version of the ontology to be used by first-order theorem provers.

VeSNet-SUMO

UAT [new]

The Unified Astronomy Thesaurus (UAT) is an open, interoperable and community-supported thesaurus which unifies the existing divergent and isolated Astronomy & Astrophysics thesauri into a single high-quality, freely-available open thesaurus formalizing astronomical concepts and their inter-relationships. The UAT builds upon the existing IAU Thesaurus with major contributions from the Astronomy portions of the thesauri developed by the Institute of Physics Publishing and the American Institute of Physics.

VeSNet-UAT

UMBEL [new]

UMBEL (Upper Mapping and Binding Exchange Layer) by Structured Dynamics LLC is a logically organized knowledge graph of 34,000 concepts and entity types that can be used in information science for relating information from disparate sources to one another. It was retired at the end of 2019. UMBEL was first released in July 2008. Its current release is version 1.50. Since UMBEL is an open-source extract of the OpenCyc knowledge base, it can also take advantage of the reasoning capabilities within Cyc. Including OpenCyc, UMBEL has about 65,000 formal mappings to DBpedia, PROTON, GeoNames, and schema.org, and provides linkages to more than 2 million Wikipedia pages (English version). All of its reference concepts and mappings are organized under a hierarchy of 31 different "super types", which are mostly disjoint from one another. Each of these "super types" has its own typology of entity classes to provide flexible tie-ins for external content. 90% of UMBEL is contained in these entity classes.

VeSNet-UMBEL

Wikidata [new]

Wikidata is a free and open knowledge base that can be read and edited by both humans and machines. Wikidata acts as central storage for the structured data of its Wikimedia sister projects including Wikipedia, Wikivoyage, Wiktionary, Wikisource, and others.

Wikipedia

Wikipedia is a multilingual open online encyclopedia written and maintained by a community of volunteers through open collaboration and a wiki-based editing system. It is hosted by the Wikimedia Foundation, an American non-profit organization funded mainly through donations.

VeSNet-Wikipedia

WordNet Domains

WordNet Domains is a lexical resource created in a semi-automatic way by augmenting WordNet with domain labels. WordNet Synsets have been annotated with at least one semantic domain label, selected from a set of about two hundred labels structured according the WordNet Domain Hierarchy.

VeSNet-WordNet Domains

YAGO 3.0 [new]

YAGO is a product of the Max-Planck Institute for Informatics. YAGO 3 combines the information from the Wikipedias in multiple languages with WordNet, GeoNames, and other data sources. YAGO 3 taps into multilingual resources of Wikipedia, getting to know more local entities and facts. This version has been extracted from 10 different Wikipedia versions (English, German, French, Dutch, Italian, Spanish, Polish, Romanian, Persian, and Arabic).

VeSNet-YAGO

YSO [new]

General Finnish Ontology YSO (Yleinen Suomalainen Ontologia) by National Library of Finland is a trilingual ontology consisting mainly of general concepts. YSO has been founded on the basis of concepts in Finnish cultural sphere. Following the international standards for thesauri, the terms for concepts are usually plural nouns. Terms in singular are usually mass nouns or terms referring to actions or abstract concepts.

VeSNet-YSO

Last update: 2025-09-17
Back to top