WORDNET BASED CATEGORIZATION DICTIONARY

 

DICTIONARY INFORMATION

Description: This categorization dictionary is derived from the WordNet® database to provides basic categorizations of noun, verbs, adjectives and adverbs currently found in the WordNet 2.0 database into 44 syntactic category and logical groupings. Four versions of this categorization dictionary are currently available:

 

Words only

  • The full version offers categorization of 109231 words into 44 WordNet lexical categories. More than 43,000 of those entries are categorized in more than one category.
  • The limited version consists of 65425 unambiguous words categorized into those same categories. Unambiguous words are defined as words that are categorized in only one Wordnet lexical category.

 

© All rights reserved Provalis Research 2024

Words & Phrases

  • The full version offers categorization of 174268 words and phrases into 44 WordNet lexical categories. More than 47,000 of those entries are categorized in more than one category.
  • The limited version consists of 126869 unambiguous words and phrases categorized into those same categories. Unambiguous words and phrases are defined as entries that are categorized in only one Wordnet lexical category.

Requirement: WordStat 4.0 (with English lemmatization option enabled) or WordStat 3.1 if the text corpus to analyze has already been lemmatized.

 

Note: WordNet® is an online lexical reference system developed by the Cognitive Science Laboratory at Princeton University under the direction of Professor George A. Miller (Principal Investigator). WordNet 1.7.1 Copyright © 2001 by Princeton University. All rights reserved.

 

DOWNLOAD CATEGORIZATION DICTIONARY

Click here to download the four dictionaries (about 2Mb).