Skip to content

Tag: WordNet

‘Fire’: Some observations

At the end of class on Tuesday, November 21, we were asked to further investigate the word ‘fire’ as a noun and a verb. I made the following observations during my short investigation:

The fact that ‘fire’ is a noun and a verb can be confirmed by looking at ‘fire’ in the Oxford English Dictionary. Although polysemy is often studied by using Princeton’s WordNet or the various wordnets in other languages (please see my post WordNet and wordnets for more information), it is clear that the dictionary entry in this case gives plenty of detail about the polysemy of the word, as both the entry for ‘fire’ as a verb and the entry for ‘fire’ as a noun contain dozens of different meanings.

I wanted to compare the usage of ‘fire’ as noun and a verb to see which one is used more frequently. I decided to refer to the British National Corpus on Sketch Engine. By looking at the Wordlist feature, I discovered that ‘fire’ appears 17,348 times as a lemma. It appears 14,172 times as a noun and 3,176 times as a verb, showing that it is used far more frequently as a noun.

Leave a Comment

WordNet and wordnets

Princeton’s WordNet is a lexical database showing semantic relationships between words in the English language. It focuses on nouns, verbs, adjectives and adverbs, as words within these word classes are all content words, meaning that they have meaning by themselves (as opposed to function words). Princeton’s WordNet takes these content words and groups them into ‘synsets’, which are groups of cognitive synonyms, or words with the same meaning or sense (Sources: PARTS OF SPEECH, WordNet | A Lexical Database for English).

Wordnets have emerged in other languages based on this concept, including in my languages of study – Irish, Spanish and German. Wordnets for each of these languages can be found by following these links:

  • EuroWordNet database: a multilingual database providing wordnets for several European languages, including Spanish and German. Free samples from each language can be downloaded here.
  • Líona Séimeantach na Gaeilge (LSG), or the Language Semantic Network: an Irish-language wordnet, providing a comprehensive database of Irish words and the semantic links between them.  The PDF version can be downloaded here.

The PDF version of the LSG displays the wordnet in alphabetical order. As in the Princeton WordNet, content words are presented in synsets, showing relationships between words. The word ‘comhchiall’ denotes synonymous words, ‘aicmí’ denotes the class to which the word belongs and ‘fo-aicmí’ the subclasses stemming from the word. ‘Gaolta’ shows a related word that is not synonymous. In this screenshot below from the PDF, for example, one can see that the word ‘teangeolaíocht’ (linguistics) is shown to be in the class of ‘eolaíocht’ (science) with one subset being ‘pragmataic’ (pragmatics). It is shown to be related to, but not synonymous with, ‘gramadach’ (grammar).

This shows that synsets on the LSG, like those in Princeton’s WordNet, have a hierarchical element. Using the relations expressed through hypernyms and hyponyms, the LSG shows where each word lies within the hierarchy of similar words in the synset. In the example above, ‘eolaíocht’ is a hypernym for ‘teangeolaíocht’, and ‘pragmataic’ a hyponym for ‘teangeolaíocht’.  Antonyms are not shown within the LSG PDF file, unlike in Princeton’s WordNet. 

The entries are linked to the synsets available in the Princeton WordNet, which its creator, Kevin Scannell, states is helpful for his work on English-Irish machine translation. The entries are not mapped directly, however, partly due to the distinctions within Irish that do not exist within English (such as the difference between ‘rua’ and ‘dearg’) (Sources: LSG: Home, LSG: Details).

Leave a Comment