Correspondence

The key understanding of a sense is as a correspondence betwen an ontology entity and a lexical entry and as such the set of senses in the lexicon constitute a many-to-many mapping between lexical entries and ontology entities. This means that each sense must apply to exactly one lexical entry and one ontology element. Thus, for any ontology-based disambiguation task the challenge can be simply specified as selecting the correct sense for the task. For this reason lemon provides a number of features that should prove useful for this task, which are detailed in section [*].

There are several reasons for defining the correspondence as an element of the lexicon. Firstly, to allow further description of pragmatic usage between the two terms: for example, “cats and dogs” would be considered a natural usage whereas “Felis Catus and dogs” would be considered odd as the pragmatic register of the two terms is different. Although lemon also includes features to assign words to a pragmatic context, it does require the definition of a pragmatic taxonomy, which must be done for any particular application of the model.

The second case is where some terms are closely lexically bound, i.e., they generally occur together. An example from French is “rivière et fleuve”, which are often used together to express the same concept in English as “river”. It is natural that we should wish to connect these two terms by indicating they have disjoint senses that together constitute the idea of “river”. As the meaning of “rivière” in French is defined by Larousse as

Cours d'eau de faible ou moyenne importance qui se jette dans un autre cours d'eau (Watercourse of small size or little importance, which flows into another watercourse)

This can be considered in a semantic sense to be nearly equivalent to the French term “affluent”, defined as

Un cours d'eau qui se jette dans un autre. (A watercourse which flows into another)

However, a system generating “fleuve et affluent” as the equivalent concept to that of “river” would generally not be considered to be correct, due to lexical collocations. In fact for reference resolution it may be better to assert “affluent” as a broader term for “rivière”, as every usage of “rivière” is necessarily an “affluent”.

The final case is where terms are used in a metaphoric or metonymic manner. For example,

The White House said today, ...

Here “the White House” does not refer to the literal building but the spokesperson and other members of the organization in that building. As such the correspondence would give an ontology entity with the properties of being a human or organization; although reasoning using this information, may generate odd statements such as

*The White House is a member of the species Homo Sapiens

Thus we need to define the lexical semantics as clearly separate from those of the ontological element. This could be done for example by asserting something as a metonymic raising of another sense e.g.,

:white_house
  lemon:canonicalForm [ lemon:writtenRep "The White House"@en ] ;
  lemon:sense :white_house_sense .
:white_house_spokesperson
  lemon:canonicalForm [ lemon:writtenRep
     "White House Spokesperson"@ en ] ;
  lemon:sense :white_house_spokesperson_sense .

:white_house_sense :metonymicRaisingOf :white_house_spokesperson_sense.

:metonymicRaisingOf rdfs:subPropertyOf lemon:senseRelation.

John McCrae 2012-07-31