Project 1 — Word Sense Disambiguation

Within ULM-1, Rubén Izquierdo & Marten Postma focus on Word Sense Disambiguation (WSD).

Word Sense Disambiguation is defined as the task of deciding on the meaning of a word by computers. Humans are very good at this task. So good, in fact, that they do not even notice how complicated the task is. For example, the sentence “After going for a run, the man took a shower.” is perfectly understandable and would not be considered difficult by most. However, for a computer, this sentence alone has more than 1.3 million meaning combinations.

Ambiguity demoFor the English sentence “After going for a run, the man took a shower”, the number of possible meaning combinations are shown. The number evokes how complex it would be for a machine to understand natural language sentences.

The ambiguity of natural language is a problem that is not well understood. We do not have a clear idea about the size and complexity of the problem. We are investigating the problem systematically by determining the relation between 3 variables: Word (W) – Meaning (M) – context (C). The goal is to more properly define this complex relation and apply this knowledge to come closer to the optimal solution for the Word Sense Disambiguation task.

We do not notice these other meanings in our social communication.
 
But machines really suffer from this ambiguity.

Resources
In order to get a better understanding the ambiguity of natural language, several resources have been created within the project:

Publications
The following publications are the main publications from the ULM project 1:

A word can refer to many different things
 
but many different words can also refer to the same thing.