Some ideas of semantic analysis for anaphora resolution Dmitry P. Vetrov Dorodnicyn Computing Centre of RAS.

Презентация:



Advertisements
Похожие презентации
REFERENCE ELEMENTS 64. If your REFERENCE ELEMENTS toolbar is not in view and not hidden, you can retrieve it from the toolbars menu seen here. 65.
Advertisements

The category of mood. The category of mood is an explicit verbal category expressing the relation of the action denoted by the predicate to reality as.
How can we measure distances in open space. Distances in open space.
Multiples Michael Marchenko. Definition In mathematics, a multiple is the product of any quantity and an integer. in other words, for the quantities a.
Combination. In mathematics a combination is a way of selecting several things out of a larger group, where (unlike permutations) order does not matter.
Centrifugal force (rotating reference frame). Centrifugal force (from Latin centrum "center" and fugere "to flee") can generally be any force directed.
Yogesh Mehla Now concept of logic building is not so complex and not so simple. We will not work on how to make logic program in.
Loader Design Options Linkage Editors Dynamic Linking Bootstrap Loaders.
General characteristics As any other part of speech, the noun can be characterized by three criteria: Semantic (the meaning) Morphological (the form and.
HPC Pipelining Parallelism is achieved by starting to execute one instruction before the previous one is finished. The simplest kind overlaps the execution.
Travel is the change in location of people on a trip through the means of transport from one location to another.
WHAT POINTS SHOULD WE CONSIDER WHEN CHOOSING A CAREER? Done by :Teplysh Helen 11V from Teacher: Lyovkina E.A. Lyceum 1 named after A.P Guzhvin town Kamyzyak.
How to crack technical interview ? Yogesh Mehla. Many of my friends who are technically good and even great, but they are unable to crack their first.
Indirect Questions How do you make indirect questions? When do you use this grammar?
I consider that each child must have his own personal rights.Due to it, we can have a possibility in the private life.Some of them are presented in my.
Family Relationships (Семейные Отношения). Family How could you describe the word family? First of all family means a close unit of parents and their.
Lesson 2. How to say hello & goodbye ?. When we first meet someone whether it is a person we know or someone we are meeting for the first time, we will.
Презентация к уроку по английскому языку (10,11 класс) на тему: Презентация-конспект "English Pronouns"
PERT/CPM PROJECT SCHEDULING Allocation of resources. Includes assigning the starting and completion dates to each part (or activity) in such a manner that.
Secrets and techniques … to help teachers improve teenagers fluency and make their vocabulary learning more effective.
Транксрипт:

Some ideas of semantic analysis for anaphora resolution Dmitry P. Vetrov Dorodnicyn Computing Centre of RAS

Anaphora resolution Cant be done directly by syntax analysis. In mane cases it is unclear how to refer pronouns without understanding the sense of the phrase

Example Rebel fighter attacked Imperial ship that was going to take off. It exploded into pieces and sank. From formal point of view both variants are acceptable. But without knowing which is right, we cant unite sentences within an abstract. That could be necessary for automatic text annotation.

Training corpus The use of training corpus is difficult due to its limited size. Moreover we do not have annotated corpus for Russian language. What would be desirable is to use unmarked corpus. Then we could use very large sets of texts for training and building ontology.

Hierarchical vocabulary Vocabulary by Baranov which contains 5-level hierarchical classification of words: Fighter I -> flying transportation -> transportation -> device -> artificial -> substantial Fighter II -> character -> psychology -> human -> biological organism -> substantial The most of words can be referred by 6-numbered code which corresponds to category number at each level. This opens great opportunities for generalizing from short texts

Training During text processing compute the number of associations between two categories of various levels - number of associations of word w with any words from j-th category of k-th level. This can be done relatively easy by using ONLY formal analysis (e.g. we may consider the fighter both as a plane and as a man). The inexactnesses will be compensated by large volume of text

Semantic analysis After the training is finished we may estimate the relevance measure between two words: - number of category from level k for word w - number of associations of words from i-th category of k-th level with ANY other words

Decision making If we have several variants of referring the word w with words v1,…,vm, we prefer the most relevant Explodedit Ship? Fighter?

Ambiguity resolution FighterattackedshipTake offitexploded, sank Man Plane fired spacecraft criticized boat Take off spacecraft plane man boat exploded, sank

Dynamic programing FighterattackedshipTake offitexploded, sank Man Plane fired spacecraft criticized boat Take off spacecraft plane man boat exploded, sank

Advantages Do not need the text to be marked (annotated) Do not need the text to be marked (annotated) May generalize to the words that were not met during training due to hierarchical system of categories May generalize to the words that were not met during training due to hierarchical system of categories Dynamic programming allows processing quite large graphs built according to the set of sentences Dynamic programming allows processing quite large graphs built according to the set of sentences

Thank you Contact persons: Pavel Tolpegin Dmitry Vetrov Dmitry Kropotov