Distributions of verbs and adverbs on the multidimensional sphere Vvedensky V.L. Kurchatov Institute.

Презентация:



Advertisements
Похожие презентации
Victor Vvedensky, Kurchatov Institute. Direct contact with technical devices using human speech implies ability of the computer to understand in some.
Advertisements

Victor Vvedensky. Adverbs and adjectives Nouns and verbs are two most basic lexical elements of speech. Usually they correspond to entities and actions.
In mathematics, the notion of permutation is used with several slightly different meanings, all related to the act of permuting (rearranging) objects.
© 2005 Cisco Systems, Inc. All rights reserved. BGP v Customer-to-Provider Connectivity with BGP Connecting a Multihomed Customer to Multiple Service.
How can we measure distances in open space. Distances in open space.
Knot theory. In topology, knot theory is the study of mathematical knots. While inspired by knots which appear in daily life in shoelaces and rope, a.
© The McGraw-Hill Companies, Inc., Chapter 4 Counting Techniques.
11 BASIC DRESS-UP FEATURES. LESSON II : DRESS UP FEATURES 12.
Multiples Michael Marchenko. Definition In mathematics, a multiple is the product of any quantity and an integer. in other words, for the quantities a.
Computers are a necessary part of modern life. Computers play an important role in the lives of most of us today, whether we realize it or not. Some people,
Loader Design Options Linkage Editors Dynamic Linking Bootstrap Loaders.
© 2009 Avaya Inc. All rights reserved.1 Chapter Two, Voic Pro Components Module Two – Actions, Variables & Conditions.
Учимся писать Эссе. Opinion essays § 1- introduce the subject and state your opinion § 2-4 – or more paragraphs - first viewpoint supported by reasons/
Statistics Probability. Statistics is the study of the collection, organization, analysis, and interpretation of data.[1][2] It deals with all aspects.
Combination. In mathematics a combination is a way of selecting several things out of a larger group, where (unlike permutations) order does not matter.
Purposes Working with students Working with teachers Opinion Conclusion.
Topology Topology (from the Greek τόπος, place, and λόγος, study) is a major area of mathematics concerned with properties that are preserved under continuous.
Family Relationships (Семейные Отношения). Family How could you describe the word family? First of all family means a close unit of parents and their.
Tool: Pareto Charts. The Pareto Principle This is also known as the "80/20 Rule". The rule states that about 80% of the problems are created by 20% of.
Teens problems Semenova Nastya The 10 Б form student Teacher: Pshennikova E.D
Транксрипт:

Distributions of verbs and adverbs on the multidimensional sphere Vvedensky V.L. Kurchatov Institute

Direct contact with technical devices using human speech implies ability of the computer to understand in some way the idea of the message. The people feel semantic similarity of words and can find the word with close or just opposite meaning. How we can implement this feeling of sense in the computer?

The meaning of words have to convey interpreters into the other language. Their experience is accumulated in the dictionaries which can be used for the analysis of words. One can determine proximity of words and can construct the mathematical space imbedding these words in accordance with their meaning.

Участок пространства близости английских глаголов. The space imbedding the set of English verbs with close meaning. It is constructed using their translations into many languages.

This technique is rather cumbersome, which is difficult to apply for all the verbs of a language. In order to proceed reasonably quickly, more handy approach should be used for the construction of the space imbedding words. It would be more convenient if the data of only one language were sufficient for this procedure.

Each verb can be attributed with the set of properties. The adverbs, which can be readily used together with this verb in many different contexts, reflect these properties. One can смело прыгать to jump bravely ловко прыгать to jump deftly возбужденно прыгать to jump excitedly

The language allows only certain combinations of verbs with adverbs though the logic behind matching verbs and adverbs is sometimes quite peculiar. One can горячо спорить argue hotly, though can not горячо видеть see hotly, and Russians do not say горячо кипеть, although sometimes the blood boils hotly in English.

In Russian there are about 1900 words used as adverbs. For each verb one can say whether it is compatible with any of those adverbs. Since the majority of language objects are fuzzy the compatibility measure should be selected from the range [0,1]

Compatibility values for representative sets of 25 verbs and 25 adverbs of Russian language

The set of compatible adverbs is specific for every verb and reflects the individuality of the word. The verbs with close meaning always have about 4% of adverbs for which we can say that they can be used with one verb and not with another. There is a certain distance between different verbs and each verb occupies a certain cell in the multidimensional space, which can be called the mental space of verbs.

Using the table of compatibility one can calculate the distance between any two verbs. Each verb finds definite place in the «mental space» in accordance with these distances. There is no need to use all the adverbs for this procedure, just 100 adverbs from the properly selected «basic reference set» are sufficient to determine accurately enough these distances.

2D projections of 900 verbs in first 9 dimensions of the multidimensional space Each verb is presented as a dot. The numbers indicate the order of dimensions. Each 2- dimensional plane shows distributions for the pair of dimensions with numbers indicated for rows and columns. The uppermost square represents projection on the plane in dimensions one and two or X and Y. The scale is normalized to unity

Sphericity of the space imbedding verbs Chineese paper lantern The verbs in the multidimensional space are nearly equidistant from a point lying between two hypothetical verbs - universal, compatible with any adverb, and individual, which can not be used with adverbs. Slight nonsphericity is observed.

Representative set of 100 verbs The regular distribution of verbs makes feasible accurate selection of the representative set of verbs, which fill evenly the observed distribution - this is one example: предлагать, начинать, изменять, направлять, вынимать, возвращать, обвинять, просить, помогать, убирать, указывать, решать, соблюдать, разрешать, преодолевать, приносить, уменьшать, поворачивать, проникать, закрывать, работать, нападать, втягивать, приказывать, хранить, отталкивать, выставлять, покрывать, обещать, носить, попадать, определять, прикреплять, входить, вставать, скрывать, освобождать, приходить, беречь, очищать, выходить, судить, пересекать, кидать, перевозить, служить, лгать, уступать, возникать, привязывать, завязывать, спасать, избегать, побеждать, обнаруживать, беспокоить, запоминать, отвлекать, узнавать, расти, выручать, шутить, возить, бегать, терять, рассыпать, отсутствовать, жить, любить, мучить, катать, выдерживать, зависеть, летать, отдыхать, стучать, обманывать, плавать, блуждать, будить, грабить, бывать, праздновать, дарить, дружить, радовать, умирать, успевать, блестеть, воровать, восхищать, спать, баловать, уставать, болеть, сушить, уважать, плевать, перекашивать, бесить.

Distributions of verbs and adverbs in first two dimensions of the multidimensional space 900 verbs and 600 adverbs of Russian language in first two dimensions. The distributions nearly mirror each other so that the most compatible verbs and adverbs are close, while incompatible pairs are on the far ends. Sets of verbs compatible with three adverbs are shown as hazel points. Green points indicate fuzzy cases. To cost, to fill over, to throw down, to continuepatiently, industriously, motionlessly

Representative set of 100 adverbs This is an example of the representative set of adverbs, which fill evenly the corresponding distribution: неизменно, предусмотрительно, неожиданно, часто, редко, открыто, независимо, уверенно, легко, охотно, дружно, усердно, покорно, торопливо, много, незаметно, старательно, радостно, хитро, медленно, обоснованно, бодро, забавно, тайно, невозмутимо, небрежно, грустно, безнаказанно, твердо, весело, беззаботно, мужественно, поспешно, нерешительно, нетерпеливо, точно, загадочно, дополнительно, невольно, справедливо, резко, основательно, безошибочно, преданно, воинственно, безотказно, смешно, верно, четко, безупречно, чрезмерно, скрытно, неловко, тщательно, непреклонно, ненадежно, полезно, сложно, молчаливо, понятно, насильно, тяжело, устало, оживленно, кропотливо, придирчиво, неудобно, отчётливо, непримиримо, осуждающе, враждебно, безвольно, чудесно, неодобрительно, чисто, развязно, добротно, громко, кратко, задумчиво, подробно, ласково, неограниченно, болезненно, бесперебойно, доверчиво, приветливо, счастливо, глубоко, трезво, проникновенно, сердечно, смутно, сжато, густо, смертельно, дорого, звонко, незыблемо, пристально.

The general layout of distributions suggests the idea that the verbs and adverbs are presented in two adjacent portions of cortical tissue. These areas are rich with internal connections, and provided with multiple crossing links. Density of these links falls out quickly with the distance - only few connections link far ends of these cortical patches. The presence or absence of such a link makes possible or prohibits combined use of a verb with a certain adverb in a fluent intelligent speech.

Small cortical patches with a certain function D.H.Hubel, Eye, Brain, and Vision Brain studies, using special functional staining techniques, reveal confined areas performing definite tasks. These portions of the monkey cortex extract visual features with certain orientation. They are about 1 mm x 2 mm large. Rich internal connections can impart this cortical patch with properties reflected as additional dimensions. That is like blowing up the flat soap film making the bulb in the space with multiple dimensions.

Our results show that the verbs and adverbs of human language are closely and strictly mathematically interrelated. Our data on nouns and adjectives indicate that this holds for them also. We believe that the rules controlling compatibility of words and the number of words in the language are determined by the layout of the space imbedding these objects of human speech. We see the way, how this abstract mental space can find material substrate in the cortex of the human brain.

To be continued…