CSC 121 Computers and Scientific Thinking Fall 2005 1 Applications in Artificial Intelligence BY Adish 09463021148.

Презентация:



Advertisements
Похожие презентации
PERT/CPM PROJECT SCHEDULING Allocation of resources. Includes assigning the starting and completion dates to each part (or activity) in such a manner that.
Advertisements

The waterfall model is a popular version of the systems development life cycle model for software engineering. Often considered the classic approach to.
In mathematics, the notion of permutation is used with several slightly different meanings, all related to the act of permuting (rearranging) objects.
UNIT 2. Introduction to Computer Programming. COM E 211: Basic Computer Programming UNIT 2. Introduction to Computer Programming Algorithm & Flowcharting.
Designing Network Management Services © 2004 Cisco Systems, Inc. All rights reserved. Designing the Network Management Architecture ARCH v
The waterfall model is a popular version of the systems development life cycle model for software engineering. Often considered the classic approach to.
© 2006 Cisco Systems, Inc. All rights reserved. BSCI v Implementing BGP Explaining BGP Concepts and Terminology.
© 2005 Cisco Systems, Inc. All rights reserved. BGP v Route Selection Using Policy Controls Applying Route-Maps as BGP Filters.
Statistics Probability. Statistics is the study of the collection, organization, analysis, and interpretation of data.[1][2] It deals with all aspects.
SPLAY TREE The basic idea of the splay tree is that every time a node is accessed, it is pushed to the root by a series of tree rotations. This series.
Genetics Genetics (from Ancient Greek γενετικός genetikos, "genitive" and that from γένεσις genesis, "origin"),[1][2][3] a discipline of biology, is the.
The Law of Demand The work was done by Daria Beloglazova.
© 2005 Cisco Systems, Inc. All rights reserved. BGP v Route Selection Using Policy Controls Using Multihomed BGP Networks.
Using Information Technology Chapter 1 Introduction to Information Technology.
Introduction to Emotional Intelligence. What is Emotional Intelligence? Emotional intelligence is a set of emotional and social skills that collectively.
© 2005 Cisco Systems, Inc. All rights reserved.INTRO v Building a Simple Serial Network Understanding the OSI Model.
© 2005 Cisco Systems, Inc. All rights reserved. BGP v Customer-to-Provider Connectivity with BGP Connecting a Multihomed Customer to Multiple Service.
Teacher of English – Polishchuk N.M 1. The passive model of learning 2. The active learning model 3. Interactive learning model.
© 2005 Cisco Systems, Inc. All rights reserved. BGP v Optimizing BGP Scalability Implementing BGP Peer Groups.
Correlation. In statistics, dependence refers to any statistical relationship between two random variables or two sets of data. Correlation refers to.
Транксрипт:

CSC 121 Computers and Scientific Thinking Fall Applications in Artificial Intelligence BY Adish

2 Artificial Intelligence Artificial Intelligence (AI) is a subfield of computer science closely tied with biology and cognitive science AI is concerned with computing techniques and models that simulate/investigate intelligent behavior AI research builds upon our understanding of the brain and evolutionary development in return, AI research provides insights into the way the brain works, as well as the larger process of biological evolution two hot research areas in AI are: 1. neural networks: building a model of the brain and "training" that model to recognize certain types of patterns 2. genetic algorithms: "evolving" solutions to complex problems (especially problems that are intractable using other methods)

3 Neural Networks the idea of neural networks predates modern computers in 1943, McCulloch and Pitts described a simple computational model of a neuron neural networks were a focus of CS research in the 1950's humans lack the speed & memory of computers, yet are capable of complex reasoning/action maybe our brain architecture is well-suited for certain tasks general brain architecture: many (relatively) slow neurons, interconnected dendrites serve as input devices (receive electrical impulses from other neurons) cell body "sums" inputs from the dendrites (possibly inhibiting or exciting) if sum exceeds some threshold, the neuron fires an output impulse along axon

4 Artificial Neurons neural networks are based on the brain metaphor large number of simple, neuron-like processing elements large number of weighted connections between neurons note: the weights encode information, not symbols! parallel, distributed control emphasis on learning McCulloch & Pitts (1943) described an artificial neuron inputs are either electrical impulse (1) or not (0) each input has a weight associated with it the activation function multiplies each input value by its weight if the sum of the weighted inputs >=, then the neuron fires (returns 1), else doesn't fire (returns 0) if w i x i >=, output = 1 if w i x i <, output = 0

5 Computation via Neurons can view an artificial neuron as a computational element accepts or classifies an input if the output fires INPUT: x 1 = 1, x 2 = 1.75*1 +.75*1 = 1.5 >= 1 OUTPUT: 1 INPUT: x 1 = 1, x 2 = 0.75*1 +.75*0 =.75 < 1 OUTPUT: 0 INPUT: x 1 = 0, x 2 = 1.75*0 +.75*1 =.75 < 1 OUTPUT: 0 INPUT: x 1 = 0, x 2 = 0.75*0 +.75*0 = 0 < 1 OUTPUT: 0 this neuron computes the AND function

6 Learning Algorithm Rosenblatt (1958) devised a learning algorithm for artificial neurons start with a training set (example inputs & corresponding desired outputs) train the network to recognize the examples in the training set (by adjusting the weights on the connections) once trained, the network can be applied to new examples e.g., Optical Character Recognition (OCR) perhaps one hidden unit "looks for" a horizontal bar another hidden unit "looks for" a diagonal another looks for the vertical base the combination of specific hidden units indicates a 7 while this algorithm is simple and easy to execute, it doesn't always work there are some patterns that cannot be recognized by a single neuron however, by adding additional layers of neurons, the network can develop complex feature detectors (i.e., internal representations)

7 Neural Net Example consider the following political poll, taken by six potential voters each ranked topics as to their importance, scale of 0 to 10 voters 1-3 identified themselves as Republicans, voters 4-6 as Democrats BudgetDefenseEnvironment voter 1963 voter 2886 voter 3752 voter 4746 voter 5318 voter 6637 based on survey responses, can we train a neural net to recognize Republicans and Democrats?

8 Neural Net Example the most commonly used training algorithm for multi-layer neural networks is called backpropogation training the network can take many iterations the algorithm is not guaranteed to converge on a solution in all cases, but works well in practice backpropogation simulator: note: inputs to network can be real values between –1.0 and 1.0 for this example, response of 8 input value of 0.8 generalization problem you can train a network to recognize a collection of patterns, but you can't be sure of what features it is using to decide how do you know if the trained network will behave "reasonably" on new inputs? classic example: A military neural net was trained to identify tanks in photos. After extensive training on both positive and negative examples, it proved very effective at classification. But when tested on new photos, it failed miserably. WHY? various techniques are used to select training examples to help guard against these types of bad generalizations, but can't know for sure!

9 Neural Net Applications pattern classification 9 of top 10 US credit card companies use Falcon uses neural nets to model customer behavior, identify fraud claims improvement in fraud detection of 30-70% scanners, tablet PCs, PDAs -- Optical Character Recognition (OCR) prediction & financial analysis Merrill Lynch, Citibank, … -- financial forecasting, investing Spiegel – marketing analysis, targeted catalog sales control & optimization Texaco – process control of an oil refinery Intel – computer chip manufacturing quality control AT&T – echo & noise control in phone lines (filters and compensates) Ford engines utilize neural net chip to diagnose misfirings, reduce emissions ALVINN project at CMU: trained a neural net to drive a van backpropagation network: video input, 9 hidden units, 45 outputs

10 Evolutionary Models neural networks are patterned after the processes underlying brain activity artificial neurons are interconnected into networks information is sub-symbolic, stored in the strengths of the connections genetic algorithms represent an approach to problem-solving that is patterned after the processes underlying evolution potential solutions to problems form a population better (more fit) solutions evolve through natural selection Darwin saw "… no limit to the power of slowly and beautifully adapting each form to the most complex relations of life …" through the process of introducing variations into successive generations and selectively eliminating less fit individuals, adaptations of increasing capability and diversity emerge in a population evolution and emergence occur in populations of embodied individuals, whose actions affect others and that, in turn, are affected by others selective pressures come not only from the outside, but also from the interactions between members of the population

11 Evolution & Problem-Solving evolution slowly but surely produces populations in which individuals are suited to their environment the characteristics/capabilities of individuals are defined by their chromosomes those individuals that are most fit (have the best characteristics/capabilities for their environment) are more likely to survive and reproduce since the chromosomes of the parents are combined in the offspring, combinations of fit characteristics/capabilities are passed on with a small probability, mutations can also occur resulting in offspring with new characteristics/capabilities in 1975, psychologist/computer scientist John Holland applied these principles to problem-solving genetic algorithms solve a problem by starting with a population of candidate solutions using reproduction, mutation, and survival-of-the-fittest, evolve even better solutions

12 Genetic Algorithm (GA) for a given problem, must define: chromosome: bit string that represents a potential solution fitness function: a measure of how good/fit a particular chromosome is reproduction scheme: combining two parent chromosomes to yield offspring mutation rate: likelihood of a random mutation in the chromosome replacement scheme:replacing old (unfit) members with new offspring termination condition: when is a solution good enough? in general, the genetic algorithm: start with an initial (usually random) population of chromosomes while the termination condition is not met 1. evaluate the fitness of each member of the population 2. select members of the population that are most fit 3. produce the offspring of these members via reproduction & mutation 4. replace the least fit member of the population with these offspring

13 GA example A thief has a bag in which to carry away the 'loot' from a robbery. The bag can hold up to 50 pounds. There are 8 items he could steal, each with a monetary value and a weight. What items should he steal to maximize his $$ haul? tiara$ lbs coin collection$ lbs HDTV$ lbs laptop$ lbs silverware$ lbs stereo $80025 lbs PDA $600 1 lb clock $300 4 lbs could try a greedy approach (take next highest value item that fits) based on value: tiara + coins + HDTV + PDA = 49 lbs, $9,900 note that this collection is not optimal tiara + coins + laptop + silverware + PDA + clock = 31 lbs, $11,300

14 GA example (cont.) chromosome: a string of 8 bits with each bit corresponding to an item 1 implies that the corresponding item is included; 0 implies not included e.g., represents (tiara + coins + HDTV) represents (coins + HDTV + silverware) fitness function: favor collections with higher values fit(chromosome) = sum of dollar amounts of items, or 0 if weight > 50 e.g., fit( ) = 9300 fit( ) = 0 tiara$ lbs coin collection$ lbs HDTV$ lbs laptop$ lbs silverware$ lbs stereo $80025 lbs PDA $600 1 lb clock $300 4 lbs reproduction scheme: utilize crossover (a common technique in GA's) pick a random index, and swap left & right sides from parents e.g.,parents and , pick index |0000 and 0110|1000 yield offspring and

15 GA example (cont.) Generation 0 (randomly selected): (fit = 9300) (fit = 0) (fit = 9300) (fit = 9200) (fit = 2800) (fit = 4300) (fit = 0) (fit = 8200) tiara$ lbs coin collection$ lbs HDTV$ lbs laptop$ lbs silverware$ lbs stereo $80025 lbs PDA $600 1 lb clock $300 4 lbs choose fittest 4, perform crossover with possibility of mutation | | | | Generation 1 (replacing least fit from Generation 0): (fit = 9300) (fit = 0) (fit = 9300) (fit = 9200) (fit = 8700) (fit = 10400) (fit = 7000) (fit = 8200) choose fittest 4, perform crossover with possibility of mutation 1101 | | | | Generation 2 (replacing least fit from Generation 1): (fit = 9300) (fit = 8400) (fit = 9300) (fit = 9200) (fit = 9300) (fit = 10400) (fit = 11300) (fit = 9200) visual example: english/gavgb.html english/gavgb.html

16 GA Applications genetic algorithms for data mining using GA's, it is possible to build statistical predictors over large, complex sets of data e.g., stock market predictions, consumer trends, … GA's do not require a deep understanding of correlations, causality, … start with a random population of predictors fitness is defined as the rate of correct predictions on validation data "evolution" favors those predictors that correctly predict the most examples e.g., Prediction Company was founded in 1991 by astrophysicists (Farmer & Packard) developed software using GA's to predict the stock market – very successful genetic algorithms for scheduling complex resources e.g., Smart Airport Operations Center by Ascent Technology uses GA for logistics: assign gates, direct baggage, direct service crews, … considers diverse factors such as plane maintenance schedules, crew qualifications, shift changes, locality, security sweeps, … too many variables to juggle using a traditional algorithm (NP-hard) GA is able to evolve sub-optimal schedules, improve performance Ascent claims 30% increase in productivity (including SFO, Logan, Heathrow, …)