• 검색 결과가 없습니다.

Learning Automata Chapter 1~1.2

N/A
N/A
Protected

Academic year: 2022

Share "Learning Automata Chapter 1~1.2"

Copied!
28
0
0

로드 중.... (전체 텍스트 보기)

전체 글

(1)

Learning Automata Chapter 1~1.2

Sunuwe Kim

(2)

Contents

Introduction

Learning

Learning in Psychology

Deterministic, Stochastic, and Adaptive Process

Hill Climbing

Deductive and Inductive Inference

Indentification and Control

Pattern Recognition

Bayesian Learning

2

(3)

Introduction

System theory

Feedback control concept

1930

Cybernetics by Norbert Wiener 1948

Communication and control in Man and

Machine

Socio- economic environment

(4)

Introduction

© 2015, SNU CSE Biointelligence Lab., http://bi.snu.ac.kr 4

Systems Theory

The identification and control of well defined deterministic and stochastic systems

Interest gradually shifted to substantial amount of uncertainty

Adaptation

Learning

Pattern recognition

Self-organization

Deterministic and Stochastic approach

(5)

Introduction

In more Complex Systems

Highly Uncertainty

Lack of parameter information

Hard to find dynamic relation

Distributed processing

Due to distributed databases

Collecting, processing, accessing data

Learning Automata

Input output searching under the influence of reinforcement feedback

Collective behavior of a number of automata operating in

Pattern recognition Adaptive control Self-organization

(6)

Learning

© 2015, SNU CSE Biointelligence Lab., http://bi.snu.ac.kr 6

Learning in Psychology

Empiricism

Experience is the only source of knowledge

All complex ideas are made up of simpler ideas

Complex ideas are connected together through the association of experience

Rationalism

The interrelations among elementary ideas are just as fundamental as the ideas themselves

(7)

Learning

Learning in Psychology

Stimulus-response theories

Importance of motivations, reward, punishment

Behaviorist

A detailed understanding of the internal workings of an organism is not necessary for developing a theory of behavior

Cognitive theories

Collection, transmission, storage, retrieval of information

(8)

Learning in Psychology

Expectancy learning

Classical learning

Instrumental conditioning

Operant conditioning

© 2015, SNU CSE Biointelligence Lab., http://bi.snu.ac.kr 8

(9)

Learning in Psychology

Expectancy conditioning  inductive inference

Instrumental conditioning

Classical conditioning

(10)

Learning in Psychology

Expectancy conditioning  inductive inference

Instrumental conditioning

Classical conditioning

© 2015, SNU CSE Biointelligence Lab., http://bi.snu.ac.kr 10

(11)

Learning in Psychology

Operant conditioning

(12)

Mathematical Learning Theory

Hull (1943)

Argued for the development of quantitative theories in learning

Estes (1959)

The development of learning theory for individual organisms is an elaboration of association theory

Association theory

Complex stimulus and response patterns at higher levels of learning

© 2015, SNU CSE Biointelligence Lab., http://bi.snu.ac.kr 12

(13)

Deterministic, Stochastic, and Adaptive Process

Adaptive Control Processes - A Guided Tour

(Bellman , 1961)

Deterministic

Stochastic

Adaptive Processes

Prior information

Less Prior information

(14)

Deterministic, Stochastic, and Adaptive Process

Deterministic Processes

The controller is designed

To satisfy either a set of performance criteria

To optimize a given index of performance

The design of practical estimators and controllers may be difficult due to analytical and computational reasons

© 2015, SNU CSE Biointelligence Lab., http://bi.snu.ac.kr 14

(15)

Deterministic, Stochastic, and Adaptive Process

Stochastic Control Processes

Probability characteristic

Relevant distributions

Optimization of linear system with quadratic performance

The optimization problems as well as the numerical procedures used to solve them are same

(deterministic process)

(16)

Deterministic, Stochastic, and Adaptive Process

Adaptive Processes

Prior information is considerably less

Ex. economics, biology, engineering, psychology, operations research, and AI

So, information needed for their estimation by off-line experiment

© 2015, SNU CSE Biointelligence Lab., http://bi.snu.ac.kr 16

(17)

Hill Climbing

Optimization problems

𝐽 𝛼 < 𝐽 𝛼 for all 𝛼 ≠ 𝛼

(18)

Deductive and Inductive Inference

Inference reasoning

© 2015, SNU CSE Biointelligence Lab., http://bi.snu.ac.kr 18

All orange are fruits

All fruits grow on trees

Therefore, all orange grow on trees

This cat is black.

That cat is black. A third cat is black

Therefore all cats are black

Inductive and deductive inference do not contradict but merely complement each other and both are found to be essential for the learning.

(19)

Deductive and Inductive Inference

Credibility and Induction

Inductive probability : “credibility”

Measure of confidence we place in a hypothesis on the basis of observed data

𝑞 𝐻𝑖 ≥ 0, � 𝑞 𝐻𝑖 = 1

𝑖

(20)

Identification and Control

Identification vs Control

Decide better action in the future, or taking the best action on the basis of past experience

Feldbaum (1965) – Dual control problem Contains uncertainty

New inputs it has to improve its knowledge of the characteristics of the system

New knowledge, it has to determine what actions are necessary for successful control

© 2015, SNU CSE Biointelligence Lab., http://bi.snu.ac.kr 20

(21)

Pattern Recognition

Definition

Deals with the issue of building a machine or a program that will display some capability of living organisms for classifying or discriminating sensory signals

Object

To sort patterns into different classes so that those

patterns which belong to a class share some common properties

Medical diagnosis, speech recognition, and scene analysis

(22)

Pattern Recognition

Pattern recognition process

Collect raw data

Raw data of the patterns is converted into n-dimensional feature vectors

Feature : contain the essential attributes of the given patterns

Preprocessing (Data Transformation, Smoothing, Normalize, Balancing etc)

Filtering (FFT, DWT, CWT, etc.),

Feature extraction(PCA)

Feature selection (ICA)

Classification

© 2015, SNU CSE Biointelligence Lab., http://bi.snu.ac.kr 22

𝑝 𝑥 𝑤1 > 𝑝 𝑥 𝑤2 ⇒ 𝑥 ∈ 𝑤1 X is the pattern, w is class

(23)

Pattern Recognition

Limitation

Unknown parameters

Non linear separable case, complex pattern

Image, speech, natural language etc.

(24)

Bayesian Learning

Frequentist vs Bayesian (Degree of Belief)

Frequentist

Parameters – quantities whose values are fixed but unknown

The best estimate of their values – the one that

maximizes the probability of obtaining the observed samples

Bayesian

Parameters – random variables having some known prior distribution

Observation of the samples converts to a posterior density; revising our opinion about the true values of the parameters

© 2015, SNU CSE Biointelligence Lab., http://bi.snu.ac.kr 24

(25)

Bayesian Learning

Bayesian approach

Extract information regarding the unknown parameter 𝜃 from observation 𝑥1, 𝑥2, 𝑥3, … , 𝑥𝑁 on the system

𝑝 𝜃 𝑥

𝑖

= 𝑝 𝑥

𝑖

𝜃 𝑝 𝜃 𝑝(𝑥

𝑖

)

Posterior Likelihood

The probability of getting this evidence if this hypothesis were true

Prior

The probability of 𝜃being true, before gathering evidence

The probability that the hypothesis

𝜃, 𝑝𝑝𝑝𝑝𝑝𝑝𝑝𝑝𝑝) is true Evidence

(26)

Further discussion

© 2015, SNU CSE Biointelligence Lab., http://bi.snu.ac.kr 26

(27)

Learning automata

Example

(28)

Hill Climbing

Optimization problems

© 2015, SNU CSE Biointelligence Lab., http://bi.snu.ac.kr 28

참조

관련 문서

Gombrich's theory of representation is based on the theory of perceptual trial and error, 'seeing' is not mere registration of image of stimuli as

[r]

An efficient algorithm for mining association rules in large databases. An effective hash-based algorithm for

Introduction to the Distance Learning Course Development Guide ...139 Designing Instruction for Distance Learning Programs...140 Objectives of this Course Development Guide

This study is carried out vocabulary learning through image association as a method on efficient vocabulary learning for middle school students and wanted to know if

[r]

저탄소원의 전력 생산량을 기존 탄소중립 공표안보다 확대해야 하며 이를 위해 재생 에너지, 원자력 활용과 수소 생산 확대를 언급함.. ■ 보고서에서는 2050

등록제 민간자격 운영 사실을 특정한 등록 기관에 비치된 장부에 2. 기재하는 행위로서 등록한 경우에만 민간자격으로