• 검색 결과가 없습니다.

Variance Reduction: Techniques

N/A
N/A
Protected

Academic year: 2022

Share "Variance Reduction: Techniques"

Copied!
31
0
0

로드 중.... (전체 텍스트 보기)

전체 글

(1)

SNU/NUKE/EHK SNU/NUKE/EHK

Variance Reduction: Techniques

몬테카를로 방사선해석 (Monte Carlo Radiation Analysis)

Notice: This document is prepared and distributed for educational purposes only.

(2)

Sums of Random Variables

• Draw N samples x1, x

2, …, xN from f(x) and define the following linear combination

where cn’s are real constants and gn(x) are real-valued functions.

• The mean and variance of G are

review

(3)

SNU/NUKE/EHK

• Consider the special case where gn(x) = g(x) and cn=1/N:

• The expectation value for G is

--- The expectation value for the average (not the average itself) of N observations of the random

variable g(x) is simply the expectation value for g(x).

- The simple average is an unbiased estimator for the mean.

review

Sums of Random Variables (cont.)

(4)

Sums of Random Variables (cont.)

• Also, the variance is given by

--- The variance in the average value of N samples of g(x) is smaller than the variance in the original

random variable g(x) by a factor of N.

review

(5)

SNU/NUKE/EHK

Estimation Mean and Variance

• To estimate the integral

• Define the function

• Now define the random variable V(x) as

(6)

Estimation Mean and Variance (cont.)

• The expected value of V is then

which gives E(V) = tan-1(5) – tan-1(0) ≒ 1.373400.

• The mean value of V2 is, on the other hand,

(7)

SNU/NUKE/EHK

Estimation Mean and Variance (cont.)

• The variance of the random variable V is then var(V) = E(V2) – [E(V)]2 ≒ 2.028042, and

the standard deviation is, therefore,

(8)

Improvement of Certainty in Estimate

• The purpose of MC calculation is usually to obtain an estimate of the expected value of a random variable.

• The usual measure of the certainty of the result is the standard deviation of the estimate obtained.

• The improvement in the certainty of an estimate of the mean can be made by increasing the number of samples:

which might cost too much time.

• Other methods for improvement in certainty?

(9)

SNU/NUKE/EHK

• A measurement system can be accurate but not precise, precise but not accurate, neither, or both. For example, in an experiment that contains a systematic error,

- increasing the sample size generally increases precision but does not improve accuracy;

- eliminating the systematic error improves accuracy but does not change precision.

• A measurement system is called valid if it is both accurate and precise.

- bias (systematic error) vs. random error

Certainty: Accuracy vs. Reliability (consistency)

review

(10)

Stratified Sampling

• The range over which the independent variable is sampled is divided into strata and each stratum is sampled

separately.

• Exercise

- Estimate the integral of

(11)

SNU/NUKE/EHK

Stratified Sampling: exercise

• Step 1. Define the pdf f1(x) for the first stratum

and the pdf f2(x) for the second stratum

(12)

Stratified Sampling: exercise (cont.)

• Step 2. Define the corresponding random variables as

so that

which leads to

or I = E(V1) + E(V2). (9)

(13)

SNU/NUKE/EHK

Stratified Sampling: exercise (cont.)

• Step 3. Calculate the expected value E(V1) in the first stratum and E(V2) in the second stratum:

(14)

Stratified Sampling: exercise (cont.)

• Step 4. Calculate the variances V1 for the first stratum and V2 for the second stratum:

Hence,

2

(15)

SNU/NUKE/EHK

Stratified Sampling: exercise (cont.)

• Step 5. The value of the original integral is

• Step 6. Compare the mean and the variance E(V1) + E(V2) ≒ 1.373400 as (3) var[E(V

1)+E(V

2)]

= var[E(V1)] + var[E(V2)]

= var(V

1)/n

1 + var(V

2)/n

2 and the standard deviation is

• ex. var[E(V1)+E(V2)] ≒4.8144 x 10-5 with n1=5000 and n2=5000

(16)

Demonstration of the Stratification Property:

nomenclature

• Conditional probability P{X=x│B}

≡ the probability of X=x given condition B.

• E(V│A) ≡ the expected value of random variable V(x) over the set A of xs.

• “~” indicates the subtraction of one set from another.

(17)

SNU/NUKE/EHK

Demonstration of the Stratification Property

• Theorem

Given a variable function V(x) for x in a set R of real numbers such that E(V│A) in a subset A and E(V│R) and var(V│R) in the set R exist and are finite, if

E(V│A) does not equal E(V│R) then the variance of the estimate of E(V│R) using the strata defined by A and R~A is less than the variance of the estimate

without stratification.

(18)

Demonstration of the Stratification Property :

Proof

• Let f(x) be the pdf of x for V(x) in R and let

• Define

(19)

SNU/NUKE/EHK

Demonstration of the Stratification Property :

Proof (cont.)

• Then

E(V│A) = 1 = 1 = and

E(V│R~A) = 2 = ~ 2 = ~

whereas E(V│R) = = .

And thus,

E(V│R) = a·E(V│A) + (1-a)·E(V│R~A) or = a 1 + (1-a) 2.

• Since E(V│A) ≠ E(V│R), defining = E(V│A) - E(V│R) gives E(V│R~A) = E(V│R) - .

Therefore, if > 0, E(V│A) >E(V│R) then E(V│R~A) <E(V│R) and vice versa.

(20)

Demonstration of the Stratification Property : Proof (cont.)

• The variance of V is given by

• The variance of the mean estimate of V or , based on n samples is then [var(V)]/n, or

f(x)dx + f(x)dx .

f(x)dx + f(x)dx (13)

(21)

SNU/NUKE/EHK

Demonstration of the Stratification Property : Proof (cont.)

• In an unbiased sampling scheme with total n samples for these two strata, nP(A) or na samples are used for stratum A and nP(R~A) or n(1-a) are used for stratum R~A. Therefore, the variance of [a 1 + (1-a) 2] is

var[a 1 + (1-a) 2]

= a2 · var( 1) + (1-a)2 · var( 2)

= a2

na · var( 1) + (1−a)2

n(1−a) · var( 2)

= a

n 1 2 1 + (1−a)

n ~ 2 2 2

= 1

n 1 2 + 1

n ~ 2 2 (14)

(22)

Demonstration of the Stratification Property : Proof (cont.)

• From (13) and (14),

var[ ] = var[a 1 + (1-a) 2] if = 1 = 2.

var[ ] = 1

n 2 + 1

n ~ 2 (13)

var[a 1 + (1-a) 2]

= 1

n 1 2 + 1

n ~ 2 2 (14)

(23)

SNU/NUKE/EHK

Demonstration of the Stratification Property : Proof (cont.)

• When E(V│A) ≠ E(V│R), which is the assumption of the theorem, consider a function h(v) defined by

• Taking the derivative of h(v) with respect to v gives

and dh(v)/dv = 0 implies v = 1 .

(24)

Demonstration of the Stratification Property : Proof (cont.)

• Note that the second derivative of h(v) with respect to v is 2a > 0. Then, v = 1 as the minimum value for h(v),

which means that if E(V│A) ≠ E(V│R) then

1 2 < − 2 .

Similarly, if E(V│R~A) ≠ E(V│R) then

2 2

~ < ~2 .

• Therefore, var[a 1 + (1-a) 2] < var[ ].

(25)

SNU/NUKE/EHK

Optimal Choice of the Number of Samples

• Let w denote the variance of the mean of the stratified sample, i.e.,

made with total n samples, with m samples for the first stratum and n-m samples for the second stratum.

This gives

• By setting dw(m)/dm = 0, the optimum value m is

(26)

Biased Sampling Scheme

• In the general case of biased sampling, the samples are picked from a modified pdf.

• Consider the random variable V(x) with pdf f(x). The expected value of V is then given by

• Given a pdf g(x) such that g(x) > 0 everywhere, one has

(27)

SNU/NUKE/EHK

Biased Sampling Scheme (cont.)

• The random variable V’ associated with the pdf g(x) is V’(x) = V(x)·f(x)/g(x) and E(V’) = E(V).

• The proper selection of g(x) can result in the variance being significantly reduced.

- Select g(x) such that

V’(x) = V(x)·f(x)/g(x) = c ≡ Constant. (3) Then,

and

(28)

Biased Sampling Scheme (cont.)

• Therefore, in theory, it is possible to choose a modified pdf such that the sampling process gives an answer with zero variance if one can find g(x) that has the same shape as V(x)f(x) to satisfy (3).

(29)

SNU/NUKE/EHK

Biased Sampling Scheme: example

(30)

Biased Sampling Scheme: example (cont.)

• Select a pdf for biasing

- Considering a straight line from (0,1) to (1,2.718) is somewhat the same shape as V(x)f(x), define a modified pdf as

• The MC results by the biased sampling are

- the integral value = 1.71823566 - standard deviation of random variable = 0.06507303 - standard deviation of the integral estimate = 0.00006507 (7)

(31)

SNU/NUKE/EHK

Biased Sampling/ weighting factor

• A new random variable V’ is defined for a modified pdf g(x) as

V’(x) = V(x)·f(x)/g(x) to give E(V’) = E(V).

• The original random variable V(x) is weighted by V’(x)/V(x) = f(x)/g(x) ≡ weighting factor

to compensate the modification of pdf from f(x) to g(x).

참조

관련 문서

• Time stamps that are used to stamp documents with the time, and a turret clock used in a clock tower. • &#34;program clock&#34; is a timer that can be programmed with

A study on Creative Teaching for

Ejectors is a fluid transportation device for which a principle is used that high-pressure fluid are spouted through driving pipe and the pressure

(Attached to a noun) This is used to indicate the topics, to compare two information or to emphasize the preceding noun in a sentence.. (Attached to a noun) This is used

The juice of the Dragon Blood Tree looks like blood and the Desert Rose looks like an elephant ’ s leg... I will see Niagara Falls there and enjoy pure nature in

Many people are trying to save the earth with special days like “ Earth Day, ” “ Buy Nothing Day ” and “ Meatout Day.. Then you should remember these special days and

These samples are measured by the Agilent 8453 UV-visible dissolution testing system using standard cells or, for more convenience, with a sipper. In both cases the operator

• The values of P r and v r listed for air in Table A-17 are used for calculating the final temperature during an isentropic process.. EX 4) Isentropic Compression of Air in