자료구조와 알고리즘

(1)

자료구조와 알고리즘

충북대학교

소프트웨어학과 이충세

(2)

문의 사항

Email : csrhee@cbnu.ac.kr 전화 : 043-261-2237

강의 자료 : web.cbu.ac.kr/~algor 참조

(3)

알고리즘 : 주제



기초 자료구조



알고리즘의 분석 기법



트리와 그래프



점화식



알고리즘 기법

분할정복, 동적 프로그램, Greedy 방법

(4)

1.1 데이터와 데이터 객체

1.1.1 데이터의 개념

* 정보(information) : 어떤 목적을 가진 활동에 직접 또는 간접적 인 도움을 주는 지식

* 데이터(data) : 정보라는 제품의 생산에 입력되는 원재료 자원->

사실(fact), 개념(concept), 명령(instruction)의 총칭

* 데이터 형(data type) : 프로그래밍 언어의 변수(variable)들이 가질 수 있는 데이터의 종류

- a collection of objects and a set of operations that act on

(5)



FORTRAN : 정수형(integer), 실수형(real),논리형 (logical),복소수형(complex), 배정도 실수형(double precision) 등



PL/I : 문자형(character)



SNOBOL : 문자열(character string)



LISP : 리스트(list) 또는 s-수식(s-expression)



Pascal : 정수형, 실수형, 논리형(boolean), 문자형

(6)

1.1.2 데이터와 전산학 전산학의 연구범위

* 데이터를 저장하는 기계(machine)

* 데이터 취급에 관련된 내용을 기술하는 언어 (language)

* 원시 데이터로부터 생성할 수 있는 여러 종 류의 정제된 데이터를 기술하는 기초 내용 * 표현되는 데이터에 대한 구조

(7)

* 데이터 객체 : 데이터 형의 실체를 구성하는 집합(set) 의 원소(element)

* 변수(variable) : 데이터 객체의 명칭

* 정수형 데이터 객체 : D = { …,-2, -1, 0, 1, 2, …}

* 길이가 30자 이내인 영문자 문자열(alphabetic character string)의 데이터 객체 :

D = { ‘’, ‘A’, …, ‘Z’, ‘AA’, …}

1.1.3 데이터 객체

(8)

1.2 데이터 구조의 개념

1.2.1 데이터 구조

* 데이터 구조(data structure)란:데이터 객체의 집합 및 이들 사이의 관계를 기술-> 데이터 객체의 원소에 적용될 연산(operation)이 수행되는 방법을 보여줌 - the organized collections of data to get more

efficient algorithms

(9)

* 추상적 데이터 형(abstract data type) D: 데이터 구조의 정의 영역(domain) 의 집합

F: 함수(function) 집합 A: 공리(axiom) 집합 d = (D, F, A)

- a data type that is organized in such a way that the specification of the objects and the specification of the operations on the objects is separated from the representation of the objects and the

implementation of the operations

1.2.2 데이터 구조의 표현

(10)

Abstract data type

structure Natural_Number is

objects: an ordered subrange of the integers starting at zero and ending at the maximum integer (INT_MAX) on the computer

functions:

for all x, y ∈ Nat_Number; TRUE, FALSE ∈ Boolean

and where +, -, <, and == are the usual integer operations Nat_NoZero() ::= 0

Boolean Is_Zero(x) ::= if (x) return FALSE else return TRUE

Nat_No Add(x,y) ::= if ((x + y) <= INT_MAX) return x + y else return INT_MAX

Boolean Equal(x, y) ::= if ( x == y) return TRUE else return FALSE

Nat_No Successor(x) ::= if ( x == INT_MAX) return x else return x + 1

Nat_No Subtract(x, y) ::= if ( x < y) return 0 else return x – y

(11)

데이터 객체 natno={0,1,2,…}일 때,

* 함수 집합 F의 정의 ZERO( ) -> natno

ISZERO(natno) -> boolean SUCC(natno) -> natno

ADD(natno, natno) -> natno EQ(natno, natno) -> boolean

(12)

*

공리 집합 A의 정의

ISZERO(ZERO)::=true

ISZERO(SUCC(x))::=false ADD(ZERO, y)::=y

ADD(SUCC(x),y)::=SUCC(ADD(x,y))

EQ(x, ZERO)::=if ISZERO(x) then true else false EQ(ZERO, SUCC(y))::=false

(13)

1.3 데이터 구조의 영역

1.3.1 데이터 구조론

* 데이터 구조론: 데이터 처리 시스템에서 취급 하는 데이터 객체들을 기억 공간 내에 표현하 고 저장하는 방법과, 데이터 상호간의 관계를 파악하여 수행할 수 있는 연산과 관련된 알고 리즘을 연구하는 학문

(14)



선형 구조(linear structure, sequential structure): 데 이터 상호간에 1:1의 관계를 가진 것-> 연접 리스트, 연결 리스트, 스택, 큐 등



비선형 구조(non-linear structure): 데이터 상호간에 1:n 또는 n:m의 관계를 가진 것-> 트리, 그래프



파일 구조(file structure): 레코드의 집합체로 이루어 지는 특수한 형태의 데이터 구조

1.3.2 데이터 구조의 형태

(15)

- 처리 능률 : 어떤 데이터 구조를 선택함에 따라 영향을 크게 받음 - 데이터 구조를 선택하는 기준

* 데이터의 양

* 데이터의 활용 빈도 * 데이터의 갱신 정도

* 데이터 처리를 위하여 사용 가능한 기억 용량

* 데이터 처리 시간의 제한

* 데이터 처리를 위한 프로그래밍의 용이성

1.3.3 데이터 구조의 선택

(16)

알고리즘과 프로그램 2

2.1 알고리즘 2.2 프로그램

2.3 프로그램의 분석

(17)

2.1 알고리즘

 An example of software development in action



Specification : a precise description of the problem



Design : formulating the steps to solve the problem



Implementation : the actual source code to carry out the design

 2.1.1 알고리즘의 개념

*

주어진 문제 해결을 위하여 실행되는 명령어들의 유한 집합 -> 데이터 변환을 위해서 적용

되는 잘 정의된 방법

(18)

Specification and implementation

 Celsius ToFahrenheit

public static double celsiusToFahrenheit(double c)

Convert a temperature from Celsius degree to Fahrenheit degree Parameters:

c – a temperature in Celsius degrees Precondition:

c >= -273.16.

Returns(Postcondition):

the temperature c converted to Fahrenheit degree Throws: IllegalArgumentException

indicates that c is less than the smallest Celsius temperature(-273.160) Public static double celsiusToFahrenheit(double c)

{

final double MINIMUM_CELSIUS = -273.16 if ( c < MINIMUM_CELSIUS)

throw new IllegalArgumentEXception(“Argument “ + c + “ is too small.”);

return (9.0/5.0) * c + 32;

(19)

알고리즘이 만족 해야 할 조건



입력(input)



출력(output)



명확성(definiteness)



유한성(finiteness)



효과성(effectiveness)

(20)

2.1.2 알고리즘과 전산학



컴퓨터: 데이터 변환을 위해 사용하는 수단 -

> 알고리즘



전산학의 연구영역

* 컴퓨터 시스템의 기계구성과 조직 형태 * 언어의 설계와 번역

* 알고리즘의 기초(추상적 컴퓨터 모델) * 알고리즘의 분석

(21)

알고리즘

 Finite number od instruction to solve a problem : well defined

 The theoretical study of computer- program

 performance and resource usage.



What’s more important than performance

(22)

알고리즘의 고려사항



What’s more important than performance?

• modularity • correctness

• maintainability • functionality

• robustness • user-friendliness

• programmer time • simplicity

• extensibility • reliability

(23)

2.2 프로그램



2.2.1 프로그램 작성 절차

* 요구 사항(requirement)의 정의 * 설계(design)

* 평가(evaluation)

* 상세화 및 코딩(refinement and coding)

* 검증(verification)

(24)

2.2.2 프로그램의 작성 요령



하향식 방법(top-down method)



논리적 모듈(module)



부 프로그램(subprogram)을 사용



순차(sequencing), 분기(branching),

반복(repeating)등 세가지 표준 논리 제어 구조



GOTO문의 사용을 피한다



연상 이름(mnemonic-name) 문서화

(25)

2.2.3 순환 기법



순환(recursion): 자기 자신을 호출하도록 구 성하는 것 -> 프로그램을 단순화하고 이해하 기 용이 할 경우가 많음



순환 프로그램의 작성단계 * 순환관계 파악

* 알고리즘 구성

* 프로그램 언어로 기술

(26)

The tree-recursive process

Fibo 4

Fibo 3 Fibo 2

Fibo 2 Fibo 1 Fibo 1 Fibo 0

Fibo 1 Fibo 0

1 1 0

Int Fibo(int n) {

if (n <= 1) return n;

(27)

2.3 프로그램의 분석



2.3.1 프로그램의 평가 기준 * 바른 수행

* 정확한 동작 * 설명서

* 부 프로그램 * 해독

(28)

다른 평가 기준



프로그램의 수행에 필요한 기억장치의 용량 -> 비교 적 용이



프로그램의 연산시간 -> 매우 어려움 * 기계

* 기계의 명령어의 집합 * 명령어의 수행시간 * 컴파일러의 번역 시간

-> 정확한 판정이 불가능 -> 명령문의 수행 빈도수를 계산

(29)

2.3.2 분석 기법

 Big oh 표시법 : 두 함수T(n)과 f(n)이 있을 때, n>=n⁰을 만족하 는 모든 n에 대하여 |T(n)| <= C*|f(n)|인 양의 상수 C와 n⁰가 존재하면 T(n)=O(f(n))이라고 정의

 Big omega notation : 만일 양의 상수 C와 n0가 존재하여 n>=n0인 모든 n에 대해서 |T(n)|>=C*|f(n)|이 성립하면 T(n)=Ω(f(n))으로 나타냄

 양의 상수 C¹, C², n⁰가 존재하여 모든 n>=n⁰에 대해서

C¹|f(n)|<=|T(n)|<=C²|f(n)|이 성립하면 T(n)=Θ(f(n))이라 한 다.

(30)

3. How to Measure Complexities

How good is our measure of work done to compare algorithms ?

How precisely can we compare two

algorithms using our measure of work ?

Measure of work

# of passes of a loop

# of basic operations

(31)

Big Oh Notation



(32)

32

For solving a problem

P

, suppose that two algorithms

A ₁

and

A ₂

need 10

⁶ n

and 5

n

basic operations, respectively, i.e.

basic operations

A ₁

10

⁶ n

A ₂

5

n

Which one is better ?

What are their time complexities ?

(33)

Now, suppose that algorithms

A

₁ and

A

₂ need the following amount of time:

time complexity

A

₁ 10⁶

n

O(

n

)

A

₂

n

² O(

n

²) Which one is better ?

A

₁ is better if

n

> 10⁶

A

₂ is better if

n

< 10⁶

Then, why time complexity ? Suppose that

n

→ ∞

Then,

n

² grows much faster than 10⁶

n

. i.e.,

Under the assumption that

n

_{→ ∞}

A

is better than

A

∞

∞ →

→

n

n 6

2

lim10

10⁶

n

T

(

n

)

(34)

34

N = {0, 1, 2, …}

N

⁺

= {1, 2, 3, …}

R = the set of real numbers

R

⁺

= the set of positive real numbers R

^*

= R

⁺

∪ {0}

f

: N → R

^*

and

g

: N → R

^* g

is:

Ω (

f

):

g

grows at l t f t

f

(35)

Definition: Let

f

: N → R^*. O(

f

) is the set of functions,

g

: N →R^* such that for some

c

∈ R⁺ and some

n

₀ ∈ N,

g

(

n

) ≤

c

⋅

f

(

n

) for all

n

≥

n

₀.

O(

f

) is usually called

“big oh of

f

”, “oh of

f

”, or “order of

f

”.

Note: In other books,

g

(

n

) = O(

f

(

n

)) if and only if there exist two positive constants

c

and

n

₀ such

that |

g

(

n

)| ≤

c

⋅ |

f

(

n

)| for all

n

≥

n

₀

Under the assumption that

f

: N → R^* and g: N → R^*, two definitions have

a minor difference.

How to check :

) ( '

) ( lim ' )

( ) lim (

rule s Hopital' L'

By : note

) ( O ) ,

( )

lim (

^*

n f

n g n

f n g

f g

c n c

f n g

n n

n

∞

→

∞

→

∞

→

=

∈

⇒

∈

= R

What is it ?

(36)

36

Definition: Let

f

: N → R

^*

. Ω(

f

) is the set of functions,

g

: N →R

^*

such that for some

c

_∈ R

⁺

and some

n ₀

_{∈ N,}

g

(

n

) ≥

c

⋅

f

(

n

) for all

n

_≥

n ₀

. Ω(

f

) is usually called

“big omega of

f

” or “omega of

f

”.

Note: In other books,

g

(

n

) = Ω(

f

(

n

c

and

n ₀

such that |

g

(

n

)| ≥

c

⋅ |

f

(

n

)| for all

n

_≥

n ₀

How to check :

) ( f

g

∈Ω

⇒

or

) 0

( )

lim ( → >

∞

→

c

n f

n g

n

∞

∞ →

→ ( )

) lim (

n f

n g

n

(37)

37

Definition: Let

f

: N → R

^*

. θ(

f

) = O(

f

) ∩ Ω(

f

).

θ(

f

) is usually called

“theta of

f

” or “order of

f

”.

Note: (Alternative definition of θ(

f

))

g

(

n

) = θ(

f

(

n

c ₁

and

c ₂

and

n ₀

such that

c ₁

⋅ |

f

(

n

)| ≤ |

g

(

n

)| ≤

c ₂

⋅

|

f

(

n

)| for all

n

_≥

n ₀

How to check :

) (

) , (

)

lim ( c c g f

n f

n g

n

= ∈

⁺

⇒ ∈ θ

∞

→

R

(38)

Definition: Let

f

: N → R

^*

. o(

f

) = O(

f

) - θ(

f

).

o(

f

) is usually called “little oh of

f

”.

How to check :

n

- 5,

n

,

n ²

, 10

¹⁰ n ²

+ 10

⁵ n

+ 10

⁹

,

n ²

- 9

∈ o(

n ³

)

⇒

∞ →

→ 0

) (

) lim (

n f

n g

n

g

(

n

) ∈ o(

f

(

n

))

(39)

Definition: Let

f

: N → R

^*

. ω(

f

) = Ω(

f

) - θ(

f

).

ω(

f

) is usually called “little omega of f”

How to check :

⇒

∞

→

( )

) lim (

n f

n g

n

∈

− +

+ 10 10 , 9

10 ,

¹⁰ ² ⁵ ⁹ ³

2

n n n

n

ω(n)

Note:

g ( n ) ∈ o ( f ( n )) ⇔ f ( n ) ∈

ω(g(n))

g

(

n

) = ω(

f

(

n

))

(40)

How important is time complexity ?

(41)

(42)

∆

T

= a fixed amount of time

S

= the maximum input size that a

particular algorithm can handle within

∆

T

Suppose that our computer speed increases by a factor

t

.

f

(

S

) = the number of steps executed in

∆

T

by the old computer.

f

(

S _new

) = the number of steps executed in

∆

T

by the new computer

t S

S t S

n S

tS S

n

S S

n

S S

n T

n

t new

2 log

) ( log

) (

4

3 4

3 2

2 2

1 1

+

(43)

43

Properties of O, Ω, θ

Let

f

,

g

,

h

: N →R

^*

. Then,

P1: (Transitivity)

f

_{∈ O(}

g

) and

g

_{∈ O(}

h

) ⇒

f

_{∈ O(}

h

) How about Ω, θ, o, ω ?

P2:

f

_{∈ O(}

g

) ⇔

g

_{∈ Ω(}

f

)

f

_{∈ o(}

g

) ⇔

g

_{∈ ω(}

f

) Duality

P3:

f

_∈θ(

g

) ⇔

g

_∈θ(

f

)

P4: θ is an equivalence relation P5: O(

f

+

g

) = O(max{

f

,

g

})

How about Ω, θ, o, ω ? [Proof] Exercise. (Homework)

(44)

44 Transformability

Definition: (

Lower bound

)

A lower bound in time complexity for solving a problem

P

is said to be time required to solve

P

.

(Alternatively) A (tight) lower bound in time complexity for solving a problem is the least amount of time to solve the most difficult instance of the problem.

Definition: (

Upper bound

)

An upper bound in time complexity for

(45)

45

Definition: (

Transformability

)

A problem

A

is said to be transformable into a problem

B

if the following is true:

(1) The input to the problem

A

can be converted into a suitable

input to the problem

B.

(2) The problem

B

is solved.

(3) The output of the problem

B

is transformed into a correct

solution of the problem

A

.

B

A ∝

_τ₍_n)

(46)

46

Theorem: (

Lower bound via transformability

)

If a problem

A

requires

L

(

n

) time and if , then the problem

B

requires at least

L

(

n

) - O(τ(

n

)) time.

[Proof] Exercise. (Homework) (Hint:

by contradiction)

Theorem: (

Upper bound via transformability

)

B A ∝

_τ₍_n)

B

A ∝

_τ₍_n)

(47)

Example: (Lower bound via transformability)

A

: Given a set of

n

real numbers, sort them. Ω (

n

log

n

)

B

: Given a set

L

of vertical segments and a pair of points

p

and

q

, find the shortest path between

p

and

q

avoiding

L

.

(1) input transform (

A

_→

B

) O(

n

) input to

A

input to

B

{

x

₁,

x

₂, …,

x

_n} {

l

₁,

l

₂, …,

l

_n}

x

_i ⇒ ((

x

_i, -

c

), (

x

_i, +

c

)) =

l

_i

p q

O(

n

)

c

) 0 , (

} { max

} { min

max min max

min

c x

q

c x

p

x x

i i i i

+

=

−

=

(48)

(2) Suppose that

B

is solved

x

_i ⇒ ((

x

_i, -

c

), (

x

_i, +

c

)) =

I

_i

Solution: (

x

₁ -

c

, 0), (

x

₁,

c

), (

x

₄, c), (

x

₃,

c

), (

x

₂, c), (

x

₅, c), (

x

₅ +

c

, 0)

p

q

(3) Output transform O(

n

)

Drop

p

and

q

from the solution.

Read off each

x

-coordinate.

What do you obtain ?

 The sorted list The solution of

A

.

p q

l

₁

l

₄

l

₃

l

₂

l

₅

B A ∝

_O₍_n)

n n log

) log (

) ( O ) log

( n n n n n

L = Ω − = Ω

∴

(49)

49 Searching an Ordered List

BIN

: Given a value

x

and an array of

L

containing

n

entries in

the non- decreasing order, find an index of

x

in the list or,

if

x

_∉

L

, return 0 as the answer.

SEQ

:

L

is an unordered array.

Is

BIN

=

SEQ

? No !!!

A

solves

SEQ

_⇒

A

solves

BIN

⇐

(50)

Review

What is a lower bound in time

complexity for solving

SEQ

? Ω(

n

)

Any optimal algorithm for solving

SEQ

? Yes, sequential search.

What is time complexity for the sequential search algorithm ? O(

n

) in the worst case,

in average case

n n q

q ( 1 ) 2

) 1

( + + −

(51)

51

What is a lower bound in time complexity for solving

BIN

? Ω(

n

). Why ?

What if considering only searching time ?

BIN-D

: Given

L

&

x

, is

x

_∈

L

?

L

= (

x ₁

,

x ₂

, …,

x _n

)

W

= {(

x ₁

,

x ₂

, …,

x _n

) |

x _i < x _i+1 i

and

x _j =x

for some

j

}

∀

(52)

Alternative proof: Adversary argument (from book)

α= {

A

|algorithm

A

solves

BIN-D

} Take any

A

_{∈ α.}

Let

d _A

= the depth of decision tree for

A.

A lower bound is Ω(log

₂ n

) ⇔

d _A

_{≥ log}

₂ n

∴ We need to show

d _A

≥

log ₂ n

!!!

(53)

N _A

= # of nodes in the decision tree for

A

Then,

d _A

≥

log ₂ N _A

. Why?

∴

If

N _A

_≥

n

, then

d _A

≥

log ₂ n

, Claim :

N _A

_≥

n.

(54)

In order to prove that

N _A

_≥

n

,

let each node of the decision tree be labeled

i

ⁱ

⇔

x

:

L

(

i

) at the node.

x:L(j) j

x:L(i)

(55)

Suppose that

N _A

<

n

for a

contradiction. Then, there is no node which is labeled “

i

” for some 1≤

i

_≤

n

.

Let

S

= (

S ₁ , S ₂ , … , S _i , … , S _n

) be a sorted list,

where

(1)

S _j

<

S _j+1

in all 1≤

j

≤

n

(2)

S _i

=

x

Now, we make two list

L ₁

and

L ₂

:

55 1,2 for

)) ( , ) 2 ( ), 1 ( (

L

_k

= L

_k

L

_k

 L

_k

n k = x

≡

(56)

) ( ...

...

) 2 ( ) 1 (

...

) ( ...

...

) 2 ( ) 1 (

...

2 1

n i

L

n i

L

S S

S

S S

ⁱ ⁿ

↓

≡ x

x i

L

₂( ) ≠

x i

L

₁( ) =

(57)

By construction,

(1)

L ₁

and

L ₂

are sorted,

(2)

L

₁(

j

) =

L

₂(

j

) ∀

j

_≠

i,

and

(3)

L

₁(

i

) =

x

and

L

₂(

i

) ≠

x

Since no node in the decision tree is labeled “

i

”, the algorithm

A

gives the same answer for different input

L

₁ and

L

₂.

∴ The algorithm

A

is wrong #

∴

N _A

_≥

n

_⇒

d _A

_{≥ log}₂

N _A

_{≥ log}₂

n

(58)

58

Can we modify the sequential search algorithm for obtaining a better time bound to solve

BIN

?

(1) (2) (3) ………... (n)

L

Now,

L

is in the non-decreasing order !!!

⁽

ⁱ

⁾

x

………...

 



⇒

<

⇒

>

≠

stop )

(

continue )

(

) (

i L x

Improvement is here !!

(59)

Compare

x

to the every

k ^th

elements in

L

!!!

( k ) (2 k )

x >

L

((

r

-1)

k

) x <

L

(

rk

)

case worst in the

s comparison )

1 (  

  +

− k

k n

Why ?

elements )

1 ( k −

(60)

How about the average case ?

index = 0 ⇔

x

∈

g

_i for some

i

I

_i = {input

L

|

x

=

L

(

i

)},

i

= 1, 2, …,

n I

_n+i = {input

L

|

x

∈

g

_i},

i

= 1, 2, …,

n

+1

t

(

I

_j) = # of comparisons for

I

_j,

j

= 1, 2, …, 2

n

+1

p

(

I

_j) = probability that

L

∈

I

_j,

j

= 1, 2, …, 2

n

+1

…….….

(1) (2) ………. (i)

 



⇒

<

⇒

>

≠

stop )

(

continue )

(

) (

i L x

 

 



+

=

>

≤

<

−

=

<

=

1 if

)

(

2 if ) ( )

1 (

1 if

)

1 (

n i n

L x

n i i

L x i

L

i L

x g

_i

g

1

g

₂

g

₃

g

_i

g

_n₊₁

) 1 (

L L(2) L(i−1) L(i) L(n)

...

... ...

...

(61)

Assumption:

(1)

P

(

x

∈

L

) = 1/2

(2)

x

is equally likely to be in

L

(

i

) if

x

∈

L

(3)

x

is equally likely to be in

g

_i if

x

∉

L

2 1

1

1 1

2 1 2 1

1 1

2 2( 1) 2( 1)

1 1

1 1 1

( 1) 1

4 4 2 1

( ) ( )

( ) ( ) ( ) ( )

( ) ( ) ( ) ( ) ( ) ( )

( )

n

i i

i

n n

i i n i n i

i i

n n

i i n i n i n n

i i

n n

n

n n n

i i

n n n

P I t I

P I t I P I t I

P I t I P I t I P I t I

A

i i

+

=

+

+ +

= =

+ + + +

= =

+ +

= =

+ +

+

+ +

=

= ⋅ + ⋅ +

= + + −

∑

∑ ∑

g

1

g

₂

g

₃

g

_i

g

_n₊₁

) 1 (

L L(2) L(i−1) L(i) L(n)

...

... ...

...

(62)

62 Binary Search

Divide and Conquer !!!



Binary search tree



Array

) (log O ) (

) 1 (

2 ) ( )

(

2 1

n n

T

c T

T n c n T

=

⇓

=

 

  +

=

2 ) ( 1

:  

  + n L

x

(63)

Average-case Analysis

I

_i = {input

L

|

x

=

L

(

i

)},

i

= 1, 2, …,

n I

_n+i = {input

L

|

x

∈

g

_i},

i

= 1, 2, …,

n

+1

t

(

I

_j) = # of comparisons for

I

_j,

j

= 1, 2, …, 2

n

+1

p

(

I

_j) = probability that

L

∈

I

_j,

j

= 1, 2, …, 2

n

+1

Assumptions:

(1)

p

(

I

_j) = 1/(2

n

+1) (2)

n

= 2^k - 1,

k

_≥1

# of comp. # of node 1 2

⁰

2 2

¹

3 2

²

….…………..

k-1

……....……….….…………..

………...

2 1

1

2 1

1

( ) ( ) ( )

1 ( )

2 1

n

i i

i

n i i

A n P I t I

n t I

+

=

+

=

= ⋅

= +

∑

L

x

∈

(64)

1 2

) 1 (

1 2

) 1 (

1 2 2

1 ) ( )

1 ( 2

) 1 1 (

2 1

) ( ) ( )

(

1

1 2

1 1

1 2

1 1

2

1

+ + +

+ +

= −

 

 



 ⋅ + +

= +

 

 



 +

= +

⋅

=

∑

=

−

+ +

=

= +

=

n n k n

k

n k n i

I t I

n t I

n t

I t I p n

A

k k

i

n

n i

i n

i

i n

i

i n

i

i i

1 2

Now, n =

^k

− 1 2 = +

∴

^k

n

) 1 (

log

₂

+

= n

k

1

See P22 in the textbook

자료구조와 알고리즘