• 검색 결과가 없습니다.

Evaluate the distance to the affine functions of Boolean vector functions

N/A
N/A
Protected

Academic year: 2022

Share "Evaluate the distance to the affine functions of Boolean vector functions"

Copied!
10
0
0

로드 중.... (전체 텍스트 보기)

전체 글

(1)

EVALUATE THE DISTANCE TO THE AFFINE FUNCTIONS OF BOOLEAN VECTOR FUNCTIONS

Y. O. SUNG, J. H. JEONG AND C. H. SEO

1. Introduction

The Data Encryption Standard(DES)[3,8,9] was developed by an IBM team around 1974 and adopted as a national standard in 1977.

Since that time, many cryptanalysts have attempted to find shortcuts for breaking the system.The entire algorithm was published in the Fed- eral Register[l]' Boolean functions from GF(2n) into GF(2) are com- monly found in cryptographic applications. Usually they are designed to be nonlinear and to produce a balanced output, and often one finds the additional requirement that from knowledge of the output bit it should not be possible to reliably guess one or more input bits. Con- sider for example DES, where the S-box[2,7] in the encryption system is originally defined as a device with n-bit input and n-bit output so that 2n output vectors are some permutation of2n input vectors. Thus the Boolean vector function of an n-bit inputjn-bit output S-box can be considered as an injective function,whose domain and range are both GF(2n). We can extend the definition of S-box for the case when the number n of input bits is greater than the number m of output bits.

Then the Boolean vector function is from GF(2n) to GF(2m) with the additional requirement that the output should be balanced. In fact, in DES, the S-boxes with 6-inputj4-output are employed. We can even further extend the definition of S-box by removing the condition that the output should be balanced. The balanced Boolean vector function is generally nonlinear. The linear function is easy to attack via dif- ferential cryptanalysis. It is generally believed that the further away from linear functions a balanced Boolean vector function is the more difficult it is to attack the S-box. In the sense, we are interested in the

Received November 26, 1994.

1991 AMS Subject Classification: 46S20, 94A60.

Key words: Hamming distance, Boolean vector function.

(2)

Boolean vector function which is furthest away from linear Boolean functions[4] inview of the Hamming distance[5].

We have outlined the criteria[6,13] that IBM used to design the S- boxes and permutation.These criteria were developed specifically to thwart attacks based on differential cryptanalysis. A measure of the success of IBM's approach to the design of the S-boxes and permuation is the enormous amount of chosen plaintext (in excess of 1015 bytes) required by Biham and Shamir's attack[6].

Inthis paper, the lower and the upper bounds on the maximum dis- tance to the Affine functions of Boolean vector functions and balanced Boolean vector functions from GF(2n) to GF(2m) are derived. Insec- tion 2, we state some necessary notation and definition,extend distance concept to Boolean vector functions, in section 3,the Boolean vector functions ofS~boxesin DES are investigated and the distances of the Boolean vector functions of the 32 4-bit input/4-bit output S-boxes are evaluated. In the appendix, the functions are tabulated and com- puted the distance to Affine functions of each of the 32 Boolean vector functions.The implementation has been done on a SUN SPARC-2 sta- tion using C-language.

2. Preliminaries

We state here some necessary notation and definition in order to compute the distance to Affine functions and construct DES like S- boxes. LetEfldenote the addition over GF(2n) or the bit-wise exclusive- or, I . Idenotes the cardinality of a set. wtO denotes the Hamming weight function,Cfn) denotes an n dimensional vector with Hamming weight 1 at the i-th position.

DEFINITION 2.1. The Hamming weight of a word Co,Cl,'" ,On-I, Ci E GF(2) is defined as the sum of the Hamming weights of its digits, where

{ 0 if wt(C·) =

z 1 if

Letf and 9 be Boolean functions from GF(2n)which is n-dimensional vector space over GF(2) to GF(2): GF(2n) ~ GF(2). The Hamming

(3)

distance dU,g) between two Boolean functions f and 9 is defined as

(1) d(j,g) = wt[f+g].

Now, we can extend this distance concept to Boolean vector func- tions, namely F and G. Let F and G be m-dimensional vectors such that each component ofF and G is same Boolean function in GF(2n).

Then the distance between F and G can be defined as (2) d(F,G) = I{x E GF(2n)lF(x)+G(x) =1= O}I

To find the distance from a Boolean function f to Affine functions is an interesting subject in the study of Boolean functions. Here the distance DJ from a function f to Affine functions is defined as

(3) DJ = min{d(j,£)}.

lEL,.

whereLn is the set ofallAffine Boolean functions inGF(2n). Similarly we can define the Hamming distance to Affine functions of a Boolean vector function. Let F : GF(2n ) -+ GF(2m) be a Boolean vector function. The Hamming distance DF of a Boolean vector function F to Affine functions is defined as

(4) DF = min{d(F,L)},

L

where L

=

(£1,£2,'" ,£m) is an Affine Boolean vector function such that £i ELn , for all i = 1,2, ... ,m.

For a Boolean vector function F = {!I,12, . .. ,fm}, the definition in(4) can be equivalently expressed as

(5)

m

DF = 2n - max{wt[I1(Ji+Ri)]}'

L .

1=1

Now, let us introduce the parameter d(n,m). Let Fn,m be the set of allBoolean vector functions from GF(2n) to GF(2m ). Thenden,m) is defined as

(6) d(n,m)

=

max {DF}.

FEF,.,m

(4)

Especially when m = 1, to find outd(n,1) is equivalently to find out the maximum of the weights of coset leaders in a first-order Reed-Muller code[5]R(1,n) of length2n.

For even values ofn, the Boolean functions which achieves d(n, 1) are called bent functions[4] and it is known that

1 n-2

(7) den,1) = 2n- - 2-2- , n is even.

For odd values ofn, we have derived the following lower bound.

THEOREM 2.1. d(2m +1, 1) ~ 2d(2m, 1).

Proof. Let x = (XbX2"" ,X2m+l) be in GF(22m+l

). Set y =

(X2,'" ,X2m+t). Then any Boolean function j(x) in GF(22ffl+l) can be expressed as

j(x) =Xlg(y) +hey),

and any Affine function.e(x)in GF(22m+l ) can be expressed as .e(x) = ClXl+£1(y),

where 9 and h are Boolean functions in GF(22m),Cl is in GF(2) and .el is an Affine Boolean function in GF(22m).

THEOREM 2.2. d(2m

+

1,1) ::; 22m - 2m- l - 2.

Proof. H d(2m +1,1) > 2(22m- l - 2m- l) = 2d(2m,1) then it

means that for any j\ffine Boolean function £1 in GF(22m) and any number Cl, there exists some functions 9 and h such that

(8) wt[h+£1] +wt[g+h+£1 +Cl] > 2(22m- l - 2m- l).

Since inequality(8) should be satisfied for any Cl, we may say that for any.e1

(9) wt[h+£1] ::; 22m-l

(10) wt[g+h+£1 +Cl] ::;22m-1.

From(8)-(10), it Can be concluded that Dh >22m- l - 2m andDh+g >

22m- l - 2m. And in this case, the maximum of the LHS of (8) is no greater than (22m-l

- 2m- 1-1)+(22m- 1- 1). Therefore d(2m+ 1,1) ::;22m - 2m- l

- 2. Q.E.D.

(5)

3. Evaluate Distance to the Affine Function of Boolean Vector Functions with Balance Property

As mentioned in the introduction, the Boolean vector functions of a S-box must satisfy the balance property. Let F: GF(2n ) - t GF(2m)

be the Boolean vector function of an S-box with n-bit input and m-bit output. The balance property implies that the number ofxEGF(2n)

such that F(x) = cis exactly2n - m for everyC E GF(2m ).

Inthis section, we are going to investigate into t.he distance from the Boolean vector functions to Affine functions. Let's define the maximum distance parameter D(n, m).

DEFINITION 3.1.

D(n,m) = maxF{minL{d(f,f)}},

whereF is a balanced Boolean vectorfunction from GF(2n)toGF(2m).

As in section 2, let us first consider the case when m = 1. For the even values of n, it is obvious that there exists some balanced Boolean function which is 2n;2 apart from some bent functions.

Thus we have (11)

Note that whenm

=

2,the equality in(11)is achieved, i.e. D(4,1)

=

4.

Therefore we can have the following lower bound.

THEOREM 3.1. Forevenn and m ::; ~,

D(n,m) 2:: (2m _1)(2n- m - 2~-m+I).

The proof of Theorem 3.1 is given by showing the existence of a specific Boolean vector function of S-box which yields (2m -1)(2n- m

_

2~-m+l) as the distance to Affine function. Particularly (for m = ~), here is the construction.

First, construct m bent functions using Nyberg's Construction meth- od[7]. i.e., let Xl = (XI,X2,'" ,X2m-I) and X2 = (X2,X4,oo. ,X2m),

then the Nyberg's m bent functions h(x), i = 1,2, ... , m is given by

liCx) = Ai-I(XI)0 X2·

(6)

where A is the state transition function of a linear feedback shift reg- ister of length m with a linear feedback polynomial.

Next, modify the each Ji as 9i = Ji + hiwhere

m-I

hi(x) = X2i

IT

(X2j+l +1).

j=O

Then wt[hlx)] = 2m- l for all i,and in the position where hlx) = 1,

Xl is automatically0, so that Ilfi)

=

O. Thus the modified function

9i is balanced and in fact G

=

(91,·· . , 9m) is also balanced,and finally it is easy to see that

deL

m Ci(9i+X2i)) = 22m-l - 2m

i=l

for any combination (Cl,C2,··· ,cm). Thus DG = (2m -1)(2m - 2).

EXAMPLE 3.1. When Tt is equal to 4, m is equal.to 2, and

11

=

XlX2 +Xa X4, 12

=

XlX2+X2 Xa+XlX4·

Xl 0 0

o

0 0 0 0 0 1 1 1 1 1 1 1 1

X2 0 0 001 1 1 1 0 0 0 0 1 1 1 1 Xa 0 0 1 100 1 1 0 0 1 1

o

0 1. 1

X4 0 1 010 1 0 1 0 1 0 1

o

1 0 1

11 000 1 0 0 0 1 0 0 0 1 1 1 1 0

12 000 000 1 1 0 1 0 1 1

o

0 1

91 000 1 1 1 0 1 0

o

0 1 111 0

92

o

100 0 1 1 1 0 1 0 1 100 1

91 +X2 000 100 1

o

0

o

0 1 000 1

92 +X4

o

0 0 1 0 0 1

o

0

o

0 0 110 0

91+92+X2+X4

o

0 0 0 0 0 0

o

0

o

0 1 110 1

For oddnand m < ';,we can have a similar lower bound as follows.

THEOREM 3.2. Forn

=

2k+1 and m ~ k,

D(n,m) ~ (2m _1)(2n- m _ 2~-m).

(7)

In DES, there are 8 different S-boxes employed and they are all with 6-bit input and 4-bit output. But in fact, each Boolean vec- torfunction Ji of an S-box Si can be decomposed of FiC'x) = (xo +

l)(xs +l)GiO(Y)+(xo+l)X s G il(Y)+xo(xs+1)Gi2(Y)+xoxsGiaOl)

where Y

=

(Xl, X2, Xa, X4) and each of the four functions Gij(y),

j = 0,1,2,3, can also serve as a balanced Boolean vector function of the 4-bit inputj4-bit output S-box. In other words, the 6-bit inputj4- bit output S-boxes in DES are the combination of four 4-bit inputj4- bit output S-boxes. In the appendix, the functions are Gij, i

=

5,6,

j = 0,1,2,3 tabulated. Also we computed the distance Dij to Affine functions of each of the 32 Boolean vector functions Gij.

4. Appendix

The Boolean vector function FiCx) of the i-th S-box Si is given by

I

GiO(XI,··· ,X4) Xo = 0 Xs = 0

Fi(x) = Gil(XI,'" ,X4) Xo = 0 Xs = 1 G i2 (XI,··· ,X4) Xo = 1 Xs = 0

Gia(XI,··· ,X4) Xo = 1 Xs = 1

and Gij

=

{9ijO, 9ijI, 9ij2, 9ija}. The following is the table of9ijk and Dij = DG;j'

9soo =Xl +X4 +XlXa+X2 X a+Xa X 4+XlXa X4+X2 X aX 4

9S0l =X2+Xa+X4 +XlXa

9S02 =1+Xl+Xa +X4 +XlX4 +X2Xa+X2X4 +XaX4 +XlX2Xa

+XlX2 X 4+XlXaX4 +X2 X a X 4

9S03 =X2+XlXa +XlX4 +X2 X 4+Xa X 4+XlX2 X 4+X2 X aX 4

Dso =7

9Sl0 =1+Xl+X2 +Xa+XlX2 +XaX4 +XlX2 X 4+XlXaX4

9S11 =1+Xa+X4 +XlX2+XlXa +X2X4 +Xa X 4+XlX2Xa +X2 X a X 4

9Sl2 =1 +Xl +X2 +XlXa+X2X4 +XaX4+XlXaX4 9Sla =Xl +X4 +X2 X a+Xa X 4 +XlX2X4 +XlX3 X 4

DSl =8

(8)

9520 =XI +X2 +X2 X 3 +X3 X 4 +XIX2 X 3

9521 =1+x2 +X3 +X4+xlx2 +XIX3 +x3X4

+XIX2 X 3

+

X2 X 3 X 4

9522 =XI+X2 +X4 +XIX2 +XIX3 +XIX2X4 +Xlx3 X 4

9523 =XI +x3 +xlx2+x2 X 4 +Xlx2 x 3+xIX3 X 4

D52 =8

9530 =1+Xl +X2 +XIX4 +X2X4+X3 X 4 +XIX2 X 3

+XIX2 X 4 +XIX3 X 4 +X2 x 3 x 4

9531 =XI +X3 +XIX2 +X2 X 3 +X2 X 4 +XIX2 X 3

9532 =1+x2 +X3 +X4+XIX2 +XIX4 +XIX2 X 4

9533 =1+Xl +X3 +X4+XIX3 +XIX2X3 +XIX2 X 4

D53 =8

9600 =1+Xl +X4 +XIX2 +X2X3 +X3X4 +X2 X 3 X 4

9601 =1+Xl +X2 +X3+x4 +xlx3+x2 x 4

9602 =X3+xlx2 +x2 X 4 +XIX2 x 4 +XIX3 x 4

9603 =x2 +X4 +xlx2+XIX3 +x2X3+xIX2X3 +X2X3 X 4

D60 =9

9610 =1+Xl +X2 +X3 +XIX2 +X2 X 4 +Xlx2 X 3

9611 =xI +x2+X3 +X4 +xlx3 +x2 X 4 +xlx3 x 4 +X2 X 3 X 4

9612 =1+x3 +xIX2 +XIX4 +X2x4 +X3X4+XIX2X4 +xIX3 X 4

9613 =X2+X4 +XIX2 +XIX3 +X3 X 4 +XIX3 X 4

D61 =8

9620 =1+Xl +X2 +XIX2 +X2X3 +X2 X 4+X3X4 +X2X3X4

9621 =X1 +X3 +X4 +XIX2 +X1 X a +X2X4 +xa X 4

+XIX2 X 4 +XIX3 X 4

9622 =XI +X2 +X3 +X4 +XIX2X4

9623 =1+X2 +X4 +XIX2+XIX3 +X2X4 +X3X4 +XIX2X3

D62 =8

(9)

9630 =Xl +X2+XlX3 +x2x4x3X4+XlX2X4 +XlX3 X 4

9631 =1+Xl +x2+x3

-+

X4 +xlx3+xlx2 x 3

9632 =Xl +X3 +x4 +xlx4 +x2 x 4 +xlx3 x 4

9633 =Xl+x2+x4 +x2 x 4 +x3 x 4

D63 =8

References

1. Data Encryption Standard Federal Information Processing Standards Publica- tion, 46, National Bureau. of Standards, 1977.

2. A. F. Webster and S. E. Tavares, On the design of S-boxes, Proc. of Crypto.

'86 (1987), 523-534.

3.A. Shamir" Onthe security of DES, Proc. of Crypto. '85 (1986), 280-283.

4. O. S. Rothaus, Onbent junctions, J. of Combin. Theory, 20A (1976), 300-305.

5. D. C. Gorenstein and N. Zierler,A class of Error-Correcting Codes in GF(pm), J.Soc. Indus. Appl. Math. 9 (1961), 207-214.

6. E. Biham and A. Shamir" Differential Cryptanalysis of DES-like Cryptosys- tems, J. Cryptology 4 (1991), 3-72.

7.K.Nyberg,Perpect Nonlinear S-boxes, Proc. of Eurocrypto. '86 (1986),284-286.

8. C. H. Meyer, Ciphertext/Plaintext and Ciphertext/Key Dependence vs Number of Rounds for the Data Encryption Standard, National Computer Conference (1978), 11-19.

9. E. Biham and A. Shamir, Differential Cryptanalysis of the Full 16-round DES, Proc. of Crypto. '92 (1992), 487-496.

10.E. Biham, New Types of Cryptanalytic Attacks Using Related Keys, Techni- cal Report #753, Computer Science Department, Technion-Istrael Institute of Technology, 1992.

11. E. Biham" personal communication, 1993.

12.K. Kim, S. Park, and S.Lee,Reconstruction of S2DES S-boxes and their Immu- nity to Differential Cryptanalysis, Proc. of the 1993 Korea-Japan workshop on Information Security and Cryptography, Seoul, Korea 24-26 (1993), 282-29l.

13.W. Meier, O. Staffelbach, Nonlinerarity Criteria for Cryptographic Functions, Proc. of Eurocrypt '89 (1989.), 549-562.

Y. O. Sung

Department of Mathematics Education Kongju University

Chungnam 314-701, Korea

(10)

J. H. Jeong

Department of Mathematics Seonam University

Namwon 590-170, Korea C. H. Seo

Institute Defence of Information Systems Seoul 130-650, Korea

참조

관련 문서

Some most frequent words in the text collection forms the dimension words for the vector dimension space and the frequency count of bigrams constructed from the sentences are used

Bain, Functions of the Alloying Elements in Steel, American Society for Metals, 1939, p. Bain, Functions of the Alloying Elements in Steel, American Society for

디지털 시스템에서 시스템을 나타내는 부울식은 논리 게이트 로 구현되기 때문에 부울식을 간략화 한다는 것은 부울식을 나타내는 논리 게이트 수의 감소시켜 비용적인

• The sequences of Gaussian functions play a special role in connection with transforms in the limit.. • The properties which make the Gaussian functions

The index is calculated with the latest 5-year auction data of 400 selected Classic, Modern, and Contemporary Chinese painting artists from major auction houses..

9.6 Calculus Review: Functions of Several Variables 9.7 Gradients of a Scalar Field. Directional Derivative 9.8 Divergence of a Vector Field. 9.9 Curl of a Vector Field 9.9 Curl

In this way, the origin of capital gain tax which is to fulfill the security of internal revenue and property holdback functions has a lot of natural

The eigenvalue problem can be used to identify the limit state of the process, in which the state vector x is reproduced under the multiplication by the.. stochastic matrix