• 검색 결과가 없습니다.

HDR Video Synthesis Using Superpixel-Based Motion Estimation

N/A
N/A
Protected

Academic year: 2022

Share "HDR Video Synthesis Using Superpixel-Based Motion Estimation"

Copied!
2
0
0

로드 중.... (전체 텍스트 보기)

전체 글

(1)

2018 년 한국방송·미디어공학회 하계학술대회

슈퍼픽셀 기반의 움직임 추정을 이용한 HDR 동영상 합성

Tu Van Vo, 이철 부경대학교

tuvovan@pukyong.ac.kr, chullee@pknu.ac.kr

HDR Video Synthesis Using Superpixel-Based Motion Estimation

Tu Van Vo, Chul Lee Pukyong National University

요 약

We propose a novel high dynamic range (HDR) video synthesis algorithm using alternatively exposed low dynamic range (LDR) videos. We first develop a superpixel-based illumination invariant correspondence estimation algorithm. Then, we propose a reliability weight to further improve the quality of the synthesized HDR frame. Experimental results show that the proposed algorithm provides high-quality HDR frames compared to conventional algorithms.

1. 서론

High dynamic range (HDR) imaging is becoming more popular and getting more interests from both academia and industry owing to its faithful representation of the scene.

Since most of the researches on HDR imaging has focused on high-quality synthesis of still images, HDR video generation receives relatively less attention. Although a few specialized HDR camera systems have been developed to capture HDR videos, they are too expensive to be used in practical applications. Instead, software-based approaches have been proposed to be employed in consumer devices.

A common approach to synthesize an HDR video is to capture multiple low dynamic range (LDR) frames taken with different exposure times and then merge them using alignment techniques. The main challenge of this approach is that it requires accurate motion estimation between consecutive frames to obtain high-quality HDR frames. Kang et al. [1]

employed an optical flow-based solution to estimate small motions. However, it may fail to estimate accurate correspondences when complex motions exist. Kalantari et al.

[2] employed a patch-based correspondence estimation scheme, but its computational complexity is too high to be used in practical applications.

In this work, we propose a robust HDR video synthesis algorithm using a superpixel-based illumination invariant motion estimation technique to address the aforementioned motion problems between adjacent frames. We further reduce

artifacts caused by inaccurate correspondence estimation in the synthesized frame by developing a weighted averaging algorithm. Experimental results show that the proposed algorithm provides higher synthesis qualities than the conventional algorithms [2], [3].

2. 제안하는 기법

To acquire an HDR video using a conventional video camera, we first capture input frames with alternating exposure times for each frame. Suppose that we are given three consecutive frames −1, , and +1. Since each frame has limited dynamic range, they may contain poorly-exposed regions, i.e., under- or over-exposed. We first define those regions in the reference frame using a threshold value th. Specifically, a pixel ( ) at location is defined as well- exposed, if th< ( ) < 255 − th.

Since an HDR frame is synthesized by merging alternatively exposed LDR frames, correspondences between the frames should be accurately estimated. We first estimate correspondences between frames −1 and , then merge them to synthesize an HDR frame as done in [4]. However, pixels in a poorly-exposed region contain no information, resulting in failure in correspondence estimation. Therefore, in a poorly-exposed region, instead of estimating the motion field from −1 to directly, we estimate the motion field from −1 to +1 and halve it. In this work, we propose a superpixel-based motion estimation scheme, assuming that, if a pair of pixels in −1 and +1 match, their neighboring pixels are likely to match as well. Specifically, for a pixel at , its neighboring pixels are defined in the intersection of

This work was supported by the National Research Foundation of Korea (NRF) grant funded by the Korea government (MSIT) (No. NRF- 2016R1C1B2010319).

90

(2)

2018 년 한국방송·미디어공학회 하계학술대회

superpixel [5] and the support window . Let −1 and

+1 denote the DASC descriptors [6] of the pixels ’s in −1 and +1, respectively. Then, we define two costs −1=

1

−1∈( −1x−1) ( , ) × −1 and +1=

1

+1+ ∈( +1+ +1+) ( + , + ) × +1+ , where the weight ( , ) and the normalizing parameter are given by ( , ) = exp (−‖ − ‖

1 ) with 1= 8 and = ∑ ∈( ∩ ) ( , ), respectively. The optimal motion vector is obtained to minimize the overall cost

( ) = ‖ −1+12. (1) The motion vector obtained by minimizing the cost in (1) may be inaccurate in some regions, resulting in artifacts in the synthesized HDR frames. To improve the accuracy of the correspondence, we quantify the reliability of the motion vector using the structural similarity between the reference frame and the warped frame −1, which is warped from

−1 to using the estimated motion vector. Specifically, the reliability is defined as

−1( ) = SSIM( ( ), −1( )) (2) where SSIM(⋅,⋅) is the structural similarity index between two blocks centered at . If −1( ) is small, the corresponding motion vector is unreliable so as to assign a small weight to

−1( ) in the HDR synthesis. As −1( ) increases, −1( ) contributes more to the final HDR frame. Therefore, the reliability map effectively reduces artifacts in the synthesized HDR frame caused by inaccurate correspondence estimation.

Then, the final weight ( ) is obtained by multiplying the reliability term and luminance term ( ) [7]

( ) = ( ) × ( ). (3) Finally, we synthesize the HDR frame by weighted averaging the irradiance maps in each frame

ℎ ( ) = ∑ ( ) × ( ( ))

1

=−1

(4)

where ℎ ( ) is the HDR radiance value at location , and (. ) denotes the camera response function [7].

3. 실험 결과

We evaluate the performance of the proposed HDR video synthesis algorithm on the alternatively exposed LDR dataset in Fig. 1. The parameter th is fixed to 12, and the size of the block is × . The number of superpixels in a frame is 2000.

Fig. 1 compares the synthesized results of the 31th frame of the Student sequence obtained by Kalantari et al.’s algorithm [2], Li et al.’s algorithm [3], and the proposed algorithm. In Figs. 1(b) and (c), Kalantari et al.’s algorithm and Li et al.’s algorithm provide artifacts, e.g., missing object structures, due to inaccurate correspondence estimation in poorly-exposed regions. However, we see that the proposed

algorithm in Fig. 1(d) provides the highest-quality result by estimating accurate correspondences using the superpixel- based motion estimation.

4. 결론

We proposed a robust algorithm to synthesize HDR videos from alternatively exposed LDR videos. We first estimated the correspondences between two frames taken with different exposure times via a superpixel-based illumination invariant motion estimation. We also developed a reliability weight to further improve the synthesis performance by reducing artifacts. Experimental results on real datasets show that the proposed algorithm can estimate accurate motions between frames and obtain high-quality HDR videos.

5. 참고문헌

[1] S. B. Kang, M. Uyttendaele, S. Winder, and R. Szeliski, “High dynamic range video,” ACM Trans. Graphics, vol. 22, no. 3, pp.

319-325, Jul. 2003.

[2] K. N. Kalantari, E. Shechtman, S. Darabi, D. Goldman, P. Sen, and D. Goldman, “Patch-based high dynamic range video,” ACM Trans. Graphics, vol. 32, no. 6, 2013.

[3] Y. Li, C. Lee, and V. Monga, "A maximum a posteriori estimation framework for robust high dynamic range video synthesis," IEEE Trans. Image Process., vol. 26, no. 3, pp. 1143-1157, 2017.

[4] T. V. Vo and C. Lee, “Robust HDR video synthesis using superpixel-based illumination invariant motion estimation,” to appear in Proc. IEEE Int. Conf. Consum. Electron.-Asia, Jun. 2018.

[5] R. Achanta, A. Shaji, K. Smith, A. Lucchi, and P. Fua, "SLIC superpixels compared to state-of-the-art superpixel methods,"

IEEE Trans. Pattern Anal. Machine Intelli., vol. 34, no. 11, pp.

2274 - 2282, May 2012.

[6] S. Kim, D. Min, B. Ham, S. Ryu, M. N. Do, and K. Sohn, “DASC:

Dense adaptive self-correlation descriptor for multimodal and multi-spectral correspondence,” in Proc. IEEE Conf. Comput. Vis.

Pattern Recognit., Jun. 2015, pp. 2103-2112.

[7] P. Debevec and J. Malik, "Recovering high dynamic range radiance maps from photographs," Proc. ACM SIGGRAPH, pp.

369-378, Aug. 1997.

[8] LiU HDRv Repository, Available: http://hdrv.org/.

(a) (b)

(c) (d)

Fig. 1. Comparison of HDR video synthesis results for the Student sequence [8]. (a) Ground-truth, (b) Kalantari et al.’s algorithm [2],

(c) Li et al.’s algorithm [3], and (d) proposed algorithm.

91

참조

관련 문서

다양한 번역 작품과 번역에 관한 책을 읽는 것은 단순히 다른 시대와 언어, 문화의 교류를 넘어 지구촌이 서로 이해하고 하나가

[r]

노인 만성폐쇄성폐질환 환자들은 인플루엔자에 대한 합 병증의 위험이 높아서 매년 인플루엔자 예방접종을 받아야 한다. 폐렴구균 예방접종은 논란이 되기는 하지만 평생 1회

노인에서의 수면 무호흡으로 인한 장기적 합병증이 중년 환자와 비교하여 차이가 있는지 여부는, 노인의 수면 무호 흡을 하나의 특정질환으로 간주할 것인지 아니면 노인의

혼합제를 사용하여도 천식 조 절이 되지 않는 경우 경구 약제인 류코트리엔 조절제 그리 고 (또는) 테오필린을 추가하여 사용한다. 류코트리엔은 효 과 면에서는

12) Maestu I, Gómez-Aldaraví L, Torregrosa MD, Camps C, Llorca C, Bosch C, Gómez J, Giner V, Oltra A, Albert A. Gemcitabine and low dose carboplatin in the treatment of

Ross: As my lawfully wedded wife, in sickness and in health, until

• Potential energy: molecular interaction, bonding energy, atomic energy  For ideal gases, potential energy can be ignored.. because they have