⳾㧒 䢮 䢮ἓ㠦㍲ ❻⩂┳㦚 䢲 䢲㣿䞲 㦮⹎₆ 㧊 㧊⹎㰖 㠊⏎䎢㧊㎮㦚 㦚 㥚䞲 㧊 㧊⹎㰖 䌲⁎ ㍺ ㍺Ἒ ῂ ῂ䡚
㔶㥺⹎*, 㞞㰚䡚**, 㧚☯䡗***
*䢎㍲╖䞯ᾦ 䅊䜾䎆Ὃ䞯ὒ
**㩲㭒╖䞯ᾦ ἓ㡗㩫⽊䞯ὒ
***䢎㍲╖䞯ᾦ 䅊䜾䎆㩫⽊Ὃ䞯
e-mail : dhim@ hoseo.edu
Design and Implementation of Deep-Learning-Based Image Tag for Semantic Image Annotation in Mobile Environment
YoonMi Shin*, Jinhyun Ahn**, Dong-Hyuk Im***
*Department of Computer Engineering, Hoseo University
**Department of Management Information Systems, Jeju National University
***Division of Computer and Information Engineering, Hoseo University
殚 檃檃
⳾㧒㦮 ₆㑶 㩚ὒ ㏢㎲⹎❪㠊 ㌂㣿㦮 㯳Ṗ⪲ 㑮㠜㧊 Ⱔ㦖 Ⲗ䕆⹎❪㠊 䆮䎦䁶✺㧊 ㌳㎇♮ἶ 㧞┺. 㧊⩂䞲 Ⱔ㦖 㟧㦮 䆮䎦䁶 㭧㠦㍲ ㌂㣿㧦Ṗ 㤦䞮⓪ 㧊⹎㰖⯒ 䣾㥾㩗㦒⪲ 㺔₆ 㥚䟊 㦮⹎ ₆
㧊⹎㰖 Ỗ㌟㦚 㧊㣿䞲┺. 㧊 Ỗ㌟ ₆⻫㦖 㧊⹎㰖㠦 㦮⹎ 㧞⓪ 㩫⽊✺㦚 㧊㣿䞮㡂 ㌂㣿㧦Ṗ 㺔ἶ 㧦 䞮⓪ 㧊⹎㰖⯒ 㩫䢫䞮Ợ 㺔㦚 㑮 㧞┺. ⽎ 㡆ῂ㠦㍲⓪ ⳾㧒 䢮ἓ㠦㍲ 㧊⹎㰖Ṗ Ṗ㰞 㑮 㧞⓪ 㦮
⹎㩗 㩫⽊⯒ 㠊⏎䎢㧊㎮ 䞮ἶ 㧊㢖 ▪㠊 ⳾㧒㠦 㧞⓪ 㧊⹎㰖㠦 䛣㎇䞲 㠊⏎䎢㧊㎮㦚 㥚䟊 ❻⩂
┳ ₆㑶㦚 㧊㣿䞮㡂 ┺㟧䞲 䌲⁎✺㦚 㧦☯ ㌳㎇䞮☚⪳ ῂ䡚䞮㡖┺. 㧊⩝Ợ ㌳㎇♲ 㠊⏎䎢㧊㎮ 㩫⽊
✺㦖 㦮⹎㩗 ₆ 䌲⁎⯒ 䐋䟊 RDF 䔎Ⰲ䝢⪲ 䢫㧻♲┺. SPARQL 㰞㦮㠊⯒ 㧊㣿䞮㡂 㦮⹎ ₆ 㧊⹎
㰖 Ỗ㌟㦚 䞶 㑮 㧞┺.
1. 昢嵦
⳾㧒 㔲㧻㦮 ╂ὒ ┺㟧䞲 ㏢㎲ ⹎❪㠊㦮 䢲㣿 㯳Ṗ⪲ 㧎䞮㡂 ╖⨟㦮 㧊⹎㰖 䆮䎦䁶Ṗ 㯳Ṗ䞮ἶ 㧞
┺. ➆⧒㍲ 㧊⩂䞲 Ⱔ㦖 㟧㦮 㧊⹎㰖⯒ 㩖㧻䞮ἶ ὖ Ⰲ䞮⓪ ộ㧊 㭧㣪䞮┺[1,2]. 㧊⯒ 㥚䟊 Ⱔ㦖 㟧㦮 㧊⹎
㰖 䆮䎢䁶 ㏣㠦㍲ 䣾㥾㩗㦒⪲ Ỗ㌟ 䞶 㑮 㧞⓪ 㧊⹎
㰖 㠊⏎䎢㧊㎮ ₆⻫㧊 㩲㞞♮㠞┺[3,4]. 㧊⹎㰖 㠊⏎䎢 㧊㎮ ₆⻫㦖 㧊⹎㰖Ṗ Ṗ㰖ἶ 㧞⓪ 㦮⹎ 㩫⽊✺㦚 㧊
⹎㰖㢖 䞾℮ 㩖㧻䞮㡂 㧊⹎㰖⯒ ┺㟧䞮Ợ 䚲䡚䞮⓪
₆⻫㧊┺. 㧊 ₆⻫㦚 䐋䟊 Ⱔ㦖 㟧㦮 㧊⹎㰖 ◆㧊䎆
㏣㠦㍲ ㌂㣿㧦Ṗ 㤦䞮⓪ 㧊⹎㰖⯒ 㩫䢫䧞 㺔㦚 㑮 㧞
┺.
㧊㩚 㡆ῂ㠦㍲⓪ ⳾㧒 䢮ἓ㠦㍲ 㠑㦖 㠊⏎䎢㧊㎮
㩫⽊⯒ 㧊⹎㰖㢖 䞾℮ 㩖㧻䞲┺[5]. ⳾㧒 䢮ἓ㠦㍲
㧊㣿䞶 㔲 㧊⹎㰖㠦 ╖䞲 ㌗䢿 㩫⽊㢖 ㌂㣿㧦Ṗ 㧛⩻
䞲 䌲⁎ 㩫⽊✺㦚 㧊㣿䞮㡂 㠊⏎䎢㧊㎮㦚 䞲┺. ⳾
㧒 䢮ἓ㠦㍲ 㧦☯㦒⪲ 㠑㠊㰚 ㌗䢿 㩫⽊⓪ 㡾䏾⪲㰖 㠎㠊㧎 RDF 䔎Ⰲ䝢㦮 ⁎⧮䝚 䎢㧊䎆⪲ ⼖䢮䞮㡂 㠊
⏎䎢㧊㎮ 䞲┺. ㌂㣿㧦Ṗ 㰗㩧 㧛⩻䞲 䌲⁎✺㦖 DBPedia⯒ 㧊㣿䞮㡂 RDF 䔎Ⰲ䝢⪲ ⳾◎Ⱇ 䞲┺.
⽎ 㡆ῂ㠦㍲⓪ ⳾㧒 ₆₆㠦㍲㦮 㠊⏎䎢㧊㎮ Ỗ㌟ 㔲㓺䎲 ㍺Ἒ[5]⯒ ₆㦒⪲ RNN(Recurrent
Neural Network)[6]㦮 䡫䌲 㭧 One to Many 㔳㦮 ❻
⩂┳ ⳾◎㦚 㧊㣿䞲┺. 㧊 㔳㦖 㧊⹎㰖 䃷㎮ ㌳㎇
㠦㍲ Ⱔ㧊 ㌂㣿♮⓪ ⳾◎⪲㍲ 䞮⋮㦮 㧛⩻Ṩ㦒⪲ 㡂
⩂ Ṳ㦮 㿲⩻Ṩ㦚 㠑㦚 㑮 㧞┺. 㧊 ⳾◎㦚 㿪Ṗ䞮㡂 㧛⩻ 㧊⹎㰖㠦 ὖ⩾♲ 䌲⁎✺㦚 㧦☯㦒⪲ ㌳㎇♮⓪ 䡫䌲⪲ 䢫㧻䞮㡖┺.
2. 洢橎穢 柢枪癢
⽎ 㡆ῂ㠦㍲⓪ ⳾㧒 䢮ἓ, 㧊⹎㰖 㠊⏎䎢㧊㎮ ₆ 㑶㠦 CNN(Convolutional Neural Network)ὒ Multimodal RNN(Multimodal Recurrent Neural Network)㦚 ἆ䞿䞲
⳾◎[7]㦚 㿪Ṗ⪲ ㌂㣿䞲┺.
(⁎Ⱂ 1)㧊⹎㰖 ὖ⩾ 䌲⁎ 㧦☯ ㌳㎇ ὒ㩫 2019년 추계학술발표대회 논문집 제26권 제2호 (2019. 11)
- 895 -
(⁎Ⱂ 1)㦖 㧛⩻ 㧊⹎㰖㢖 ὖ⩾♲ 㧦☯ 䌲⁎ ㌳㎇ ὒ 㩫㦚 ⽊㡂㭖┺. Ⲓ㩖 㧊⹎㰖Ṗ 㧛⩻Ṩ㦒⪲ ✺㠊ṖỢ
♮Ⳋ CNN 㦚 㧊㣿䞮㡂 㧊⹎㰖㦮 䔏㰫㦚 㿪㿲䞮Ợ ♲
┺. 㿪㿲♲ 䔏㰫ⱋ㦖 1 㹾㤦㦮 䡫䌲⪲ ⼖䡫♮⓪◆ 㧊 䡫䌲Ṗ ₆㫊 CNN ⳾◎㦮 㢚㩚 㡆ἆ Ἒ䂋㧊┺. 㢚㩚 㡆ἆ Ἒ䂋㦖 RNN 㦮 䧞✶ ⩞㧊㠊㦮 㽞₆Ṩ㦒⪲ ✺㠊 ṖỢ ♲┺. RNN 㦮 㼁⻞㱎 ㎖㦮 㧛⩻Ṩ㦖 ⶎ㧦䏶䋆㦒
⪲ <s>Ṗ ✺㠊ṖỢ ♲┺. 㧊⯒ 㔲㧧㦒⪲ 㧊⹎㰖㢖 ὖ
⩾♲ 䌲⁎✺㦚 㡞䁷䞮Ợ ♲┺. <e>⧒⓪ ⶎ㧦䏶䋆㧊 ⋮ 㡺Ợ ♮Ⳋ 㡞䁷㦖 㫛⬢♲┺. 㧊⩝Ợ 㠑㠊㰚 䌲⁎✺ὒ
⳾㧒 䢮ἓ㠦㍲ 㔲Ṛ, 㧻㏢ 䌲⁎✺₢㰖 䐋䞿♮㠊 㧊
⹎㰖 䌲⁎ 㩫⽊✺㦚 Ṭ⓪┺.
㥚 ὒ㩫㦚 㿪Ṗ䞲 㩚㼊 㔲㓺䎲 ῂ㫆⓪ (⁎Ⱂ 2)㠦㍲
⽊㡂㭖┺.
(⁎Ⱂ 2)㩚㼊 㔲㓺䎲 ῂ㫆
㌂㣿㧦Ṗ ⳾㧒 䢮ἓ㠦㍲ 㤦䞮⓪ 㧊⹎㰖⯒ 㧛⩻ 䞲
┺. 㧛⩻ 㧊⹎㰖㠦 ╖䟊 Tag Process ὒ㩫㦚 䐋䟊 㔲Ṛ, 㥚䂮 㩫⽊㢖 ❻⩂┳ ⳾◎㦚 䐋䟊 䌲⁎ 㩫⽊✺㦚 㧦☯
㦒⪲ ㌳㎇♲┺. 䌲⁎ ◆㧊䎆 SPO ⼖䢮 ⳾✞㠦㍲ ㌳㎇
♲ 䌲⁎ 㩫⽊✺㦚 㧊㣿䞮㡂 㠊⏎䎢㧊㎮ RDF 䔎Ⰲ䝢䡫 䌲⪲ 䢫㧻䞲┺. 㠊⏎䎢㧊㎮ 䞲 ◆㧊䎆⓪ MySQL ὒ Jena TDB 㠦 㩖㧻♲┺. SPARQL 㰞㦮 㔲 㩖㧻 ♲ 㧊⹎
㰖 ID ⯒ 㺔₆ 㥚䟊 RDF 䔎Ⰲ䝢㦚 ㌂㣿䞮㡂 ⍺㧚✲
⁎⧮䝚 䡫䌲⪲ 㧎◇㕇 䞮㡂 Jena TDB 㠦 㩖㧻䞲┺.
3. 柢枪癢 割笊
(⁎Ⱂ 3)㌂㰚 㾂㡗 (⁎Ⱂ 4)㧊⹎㰖 䌲⁎ Ⰲ㓺䔎 (⁎Ⱂ 3)㦖 ⳾㧒 ₆₆⪲ ㌂㰚 㹣₆ ⡦⓪ ㌂㰚 ❇⪳
㦚 䞮⓪ 䢪Ⳋ㧊┺. 㧊 ὒ㩫㦚 䐋䟊 ㌂㰚㦚 㧛⩻䞮Ợ
♮Ⳋ (⁎Ⱂ 4)㻮⩒ ❻⩂┳ ὒ㩫㦚 䐋䟊 ㌂㰚㠦 ╖䟊
㧦☯㦒⪲ ὖ⩾♲ 䌲䋂 㩫⽊ Ⰲ㓺䔎⯒ ⽊㡂㭖┺. 㿪Ṗ
⪲ ㌂㣿㧦Ṗ 㑮☯㦒⪲ 㧊⹎㰖㠦 ╖䞲 䌲⁎⯒ 㧛⩻䞶 㑮☚ 㧞┺.
4. 冶嵦
⳾㧒 䢮ἓ㦮 㩚㦒⪲ 㧊⹎㰖 䆮䎦䁶Ṗ 㯳Ṗ䞮㡂 㦮⹎㩗㧎 㧊⹎㰖 Ỗ㌟㧊 㭧㣪䟊㪢┺. ⽎ 㡆ῂ㠦㍲⓪
⳾㧒 䢮ἓ㠦㍲ ❻⩂┳ ⳾◎ 㭧 CNN ὒ Multimodal RNN 㦚 㧊㣿䞮㡂 㧊⹎㰖㠦 ╖䞲 䌲⁎ 㩫⽊✺㦚 㧦☯
㦒⪲ ㌳㎇䞲┺. ➆⧒㍲ ㌂㣿㧦Ṗ 㰗㩧 㧊⹎㰖㠦 䌲⁎
⯒ 㧛⩻䞮⓪ ⻞Ệ⪲㤊 ┾㩦㦚 ⽊㢚䞲┺. ⡦䞲 㧦☯㦒
⪲ ㌳㎇ ♲ 䌲⁎⯒ 㧊㣿䞮㡂 䛣䞲 㠊⏎䎢㧊㎮㦚 ῂ
㎇ 䞲┺.
䟻䤚 ὒ㩲⪲⓪ DBPedia 㦮 㑶㠊 㩫⽊⯒ ㌂㣿䞮㰖 㞠ἶ 䌲⁎ Ṛ㠦 㑶㠊 㩫⽊⯒ 㡆ἆ 䞶 㑮 㧞☚⪳ ❻⩂
┳ ⳾◎㦚 㧊㣿䞮㡂 㧊⹎㰖㠦 ╖䞲 㦮⹎㩗 ₆ 䌲⁎
⯒ 㧦☯㦒⪲ 䞶 㑮 㧞☚⪳ 䞶 Ἒ䣣㧊┺.
Acknowledgement
㧊 ⏒ⶎ㦖 2017 ⎚☚ 㩫(⹎⧮㺓㫆ὒ䞯)㦮 㨂㤦 㦒⪲ 䞲ῃ㡆ῂ㨂┾㦮 㰖㤦㦚 㞚 㑮䟟♲ 㡆ῂ (No.NRF-2017R1C1B1003600)㧊Ⳇ, 2018 ⎚☚ 㩫(ᾦ㥷
)㦮 㨂㤦㦒⪲ 䞲ῃ㡆ῂ㨂┾㦮 㰖㤦㦚 㞚 㑮䟟♲
₆㽞㡆ῂ㌂㠛㧚(No. NRF-2018R1D1A1B07048380).
⡦䞲, ⽎ 㡆ῂ⓪ ὒ䞯₆㑶㩫⽊䐋㔶 㩫⽊䐋㔶₆ 㑶㰚䦻㎒䎆㦮 ╖䞯 ICT 㡆ῂ㎒䎆㰖㤦㌂㠛㦮 㡆ῂἆὒ
⪲ 㑮䟟♮㠞㦢 (IITP-2019-2018-0-01417).
焾処怾竒
[1]⏎㔏⹒, and 䢿㧎㭖. "Ⲗ䕆⹎❪㠊 Ỗ㌟ 㔲㓺䎲㦮
㍺Ἒ ῂ䡚." 㩫⽊ὒ䞯䣢⏒ⶎ㰖: ◆㧊䌖㧊 㓺 30.5 (2003): 494-506.
[2]㧊㡺㭖, et al. "㏢㎲ ゛◆㧊䎆⯒ 㧊㣿䞲 㡗䢪 䦻 䟟 㣪㧎 ㍳." 䞲ῃ䆮䎦䁶䞯䣢⏒ⶎ㰖 14.10 (2014): 527-538.
[3]Im, D. H., Park, G. D.: Linked tag: image annotation using semantic relationships between image tags. Multimed Tools Appl. April 2015, vol. 74, Issue 7, pp2273-2287 (2015) [4]Im, Dong-Hyuk, and Geun-Duk Park. "STAG:
semantic image annotation using relationships between tags." 2013 International Conference on Information Science and Applications (ICISA). IEEE, 2013.
[5]⏎䡚▫, ㍲ὧ㤦, and 㧚☯䡗. "⳾㧒 䢮ἓ㠦㍲
㦮⹎ ₆ 㧊⹎㰖 㠊⏎䎢㧊㎮ Ỗ㌟." Ⲗ䕆⹎❪
㠊䞯䣢⏒ⶎ㰖 19.8 (2016): 1498-1504.
[6]ڨۄۆۊۇۊۑڇٻ گۊۈ܀ݫ ڇٻ ۀۏٻ ڼۇډٻ ٽڭۀھېۍۍۀۉۏٻ ۉۀېۍڼۇٻ network based language model." Eleventh annual conference of the international speech communication association. 2010.
[7]Karpathy, Andrej, and Li Fei-Fei. "Deep visual-semantic alignments for generating 2019년 추계학술발표대회 논문집 제26권 제2호 (2019. 11)
- 896 -
image descriptions." Proceedings of the IEEE conference on computer vision and pattern recognition. 2015.
2019년 추계학술발표대회 논문집 제26권 제2호 (2019. 11)
- 897 -