• 검색 결과가 없습니다.

Investigation of Topographic Characteristics of Parcels Using UAV and Machine Learning

N/A
N/A
Protected

Academic year: 2021

Share "Investigation of Topographic Characteristics of Parcels Using UAV and Machine Learning"

Copied!
8
0
0

로드 중.... (전체 텍스트 보기)

전체 글

(1)

https://doi.org/10.7848/ksgpc.2017.35.5.349

Investigation of Topographic Characteristics of Parcels Using UAV and Machine Learning

Lee, Chang Han

1)

·Hong, Il Young

2)

Abstract

In this study, we propose a method to investigate topographic characteristics by applying machine learning which is an artificial intelligence analysis method based on the spatial data constructed using UAV and the training data created through spatial analysis. This method provides an alternative to the subjective judgment and accuracy of spatial data, which is a problem of existing topographic characteristics survey for officially assessed land price. The analysis method of this study is expected to improve the problems of topographic characteristics survey method of existing field researchers and contribute to more accurate decision of officially assessed land price by providing more objective land survey method.

Keywords : Unmanned Aerial Vehicle, Machine Learning, Spatial Analysis, Topographical Characteristics Original article

Received 2017. 09. 25, Revised 2017. 10. 16, Accepted 2017. 10. 27

1) Member, Dept of GIS Engineering, Namseoul University (E-mail: [email protected])

2) Corresponding Author, Member, Dept of GIS Engineering, Namseoul University (E-mail: [email protected])

This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://

1. Introduction

Since the introduction of the officially assessed land price, the land price is estimated by analyzing the characteristics by assessing items. The existing topographic characteristics survey has a problem that objectivity is deteriorated by visiting the parcels and determining through the visual characteristics of the investigator(Lee, 2012). This is caused by subjective judgment at the time of field survey without an instrument or analysis tool. In order to secure objectivity, there have been attempts to apply GIS techniques to the analysis of topographical characteristics(Jung et al., 2015). It is a method to construct the data using the digital topographic map provided by NGII(National Geographic Information Institute) and to investigate the topographic characteristics using the spatial analysis technique.

In recent years, UAV(Unmanned Aerial Vehicle) have become widely used as a tool for on-site surveying. In the case

of surveying with UAV, it has an advantage of securing highly accurate spatial information and immediately reflecting changes in the field. UAV is capable of rapidly constructing topographical data on site and making it possible to build more precise and accurate data compared to existing digital maps. In addition, Machine Learning, which has recently emerged as an artificial intelligence analysis method, has been used in various fields while enabling objective decision making based on training dataset(Shin and Kim, 2016; Song et al., 2016). This method of analysis extends automated applications as an alternative to human decision making.

In this study, we proposed a method to investigate

topographic characteristics by applying machine learning

which is an artificial intelligence analysis method composed

of training data and measurement results using UAV. This

method provides a solution to the subjective judgment and

the accuracy of the spatial data, which is a problem of the

existing topographic survey for the public land price decision.

(2)

Through the case study, the topographical characteristics data were collected using the low-cost UAV, Phantom 3, the training dataset was constructed through the spatial analysis using ArcGIS, and the machine learning analysis tool, TensorFlow was applied for objective decision making.

The analytical method of this study is expected to contribute to more accurate estimation of the public land price in the future by improving the problem of survey method of land characteristics of existing field investigators by providing more objective land survey method.

2. Methodology

The purpose of this study is to propose a survey method of topographical characteristics among the assessing items of the officially assessed land price. The survey method consists of three stages. The first stage is the data collection using an UAV and construction of spatial information. The

second stage is to generate the training dataset using spatial analysis technique and the third stage is to the presentation of assessed results using machine learning. Fig. 1 presents the process and methodology that are used for this study.

First, the spatial information collections consists of the process of aerial photogrammetry using an UAV. We used DJI Phantom 3 as the unmanned aircraft and Pix4D pro was used to process the captured images. The DSM(Digital Surface Model) created by Pix4D pro was constructed by using the ATIN algorithm in the TerraScan program to classify the ground surface and the non-surface into a DEM(Digital Elevation Model).

Second, ArcGIS tools and ModelBuilder were used for spatial analysis for building the training dataset using collected spatial data from the previous stage(Allen, 2011). The spatial characteristics of the parcels were analyzed through the spatial analysis model and the characteristics of the parcels were tested by applying the machine learning method based on the results of this analysis. In order to investigate the topographic characteristics by machine learning, it is necessary to distinguish independent variables from dependent variables.

The dependent variable used the results of the survey of the topographic characteristics of the parcels made by field survey method. The independent variables are variables that can explain the topographic characteristics of the parcel, which is the relationship between elevation, slope, and features.

The relationship between the independent variables, the

Fig. 1. Methodology of topographic characteristics survey Fig. 2. Process of spatial analysis

Surver of elevation and slope

Generate near feature

(3)

topographic features, is the shortest distance between the parcels and features, and the features include the main road, reservoir, stream, and cultivated land. Elevation and slope are known to be important parameters for surveying the topographic of area by many researchers(Weiss, 2001). The main road of the land feature was used as an efficient way to acquire the topographic characteristics of the parcels(Lee, 2012), and the reservoir, stream. The following Fig. 2 shows the process of constructing survey values of the topographic characteristics of the parcels using the elevation and slope of the parcels and the nearby features.

The third step is to construct a model that automatically classifies the topographic features using the training data constructed through spatial analysis. We constructed a model to distinguish multiple classes using multi-variable, and used TensorFlow, a python-based machine learning library. TensorFlow is the machine learning library, available in Google's open source format since 2015, and has been successfully used in areas such as image and speech recognition(Abadi et al., 2016). The TensorFlow provides a specialized feature for computing the multidimensional array of nodes and edges in the form of graphs of data flow.

TensorFlow have the advantage of graphical representation of the flow of data, various computational structures, and python-based interoperability.

3. Spatial Bata Acquisition Using UAV

The study area is shown in Fig. 3(a) and there are some

areas including Doheung-ri and Haam-ri in Asan-si. Forest, farmland, village, and stream are distributed in this area and have various topographic characteristics. The field survey results of the study area were classified as low land, flat land, mild slope, steep slope, which indicate the topographic characteristics of the parcels and there are total 719 parcels which is including with 1 parcel of low land, 182 parcels of flat, 448 parcels of mild slope, 22 parcels of steep slope and 66 parcels such as roads and rivers that do not require survey(See Table 1). In order to investigate the topographic characteristics of the parcel units, the digital cadastral maps were used.

In this study, Phantom3, developed by DJI of China, was used as a rotary wing UAV for the collection of high- resolution spatial data. The UAV images are constructed to DSM in the photogrammetry method.

The accuracy of the position and height of the constructed information is confirmed by the VRS(Virtual Reference

Topographic

characteristics Code Count

Low land 1 1

Flat land 2 182

Mild slope 3 448

Steep slope 4 22

Except research - 66

Total - 719

Table 1. Result of field survey

(a) Satellite image (b) Digital cadastral map (c) Points of VRS survey

Fig. 3. Study area(Asan-si)

(4)

Station), which is the virtual reference point system in Network-RTK(Real Time Kinematic), as the check point(Park and Lee, 2016). The equipment used for the VRS survey is a mobile phone that receives GPS information from Trimble R and the observatory, and the location accuracy of the survey is 2-5 cm(See Table 2.).

The study area is about 1.2km and 0.9km(width and height). The details of the images taken in the study area are shown in Fig. 3(c). Fourteen GCP(Ground Control Points) were used before 3D model and DSM modeling by unmanned aerial photogrammetry. Ten checkpoints were selected for

accuracy evaluation.

We shot the image with using UAV and surveyed the ground reference point with VRS in order to make DSM which can make the DEM. We use the Pix4D pro, photogrammetry software, to adjust the position of the UAV's image and ground reference point. The total images captured by UAV was 622 images, and the aerotriangulation was used after the interior orientation of the camera of the UAV in the Table 3(Kim et al., 2017). As a result of the unmanned aerial photogrammetry, 0.187 pixel and 0.248m of Z error were found on average.

DSM produced by photogrammetry software is a model that include terrain and features such as buildings, trees, etc.

This model is not suitable for surveying topographic features and should be converted to the DEM. This is because DSM has characteristics of artificial and topographical feature and we need to create a DEM that has only topographic features.

TerraScan of Terrssolid Inc., commercial software, Adaptive TIN(ATIN), which is a software algorithm, was used to classify the surface and non-artificial surface areas, and point cloud data of the DSM was used. As a result of comparing ATIN algorithm with other algorithms, qualitative and quantitative results can be used regardless of regional characteristics(Kim and Cho, 2012).

In the study area, the maximum size of the bottom of the building is 50m, the maximum slope is 88 degree close to the slope of the general building and the ground, the angle threshold is 5.5 degree and the distance threshold is 1 m. Non- surface point cloud data removed artificial surface is las2dem of Lastool provided as open source, reads LAS formatted file

Survey method Network-RTK(VRS)

Satellite transmitters Trimble R8(GPS) Satellite count More than five satellite

The coordinate system KGD2002

Regular observatory Cheonan

Accuracy 2-5cm

Table 2. Dataset for survey information

Drone Phantom3 professional

Camera DJI(FC300X)

Image size 4,000 × 3,000(pixel)

Flying height 100m

Over lap End overlap(80%)

×Side lap(70%)

Ground sampling distance 4.5cm

Image count 622 images

Table 3. Dataset for drone information

(a) DSM (b) Ground Points (c) DEM

Fig. 4. Process of generate DEM

(5)

and constructs delaunay triangular by irregular triangular network algorithm to generate DEM as shown in Fig. 4(c).

4. Spatial Analysis

DEM was generated by the unmanned aerial photogrammetry method and spatial information such as road, reservoir, stream, cultivated land, and digital cadastral map were used in the national map. Table 4 shows the spatial information required for the analysis. In the digital cadastral map, a parcel is analyzed to the elevation and slope of the parcels in the DEM, and the shortest distance between the parcels and the features.

In order to analyze on the basis of parcels with the digital cadastral map, the parcels in the study area should be divided into one unit. In order to divide 719 parcels into individual parcels, a 19-digit PNU(Parcel Unique Number) field, which is the unique number of parcels, is used.

As shown in Fig. 5(a), the 2-2 survey parcel is used to obtain elevation and slope of parcels through DEM. The grid size of the DEM is 1m and considering the capacity and accuracy of spatial information. The representative values of the elevation and slope of the parcel is averaged value of elevation and slope values in the parcel, because they can represent the overall elevation and slope of the parcels. In order to use the information of the DEM and slope model as the parcel, the raster is converted into the raster to point tool and the 2-2 parcel elevation and slope average values was calculated by the summary statistics tool. The slope model

is analyzed by the slope tool with neighborhood algorithm and the algorithm is used where the slope is expressed as smooth curves through averaging instead of expressing slope as steep slope(Zhou and Liu, 2004; Ko et al., 2014; Lee and Sung, 2016).

As shown in Fig. 5(b), the shortest distance relation between features of the 2-2 survey parcel is constructed by clipping the spatial information of the digital topographic map and using it as data that can be used in the study area. The range of the constructed feature data is set to the inhabitant area, which is a higher administrative unit than the study area. Generate Near, which is the analysis tool, calculates the shortest distance between a feature and a parcel.

Table 5 below shows the learning materials used for machine learning. The topographic characteristics such as elevation and slope of the parcels and distance variables with road, reservoir, stream, cultivated land are used for the independent variable and the results of field survey were used as dependent variable.

Table 4. Spatial data information

Source Data Use

UAV DSM DEM Elevation

Slope National

Geographic Information Institute

Digital map

Road

The shortest distance Reservoir

Stream Cultivated land Ministry of Land,

Infrastructure, and Transport

Digital cadastral

map Parcels (a) Elevation and slope for 2-2 of parcel

(b) Survey the shortest distance of features nearby 2-2 of parcell Fig. 5. Spatial analysis graphing

Elevation Slope

(6)

5. Machine Learning

In this study, we applied the multinomial logistic regression, which is the supervisory classification analysis of multi-class classification using multidimensional variables.

The following Fig. 6 shows the python code of Fig. 6(a) and training data of Fig. 6(b). In case of TensorFlow, the multinomial logistic regression is implemented with the softmax classifier. The training data was normalized to a value between 1 and 15 to reduce the effect of size difference between variables.

The Fig. 7 shows the result of the analysis and the degree of accuracy is 78.03%, which is relatively low. These results show that there is a consistent result for the Y value used as the dependent variable and it is necessary to modify and add

the training data considering the case of the error parcels.

The following Table 6 shows the summary of the errors that occurred in the machine learning results. On the left side of Table 6, the average value of the corresponding variable values is shown on the right site and there are the number of True when the actual value is equal to the expected value and the number of False when the other case is, at the left side.

The errors were 149 for code 2(flat land) and 13 for code 3(mild slope). In case of code 2, 149 errors were predicted from code 2 to code 3 value. For these erroneous data, the average slope is 6.1, which is much different from the normal value of 0.2, which is close to the average value of slope of code 3, 6.5. In addition, for code 3, the mean slope of the 13 errored data is 3.1, which is smaller than the average of code 3 of 6.4. This can be confirmed by the fact that the majority of the errored data indicates an error in judging the slope. This result can be regarded as an error caused by the Table 5. Training data set for machine learning

ID Field Survey Elevation(m) Slope(°) Road(m) Stream(m) Reservoir(m) Cultivated land(m)

1 02 9.68 0.83 114.83 389.36 320.84 0.00

2 02 4.94 0.55 265.69 395.01 175.85 0.00

3 02 10.00 0.00 213.36 894.55 348.49 65.40

4 03 9.39 3.13 151.58 864.48 288.39 22.90

5 03 10.00 0.00 180.66 892.57 318.88 51.29

6 02 4.37 0.58 123.61 305.89 200.09 0.00

7 04 23.73 11.80 371.11 678.25 254.00 1.36

8 02 6.58 3.00 77.05 814.11 205.55 34.55

9 02 7.14 1.84 71.78 806.58 199.52 34.55

: : :

: :

: :

: :

: :

: :

: :

:

(a) Python code (b) Training data set Fig. 6. Python code and training data set

Fig. 7. Machine learning result

(7)

subjective judgment of the investigator in the selection of the site through the field survey.

Considering these errors, we modified the code to 3 if the slope value of the 149 data that is in error in 2 is more similar to code 3. In the same way, in case of code 3, when the slope value is close to the average value of code 2, the code is updated to 2 to update the learning data. The following Fig. 8 shows the results of machine learning using updated training data after correcting the topographic characteristics of parcels showing errors. As a result of the analysis, the accuracy was 94.1%, which was improved to a fairly accurate level. These results show that the method through machine learning in the repeated decision making considering various variables shows high accuracy.

6. Conclusion

In the case of the topographic characteristics survey based on the existing field survey, the automated method cannot be

done and the result of the investigation of the topographical characteristics due to the process by the arbitrary judgment of the investigator is in error. As an alternative to these errors, this study proposed an improved method of accuracy using UAV and an automated organization method using a machine learning method. It is expected that this approach will improve the accuracy of surveying topographical features through a consistent and automated approach.

Through case studies, we have constructed highly accurate spatial information and confirmed the automation method of topographic characteristics by using the learning model constructed by automated method.

The limitations of the study are firstly the efficiency in the field survey with the conventional rotary wing UAV. This problem will be investigated and evaluated in a wider area if we replace it with fixed-wing UAV in the future study.

Second, the number of training materials and the number of variables used in training are small in the machine learning model. At present, considering only the survey of the topographic characteristics, it is thought that further deep-learning model development will be possible in the future considering the various variables related to the land characteristics survey in a comprehensive manner. In future studies, it will be necessary to collect data on wider area using fixed wing UAV and to apply more various parameters in land characterization.

Acknowledgement

This research was supported a grant from geospatial Fig. 8. Machine learning result with updated

training data

Table 6. Summary of machine learning result(R : Result, T: True, F : False. Avg : Average) Machine Learning Result Elevation Slope Road Stream Reservoir Cultivated

land Avg

Code R 1 2 3 4 Sum Avg Sum

1 T 0 0 0 0 1 Null 0

F 0 0 1 0 1.4 5.7 0.0 0.1 1.7 0.0 8.9

2 T 0 33 0 0 182 1.0 0.2 0.8 7.4 7.7 0.6 17.7

F 0 0 149 0 1.9 6.1 2.6 5.6 8.0 1.2 25.5

3 T 0 0 434 0 447 3.7 6.4 7.7 4.1 5.9 1.3 29.0

F 0 13 0 0 0.9 3.1 0.6 8.7 6.3 0.9 20.4

4 T 0 0 0 0 21 Null 0

F 0 0 21 0 7.5 9.8 4.0 9.8 3.1 0.6 34.9

(8)

information workforce development program funded by the Ministry of Land, Infrastructure and Transport of Korean Government (2015-02-01)

References

Abadi, M., Agarwal, A., Barham, P., Brevdo, E., Chen, Z.,Citro, C., Corrado, G. S., Davis, A., Dean, J., Devin, M.,Ghemawat, S., Goodfellow, I., Harp, A., Irving, G., Isard, M., Jia, Y., Jozefowicz, R., Kaiser, L., Kudlur, M., Levenberg, J., Mane, D., Monga, R., Moore, S., Murray, D., Olah, C., Schuster, M., Shlens, J., Steiner, B., Sutskever, I., Talwar, K., Tucker, P., Vanhoucke, V., Vasudevan, V., Viegas, F., Vinyals, O., and Warden, P. (2016). Tensorflow:

Large-scale machine learning on heterogeneous distributed systems, Google Research, California, https://research.

google.com/pubs/pub45166.html (last date accessed: 25 October 2017).

Allen, D. W. (2011), Getting to Know ArcGIS ModelBuilder:

1 editon, Esri Press, 380 New York Street, Redlands, California.

Jung, W.S., Jung, S.H., Lim, N.Y., Kim, G.H., Lee, S.K., and Choi, S.K. (2015), Extraction of land characteristics using high quality geospatial information, Journal of the Korean Society for Geo-spatial Information Science, Vol. 23, No.

3, pp. 57-67. (in Korean with English abstract)

Kim, E.M. and Cho, D.Y. (2012), Comprehensive comparisons among LIDAR filtering algorithms for the classification of ground and non-ground points, Journal of the Korean Society of Surveying, Geodesy, Photogrammetry and Cartography, Vol. 30, No. 1, pp. 39-48. (in Korean with English abstract)

Kim, E.M., Choi, S.H., and Park, J.H. (2017), Analysis of applicability of orthophoto using 3D mesh on aerial image with large file size, Journal of the Korean Society of Surveying, Geodesy, Photogrammetry and Cartography, Vol. 35, No. 3, pp. 155-166. (in Korean with English abstract)

Ko, S.M., Lee, S.W., Yune, C.Y., and Kim, G.H. (2014), Topographic analysis of landslides in umyeonsan, Journal of the Korean Society of Surveying, Geodesy, Photogrammetry and Cartography, Vol. 32, No. 1, pp. 55-

62. (in Korean with English abstract)

Lee, C.H. and Sung, C.J. (2016), Comparison of slope algorithms for relief characteristic of evaluation, Proceedings of Journal of the Korean Society for Geo- spatial Information Science, 6 October, 2016, Gunsan, Korea, pp. 239-240.

Lee, H.S. (2012), A Study on the Parcel's Height Classification Using GIS Spatial Analysis, Master’s thesis, University of Seoul, Korea, 96p, (in Korean with English abstract) Park, J.H. and Lee, W.H. (2016), Orthophoto and DEM

generation in small slope areas using low specification UAV, Journal of the Korean Society of Surveying, Geodesy, Photogrammetry and Cartography, Vol. 34, No.

3, pp. 283-290. (in Korean with English abstract)

Shin, D.H. and Kim, C.B. (2016), A Study on deep learning input pattern for summer power demand prediction, Journal of Korean Institute of Information Technology, Vol. 14, No. 11, pp. 127-134. (in Korean with English abstract)

Song, S.K., Sin, S.H., Sun, C.Y., Lim, H.J., Lee, M.K., Jo, M.H., Hong, S.K., and Jo, M.S. (2016), Deep learning based weather and sales of small business owners big data analysis, Proceedings of the Autumn Meeting of Korean Meteorological Society, 11 November, Busan, Korea, No.

10, pp. 489-490.(in Korean with English abstract)

Weiss, A. (2001), Topographic position and landforms analysis, In Poster presentation, ESRI user conference, 9 July, 2001, San Diego, California, USA, Vol. 200.

Zhou, Q. and Liu, X. (2004). Error analysis on grid-based slope and aspect algorithms, Photogrammetric Engineering &

Remote Sensing, Vol. 70, No. 8, pp. 957-962.

수치

Fig. 1. Methodology of topographic characteristics survey Fig. 2. Process of spatial analysis
Table 5 below shows the learning materials used for machine  learning. The topographic characteristics such as elevation and  slope of the parcels and distance variables with road, reservoir,  stream, cultivated land are used for the independent variable
Fig. 7. Machine learning result
Table 6. Summary of machine learning result(R : Result, T: True, F : False. Avg : Average) Machine Learning Result Elevation Slope Road Stream Reservoir Cultivated

참조

관련 문서

In this study, using concept mapping of a variety of learning techniques in the area of science, especially biology, has a positive effect on learning

This study the changes in structure and mechanical characteristics by the analysis on mechanical characteristics of the welding part and the post weld

This study examined the achievement, satisfaction and interest of an English grammar class based on flipped learning in order to find out the effect of

Objectives : The purpose of this study was to investigate the effects of general characteristics, health characteristics, mental health characteristics, suicidal

This study is carried out vocabulary learning through image association as a method on efficient vocabulary learning for middle school students and wanted to know if

The purpose of this study is to investigate the effect of peer-mentoring on the achievement and learning attitudes of middle school students toward

data mining - process of applying machine learning, visualization, statistical analysis. knowledge discovery & construction - evaluation & interpretation of the extracted

This study aims to analyze the process and the features of corporate e-Learning as a form of informal learning. For this purpose, the concept of