Classifying Sub-Categories of Apartment Defect Repair Tasks: A Machine Learning Approach (아파트 하자 보수 시설공사 세부공종 머신러닝 분류 시스템에 관한 연구)

  • Kim, Eunhye;Ji, HongGeun;Kim, Jina;Park, Eunil;Ohm, Jay Y.
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.10 no.9
    • /
    • pp.359-366
    • /
    • 2021
  • A number of construction companies in Korea invest considerable human and financial resources to construct a system for managing apartment defect data and for categorizing repair tasks. Thus, this study proposes machine learning models to automatically classify defect complaint text-data into one of the sub categories of 'finishing work' (i.e., one of the defect repair tasks). In the proposed models, we employed two word representation methods (Bag-of-words, Term Frequency-Inverse Document Frequency (TF-IDF)) and two machine learning classifiers (Support Vector Machine, Random Forest). In particular, we conducted both binary- and multi- classification tasks to classify 9 sub categories of finishing work: home appliance installation work, paperwork, painting work, plastering work, interior masonry work, plaster finishing work, indoor furniture installation work, kitchen facility installation work, and tiling work. The machine learning classifiers using the TF-IDF representation method and Random Forest classification achieved more than 90% accuracy, precision, recall, and F1 score. We shed light on the possibility of constructing automated defect classification systems based on the proposed machine learning models.

A Study for Possibility to Detect Missing Sidewalk Blocks using Drone (드론을 이용한 보도블럭 탈락 탐지 가능성 연구)

  • Shin, Jung-il
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.22 no.5
    • /
    • pp.34-41
    • /
    • 2021
  • Sidewalks are facilities used for the safe and comfortable passage of pedestrians and are paved with blocks of various materials. Currently, Korea does not have a quantitative survey method for the pavement condition of sidewalks, so it is necessary to develop an efficient survey method. Drones are being used as an efficient survey tool in various fields, but there are limited studies in which sidewalks have been investigated. This study investigates the possibility of detection by limiting the missing sidewalk blocks using a drone. This study is an initial study on the development of a method for detecting damage in sidewalk blocks. For this, sidewalk blocks were artificially removed to simulate a dropout situation, and images were acquired with 0.7-cm resolution using a drone. As a characteristic of the point cloud data acquired through image pre-processing, there was high variance of the elevation of the points in the missing area of the sidewalk block. Using these characteristics, an experiment was conducted to detect the missing parts of the sidewalk block by applying four thresholds to the variance of the elevation of points included in the grid corresponding to the sidewalk area. As a result, the detection accuracy was shown with a positive detection ratio of 70-80%, omission errors of 20-30%, and commission errors lower than 2%. It is judged that the possibility of detecting missing sidewalk blocks is high. This study focused on detecting a simulated missing sidewalk block in a limited environment. Therefore, it is expected that an efficient and quantitative method of detecting damaged sidewalk blocks can be developed in the future through additional research with considerations of the actual environment.

Investigation of the Super-resolution Algorithm for the Prediction of Periodontal Disease in Dental X-ray Radiography (치주질환 예측을 위한 치과 X-선 영상에서의 초해상화 알고리즘 적용 가능성 연구)

  • Kim, Han-Na
    • Journal of the Korean Society of Radiology
    • /
    • v.15 no.2
    • /
    • pp.153-158
    • /
    • 2021
  • X-ray image analysis is a very important field to improve the early diagnosis rate and prediction accuracy of periodontal disease. Research on the development and application of artificial intelligence-based algorithms to improve the quality of such dental X-ray images is being widely conducted worldwide. Thus, the aim of this study was to design a super-resolution algorithm for predicting periodontal disease and to evaluate its applicability in dental X-ray images. The super-resolution algorithm was constructed based on the convolution layer and ReLU, and an image obtained by up-sampling a low-resolution image by 2 times was used as an input data. Also, 1,500 dental X-ray data used for deep learning training were used. Quantitative evaluation of images used root mean square error and structural similarity, which are factors that can measure similarity through comparison of two images. In addition, the recently developed no-reference based natural image quality evaluator and blind/referenceless image spatial quality evaluator were additionally analyzed. According to the results, we confirmed that the average similarity and no-reference-based evaluation values were improved by 1.86 and 2.14 times, respectively, compared to the existing bicubic-based upsampling method when the proposed method was used. In conclusion, the super-resolution algorithm for predicting periodontal disease proved useful in dental X-ray images, and it is expected to be highly applicable in various fields in the future.

Hybrid All-Reduce Strategy with Layer Overlapping for Reducing Communication Overhead in Distributed Deep Learning (분산 딥러닝에서 통신 오버헤드를 줄이기 위해 레이어를 오버래핑하는 하이브리드 올-리듀스 기법)

  • Kim, Daehyun;Yeo, Sangho;Oh, Sangyoon
    • KIPS Transactions on Computer and Communication Systems
    • /
    • v.10 no.7
    • /
    • pp.191-198
    • /
    • 2021
  • Since the size of training dataset become large and the model is getting deeper to achieve high accuracy in deep learning, the deep neural network training requires a lot of computation and it takes too much time with a single node. Therefore, distributed deep learning is proposed to reduce the training time by distributing computation across multiple nodes. In this study, we propose hybrid allreduce strategy that considers the characteristics of each layer and communication and computational overlapping technique for synchronization of distributed deep learning. Since the convolution layer has fewer parameters than the fully-connected layer as well as it is located at the upper, only short overlapping time is allowed. Thus, butterfly allreduce is used to synchronize the convolution layer. On the other hand, fully-connecter layer is synchronized using ring all-reduce. The empirical experiment results on PyTorch with our proposed scheme shows that the proposed method reduced the training time by up to 33% compared to the baseline PyTorch.

A Study on the Estimation of Multi-Object Social Distancing Using Stereo Vision and AlphaPose (Stereo Vision과 AlphaPose를 이용한 다중 객체 거리 추정 방법에 관한 연구)

  • Lee, Ju-Min;Bae, Hyeon-Jae;Jang, Gyu-Jin;Kim, Jin-Pyeong
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.10 no.7
    • /
    • pp.279-286
    • /
    • 2021
  • Recently, We are carrying out a policy of physical distancing of at least 1m from each other to prevent the spreading of COVID-19 disease in public places. In this paper, we propose a method for measuring distances between people in real time and an automation system that recognizes objects that are within 1 meter of each other from stereo images acquired by drones or CCTVs according to the estimated distance. A problem with existing methods used to estimate distances between multiple objects is that they do not obtain three-dimensional information of objects using only one CCTV. his is because three-dimensional information is necessary to measure distances between people when they are right next to each other or overlap in two dimensional image. Furthermore, they use only the Bounding Box information to obtain the exact coordinates of human existence. Therefore, in this paper, to obtain the exact two-dimensional coordinate value in which a person exists, we extract a person's key point to detect the location, convert it to a three-dimensional coordinate value using Stereo Vision and Camera Calibration, and estimate the Euclidean distance between people. As a result of performing an experiment for estimating the accuracy of 3D coordinates and the distance between objects (persons), the average error within 0.098m was shown in the estimation of the distance between multiple people within 1m.

Deep learning-based Multilingual Sentimental Analysis using English Review Data (영어 리뷰데이터를 이용한 딥러닝 기반 다국어 감성분석)

  • Sung, Jae-Kyung;Kim, Yung Bok;Kim, Yong-Guk
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.19 no.3
    • /
    • pp.9-15
    • /
    • 2019
  • Large global online shopping malls, such as Amazon, offer services in English or in the language of a country when their products are sold. Since many customers purchase products based on the product reviews, the shopping malls actively utilize the sentimental analysis technique in judging preference of each product using the large amount of review data that the customer has written. And the result of such analysis can be used for the marketing to look the potential shoppers. However, it is difficult to apply this English-based semantic analysis system to different languages used around the world. In this study, more than 500,000 data from Amazon fine food reviews was used for training a deep learning based system. First, sentiment analysis evaluation experiments were carried out with three models of English test data. Secondly, the same data was translated into seven languages (Korean, Japanese, Chinese, Vietnamese, French, German and English) and then the similar experiments were done. The result suggests that although the accuracy of the sentimental analysis was 2.77% lower than the average of the seven countries (91.59%) compared to the English (94.35%), it is believed that the results of the experiment can be used for practical applications.

Bias Characteristics Analysis of Himawari-8/AHI Clear Sky Radiance Using KMA NWP Global Model (기상청 전구 수치예보모델을 활용한 Himawari-8/AHI 청천복사휘도 편차 특성 분석)

  • Kim, Boram;Shin, Inchul;Chung, Chu-Yong;Cheong, Seonghoon
    • Korean Journal of Remote Sensing
    • /
    • v.34 no.6_1
    • /
    • pp.1101-1117
    • /
    • 2018
  • The clear sky radiance (CSR) is one of the baseline products of the Himawari-8 which was launched on October, 2014. The CSR contributes to numerical weather prediction (NWP) accuracy through the data assimilation; especially water vapor channel CSR has good impact on the forecast in high level atmosphere. The focus of this study is the quality analysis of the CSR of the Himawari-8 geostationary satellite. We used the operational CSR (or clear sky brightness temperature) products in JMA (Japan Meteorological Agency) as observation data; for a background field, we employed the CSR simulated using the Radiative Transfer for TOVS (RTTOV) with the atmospheric state from the global model of KMA (Korea Meteorological Administration). We investigated data characteristics and analyzed observation minus background statistics of each channel with respect to regional and seasonal variability. Overall results for the analysis period showed that the water vapor channels (6.2, 6.9, and 7.3μm) had a positive mean bias where as the window channels(10.4, 11.2, and 12.4μm) had a negative mean bias. The magnitude of biases and Uncertainty result varied with the regional and the seasonal conditions, thus these should be taken into account when using CSR data. This study is helpful for the pre-processing of Himawari-8/Advanced Himawari Imager (AHI) CSR data assimilation. Furthermore, this study also can contribute to preparing for the utilization of products from the Geo-Kompsat-2A (GK-2A), which will be launched in 2018 by the National Meteorological Satellite Center (NMSC) of KMA.

Semi-automatic Construction of Learning Set and Integration of Automatic Classification for Academic Literature in Technical Sciences (기술과학 분야 학술문헌에 대한 학습집합 반자동 구축 및 자동 분류 통합 연구)

  • Kim, Seon-Wu;Ko, Gun-Woo;Choi, Won-Jun;Jeong, Hee-Seok;Yoon, Hwa-Mook;Choi, Sung-Pil
    • Journal of the Korean Society for information Management
    • /
    • v.35 no.4
    • /
    • pp.141-164
    • /
    • 2018
  • Recently, as the amount of academic literature has increased rapidly and complex researches have been actively conducted, researchers have difficulty in analyzing trends in previous research. In order to solve this problem, it is necessary to classify information in units of academic papers. However, in Korea, there is no academic database in which such information is provided. In this paper, we propose an automatic classification system that can classify domestic academic literature into multiple classes. To this end, first, academic documents in the technical science field described in Korean were collected and mapped according to class 600 of the DDC by using K-Means clustering technique to construct a learning set capable of multiple classification. As a result of the construction of the training set, 63,915 documents in the Korean technical science field were established except for the values in which metadata does not exist. Using this training set, we implemented and learned the automatic classification engine of academic documents based on deep learning. Experimental results obtained by hand-built experimental set-up showed 78.32% accuracy and 72.45% F1 performance for multiple classification.

Conversion of Camera Lens Distortions between Photogrammetry and Computer Vision (사진측량과 컴퓨터비전 간의 카메라 렌즈왜곡 변환)

  • Hong, Song Pyo;Choi, Han Seung;Kim, Eui Myoung
    • Journal of the Korean Society of Surveying, Geodesy, Photogrammetry and Cartography
    • /
    • v.37 no.4
    • /
    • pp.267-277
    • /
    • 2019
  • Photogrammetry and computer vision are identical in determining the three-dimensional coordinates of images taken with a camera, but the two fields are not directly compatible with each other due to differences in camera lens distortion modeling methods and camera coordinate systems. In general, data processing of drone images is performed by bundle block adjustments using computer vision-based software, and then the plotting of the image is performed by photogrammetry-based software for mapping. In this case, we are faced with the problem of converting the model of camera lens distortions into the formula used in photogrammetry. Therefore, this study described the differences between the coordinate systems and lens distortion models used in photogrammetry and computer vision, and proposed a methodology for converting them. In order to verify the conversion formula of the camera lens distortion models, first, lens distortions were added to the virtual coordinates without lens distortions by using the computer vision-based lens distortion models. Then, the distortion coefficients were determined using photogrammetry-based lens distortion models, and the lens distortions were removed from the photo coordinates and compared with the virtual coordinates without the original distortions. The results showed that the root mean square distance was good within 0.5 pixels. In addition, epipolar images were generated to determine the accuracy by applying lens distortion coefficients for photogrammetry. The calculated root mean square error of y-parallax was found to be within 0.3 pixels.

Retrieval Biases Analysis on Estimation of GNSS Precipitable Water Vapor by Tropospheric Zenith Hydrostatic Models (GNSS 가강수량 추정시 건조 지연 모델에 의한 복원 정밀도 해석)

  • Nam, JinYong;Song, DongSeob
    • Journal of the Korean Society of Surveying, Geodesy, Photogrammetry and Cartography
    • /
    • v.37 no.4
    • /
    • pp.233-242
    • /
    • 2019
  • ZHD (Zenith Hydrostatic Delay) model is important parameter in estimating of GNSS (Global Navigation Satellite System) PWV (Precipitable Water Vapor) along with weighted mean temperature. The ZWD (Zenith Wet Delay) is tend to accumulate the ZHD error, so that biases from ZHD will be affected on the precision of GNSS PWV. In this paper, we compared the accuracy of GNSS PWV with radiosonde PWV using three ZHD models, such as Saastamoinen, Hopfield, and Black. Also, we adopted the KWMT (Korean Weighted Mean Temperature) model and the mean temperature which was observed by radiosonde on the retrieval processing of GNSS PWV. To this end, GNSS observation data during one year were processed to produce PWVs from a total of 5 GNSS permanent stations in Korea, and the GNSS PWVs were compared with radiosonde PWVs for the evaluating of biases. The PWV biases using mean temperature estimated by the KWMT model are smaller than radiosonde mean temperature. Also, we could confirm the result that the Saastamoinen ZHD which is most used in the GNSS meteorology is not valid in South Korea, because it cannot be exclude the possibility of biases by latitude or height of GNSS station.