• Title/Summary/Keyword: automatic object detection

Search Result 222, Processing Time 0.026 seconds

Automatic Object Extraction from Electronic Documents Using Deep Neural Network (심층 신경망을 활용한 전자문서 내 객체의 자동 추출 방법 연구)

  • Jang, Heejin;Chae, Yeonghun;Lee, Sangwon;Jo, Jinyong
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.7 no.11
    • /
    • pp.411-418
    • /
    • 2018
  • With the proliferation of artificial intelligence technology, it is becoming important to obtain, store, and utilize scientific data in research and science sectors. A number of methods for extracting meaningful objects such as graphs and tables from research articles have been proposed to eventually obtain scientific data. Existing extraction methods using heuristic approaches are hardly applicable to electronic documents having heterogeneous manuscript formats because they are designed to work properly for some targeted manuscripts. This paper proposes a prototype of an object extraction system which exploits a recent deep-learning technology so as to overcome the inflexibility of the heuristic approaches. We implemented our trained model, based on the Faster R-CNN algorithm, using the Google TensorFlow Object Detection API and also composed an annotated data set from 100 research articles for training and evaluation. Finally, a performance evaluation shows that the proposed system outperforms a comparator adopting heuristic approaches by 5.2%.

Operational Ship Monitoring Based on Integrated Analysis of KOMPSAT-5 SAR and AIS Data (Kompsat-5 SAR와 AIS 자료 통합분석 기반 운영레벨 선박탐지 모니터링)

  • Kim, Sang-wan;Kim, Dong-Han;Lee, Yoon-Kyung
    • Korean Journal of Remote Sensing
    • /
    • v.34 no.2_2
    • /
    • pp.327-338
    • /
    • 2018
  • The possibility of ship detection monitoring at operational level using KOMPSAT-5 Synthetic Aperture Radar (SAR) and Automatic Identification System (AIS) data is investigated. For the analysis, the KOMPSAT-5 SLC images, which are collected from the west coast of Shinjin port and the northern coast of Jeju port are used along with portable AIS data from near the coast. The ship detection algorithm based on HVAS (Human Visual Attention System) was applied, which has significant advantages in terms of detection speed and accuracy compared to the commonly used CFAR (Constant False Alarm Rate). As a result of the integrated analysis, the ship detection from KOMPSAT-5 and AIS were generally consistent except for small vessels. Some ships detected in KOMPSAT-5 but not in AIS are due to the data absence from AIS, while it is clearly visible in KOMPSAT-5. Meanwhile, SAR imagery also has some false alarms due to ship wakes, ghost effect, and DEM error (or satellite orbit error) during object masking in land. Improving the developed ship detection algorithm and collecting reliable AIS data will contribute for building wide integrated surveillance system of marine territory at operational level.

Automatic Camera Pose Determination from a Single Face Image

  • Wei, Li;Lee, Eung-Joo;Ok, Soo-Yol;Bae, Sung-Ho;Lee, Suk-Hwan;Choo, Young-Yeol;Kwon, Ki-Ryong
    • Journal of Korea Multimedia Society
    • /
    • v.10 no.12
    • /
    • pp.1566-1576
    • /
    • 2007
  • Camera pose information from 2D face image is very important for making virtual 3D face model synchronize with the real face. It is also very important for any other uses such as: human computer interface, 3D object estimation, automatic camera control etc. In this paper, we have presented a camera position determination algorithm from a single 2D face image using the relationship between mouth position information and face region boundary information. Our algorithm first corrects the color bias by a lighting compensation algorithm, then we nonlinearly transformed the image into $YC_bC_r$ color space and use the visible chrominance feature of face in this color space to detect human face region. And then for face candidate, use the nearly reversed relationship information between $C_b\;and\;C_r$ cluster of face feature to detect mouth position. And then we use the geometrical relationship between mouth position information and face region boundary information to determine rotation angles in both x-axis and y-axis of camera position and use the relationship between face region size information and Camera-Face distance information to determine the camera-face distance. Experimental results demonstrate the validity of our algorithm and the correct determination rate is accredited for applying it into practice.

  • PDF

Offline In-Hand 3D Modeling System Using Automatic Hand Removal and Improved Registration Method (자동 손 제거와 개선된 정합방법을 이용한 오프라인 인 핸드 3D 모델링 시스템)

  • Kang, Junseok;Yang, Hyeonseok;Lim, Hwasup;Ahn, Sang Chul
    • Journal of the HCI Society of Korea
    • /
    • v.12 no.3
    • /
    • pp.13-23
    • /
    • 2017
  • In this paper, we propose a new in-hand 3D modeling system that improves user convenience. Since traditional modeling systems are inconvenient to use, an in-hand modeling system has been studied, where an object is handled by hand. However, there is also a problem that it requires additional equipment or specific constraints to remove hands for good modeling. In this paper, we propose a contact state change detection algorithm for automatic hand removal and improved ICP algorithm that enables outlier handling and additionally uses color for accurate registration. The proposed algorithm enables accurate modeling without additional equipment or any constraints. Through experiments using real data, we show that it is possible to accomplish accurate modeling under the general conditions without any constraint by using the proposed system.

Detection of Brain Ventricle by Using Wavelet Transform and Automatic Thresholding in MRI Brain Images (MRI 뇌 영상에서 웨이브릿 변환과 자동적인 임계치 설정을 이용한 뇌실 검출)

  • Won, Chul-Ho;Kim, Dong-Hun;Woo, Sang-Hyo;Lee, Jung-Hyun;Kim, Chang-Wook;Chung, Yoon-Su;Cho, Jin-Ho
    • Journal of Korea Multimedia Society
    • /
    • v.10 no.9
    • /
    • pp.1117-1124
    • /
    • 2007
  • In this paper, an algorithm that can define the threshold value automatically proposed in order to detect a brain ventricle in MRI brain images. After the wavelet transform, edge sharpness, which means the average magnitude of detail signals on the contour of the object, was computed by using the magnitude of horizontal and vertical detail signals. The contours of a brain ventricle were detected by increasing the threshold value repeatedly and computing edge sharpness. When the edge sharpness became maximal, the optimal threshold was determined, and the detection of a brain ventricle was accomplished finally. In this paper, we compared the proposed algorithm with the geodesic active contour model numerically and verified the efficiency of the proposed algorithm by applying real MRI brain images.

  • PDF

Automatic detection of periodontal compromised teeth in digital panoramic radiographs using faster regional convolutional neural networks

  • Thanathornwong, Bhornsawan;Suebnukarn, Siriwan
    • Imaging Science in Dentistry
    • /
    • v.50 no.2
    • /
    • pp.169-174
    • /
    • 2020
  • Purpose: Periodontal disease causes tooth loss and is associated with cardiovascular diseases, diabetes, and rheumatoid arthritis. The present study proposes using a deep learning-based object detection method to identify periodontally compromised teeth on digital panoramic radiographs. A faster regional convolutional neural network (faster R-CNN) which is a state-of-the-art deep detection network, was adapted from the natural image domain using a small annotated clinical data- set. Materials and Methods: In total, 100 digital panoramic radiographs of periodontally compromised patients were retrospectively collected from our hospital's information system and augmented. The periodontally compromised teeth found in each image were annotated by experts in periodontology to obtain the ground truth. The Keras library, which is written in Python, was used to train and test the model on a single NVidia 1080Ti GPU. The faster R-CNN model used a pretrained ResNet architecture. Results: The average precision rate of 0.81 demonstrated that there was a significant region of overlap between the predicted regions and the ground truth. The average recall rate of 0.80 showed that the periodontally compromised teeth regions generated by the detection method excluded healthiest teeth areas. In addition, the model achieved a sensitivity of 0.84, a specificity of 0.88 and an F-measure of 0.81. Conclusion: The faster R-CNN trained on a limited amount of labeled imaging data performed satisfactorily in detecting periodontally compromised teeth. The application of a faster R-CNN to assist in the detection of periodontally compromised teeth may reduce diagnostic effort by saving assessment time and allowing automated screening documentation.

Reliable Smoke Detection using Static and Dynamic Textures of Smoke Images (연기 영상의 정적 및 동적 텍스처를 이용한 강인한 연기 검출)

  • Kim, Jae-Min
    • The Journal of the Korea Contents Association
    • /
    • v.12 no.2
    • /
    • pp.10-18
    • /
    • 2012
  • Automatic smoke detection systems using a surveillance camera requires a reliable smoke detection method. When an image sequence is captured from smoke spreading over in the air, not only has each smoke image frame a special texture, called static texture, but the difference between two smoke image frames also has a peculiar texture, called dynamic texture. Even though an object has a static texture similar to that of the smoke, its dynamic texture cannot be similar to that of the smoke if its movement differs from the diffraction action of the smoke. This paper presents a reliable smoke detection method using these two textures. The proposed method first detects change regions using accumulated frame difference, and then picks out smoke regions using Haralick features extracted from two textures.

Crack Detection on the Road in Aerial Image using Mask R-CNN (Mask R-CNN을 이용한 항공 영상에서의 도로 균열 검출)

  • Lee, Min Hye;Nam, Kwang Woo;Lee, Chang Woo
    • Journal of Korea Society of Industrial Information Systems
    • /
    • v.24 no.3
    • /
    • pp.23-29
    • /
    • 2019
  • Conventional crack detection methods have a problem of consuming a lot of labor, time and cost. To solve these problems, an automatic detection system is needed to detect cracks in images obtained by using vehicles or UAVs(unmanned aerial vehicles). In this paper, we have studied road crack detection with unmanned aerial photographs. Aerial images are generated through preprocessing and labeling to generate morphological information data sets of cracks. The generated data set was applied to the mask R-CNN model to obtain a new model in which various crack information was learned. Experimental results show that the cracks in the proposed aerial image were detected with an accuracy of 73.5% and some of them were predicted in a certain type of crack region.

Real-time Moving Object Detection Based on RPCA via GD for FMCW Radar

  • Nguyen, Huy Toan;Yu, Gwang Hyun;Na, Seung You;Kim, Jin Young;Seo, Kyung Sik
    • The Journal of Korean Institute of Information Technology
    • /
    • v.17 no.6
    • /
    • pp.103-114
    • /
    • 2019
  • Moving-target detection using frequency-modulated continuous-wave (FMCW) radar systems has recently attracted attention. Detection tasks are more challenging with noise resulting from signals reflected from strong static objects or small moving objects(clutter) within radar range. Robust Principal Component Analysis (RPCA) approach for FMCW radar to detect moving objects in noisy environments is employed in this paper. In detail, compensation and calibration are first applied to raw input signals. Then, RPCA via Gradient Descents (RPCA-GD) is adopted to model the low-rank noisy background. A novel update algorithm for RPCA is proposed to reduce the computation cost. Finally, moving-targets are localized using an Automatic Multiscale-based Peak Detection (AMPD) method. All processing steps are based on a sliding window approach. The proposed scheme shows impressive results in both processing time and accuracy in comparison to other RPCA-based approaches on various experimental scenarios.

Detection method of objects with a special pattern in satellite images using Histogram Of Gradients (HOG) feature and Support Vector Machine (SVM) classifier (Histogram Of Gradients (HOG) 피쳐와 Support Vector Machine (SVM) 분류기를 이용한 위성영상에서 관심물체 탐색 방법)

  • Lim, Ingeun;Kim, Suhwan;Choi, Jonggook
    • Korean Journal of Remote Sensing
    • /
    • v.30 no.4
    • /
    • pp.537-546
    • /
    • 2014
  • In this paper, we propose a method to detect interesting objects in inaccessible areas using high resolution satellite images. We define the interesting objects as a set of objects which have conceptually similar image patterns, not having exact sizes or shapes. In this paper, we developed a learning and classifier of Support Vector Machine (SVM) that extracts characteristic data for inputted images using Histogram of Gradients (HOG) feature and detects similar objects in other images using the characteristic data. As automatic search of interesting objects in our proposed method, we identify that our method provides reduced time and efforts for manual searching similar objects.