• Title/Summary/Keyword: image analysis algorithm

Search Result 1,483, Processing Time 0.024 seconds

Welfare Interface using Multiple Facial Features Tracking (다중 얼굴 특징 추적을 이용한 복지형 인터페이스)

  • Ju, Jin-Sun;Shin, Yun-Hee;Kim, Eun-Yi
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.45 no.1
    • /
    • pp.75-83
    • /
    • 2008
  • We propose a welfare interface using multiple fecial features tracking, which can efficiently implement various mouse operations. The proposed system consist of five modules: face detection, eye detection, mouth detection, facial feature tracking, and mouse control. The facial region is first obtained using skin-color model and connected-component analysis(CCs). Thereafter the eye regions are localized using neutral network(NN)-based texture classifier that discriminates the facial region into eye class and non-eye class, and then mouth region is localized using edge detector. Once eye and mouth regions are localized they are continuously and correctly tracking by mean-shift algorithm and template matching, respectively. Based on the tracking results, mouse operations such as movement or click are implemented. To assess the validity of the proposed system, it was applied to the interface system for web browser and was tested on a group of 25 users. The results show that our system have the accuracy of 99% and process more than 21 frame/sec on PC for the $320{\times}240$ size input image, as such it can supply a user-friendly and convenient access to a computer in real-time operation.

Study of Sensor Technology Analysis and Site Application Model for 3D-based Global Modeling of Construction Field (건설 시공현장의 3D기반 광대역 모델링을 위한 Sensor 기술 분석과 향후 현장적용 모델 연구)

  • Kwon, Hyuk-Do;Koh, Min-Hyeok;Yoon, Su-Won;Kwon, Soon-Wook;Chin, Sang-Yoon;Kim, Yea-Sang
    • Proceedings of the Korean Institute Of Construction Engineering and Management
    • /
    • 2007.11a
    • /
    • pp.938-942
    • /
    • 2007
  • The importance of process improvement under construction has arisen from recent issue, lower productivity in the construction site. The various 3D modeling program is utilized in the procedure of construction as an alternative solution. However, it's still shortage of the consideration about a specific technical application. The purpose of the study in this paper is helpful to improve the productivity of construction site using 3D realization of constructing place as one of extensive modeling technologies, which leads to not only efficient management of construction site allowing people to check the real time situation in the place but also the revitalization of information flow about building process control and prgress, Therefore, I research into modeling algorithm and extensive construction site realization technology. 3D realization of building place would reduce the safety concerns by providing the real time information about construction site, and it could help to access easily to similar project through collecting and appling the database of sites. Furthermore it can be an opportunity to develop the procedure of production in construction industry and to upgrade the image of this field.

  • PDF

Modeling of Visual Attention Probability for Stereoscopic Videos and 3D Effect Estimation Based on Visual Attention (3차원 동영상의 시각 주의 확률 모델 도출 및 시각 주의 기반 입체감 추정)

  • Kim, Boeun;Song, Wonseok;Kim, Taejeong
    • Journal of KIISE
    • /
    • v.42 no.5
    • /
    • pp.609-620
    • /
    • 2015
  • Viewers of videos are likely to absorb more information from the part of the screen that attracts visual attention. This fact has led to the visual attention models that are being used in producing and evaluating videos. In this paper, we investigate the factors that are significant to visual attention and the mathematical form of the visual attention model. We then estimated the visual attention probability using the statistical design of experiments. The analysis of variance (ANOVA) verifies that the motion velocity, distance from the screen, and amount of defocus blur affect human visual attention significantly. Using the response surface modeling (RSM), we created a visual attention score model that concerns the three factors, from which we calculate the visual attention probabilities (VAPs) of image pixels. The VAPs are directly applied to existing gradient based 3D effect perception measurement. By giving weights according to our VAPs, our algorithm achieves more accurate measurement than the existing method. The performance of the proposed measurement is assessed by comparing them with subjective evaluation as well as with existing methods. The comparison verifies that the proposed measurement outperforms the existing ones.

A Study on Water Surface Detection Algorithm using Sentinel-1 Satellite Imagery (Sentinel-1 위성영상을 이용한 수표면 면적 추정 알고리즘에 관한 연구)

  • Lee, Dalgeun;Cheon, Eun Ji;Yun, Hyewon;Lee, Mi Hee
    • Korean Journal of Remote Sensing
    • /
    • v.35 no.5_2
    • /
    • pp.809-818
    • /
    • 2019
  • The Republic of Korea is very vulnerable to damage from storm and flood due to the rainfall phenomenon in summer and the topography of the narrow peninsula. The damage is recently getting worse because of the concentration rainfall. The accurate damage information production and analysis is required to prepare for future disaster. In this study, we analyzed the water surface area changes of Byeokjeong, Sajeom, Subu and Boryeong using Sentinel-1 satellite imagery. The surface area of the Sentinel-1 satellite, taken from May 2015 to August 2019, was preprocessed using RTC and image binarization using Otsu. The water surface area of reservoir was compared with the storage capacity from WAMIS and RIMS. As a result, Subu and Boryeong showed strong correlations of 0.850 and 0.941, respectively, and Byeokjeong and Sajeom showed the normal correlation of 0.651 and 0.657. Thus, SAR satellite imagery can be used to objective data as disaster management.

Analysis of the Cloud Removal Effect of Sentinel-2A/B NDVI Monthly Composite Images for Rice Paddy and High-altitude Cabbage Fields (논과 고랭지 배추밭 대상 Sentinel-2A/B 정규식생지수 월 합성영상의 구름 제거 효과 분석)

  • Eun, Jeong;Kim, Sun-Hwa;Kim, Taeho
    • Korean Journal of Remote Sensing
    • /
    • v.37 no.6_1
    • /
    • pp.1545-1557
    • /
    • 2021
  • Crops show sensitive spectral characteristics according to their species and growth conditions and although frequent observation is required especially in summer, it is difficult to utilize optical satellite images due to the rainy season. To solve this problem, Constrained Cloud-Maximum Normalized difference vegetation index Composite (CC-MNC) algorithm was developed to generate periodic composite images with minimal cloud effect. In thisstudy, using this method, monthly Sentinel-2A/B Normalized Difference Vegetation Index (NDVI) composite images were produced for paddies and high-latitude cabbage fields from 2019 to 2021. In August 2020, which received 200mm more precipitation than other periods, the effect of clouds, was also significant in MODIS NDVI 16-day composite product. Except for this period, the CC-MNC method was able to reduce the cloud ratio of 45.4% of the original daily image to 14.9%. In the case of rice paddy, there was no significant difference between Sentinel-2A/B and MODIS NDVI values. In addition, it was possible to monitor the rice growth cycle well even with a revisit cycle 5 days. In the case of high-latitude cabbage fields, Sentinel-2A/B showed the short growth cycle of cabbage well, but MODIS showed limitations in spatial resolution. In addition, the CC-MNC method showed that cloud pixels were used for compositing at the harvest time, suggesting that the View Zenith Angle (VZA) threshold needsto be adjusted according to the domestic region.

A Study on the Methodology of Early Diagnosis of Dementia Based on AI (Artificial Intelligence) (인공지능(AI) 기반 치매 조기진단 방법론에 관한 연구)

  • Oh, Sung Hoon;Jeon, Young Jun;Kwon, Young Woo;Jeong, Seok Chan
    • The Journal of Bigdata
    • /
    • v.6 no.1
    • /
    • pp.37-49
    • /
    • 2021
  • The number of dementia patients in Korea is estimated to be over 800,000, and the severity of dementia is becoming a social problem. However, no treatment or drug has yet been developed to cure dementia worldwide. The number of dementia patients is expected to increase further due to the rapid aging of the population. Currently, early detection of dementia and delaying the course of dementia symptoms is the best alternative. This study presented a methodology for early diagnosis of dementia by measuring and analyzing amyloid plaques. This vital protein can most clearly and early diagnose dementia in the retina through AI-based image analysis. We performed binary classification and multi-classification learning based on CNN on retina data. We also developed a deep learning algorithm that can diagnose dementia early based on pre-processed retinal data. Accuracy and recall of the deep learning model were verified, and as a result of the verification, and derived results that satisfy both recall and accuracy. In the future, we plan to continue the study based on clinical data of actual dementia patients, and the results of this study are expected to solve the dementia problem.

Speech Recognition Using Linear Discriminant Analysis and Common Vector Extraction (선형 판별분석과 공통벡터 추출방법을 이용한 음성인식)

  • 남명우;노승용
    • The Journal of the Acoustical Society of Korea
    • /
    • v.20 no.4
    • /
    • pp.35-41
    • /
    • 2001
  • This paper describes Linear Discriminant Analysis and common vector extraction for speech recognition. Voice signal contains psychological and physiological properties of the speaker as well as dialect differences, acoustical environment effects, and phase differences. For these reasons, the same word spelled out by different speakers can be very different heard. This property of speech signal make it very difficult to extract common properties in the same speech class (word or phoneme). Linear algebra method like BT (Karhunen-Loeve Transformation) is generally used for common properties extraction In the speech signals, but common vector extraction which is suggested by M. Bilginer et at. is used in this paper. The method of M. Bilginer et al. extracts the optimized common vector from the speech signals used for training. And it has 100% recognition accuracy in the trained data which is used for common vector extraction. In spite of these characteristics, the method has some drawback-we cannot use numbers of speech signal for training and the discriminant information among common vectors is not defined. This paper suggests advanced method which can reduce error rate by maximizing the discriminant information among common vectors. And novel method to normalize the size of common vector also added. The result shows improved performance of algorithm and better recognition accuracy of 2% than conventional method.

  • PDF

Improvement of 2-pass DInSAR-based DEM Generation Method from TanDEM-X bistatic SAR Images (TanDEM-X bistatic SAR 영상의 2-pass 위성영상레이더 차분간섭기법 기반 수치표고모델 생성 방법 개선)

  • Chae, Sung-Ho
    • Korean Journal of Remote Sensing
    • /
    • v.36 no.5_1
    • /
    • pp.847-860
    • /
    • 2020
  • The 2-pass DInSAR (Differential Interferometric SAR) processing steps for DEM generation consist of the co-registration of SAR image pair, interferogram generation, phase unwrapping, calculation of DEM errors, and geocoding, etc. It requires complicated steps, and the accuracy of data processing at each step affects the performance of the finally generated DEM. In this study, we developed an improved method for enhancing the performance of the DEM generation method based on the 2-pass DInSAR technique of TanDEM-X bistatic SAR images was developed. The developed DEM generation method is a method that can significantly reduce both the DEM error in the unwrapped phase image and that may occur during geocoding step. The performance analysis of the developed algorithm was performed by comparing the vertical accuracy (Root Mean Square Error, RMSE) between the existing method and the newly proposed method using the ground control point (GCP) generated from GPS survey. The vertical accuracy of the DInSAR-based DEM generated without correction for the unwrapped phase error and geocoding error is 39.617 m. However, the vertical accuracy of the DEM generated through the proposed method is 2.346 m. It was confirmed that the DEM accuracy was improved through the proposed correction method. Through the proposed 2-pass DInSAR-based DEM generation method, the SRTM DEM error observed by DInSAR was compensated for the SRTM 30 m DEM (vertical accuracy 5.567 m) used as a reference. Through this, it was possible to finally create a DEM with improved spatial resolution of about 5 times and vertical accuracy of about 2.4 times. In addition, the spatial resolution of the DEM generated through the proposed method was matched with the SRTM 30 m DEM and the TanDEM-X 90m DEM, and the vertical accuracy was compared. As a result, it was confirmed that the vertical accuracy was improved by about 1.7 and 1.6 times, respectively, and more accurate DEM generation was possible with the proposed method. If the method derived in this study is used to continuously update the DEM for regions with frequent morphological changes, it will be possible to update the DEM effectively in a short time at low cost.

Design of a Bit-Serial Divider in GF(2$^{m}$ ) for Elliptic Curve Cryptosystem (타원곡선 암호시스템을 위한 GF(2$^{m}$ )상의 비트-시리얼 나눗셈기 설계)

  • 김창훈;홍춘표;김남식;권순학
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.27 no.12C
    • /
    • pp.1288-1298
    • /
    • 2002
  • To implement elliptic curve cryptosystem in GF(2$\^$m/) at high speed, a fast divider is required. Although bit-parallel architecture is well suited for high speed division operations, elliptic curve cryptosystem requires large m(at least 163) to support a sufficient security. In other words, since the bit-parallel architecture has an area complexity of 0(m$\^$m/), it is not suited for this application. In this paper, we propose a new serial-in serial-out systolic array for computing division operations in GF(2$\^$m/) using the standard basis representation. Based on a modified version of tile binary extended greatest common divisor algorithm, we obtain a new data dependence graph and design an efficient bit-serial systolic divider. The proposed divider has 0(m) time complexity and 0(m) area complexity. If input data come in continuously, the proposed divider can produce division results at a rate of one per m clock cycles, after an initial delay of 5m-2 cycles. Analysis shows that the proposed divider provides a significant reduction in both chip area and computational delay time compared to previously proposed systolic dividers with the same I/O format. Since the proposed divider can perform division operations at high speed with the reduced chip area, it is well suited for division circuit of elliptic curve cryptosystem. Furthermore, since the proposed architecture does not restrict the choice of irreducible polynomial, and has a unidirectional data flow and regularity, it provides a high flexibility and scalability with respect to the field size m.

Efficient Deep Learning Approaches for Active Fire Detection Using Himawari-8 Geostationary Satellite Images (Himawari-8 정지궤도 위성 영상을 활용한 딥러닝 기반 산불 탐지의 효율적 방안 제시)

  • Sihyun Lee;Yoojin Kang;Taejun Sung;Jungho Im
    • Korean Journal of Remote Sensing
    • /
    • v.39 no.5_3
    • /
    • pp.979-995
    • /
    • 2023
  • As wildfires are difficult to predict, real-time monitoring is crucial for a timely response. Geostationary satellite images are very useful for active fire detection because they can monitor a vast area with high temporal resolution (e.g., 2 min). Existing satellite-based active fire detection algorithms detect thermal outliers using threshold values based on the statistical analysis of brightness temperature. However, the difficulty in establishing suitable thresholds for such threshold-based methods hinders their ability to detect fires with low intensity and achieve generalized performance. In light of these challenges, machine learning has emerged as a potential-solution. Until now, relatively simple techniques such as random forest, Vanilla convolutional neural network (CNN), and U-net have been applied for active fire detection. Therefore, this study proposed an active fire detection algorithm using state-of-the-art (SOTA) deep learning techniques using data from the Advanced Himawari Imager and evaluated it over East Asia and Australia. The SOTA model was developed by applying EfficientNet and lion optimizer, and the results were compared with the model using the Vanilla CNN structure. EfficientNet outperformed CNN with F1-scores of 0.88 and 0.83 in East Asia and Australia, respectively. The performance was better after using weighted loss, equal sampling, and image augmentation techniques to fix data imbalance issues compared to before the techniques were used, resulting in F1-scores of 0.92 in East Asia and 0.84 in Australia. It is anticipated that timely responses facilitated by the SOTA deep learning-based approach for active fire detection will effectively mitigate the damage caused by wildfires.