• Title/Summary/Keyword: False Region

Search Result 193, Processing Time 0.022 seconds

Video smoke detection with block DNCNN and visual change image

  • Liu, Tong;Cheng, Jianghua;Yuan, Zhimin;Hua, Honghu;Zhao, Kangcheng
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.14 no.9
    • /
    • pp.3712-3729
    • /
    • 2020
  • Smoke detection is helpful for early fire detection. With its large coverage area and low cost, vision-based smoke detection technology is the main research direction of outdoor smoke detection. We propose a two-stage smoke detection method combined with block Deep Normalization and Convolutional Neural Network (DNCNN) and visual change image. In the first stage, each suspected smoke region is detected from each frame of the images by using block DNCNN. According to the physical characteristics of smoke diffusion, a concept of visual change image is put forward in this paper, which is constructed by the video motion change state of the suspected smoke regions, and can describe the physical diffusion characteristics of smoke in the time and space domains. In the second stage, the Support Vector Machine (SVM) classifier is used to classify the Histogram of Oriented Gradients (HOG) features of visual change images of the suspected smoke regions, in this way to reduce the false alarm caused by the smoke-like objects such as cloud and fog. Simulation experiments are carried out on two public datasets of smoke. Results show that the accuracy and recall rate of smoke detection are high, and the false alarm rate is much lower than that of other comparison methods.

2-Stage Adaptive Skin Color Model for Effective Skin Color Segmentation in a Single Image (단일 영상에서 효과적인 피부색 검출을 위한 2단계 적응적 피부색 모델)

  • Do, Jun-Hyeong;Kim, Keun-Ho;Kim, Jong-Yeol
    • 한국HCI학회:학술대회논문집
    • /
    • 2009.02a
    • /
    • pp.193-196
    • /
    • 2009
  • Most of studies adopt a fixed skin color model to segment skin color region in a single image. The methods, however, result in low detection rates or high false positive error rates since the distribution of skin color is varies depending on the characteristics of input image. For the effective skin color segmentation, therefore, we need a adaptive skin color model which changes the model depending on the color distribution of input image. In this paper, we propose a novel adaptive skin color segmentation algorithm consisting of 2 stages which results in both high detection rate and low false positive error rate.

  • PDF

Extraction of Text Alignment by Tensor Voting and its Application to Text Detection (텐서보팅을 이용한 텍스트 배열정보의 획득과 이를 이용한 텍스트 검출)

  • Lee, Guee-Sang;Dinh, Toan Nguyen;Park, Jong-Hyun
    • Journal of KIISE:Software and Applications
    • /
    • v.36 no.11
    • /
    • pp.912-919
    • /
    • 2009
  • A novel algorithm using 2D tensor voting and edge-based approach is proposed for text detection in natural scene images. The tensor voting is used based on the fact that characters in a text line are usually close together on a smooth curve and therefore the tokens corresponding to centers of these characters have high curve saliency values. First, a suitable edge-based method is used to find all possible text regions. Since the false positive rate of text detection result generated from the edge-based method is high, 2D tensor voting is applied to remove false positives and find only text regions. The experimental results show that our method successfully detects text regions in many complex natural scene images.

Analysis of Singing Technique of Mongolian Traditional Singing Called Khoomei (몽골 전통 발성 흐미의 발성 방법 분석에 대한 사례연구)

  • Nam, Do-Hyun;Paik, Jae-Yeon;Hwang, Yoen-Shin;Choi, Hong-Shik
    • Speech Sciences
    • /
    • v.15 no.3
    • /
    • pp.145-156
    • /
    • 2008
  • The goal of this study was to investigate acoustic and physiologic characteristics of two phonation types of 'Khoomei' which is a traditional singing style of people who live around the Altai mountains or Mongolia region. It can be produced two pitches simultaneously - high melody pitch can be perceived along with a low drone pitch. Sygyt and kargyraa styles are the most popular and identifiable styles and they can be recognized as the different sounds depending on the method of voice production. Two trained Mongolians participated and have used at least 5 - 6 years. The characteristics of this voice production were measured by using flexible fiberscope, Stroboscopy, Lx Speech studio, Spead, and Doctor Speech. In Sygyt style, very high vocal fold closure (71.50%) with both true and false vocal folds contact and strong breathing support was observed. They also showed that tongue height and harmonics were increased (around 10dB) with resonance cavity movement. In contrast, it was found that Kargyraa sound had very low pitch with relaxed stomach, less laryngeal tension and lower vocal fold contact (69.50%) than hard Sygyt style sound without raising the tongue during phonation. 'Khoomei' phonation can be made by strong contact of both true and false vocal folds and by increasing the harmonics as well.

  • PDF

X-ray Image Segmentation using Multi-task Learning

  • Park, Sejin;Jeong, Woojin;Moon, Young Shik
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.14 no.3
    • /
    • pp.1104-1120
    • /
    • 2020
  • The chest X-rays are a common way to diagnose lung cancer or pneumonia. In particular, the finding of a lung nodule is the most important problem in the early detection of lung cancer. Recently, a lot of automatic diagnosis algorithms have been studied to find the lung nodules missed by doctors. The algorithms are typically based on segmentation network like U-Net. However, the occurrence of false positives that similar to lung nodules present outside the lungs can severely degrade performance. In this study, we propose a multi-task learning method that simultaneously learns the lung region and nodule-labeled data based on the prior knowledge that lung nodules exist only in the lung. The proposed method significantly reduces false positives outside the lung and improves the recognition rate of lung nodules to 83.8 F1 score compared to 66.6 F1 score of single task learning with U-net model. The experimental results on the JSRT public dataset demonstrate the effectiveness of the proposed method compared with other baseline methods.

Robust Object Extraction Algorithm in the Sea Environment (해양환경에서 강건한 물표 추적 알고리즘)

  • Park, Jiwon;Jeong, Jongmyeon
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.24 no.3
    • /
    • pp.298-303
    • /
    • 2014
  • In this paper, we proposed a robust object extraction and tracking algorithm in the IR image sequence acquired in the sea environment. In order to extract size-invariant object, we detect horizontal and vertical edges by using DWT and combine it to generate saliency map. To extract object region, binarization technique is applied to saliency map. The correspondences between objects in consecutive frames are defined by the calculating minimum weighted Euclidean distance as a matching measure. Finally, object trajectories are determined by considering false correspondences such as entering object, vanishing objects and false object and so on. The proposed algorithm can find trajectories robustly, which has shown by experimental results.

Activated Viewport based Surveillance Event Detection in 360-degree Video (360도 영상 공간에서 활성 뷰포트 기반 이벤트 검출)

  • Shim, Yoo-jeong;Lee, Myeong-jin
    • Journal of Broadcast Engineering
    • /
    • v.25 no.5
    • /
    • pp.770-775
    • /
    • 2020
  • Since 360-degree ERP frame structure has location-dependent distortion, existing video surveillance algorithms cannot be applied to 360-degree video. In this paper, an activated viewport based event detection method is proposed for 360-degree video. After extracting activated viewports enclosing object candidates, objects are finally detected in the viewports. These objects are tracked in 360-degree video space for region-based event detection. The proposed method is shown to improve the recall and the false negative rate more than 30% compared to the conventional method without activated viewports.

Extraction of the shape feature according to the risk area of the segmented tumor region based on the small-animal PET (소동물 PET기반 종양분할영역 위험구간변화에 따른 형태특성추출)

  • Lee Joung-Min;Kim Hyeong-Min;Kim Myoung-Hee
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2006.06b
    • /
    • pp.376-378
    • /
    • 2006
  • 본 논문에서는 소동물 양전자방출단층촬영 영상(Positron Emission Tomography, PET) 내 종양영역을 자동분할하고 분할된 윤곽선주변의 기하학적 위험구간에 따른 종양의 형태특성을 분석하기 위한 방법을 제시한다. PET 영상내 검출된 종양영역의 신뢰성을 위해 위음성(False negative, FN) 및 위양성(False positive, FP)의 위험구간을 같이 제공하는 것이 필요하다. 따라서, 방사선 특이적 특성이 반영된 명암값을 기반으로 Fuzzy C-Means(FCM) 클러스터링을 수행하여 종양영역을 자동 분할한다. 분활된 종양영역의 위험구간은 클러스터 간 공유되는 영역의 소속값을 이용하여 위음성, 위양성을 계산한다. 또한, 임의의 소속값 임계치 변화를 통해 위험구간의 변화에 따른 종양의 형태적 특성변화를 관측한다. 이러한 지역적 변화의 관측을 통해 위험구간의 형태학적 위치를 판단할 수 있어 위험구간에 따른 추가적인 잔여 암의 위치 및 형태 파악을 용이하게 한다.

  • PDF

Road Surface Marking Detection for Sensor Fusion-based Positioning System (센서 융합 기반 정밀 측위를 위한 노면 표시 검출)

  • Kim, Dongsuk;Jung, Hogi
    • Transactions of the Korean Society of Automotive Engineers
    • /
    • v.22 no.7
    • /
    • pp.107-116
    • /
    • 2014
  • This paper presents camera-based road surface marking detection methods suited to sensor fusion-based positioning system that consists of low-cost GPS (Global Positioning System), INS (Inertial Navigation System), EDM (Extended Digital Map), and vision system. The proposed vision system consists of two parts: lane marking detection and RSM (Road Surface Marking) detection. The lane marking detection provides ROIs (Region of Interest) that are highly likely to contain RSM. The RSM detection generates candidates in the regions and classifies their types. The proposed system focuses on detecting RSM without false detections and performing real time operation. In order to ensure real time operation, the gating varies for lane marking detection and changes detection methods according to the FSM (Finite State Machine) about the driving situation. Also, a single template matching is used to extract features for both lane marking detection and RSM detection, and it is efficiently implemented by horizontal integral image. Further, multiple step verification is performed to minimize false detections.

An Improved Secure Semi-fragile Watermarking Based on LBP and Arnold Transform

  • Zhang, Heng;Wang, Chengyou;Zhou, Xiao
    • Journal of Information Processing Systems
    • /
    • v.13 no.5
    • /
    • pp.1382-1396
    • /
    • 2017
  • In this paper, we analyze a recently proposed semi-fragile watermarking scheme based on local binary pattern (LBP) operators, and note that it has a fundamental flaw in the design. In this work, a binary watermark is embedded into image blocks by modifying the neighborhood pixels according to the LBP pattern. However, different image blocks might have the same LBP pattern, which can lead to false detection in watermark extraction process. In other words, one can modify the host image intentionally without affecting its watermark message. In addition, there is no encryption process before watermark embedding, which brings another potential security problem. To illustrate its weakness, two special copy-paste attacks are proposed in this paper, and several experiments are conducted to prove the effectiveness of these attacks. To solve these problems, an improved semi-fragile watermarking based on LBP operators is presented. In watermark embedding process, the central pixel value of each block is taken into account and Arnold transform is adopted to guarantee the security of watermark. Experimental results show that the improved watermarking scheme can overcome the above defects and locate the tampered region effectively.