• 제목/요약/키워드: direction classifier

검색결과 47건 처리시간 0.023초

Video smoke detection with block DNCNN and visual change image

  • Liu, Tong;Cheng, Jianghua;Yuan, Zhimin;Hua, Honghu;Zhao, Kangcheng
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제14권9호
    • /
    • pp.3712-3729
    • /
    • 2020
  • Smoke detection is helpful for early fire detection. With its large coverage area and low cost, vision-based smoke detection technology is the main research direction of outdoor smoke detection. We propose a two-stage smoke detection method combined with block Deep Normalization and Convolutional Neural Network (DNCNN) and visual change image. In the first stage, each suspected smoke region is detected from each frame of the images by using block DNCNN. According to the physical characteristics of smoke diffusion, a concept of visual change image is put forward in this paper, which is constructed by the video motion change state of the suspected smoke regions, and can describe the physical diffusion characteristics of smoke in the time and space domains. In the second stage, the Support Vector Machine (SVM) classifier is used to classify the Histogram of Oriented Gradients (HOG) features of visual change images of the suspected smoke regions, in this way to reduce the false alarm caused by the smoke-like objects such as cloud and fog. Simulation experiments are carried out on two public datasets of smoke. Results show that the accuracy and recall rate of smoke detection are high, and the false alarm rate is much lower than that of other comparison methods.

Robust Person Identification Using Optimal Reliability in Audio-Visual Information Fusion

  • Tariquzzaman, Md.;Kim, Jin-Young;Na, Seung-You;Choi, Seung-Ho
    • The Journal of the Acoustical Society of Korea
    • /
    • 제28권3E호
    • /
    • pp.109-117
    • /
    • 2009
  • Identity recognition in real environment with a reliable mode is a key issue in human computer interaction (HCI). In this paper, we present a robust person identification system considering score-based optimal reliability measure of audio-visual modalities. We propose an extension of the modified reliability function by introducing optimizing parameters for both of audio and visual modalities. For degradation of visual signals, we have applied JPEG compression to test images. In addition, for creating mismatch in between enrollment and test session, acoustic Babble noises and artificial illumination have been added to test audio and visual signals, respectively. Local PCA has been used on both modalities to reduce the dimension of feature vector. We have applied a swarm intelligence algorithm, i.e., particle swarm optimization for optimizing the modified convection function's optimizing parameters. The overall person identification experiments are performed using VidTimit DB. Experimental results show that our proposed optimal reliability measures have effectively enhanced the identification accuracy of 7.73% and 8.18% at different illumination direction to visual signal and consequent Babble noises to audio signal, respectively, in comparison with the best classifier system in the fusion system and maintained the modality reliability statistics in terms of its performance; it thus verified the consistency of the proposed extension.

다중 센서를 사용한 주행 환경에서의 객체 검출 및 분류 방법 (A New Object Region Detection and Classification Method using Multiple Sensors on the Driving Environment)

  • 김정언;강행봉
    • 한국멀티미디어학회논문지
    • /
    • 제20권8호
    • /
    • pp.1271-1281
    • /
    • 2017
  • It is essential to collect and analyze target information around the vehicle for autonomous driving of the vehicle. Based on the analysis, environmental information such as location and direction should be analyzed in real time to control the vehicle. In particular, obstruction or cutting of objects in the image must be handled to provide accurate information about the vehicle environment and to facilitate safe operation. In this paper, we propose a method to simultaneously generate 2D and 3D bounding box proposals using LiDAR Edge generated by filtering LiDAR sensor information. We classify the classes of each proposal by connecting them with Region-based Fully-Covolutional Networks (R-FCN), which is an object classifier based on Deep Learning, which uses two-dimensional images as inputs. Each 3D box is rearranged by using the class label and the subcategory information of each class to finally complete the 3D bounding box corresponding to the object. Because 3D bounding boxes are created in 3D space, object information such as space coordinates and object size can be obtained at once, and 2D bounding boxes associated with 3D boxes do not have problems such as occlusion.

신경회로망과 점진적 손상 모델링을 이용한 크리프 기공의 평가 (Estimation of Creep Cavities Using Neural Network and Progressive Damage Modeling)

  • 조석제;정현조
    • 대한기계학회논문집A
    • /
    • 제24권2호
    • /
    • pp.455-463
    • /
    • 2000
  • In order to develop nondestructive techniques for the quantitative estimation of creep damage a series of crept copper samples were prepared and their ultrasonic velocities were measured. Velocities measured in three directions with respect to the loading axis decreased nonlinearly and their anisotropy increased as a function of creep-induced porosity. A progressive damage model was described to explain the void-velocity relationship, including the anisotropy. The comparison of modeling study showed that the creep voids evolved from sphere toward flat oblate spheroid with its minor axis aligned along the stress direction. This model allowed us to determine the average aspect ratio of voids for a given porosity content. A novel technique, the back propagation neural network (BPNN), was applied for estimating the porosity content due to the creep damage. The measured velocities were used to train the BP classifier, and its accuracy was tested on another set of creep samples containing 0 to 0.7 % void content. When the void aspect ratio was used as input parameter together with the velocity data, the NN algorithm provided much better estimation of void content.

감시 영상에서 군중의 탈출 행동 검출 (Detection of Crowd Escape Behavior in Surveillance Video)

  • 박준욱;곽수영
    • 한국통신학회논문지
    • /
    • 제39C권8호
    • /
    • pp.731-737
    • /
    • 2014
  • 본 논문에서는 감시 카메라 환경에서 발생할 수 있는 군중의 비정상 행동 검출 방법을 제안한다. 군중들의 비정상 행동을 산발적으로 퍼지면서 뛰는 행동, 한쪽 방향으로 갑자기 뛰는 행동 두 가지로 정의하였다. 이를 검출하기 위하여 영상에서 움직임 벡터를 추출하여 군중의 비정상 행동 검출에 적합한 서술자 MHOF(Multi-scale Histogram of Optical Flow)와 DCHOF(Directional Change Histogram of Optical Flow)제안하였으며, 이를 이진 분류기인 SVM(Support Vector Machine)을 이용하여 검출하였다. 제안한 방법은 공개 데이터셋인 UMN 데이터와 PETS 2009 데이터를 이용하여 성능을 평가하였고 다른 방법론과의 비교를 통해 제안하는 알고리즘의 우수성을 입증하였다.

MHI의 형태 정보를 이용한 동작 인식 (Gesture Recognition using MHI Shape Information)

  • 김상균
    • 한국컴퓨터정보학회논문지
    • /
    • 제16권4호
    • /
    • pp.1-13
    • /
    • 2011
  • 본 논문에서는 MHI(Motion History Image)의 형태학적 정보를 이용하여 동작을 인식하는 제스처 인식(Gesture Recognition) 시스템을 제안한다. 입력되는 영상으로부터 동작에 관한 정보를 제공하는 MHI를 획득하고, 이 MHI로부터 x, y 각각의 좌표에 대한 기울기(gradient) 영상을 추출한다. 각각의 기울기 영상에 형태 문맥기법(shape context method)을 적용하여 형태 정보를 추출하고, 추출된 형태 정보 값들을 특징 값으로 사용한다. 이렇게 획득한 특징값들을 최종적으로 SVM(Support Vector Machine) 분류기로 학습 및 분류하여 동작을 인식한다. 제안하는 시스템은 MHI의 형태학적인 정보들을 사용함으로써 동작의 방향성을 인식할수 있고 다수 사람의 동작 인식이 가능하다. 뿐만 아니라 간단한 특징 추출 방법으로 높은 인식률의 시스템을 구현하였다.

영상처리 기법을 통한 RBFNN 패턴 분류기 기반 개선된 지문인식 시스템 설계 (Design of Fingerprints Identification Based on RBFNN Using Image Processing Techniques)

  • 배종수;오성권;김현기
    • 전기학회논문지
    • /
    • 제65권6호
    • /
    • pp.1060-1069
    • /
    • 2016
  • In this paper, we introduce the fingerprint recognition system based on Radial Basis Function Neural Network(RBFNN). Fingerprints are classified as four types(Whole, Arch, Right roof, Left roof). The preprocessing methods such as fast fourier transform, normalization, calculation of ridge's direction, filtering with gabor filter, binarization and rotation algorithm, are used in order to extract the features on fingerprint images and then those features are considered as the inputs of the network. RBFNN uses Fuzzy C-Means(FCM) clustering in the hidden layer and polynomial functions such as linear, quadratic, and modified quadratic are defined as connection weights of the network. Particle Swarm Optimization (PSO) algorithm optimizes a number of essential parameters needed to improve the accuracy of RBFNN. Those optimized parameters include the number of clusters and the fuzzification coefficient used in the FCM algorithm, and the orders of polynomial of networks. The performance evaluation of the proposed fingerprint recognition system is illustrated with the use of fingerprint data sets that are collected through Anguli program.

Finger Vein Recognition Based on Multi-Orientation Weighted Symmetric Local Graph Structure

  • Dong, Song;Yang, Jucheng;Chen, Yarui;Wang, Chao;Zhang, Xiaoyuan;Park, Dong Sun
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제9권10호
    • /
    • pp.4126-4142
    • /
    • 2015
  • Finger vein recognition is a biometric technology using finger veins to authenticate a person, and due to its high degree of uniqueness, liveness, and safety, it is widely used. The traditional Symmetric Local Graph Structure (SLGS) method only considers the relationship between the image pixels as a dominating set, and uses the relevant theories to tap image features. In order to better extract finger vein features, taking into account location information and direction information between the pixels of the image, this paper presents a novel finger vein feature extraction method, Multi-Orientation Weighted Symmetric Local Graph Structure (MOW-SLGS), which assigns weight to each edge according to the positional relationship between the edge and the target pixel. In addition, we use the Extreme Learning Machine (ELM) classifier to train and classify the vein feature extracted by the MOW-SLGS method. Experiments show that the proposed method has better performance than traditional methods.

이미지 센서와 3축 가속도 센서를 이용한 인간 행동 인식 (Human Activity Recognition using an Image Sensor and a 3-axis Accelerometer Sensor)

  • 남윤영;최유주;조위덕
    • 인터넷정보학회논문지
    • /
    • 제11권1호
    • /
    • pp.129-141
    • /
    • 2010
  • 본 논문에서는 사람의 행동 모니터링을 위한 멀티 센서 기반의 웨어러블 지능형 디바이스를 제안한다. 다중 행동을 인식하기 위해, 이미지 센서와 가속도 센서를 이용하여 행동 인식 알고리즘을 개발하였다. 멀티 센서로부터 얻은 데이터를 분석하기 위해 그리드 기반 옵티컬 플로우 방법을 제안하고 SVM 분류기법을 이용하였다. 이미지 센서로부터 얻은 모션 벡터의 방향과 크기를 이용하였고, 3축 가속도 센서로부터 얻은 데이터에서 FFT의 축과 크기와의 상관관계를 계산하였다. 실험 결과에서 이미지 센서 기반과 3축 가속도 센서기반의 행동 인식률은 각각 55.57 %, 89.97%를 보였으나 제안한 멀티센서기반의 행동인식률은 92.78% 를 보였다.

Forecasting of Various Air Pollutant Parameters in Bangalore Using Naïve Bayesian

  • Shivkumar M;Sudhindra K R;Pranesha T S;Chate D M;Beig G
    • International Journal of Computer Science & Network Security
    • /
    • 제24권3호
    • /
    • pp.196-200
    • /
    • 2024
  • Weather forecasting is considered to be of utmost important among various important sectors such as flood management and hydro-electricity generation. Although there are various numerical methods for weather forecasting but majority of them are reported to be Mechanistic computationally demanding due to their complexities. Therefore, it is necessary to develop and build models for accurately predicting the weather conditions which are faster as well as efficient in comparison to the prevalent meteorological models. The study has been undertaken to forecast various atmospheric parameters in the city of Bangalore using Naïve Bayes algorithms. The individual parameters analyzed in the study consisted of wind speed (WS), wind direction (WD), relative humidity (RH), solar radiation (SR), black carbon (BC), radiative forcing (RF), air temperature (AT), bar pressure (BP), PM10 and PM2.5 of the Bangalore city collected from Air Quality Monitoring Station for a period of 5 years from January 2015 to May 2019. The study concluded that Naive Bayes is an easy and efficient classifier that is centered on Bayes theorem, is quite efficient in forecasting the various air pollution parameters of the city of Bangalore.