통합 검색 | Korea Science

Multi-stage Transformer for Video Anomaly Detection

Viet-Tuan Le;Khuong G. T. Diep;Tae-Seok Kim;Yong-Guk Kim
- 한국정보처리학회:학술대회논문집
- /
- 한국정보처리학회 2023년도 추계학술발표대회
- /
- pp.648-651
- /
- 2023
Video anomaly detection aims to detect abnormal events. Motivated by the power of transformers recently shown in vision tasks, we propose a novel transformer-based network for video anomaly detection. To capture long-range information in video, we employ a multi-scale transformer as an encoder. A convolutional decoder is utilized to predict the future frame from the extracted multi-scale feature maps. The proposed method is evaluated on three benchmark datasets: USCD Ped2, CUHK Avenue, and ShanghaiTech. The results show that the proposed method achieves better performance compared to recent methods.
https://doi.org/10.3745/PKIPS.y2023m11a.648 인용 PDF

순시 무효 전력 고조파 검출방법을 이용한 단상 멀티레벨 능동전력 필터 (A Single Phase Multi-level Active Power Filter System using Instantaneous Reactive Power Harmonic Detection Method)

김수홍;김성민;이강희;김윤호
- 전력전자학회논문지
- /
- 제10권3호
- /
- pp.296-301
- /
- 2005
본 논문은 고조파 검출방식의 하나인 순시무효전력을 이용한 검출방식을 단상용 능동전력필터에 적용할 수 있도록 하였다. 가상회로를 가정하여 단상시스템에도 $\alpha$-$\beta$변환 기법을 이용함으로서 순시무효전력 검출 방식이 용이하게 적용될 수 있도록 하였다. 그리고 고조파 보상 인버터는 멀티레벨 인버터를 사용하였으며 트랜스포머 없이 입력 전원에 연결되어 고조파를 보상할 수 있도록 하였다. 제안된 알고리즘은 시뮬레이션과 실험을 통하여 입증되었다.
PDF KSCI

비디오에서 문자 검출을 위한 강인한 방법 (A Robust Method for Text Detection in Video)

;전승수;류한진;설상훈
- 한국정보과학회:학술대회논문집
- /
- 한국정보과학회 2007년도 한국컴퓨터종합학술대회논문집 Vol.34 No.1 (C)
- /
- pp.403-406
- /
- 2007
This paper proposes an effective method for text detection in video. First, we apply an edge detection method to the video frame with a relative low threshold to keep all possible text edge pixels. Second, a multi-frame integration method is applied to significantly remove background pixels which are not stationary in a specific period. Finally, text regions are extracted by using the coarse to fine projection method. Experimental results demonstrate the effectiveness of the proposed method.
PDF

퍼셉트론 신경회로망을 사용한 유성음, 무성음, 묵음 구간의 검출 알고리즘 (Voiced-Unvoiced-Silence Detection Algorithm using Perceptron Neural Network)

최재승
- 한국전자통신학회논문지
- /
- 제6권2호
- /
- pp.237-242
- /
- 2011
본 논문에서는 다층 퍼셉트론 신경회로망을 사용하여 각 프레임에서의 유성음, 무성음, 그리고 묵음 구간을 검출하는 구간검출 알고리즘을 제안한다. 다층 퍼셉트론 신경회로망의 입력으로는 고속 푸리에변환에 의한 전력스펙트럼 및 고속 푸리에변환 계수가 사용되어 네트워크가 학습된다. 본 실험에서는 원 음성에 백색잡음이 중첩된 음성을 신경회로망에 입력함으로서 각 프레임에서의 유성음, 무성음, 묵음 구간의 검출성능 결과를 나타낸다. 본 실험에서는 신경회로망의 학습 데이터 및 평가 데이터가 다를 경우에도 이러한 음성 및 백색잡음에 대하여 92% 이상의 검출율을 구할 수 있었다.
https://doi.org/10.13067/JKIECS.2011.6.2.237 인용 PDF KSCI

대규모 비디오 감시 환경에서 프라이버시 보호를 위한 다중 레벨 특징 기반 얼굴검출 방법에 관한 연구 (Face Detection Using Multi-level Features for Privacy Protection in Large-scale Surveillance Video)

이승호;문정익;김형일;노용만
- 한국멀티미디어학회논문지
- /
- 제18권11호
- /
- pp.1268-1280
- /
- 2015
In video surveillance system, the exposure of a person's face is a serious threat to personal privacy. To protect the personal privacy in large amount of videos, an automatic face detection method is required to locate and mask the person's face. However, in real-world surveillance videos, the effectiveness of existing face detection methods could deteriorate due to large variations in facial appearance (e.g., facial pose, illumination etc.) or degraded face (e.g., occluded face, low-resolution face etc.). This paper proposes a new face detection method based on multi-level facial features. In a video frame, different kinds of spatial features are independently extracted, and analyzed, which could complement each other in the aforementioned challenges. Temporal domain analysis is also exploited to consolidate the proposed method. Experimental results show that, compared to competing methods, the proposed method is able to achieve very high recall rates while maintaining acceptable precision rates.
https://doi.org/10.9717/kmms.2015.18.11.1268 인용 PDF KSCI KPUBS HTML

Black Ice Detection Platform and Its Evaluation using Jetson Nano Devices based on Convolutional Neural Network (CNN)

Sun-Kyoung KANG;Yeonwoo LEE
- 한국인공지능학회지
- /
- 제11권4호
- /
- pp.1-8
- /
- 2023
In this paper, we propose a black ice detection platform framework using Convolutional Neural Networks (CNNs). To overcome black ice problem, we introduce a real-time based early warning platform using CNN-based architecture, and furthermore, in order to enhance the accuracy of black ice detection, we apply a multi-scale dilation convolution feature fusion (MsDC-FF) technique. Then, we establish a specialized experimental platform by using a comprehensive dataset of thermal road black ice images for a training and evaluation purpose. Experimental results of a real-time black ice detection platform show the better performance of our proposed network model compared to conventional image segmentation models. Our proposed platform have achieved real-time segmentation of road black ice areas by deploying a road black ice area segmentation network on the edge device Jetson Nano devices. This approach in parallel using multi-scale dilated convolutions with different dilation rates had faster segmentation speeds due to its smaller model parameters. The proposed MsCD-FF Net(2) model had the fastest segmentation speed at 5.53 frame per second (FPS). Thereby encouraging safe driving for motorists and providing decision support for road surface management in the road traffic monitoring department.
https://doi.org/10.24225/kjai.2023.11.4.1 인용 PDF

Wavelet frame 변환을 이용한 냉연 시각검사 알고리듬 (Visual inspection algorithm of cold rolled strips by wavelet frame transform)

이창수;최종호
- 제어로봇시스템학회논문지
- /
- 제4권3호
- /
- pp.372-377
- /
- 1998
This paper deals with the detection, feature extraction and classification of surface defects in cold rolled strips. Inspection systems are one of the most important fields in factory automation. Defects such as slipmark and dullmark can be effectively detected with a Gaussian matched filter because their shapes are similar to Gaussian. It is justified that the proposed WF(Wavelet Frame) method could be regarded as multiscale Gaussian matched filter which can be applied to the inspection of cold rolled strip. After a wavelet frame transform, the entropies and moments are computed for each subband which pass through both local low pass filter and nonlinear operator. With these features as input, a MLP(Multi Layer Perceptron) is used as a classifier. The proposed inspection method was applied to the real images with defects, and hence showed good performance. The role of each extracted feature is analyzed by KLT(Karhunen-Loeve Transform).
PDF

Quasi-Orthogonal STBC with Iterative Decoding in Bit Interleaved Coded Modulation

성창경;김지훈;이인규
- 한국통신학회논문지
- /
- 제33권4A호
- /
- pp.426-433
- /
- 2008
In this paper, we present a method to improve the performance of the four transmit antenna quasi-orthogonal space-time block code (STBC) in the coded system. For the four transmit antenna case, the quasi-orthogonal STBC consists of two symbol groups which are orthogonal to each other, but intra group symbols are not. In uncoded system with the matched filter detection, constellation rotation can improve the performance. However, in coded systems, its gain is absorbed by the coding gain especially for lower rate code. We propose an iterative decoding method to improve the performance of quasi-orthogonal codes in coded systems. With conventional quasi-orthogonal STBC detection, the joint ML detection can be improved by iterative processing between the demapper and the decoder. Simulation results shows that the performance improvement is about 2dB at 1% frame error rate.
PDF KSCI

A New Anchor Shot Detection System for News Video Indexing

Lee, Han-Sung;Im, Young-Hee;Park, Joo-Young;Park, Dai-Hee
- 한국지능시스템학회논문지
- /
- 제18권1호
- /
- pp.133-138
- /
- 2008
In this paper, we propose a novel anchor shot detection system, named to MASD (Multi-phase Anchor Shot Detection), which is a core step of the preprocessing process for the news video analysis. The proposed system is composed of four modules and operates sequentially: 1) skin color detection module for reducing the candidate face regions; 2) face detection module for finding the key-frames with a facial data; 3) vector representation module for the key-frame images using a non-negative matrix factorization; 4) one class SVM module for determining the anchor shots using a support vector data description. Besides the qualitative analysis, our experiments validate that the proposed system shows not only the comparable accuracy to the recently developed methods, but also more faster detection rate than those of others.
https://doi.org/10.5391/JKIIS.2008.18.1.133 인용 PDF KSCI

다중 스케일 시간 확장 합성곱 신경망을 이용한 방송 콘텐츠에서의 음성 검출 (Speech detection from broadcast contents using multi-scale time-dilated convolutional neural networks)

장병용;권오욱
- 말소리와 음성과학
- /
- 제11권4호
- /
- pp.89-96
- /
- 2019
본 논문에서는 방송 콘텐츠에서 음성 구간 검출을 효과적으로 할 수 있는 심층 학습 모델 구조를 제안한다. 또한 특징 벡터의 시간적 변화를 학습하기 위한 다중 스케일 시간 확장 합성곱 층을 제안한다. 본 논문에서 제안한 모델의 성능을 검증하기 위하여 여러 개의 비교 모델을 구현하고, 프레임 단위의 F-score, precision, recall을 계산하여 보여 준다. 제안 모델과 비교 모델은 모두 같은 학습 데이터로 학습되었으며, 모든 모델은 다양한 장르(드라마, 뉴스, 다큐멘터리 등)로 구성되어 있는 한국 방송데이터 32시간을 이용하여 모델을 학습되었다. 제안 모델은 한국 방송데이터에서 F-score 91.7%로 가장 좋은 성능을 보여주었다. 또한 영국과 스페인 방송 데이터에서도 F-score 87.9%와 92.6%로 가장 높은 성능을 보여주었다. 결과적으로 본 논문의 제안 모델은 특징 벡터의 시간적 변화를 학습하여 음성 구간 검출 성능 향상에 기여할 수 있었다.
https://doi.org/10.13064/KSSS.2019.11.4.089 인용 PDF KSCI

검색결과 65건 처리시간 0.028초

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)