Search | Korea Science

Dilated convolution and gated linear unit based sound event detection and tagging algorithm using weak label (약한 레이블을 이용한 확장 합성곱 신경망과 게이트 선형 유닛 기반 음향 이벤트 검출 및 태깅 알고리즘)

Park, Chungho;Kim, Donghyun;Ko, Hanseok
- The Journal of the Acoustical Society of Korea
- /
- v.39 no.5
- /
- pp.414-423
- /
- 2020
In this paper, we propose a Dilated Convolution Gate Linear Unit (DCGLU) to mitigate the lack of sparsity and small receptive field problems caused by the segmentation map extraction process in sound event detection with weak labels. In the advent of deep learning framework, segmentation map extraction approaches have shown improved performance in noisy environments. However, these methods are forced to maintain the size of the feature map to extract the segmentation map as the model would be constructed without a pooling operation. As a result, the performance of these methods is deteriorated with a lack of sparsity and a small receptive field. To mitigate these problems, we utilize GLU to control the flow of information and Dilated Convolutional Neural Networks (DCNNs) to increase the receptive field without additional learning parameters. For the performance evaluation, we employ a URBAN-SED and self-organized bird sound dataset. The relevant experiments show that our proposed DCGLU model outperforms over other baselines. In particular, our method is shown to exhibit robustness against nature sound noises with three Signal to Noise Ratio (SNR) levels (20 dB, 10 dB and 0 dB).
https://doi.org/10.7776/ASK.2020.39.5.414 인용 PDF KSCI

Retrieval of Broadcast News Using Audio Content Analysis

Kim, Hyoung-Gook
- The Journal of the Acoustical Society of Korea
- /
- v.26 no.3E
- /
- pp.74-79
- /
- 2007
In this paper, we report our recent work on a indexing and retrieval system of broadcast news using audio content analysis. Key issues addressed in this work are two major parts of the audio indexing system: anchorperson detection based on audio segmentation, and phone-based spoken document retrieval, developed in the framework of the emerging MPEG-7 standard. Experiments are conducted on a database of Britisch broadcast news videos. We discuss the development of the retrieval system, and the evaluation of each part and the retrieval system.
PDF KSCI

ECG Pattern Classification Using Back Propagation Neural Network (역전달 신경회로망을 이용한 심전도 신호의 패턴분류에 관한 연구)

이제석;이정환;권혁제;이명호
- Journal of the Korean Institute of Telematics and Electronics B
- /
- v.30B no.6
- /
- pp.67-75
- /
- 1993
ECG pattern was classified using a back-propagation neural network. An improved feature extractor of ECG is proposed for better classification capability. It is consisted of preprocessing ECG signal by an FIR filter faster than conventional one by a factor of 5. QRS complex recognition by moving-window integration, and peak extraction by quadratic approximation. Since the FIR filter had a periodic frequency spectrum, only one-fifth of usual processing time was required. Also, segmentation of ECG signal followed by quadratic approximation of each segment enabled accurate detection of both P and T waves. When improtant features were extracted and fed into back-propagation neural network for pattern classification, the required number of nodes in hidden and input layers was reduced compared to using raw data as an input, also reducing the necessary time for study. Accurate pattern classification was possible by an appropriate feature selection.
PDF

Automatic Syllable Segmentation Algorithm in Noise Additional Continuous Speech (잡음이 첨가된 연속음성에서의 자동 음절분할 알고리즘)

Kim, Young-Sub;Cha, Young-Dong;Kim, Chang-Keun;Lee, Kwang-Seok;Hur, Kang-In
- Proceedings of the Korea Institute of Convergence Signal Processing
- /
- 2006.06a
- /
- pp.17-20
- /
- 2006
본 논문에서는 잡음이 첨가된 연속음성에서의 자동 음절분할을 위해 기존에 사용되고 있는 특징 파라미터인 단구간 에너지 이외에 잡음에 강인한 특성을 가지고 있는 새로운 특징인 스펙트럼 밀도비교척도와 의사역행렬을 이용한 선형판별함수를 제안한다. 기존에 사용되는 단구간 에너지는 잡음이 없는 환경에서는 좋은 성능을 나타내지만 잡음환경에서는 그렇지 못하다. 반면에 논문에서 제안한 척도들은 반대의 성능을 가지므로 주변잡음의 크기에 따라 각각의 파라미터를 적절한 가중치로 조합하는 음절구간 결정함수와 유한상태 머신을 추가로 사용면 무 잡음 환경뿐만 아니라, 잡음이 첨가된 연속음성에서도 일정수준 이상의 음절구간을 분리해 낼 수 있다.
PDF

Parallel Connected Component Labeling Based on the Selective Four Directional Label Search Using CUDA

Soh, Young-Sung;Hong, Jung-Woo
- Journal of the Institute of Convergence Signal Processing
- /
- v.16 no.3
- /
- pp.83-89
- /
- 2015
Connected component labeling (CCL) is a mandatory step in image segmentation where objects are extracted and uniquely labeled. CCL is a computationally expensive operation and thus is often done in parallel processing framework to reduce execution time. Various parallel CCL methods have been proposed in the literature. Among them are NSZ label equivalence (NSZ-LE) method, modified 8 directional label selection (M8DLS) method, HYBRID1 method, and HYBRID2 method. Soh et al. showed that HYBRID2 outperforms the others and is the best so far. In this paper we propose a new hybrid parallel CCL algorithm termed as HYBRID3 that combines selective four directional label search (S4DLS) with label backtracking (LB). We show that the average percentage speedup of the proposed over M8DLS is around 60% more than that of HYBRID2 over M8DLS for various kinds of images.
PDF KSCI

Restoration of Bi-level Images via Iterative Semi-blind Wiener Filtering (반복 semi-blind 위너 필터링을 이용한 이진영상의 복원)

Kim, Jeong-Tae
- The Transactions of The Korean Institute of Electrical Engineers
- /
- v.57 no.7
- /
- pp.1290-1294
- /
- 2008
We present a novel deblurring algorithm for bi-level images blurred by some parameterizable point spread function. The proposed method iteratively searches unknown parameters in the point spread function and noise-to-signal ratio by minimizing an objective function that is based on the binariness and the difference between two intensity values of restoring image. In simulations and experiments, the proposed method showed improved performance compared with the Wiener filtering based method in terms of bit error rate after segmentation.
PDF KSCI

A Review of 3D Object Tracking Methods Using Deep Learning (딥러닝 기술을 이용한 3차원 객체 추적 기술 리뷰)

Park, Hanhoon
- Journal of the Institute of Convergence Signal Processing
- /
- v.22 no.1
- /
- pp.30-37
- /
- 2021
Accurate 3D object tracking with camera images is a key enabling technology for augmented reality applications. Motivated by the impressive success of convolutional neural networks (CNNs) in computer vision tasks such as image classification, object detection, image segmentation, recent studies for 3D object tracking have focused on leveraging deep learning. In this paper, we review deep learning approaches for 3D object tracking. We describe key methods in this field and discuss potential future research directions.
PDF KSCI

Region-Growing Segmentation Algorithm for Rossless Image Compression to High-Resolution Medical Image (영역 성장 분할 기법을 이용한 무손실 영상 압축)

박정선;김길중;전계록
- Journal of the Institute of Convergence Signal Processing
- /
- v.3 no.1
- /
- pp.33-40
- /
- 2002
In this paper, we proposed a lossless compression algorithm of medical images which is essential technique in picture archive and communication system. Mammographic image and magnetic resonance image in among medical images used in this study, proposed a region growing segmentation algorithm for compression of these images. A proposed algorithm was partition by three sub region which error image, discontinuity index map, high order bit data from original image. And generated discontinuity index image data and error image which apply to a region growing algorithm are compressed using JBIG(Joint Bi-level Image experts Group) algorithm that is international hi-level image compression standard and proper image compression technique of gray code digital Images. The proposed lossless compression method resulted in, on the average, lossless compression to about 73.14% with a database of high-resolution digital mammography images. In comparison with direct coding by JBIG, JPEG, and Lempel-Ziv coding methods, the proposed method performed better by 3.7%, 7.9% and 23.6% on the database used.
PDF

Automated 3D scoring of fluorescence in situ hybridization (FISH) using a confocal whole slide imaging scanner

Ziv Frankenstein;Naohiro Uraoka;Umut Aypar;Ruth Aryeequaye;Mamta Rao;Meera Hameed;Yanming Zhang;Yukako Yagi
- Applied Microscopy
- /
- v.51
- /
- pp.4.1-4.12
- /
- 2021
Fluorescence in situ hybridization (FISH) is a technique to visualize specific DNA/RNA sequences within the cell nuclei and provide the presence, location and structural integrity of genes on chromosomes. A confocal Whole Slide Imaging (WSI) scanner technology has superior depth resolution compared to wide-field fluorescence imaging. Confocal WSI has the ability to perform serial optical sections with specimen imaging, which is critical for 3D tissue reconstruction for volumetric spatial analysis. The standard clinical manual scoring for FISH is labor-intensive, time-consuming and subjective. Application of multi-gene FISH analysis alongside 3D imaging, significantly increase the level of complexity required for an accurate 3D analysis. Therefore, the purpose of this study is to establish automated 3D FISH scoring for z-stack images from confocal WSI scanner. The algorithm and the application we developed, SHIMARIS PAFQ, successfully employs 3D calculations for clear individual cell nuclei segmentation, gene signals detection and distribution of break-apart probes signal patterns, including standard break-apart, and variant patterns due to truncation, and deletion, etc. The analysis was accurate and precise when compared with ground truth clinical manual counting and scoring reported in ten lymphoma and solid tumors cases. The algorithm and the application we developed, SHIMARIS PAFQ, is objective and more efficient than the conventional procedure. It enables the automated counting of more nuclei, precisely detecting additional abnormal signal variations in nuclei patterns and analyzes gigabyte multi-layer stacking imaging data of tissue samples from patients. Currently, we are developing a deep learning algorithm for automated tumor area detection to be integrated with SHIMARIS PAFQ.
https://doi.org/10.1186/s42649-021-00053-y 인용 PDF

An Extraction Method of Glomerulus Region from Renal Tissue Image (신장조직 영상에서 사구체 영역의 추출법)

Kim, Eung-Kyeu
- Journal of the Institute of Convergence Signal Processing
- /
- v.13 no.2
- /
- pp.70-76
- /
- 2012
In this paper, an automatic extraction method of glomerulus region from human renal tissue image is presented. The important information reflecting the state of kidneys richly included in the glomeruli, so it should be the first step to extract the glomerulus region from the renal tissue image for the further quantitative analysis of the renal condition. Especially, there is no clear difference between the glomerulus and other tissues, so the glomerulus region can not be easily extracted from its background by the existing segmentation methods. The outer edge of a glomerulus region is regarded as a common property for the regions of this kind ; a two- dimensional Gaussian distribution is used to convolve with an original image first and then the image is thresholded at this blurred image ; a closed curve corresponding to the outer edge can be obtained by usual pattern processing skills like thinning, branch-cutting, hole-filling etc., Finally, the glomerulus region can be obtained by extracting the area in the original image surrounded by the closed curve. The glomerulus regions are correctly extracted by 85 percentages and experimental results show the proposed method is effective.
PDF KSCI

Search Result 135, Processing Time 0.034 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)