• Title/Summary/Keyword: classification-based segmentation

Search Result 295, Processing Time 0.025 seconds

Speech/Music Classification Based on the Higher-Order Moments of Subband Energy

  • Seo, Jiin Soo
    • Journal of Korea Multimedia Society
    • /
    • v.21 no.7
    • /
    • pp.737-744
    • /
    • 2018
  • This paper presents a study on the performance of the higher-order moments for speech/music classification. For a successful speech/music classifier, extracting features that allow direct access to the relevant speech or music specific information is crucial. In addition to the conventional variance-based features, we utilize the higher-order moments of features, such as skewness and kurtosis. Moreover, we investigate the subband decomposition parameters in extracting features, which improves classification accuracy. Experiments on two speech/music datasets, which are publicly available, were performed and show that the higher-order moment features can improve classification accuracy when combined with the conventional variance-based features.

Texture Classification Based on Gabor-like Feature (유사 가버 특징에 기반한 텍스쳐 분류)

  • Son, Ji-Hoon;Kim, Sung-Young
    • The Journal of Korea Institute of Information, Electronics, and Communication Technology
    • /
    • v.10 no.2
    • /
    • pp.147-153
    • /
    • 2017
  • Efficient texture representation is very important in computer vision fields. The performance of texture classification or/and segmentation can be improved based on efficient texture representation. Gabor filter is a representation method that has long history for texture representation based on multi-scale analysis. Gabor filter shows good performance in texture classification and segmentation but requires much processing time. In this paper, we propose new texture representation method that is also based on multi-scale analysis. The proposed representation can provide similar performance in texture classification but can reduce processing time against Gabor filter. Experimental results show good performance of our method.

Multi-Tasking U-net Based Paprika Disease Diagnosis (Multi-Tasking U-net 기반 파프리카 병해충 진단)

  • Kim, Seo Jeong;Kim, Hyong Suk
    • Smart Media Journal
    • /
    • v.9 no.1
    • /
    • pp.16-22
    • /
    • 2020
  • In this study, a neural network method performing both Detection and Classification of diseases and insects in paprika is proposed with Multi-Tasking U-net. Paprika on farms does not have a wide variety of diseases in this study, only two classes such as powdery mildew and mite, which occur relatively frequently are made as the targets. Aiming to this, a U-net is used as a backbone network, and the last layers of the encoder and the decoder of the U-net are utilized for classification and segmentation, respectively. As the result, the encoder of the U-net is shared for both of detection and classification. The training data are composed of 680 normal leaves, 450 mite-damaged leaves, and 370 powdery mildews. The test data are 130 normal leaves, 100 mite-damaged leaves, and 90 powdery mildews. Its test results shows 89% of recognition accuracy.

Automatic Classification of Continuous Heart Sound Signals Using the Statistical Modeling Approach (통계적 모델링 기법을 이용한 연속심음신호의 자동분류에 관한 연구)

  • Kim, Hee-Keun;Chung, Yong-Joo
    • The Journal of the Acoustical Society of Korea
    • /
    • v.26 no.4
    • /
    • pp.144-152
    • /
    • 2007
  • Conventional research works on the classification of the heart sound signal have been done mainly with the artificial neural networks. But the analysis results on the statistical characteristic of the heart sound signal have shown that the HMM is suitable for modeling the heart sound signal. In this paper, we model the various heart sound signals representing different heart diseases with the HMM and find that the classification rate is much affected by the clustering of the heart sound signal. Also, the heart sound signal acquired in real environments is a continuous signal without any specified starting and ending points of time. Hence, for the classification based on the HMM, the continuous cyclic heart sound signal needs to be manually segmented to obtain isolated cycles of the signal. As the manual segmentation will incur the errors in the segmentation and will not be adequate for real time processing, we propose a variant of the ergodic HMM which does not need segmentation procedures. Simulation results show that the proposed method successfully classifies continuous heart sounds with high accuracy.

Automatic Sputum Color Image Segmentation for Lung Cancer Diagnosis

  • Taher, Fatma;Werghi, Naoufel;Al-Ahmad, Hussain
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.7 no.1
    • /
    • pp.68-80
    • /
    • 2013
  • Lung cancer is considered to be the leading cause of cancer death worldwide. A technique commonly used consists of analyzing sputum images for detecting lung cancer cells. However, the analysis of sputum is time consuming and requires highly trained personnel to avoid errors. The manual screening of sputum samples has to be improved by using image processing techniques. In this paper we present a Computer Aided Diagnosis (CAD) system for early detection and diagnosis of lung cancer based on the analysis of the sputum color image with the aim to attain a high accuracy rate and to reduce the time consumed to analyze such sputum samples. In order to form general diagnostic rules, we present a framework for segmentation and extraction of sputum cells in sputum images using respectively, a Bayesian classification method followed by region detection and feature extraction techniques to determine the shape of the nuclei inside the sputum cells. The final results will be used for a (CAD) system for early detection of lung cancer. We analyzed the performance of a Bayesian classification with respect to the color space representation and quantification. Our methods were validated via a series of experimentation conducted with a data set of 100 images. Our evaluation criteria were based on sensitivity, specificity and accuracy.

A comparative study on UAV pilot license by the classification criteria (무인비행장치 분류기준에 따른 조종 자격제도 비교 연구)

  • Kim, Yongseok;Choi, Sungwon
    • Journal of the Korean Society for Aviation and Aeronautics
    • /
    • v.27 no.1
    • /
    • pp.26-33
    • /
    • 2019
  • It is necessary to establish a UAV pilot license and training system because the number of UAV-related accidents has rapidly risen. Most of accidents are caused by the human factors such as the lack of control skill and aviation knowledge. In this paper, we investigate licensing policy of small UAV pilots and examine the level of UAV licensing system and classification criteria based on comparative analysis of national cases such as USA, UK and China. Recently, the Ministry of Land, Infrastructure and Transport Affairs is planning to improve the safety regulation by taking into account the risk level of the licensing system, which has been classified according to the existing weight and commercial purpose. From the comparative analysis, we suggested a improvement policy for UAV licensing system in the view of pilot license segmentation, beyond Visual Line-of-sight flight and high risk UAV for non-commercial.

Land Cover Classifier Using Coordinate Hash Encoder (좌표 해시 인코더를 활용한 토지피복 분류 모델)

  • Yongsun Yoon;Dongjae Kwon
    • Korean Journal of Remote Sensing
    • /
    • v.39 no.6_3
    • /
    • pp.1771-1777
    • /
    • 2023
  • With the advancements of deep learning, many semantic segmentation-based methods for land cover classification have been proposed. However, existing deep learning-based models only use image information and cannot guarantee spatiotemporal consistency. In this study, we propose a land cover classification model using geographical coordinates. First, the coordinate features are extracted through the Coordinate Hash Encoder, which is an extension of the Multi-resolution Hash Encoder, an implicit neural representation technique, to the longitude-latitude coordinate system. Next, we propose an architecture that combines the extracted coordinate features with different levels of U-net decoder. Experimental results show that the proposed method improves the mean intersection over union by about 32% and improves the spatiotemporal consistency.

Unconstrained Handwritten Numeral Sti-ing Recognition by Using Decision Value Generator (결정값 발생기를 이용한 무제약 필기체 숫자 열의 인식)

  • 김계경;김진호;박희주
    • Journal of Korea Society of Industrial Information Systems
    • /
    • v.6 no.1
    • /
    • pp.82-89
    • /
    • 2001
  • This paper presents recognition of unconstrained handwritten numeral strings using decision value generator, which is combined with both isolated digit identifier and recognizer designed with structural characteristics of digits. Numerical string recognition system is composed of three modules, which are pre-segmentation, segmentation and recognition. Pre-segmentation module classifies a numeral string into sub-images, which are isolated digit, touched digits or broken digit, using confidence value of decision value generator. Segmentation module segments touched digits using reliability value of decision value generator that will separate the leftmost digit from touched string of digits. Segmentation-based and segmentation-free methods have used for classification and segmentation, respectively. To evaluate proposed method, experiments have carried out with handwritten numeral strings of NIST SD19 and higher recognition performance than previous works has obtained with 96.7%.

  • PDF

A Fast Lower Extremity Vessel Segmentation Method for Large CT Data Sets Using 3-Dimensional Seeded Region Growing and Branch Classification

  • Kim, Dong-Sung
    • Journal of Biomedical Engineering Research
    • /
    • v.29 no.5
    • /
    • pp.348-354
    • /
    • 2008
  • Segmenting vessels in lower extremity CT images is very difficult because of gray level variation, connection to bones, and their small sizes. Instead of segmenting vessels, we propose an approach that segments bones and subtracts them from the original CT images. The subtracted images can contain not only connected vessel structures but also isolated vessels, which are very difficult to detect using conventional vessel segmentation methods. The proposed method initially grows a 3-dimensional (3D) volume with a seeded region growing (SRG) using an adaptive threshold and then detects junctions and forked branches. The forked branches are classified into either bone branches or vessel branches based on appearance, shape, size change, and moving velocity of the branch. The final volume is re-grown by collecting connected bone branches. The algorithm has produced promising results for segmenting bone structures in several tens of vessel-enhanced CT image data sets of lower extremities.

A Study on Automatic Vehicle Extraction within Drone Image Bounding Box Using Unsupervised SVM Classification Technique (무감독 SVM 분류 기법을 통한 드론 영상 경계 박스 내 차량 자동 추출 연구)

  • Junho Yeom
    • Land and Housing Review
    • /
    • v.14 no.4
    • /
    • pp.95-102
    • /
    • 2023
  • Numerous investigations have explored the integration of machine leaning algorithms with high-resolution drone image for object detection in urban settings. However, a prevalent limitation in vehicle extraction studies involves the reliance on bounding boxes rather than instance segmentation. This limitation hinders the precise determination of vehicle direction and exact boundaries. Instance segmentation, while providing detailed object boundaries, necessitates labour intensive labelling for individual objects, prompting the need for research on automating unsupervised instance segmentation in vehicle extraction. In this study, a novel approach was proposed for vehicle extraction utilizing unsupervised SVM classification applied to vehicle bounding boxes in drone images. The method aims to address the challenges associated with bounding box-based approaches and provide a more accurate representation of vehicle boundaries. The study showed promising results, demonstrating an 89% accuracy in vehicle extraction. Notably, the proposed technique proved effective even when dealing with significant variations in spectral characteristics within the vehicles. This research contributes to advancing the field by offering a viable solution for automatic and unsupervised instance segmentation in the context of vehicle extraction from image.