Search | Korea Science

Issues and Empirical Results for Improving Text Classification

Ko, Young-Joong;Seo, Jung-Yun
- Journal of Computing Science and Engineering
- /
- v.5 no.2
- /
- pp.150-160
- /
- 2011
Automatic text classification has a long history and many studies have been conducted in this field. In particular, many machine learning algorithms and information retrieval techniques have been applied to text classification tasks. Even though much technical progress has been made in text classification, there is still room for improvement in text classification. In this paper, we will discuss remaining issues in improving text classification. In this paper, three improvement issues are presented including automatic training data generation, noisy data treatment and term weighting and indexing, and four actual studies and their empirical results for those issues are introduced. First, the semi-supervised learning technique is applied to text classification to efficiently create training data. For effective noisy data treatment, a noisy data reduction method and a robust text classifier from noisy data are developed as a solution. Finally, the term weighting and indexing technique is revised by reflecting the importance of sentences into term weight calculation using summarization techniques.
https://doi.org/10.5626/JCSE.2011.5.2.150 인용 PDF KPUBS

An adaptive nonlocal filtering for low-dose CT in both image and projection domains

Wang, Yingmei;Fu, Shujun;Li, Wanlong;Zhang, Caiming
- Journal of Computational Design and Engineering
- /
- v.2 no.2
- /
- pp.113-118
- /
- 2015
An important problem in low-dose CT is the image quality degradation caused by photon starvation. There are a lot of algorithms in sinogram domain or image domain to solve this problem. In view of strong self-similarity contained in the special sinusoid-like strip data in the sinogram space, we propose a novel non-local filtering, whose average weights are related to both the image FBP (filtered backprojection) reconstructed from restored sinogram data and the image directly FBP reconstructed from noisy sinogram data. In the process of sinogram restoration, we apply a non-local method with smoothness parameters adjusted adaptively to the variance of noisy sinogram data, which makes the method much effective for noise reduction in sinogram domain. Simulation experiments show that our proposed method by filtering in both image and projection domains has a better performance in noise reduction and details preservation in reconstructed images.
https://doi.org/10.1016/j.jcde.2014.12.007 인용 PDF

An Autoregressive Parameter Estimation from Noisy Speech Using the Adaptive Predictor (적응예측기를 이용하여 잡음섞인 음성신호로부터 autoregressive 계수를 추산하는 방법)

Koo, Bon-Eung
- The Journal of the Acoustical Society of Korea
- /
- v.14 no.3
- /
- pp.90-96
- /
- 1995
A new method for autoregressive parameter estimation from noisy observation sequence is presented. This method, termed the AP method, is a result of an attempt to make use of the adaptive predictor which is a simple and reliable way of parameter estimation. It is shown theoretically that, for noisy input, the parameter vector computed from the prediction sequence is closer to that of the original sequence than the noisy input sequence is, under the spectral distortion criterion. Simulation results with the Kalman filter as a noise reduction filter and real speech data supported the theory. Roughly speaking, the performance of the parameter set obtained by the AP method is better than noisy one but worse than the EM iteration results. When the simplicity is considered, it could provide a useful alternative to more complicated parameter estimation methods in some applications.
PDF

Eigenvoice Adaptation of Classification Model for Binary Mask Estimation (Eigenvoice를 이용한 이진 마스크 분류 모델 적응 방법)

Kim, Gibak
- Journal of Broadcast Engineering
- /
- v.20 no.1
- /
- pp.164-170
- /
- 2015
This paper deals with the adaptation of classification model in the binary mask approach to suppress noise in the noisy environment. The binary mask estimation approach is known to improve speech intelligibility of noisy speech. However, the same type of noisy data for the test data should be included in the training data for building the classification model of binary mask estimation. The eigenvoice adaptation is applied to the noise-independent classification model and the adapted model is used as noise-dependent model. The results are reported in Hit rates and False alarm rates. The experimental results confirmed that the accuracy of classification is improved as the number of adaptation sentences increases.
https://doi.org/10.5909/JBE.2015.20.1.164 인용 PDF KSCI KPUBS HTML

Noise Reduction Algorithm in Speech by Wiener Filter (위너필터에 의한 음성 중의 잡음제거 알고리즘)

Choi, Jae-Seung
- The Journal of the Korea institute of electronic communication sciences
- /
- v.8 no.9
- /
- pp.1293-1298
- /
- 2013
This paper proposes a noise reduction algorithm using Wiener filter to remove the noise components from the noisy speech in order to improve the speech signal. The proposed algorithm first removes the noise spectrums of white noise from the noisy signal based on the noise reshaping and reduction method at each frame. And this algorithm enhances the speech signal using Wiener filter based on linear predictive coding analysis. In this experiment, experimental results of the proposed algorithm demonstrate using the speech and noise data by Japanese male speaker. Based on measuring the spectral distortion (SD) measure, experiments confirm that the proposed algorithm is effective for the speech by contaminated white noise. From the experiments, the maximum improvement in the output SD values was 4.94 dB better for white noise compared with former Wiener filter.
https://doi.org/10.13067/JKIECS.2013.8.9.1293 인용 PDF KSCI

Dimension Reduction Method of Speech Feature Vector for Real-Time Adaptation of Voice Activity Detection (음성구간 검출기의 실시간 적응화를 위한 음성 특징벡터의 차원 축소 방법)

Park Jin-Young;Lee Kwang-Seok;Hur Kang-In
- Journal of the Institute of Convergence Signal Processing
- /
- v.7 no.3
- /
- pp.116-121
- /
- 2006
In this paper, we propose the dimension reduction method of multi-dimension speech feature vector for real-time adaptation procedure in various noisy environments. This method which reduces dimensions non-linearly to map the likelihood of speech feature vector and noise feature vector. The LRT(Likelihood Ratio Test) is used for classifying speech and non-speech. The results of implementation are similar to multi-dimensional speech feature vector. The results of speech recognition implementation of detected speech data are also similar to multi-dimensional(10-order dimensional MFCC(Mel-Frequency Cepstral Coefficient)) speech feature vector.
PDF

Frequency Estimation of Multiple Sinusoids From MR Method (MR 방법으로부터 다단 정현파의 주파수 추정)

안태천;탁현수;이종범
- Journal of the Korean Institute of Telematics and Electronics B
- /
- v.29B no.2
- /
- pp.18-26
- /
- 1992
MR(Model Reduction) is presented in order to estimate the frequency of multiple sinusoids from the finite noisy data with the white or colored noises. MR, using the reduced rank models, is designed, appling the approximation of linear system to LP(Linear Prediction). The MR method is analyzed. Monte-carlo simulations are conducted for MR and Lp. The results are compared with in terms of mean, root-mean square and relative bias. MR eliminates effectevely the extremeous and exceptional poles appearing in LP and improves the accuracy of LP. Especially, MR gives promising results in short noisy measurements, low SNR's and colored noises. Power spectral density and angular frequency position are showed by figures, for examples. Finally, the new method is utilized to the communication and biomedical systems estimating the characteristics of the signal and the system identification modelling the dynamic systems from experimental data.
PDF

A Study on Data Classification of Raman OIM Hyperspectral Bone Data

Jung, Sung-Hwan
- Journal of Korea Multimedia Society
- /
- v.14 no.8
- /
- pp.1010-1019
- /
- 2011
This was a preliminary research for the goal of understanding between internal structure of Osteogenesis Imperfecta Murine (OIM) bone and its fragility. 54 hyperspectral bone data sets were captured by using JASCO 2000 Raman spectrometer at UMKC-CRISP (University of Missouri-Kansas City Center for Research on Interfacial Structure and Properties). Each data set consists of 1,091 data points from 9 OIM bones. The original captured hyperspectral data sets were noisy and base-lined ones. We removed the noise and corrected the base-lined data for the final efficient classification. High dimensional Raman hyperspectral data on OIM bones was reduced by Principal Components Analysis (PCA) and Linear Discriminant Analysis (LDA) and efficiently classified for the first time. We confirmed OIM bones could be classified such as strong, middle and weak one by using the coefficients of their PCA or LDA. Through experiment, we investigated the efficiency of classification on the reduced OIM bone data by the Bayesian classifier and K -Nearest Neighbor (K-NN) classifier. As the experimental result, the case of LDA reduction showed higher classification performance than that of PCA reduction in the two classifiers. K-NN classifier represented better classification rate, compared with Bayesian classifier. The classification performance of K-NN was about 92.6% in case of LDA.
https://doi.org/10.9717/kmms.2011.14.8.1010 인용 PDF KSCI

A NEW METHOD FOR NORTH-SOUTH ASYMMETRY OF SUN SPOT AREA ANALYSIS

Chang, Heon-Young
- Journal of Astronomy and Space Sciences
- /
- v.24 no.4
- /
- pp.261-268
- /
- 2007
We have studied the temporal variation in the North-South asymmetry of the sunspot area during the period from 1874 to 2007. Though the 9-year periodicity is commonly reported, shorter periodicities is still under study. We employ the cepstrum analysis method to analyze the noisy power spectrum of the North-South asymmetry. We demonstrate that the cleaned power spectrum shows reduction of the spurious back-ground noise level. Some of short period peaks in the power spectrum disappear after deconvolution. It should be, however, pointed out that power spectrum might look less noisy because of a filtering process during deconvolution. We conclude by pointing out that a more sophisticate filtering algorithm is required to produce a precise and reliable periodicity estimate.
https://doi.org/10.5140/JASS.2007.24.4.261 인용 PDF KSCI

EER-ASSL: Combining Rollback Learning and Deep Learning for Rapid Adaptive Object Detection

Ahmed, Minhaz Uddin;Kim, Yeong Hyeon;Rhee, Phill Kyu
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- v.14 no.12
- /
- pp.4776-4794
- /
- 2020
We propose a rapid adaptive learning framework for streaming object detection, called EER-ASSL. The method combines the expected error reduction (EER) dependent rollback learning and the active semi-supervised learning (ASSL) for a rapid adaptive CNN detector. Most CNN object detectors are built on the assumption of static data distribution. However, images are often noisy and biased, and the data distribution is imbalanced in a real world environment. The proposed method consists of collaborative sampling and EER-ASSL. The EER-ASSL utilizes the active learning (AL) and rollback based semi-supervised learning (SSL). The AL allows us to select more informative and representative samples measuring uncertainty and diversity. The SSL divides the selected streaming image samples into the bins and each bin repeatedly transfers the discriminative knowledge of the EER and CNN models to the next bin until convergence and incorporation with the EER rollback learning algorithm is achieved. The EER models provide a rapid short-term myopic adaptation and the CNN models an incremental long-term performance improvement. EER-ASSL can overcome noisy and biased labels in varying data distribution. Extensive experiments shows that EER-ASSL obtained 70.9 mAP compared to state-of-the-art technology such as Faster RCNN, SSD300, and YOLOv2.
https://doi.org/10.3837/tiis.2020.12.009 인용 PDF KSCI HTML

Search Result 31, Processing Time 0.021 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)