Search | Korea Science

Efficient Speaker Identification based on Robust VQ-PCA (강인한 VQ-PCA에 기반한 효율적인 화자 식별)

Lee Ki-Yong
- Journal of Internet Computing and Services
- /
- v.5 no.3
- /
- pp.57-62
- /
- 2004
In this paper, an efficient speaker identification based on robust vector quantizationprincipal component analysis (VQ-PCA) is proposed to solve the problems from outliers and high dimensionality of training feature vectors in speaker identification, Firstly, the proposed method partitions the data space into several disjoint regions by roust VQ based on M-estimation. Secondly, the robust PCA is obtained from the covariance matrix in each region. Finally, our method obtains the Gaussian Mixture model (GMM) for speaker from the transformed feature vectors with reduced dimension by the robust PCA in each region, Compared to the conventional GMM with diagonal covariance matrix, under the same performance, the proposed method gives faster results with less storage and, moreover, shows robust performance to outliers.
PDF

Target Birth Intensity Estimation Using Measurement-Driven PHD Filter

Zhang, Huanqing;Ge, Hongwei;Yang, Jinlong
- ETRI Journal
- /
- v.38 no.5
- /
- pp.1019-1029
- /
- 2016
The probability hypothesis density (PHD) filter is an effective means to track multiple targets in that it avoids explicit data associations between the measurements and targets. However, the target birth intensity as a prior is assumed to be known before tracking in a traditional target-tracking algorithm; otherwise, the performance of a conventional PHD filter will decline sharply. Aiming at this problem, a novel target birth intensity scheme and an improved measurement-driven scheme are incorporated into the PHD filter. The target birth intensity estimation scheme, composed of both PHD pre-filter technology and a target velocity extent method, is introduced to recursively estimate the target birth intensity by using the latest measurements at each time step. Second, based on the improved measurement-driven scheme, the measurement set at each time step is divided into the survival target measurement set, birth target measurement set, and clutter set, and meanwhile, the survival and birth target measurement sets are used to update the survival and birth targets, respectively. Lastly, a Gaussian mixture implementation of the PHD filter is presented under a linear Gaussian model assumption. The results of numerical experiments demonstrate that the proposed approach can achieve a better performance in tracking systems with an unknown newborn target intensity.
https://doi.org/10.4218/etrij.16.0116.0040 인용 PDF KSCI KPUBS

Digitally Modulated Signal Classification based on Higher Order Statistics of Cyclostationary Process (순환정상 프로세스의 고차 통계 특성을 이용한 디지털 변조인식)

Ahn, Woo-Hyun;Nah, Sun-Phil;Seo, Bo-Seok
- Journal of Broadcast Engineering
- /
- v.19 no.2
- /
- pp.195-204
- /
- 2014
In this paper, we propose an automatic modulation classification method for ten digitally modulated baseband signals, such as 2-FSK, 4-FSK, 8-FSK, MSK, BPSK, QPSK, 8-PSK, 16-QAM, 32-QAM, and 64-QAM based on higher order statistics of cyclostationary process. The first order cyclic moments and higher order cyclic cumulants of the signal are used as features of the modulation signals. The proposed method consists of two stages. At the first stage, we classify modulation signals as M-FSK and non-FSK using peaks of the first order cyclic moment. At the next step, we apply the Gaussian mixture model-based classifier to classify non-FSK. Simulation results are demonstrated to evaluate the proposed scheme. The results show high probability of classification even in the presence of frequency and phase offsets.
https://doi.org/10.5909/JBE.2014.19.2.195 인용 PDF KSCI KPUBS

Clustering and classification to characterize daily electricity demand (시간단위 전력사용량 시계열 패턴의 군집 및 분류분석)

Park, Dain;Yoon, Sanghoo
- Journal of the Korean Data and Information Science Society
- /
- v.28 no.2
- /
- pp.395-406
- /
- 2017
The purpose of this study is to identify the pattern of daily electricity demand through clustering and classification. The hourly data was collected by KPS (Korea Power Exchange) between 2008 and 2012. The time trend was eliminated for conducting the pattern of daily electricity demand because electricity demand data is times series data. We have considered k-means clustering, Gaussian mixture model clustering, and functional clustering in order to find the optimal clustering method. The classification analysis was conducted to understand the relationship between external factors, day of the week, holiday, and weather. Data was divided into training data and test data. Training data consisted of external factors and clustered number between 2008 and 2011. Test data was daily data of external factors in 2012. Decision tree, random forest, Support vector machine, and Naive Bayes were used. As a result, Gaussian model based clustering and random forest showed the best prediction performance when the number of cluster was 8.
https://doi.org/10.7465/jkdi.2017.28.2.395 인용 PDF KSCI

Acoustic Model Transformation Method for Speech Recognition Employing Gaussian Mixture Model Adaptation Using Untranscribed Speech Database (미전사 음성 데이터베이스를 이용한 가우시안 혼합 모델 적응 기반의 음성 인식용 음향 모델 변환 기법)

Kim, Wooil
- Journal of the Korea Institute of Information and Communication Engineering
- /
- v.19 no.5
- /
- pp.1047-1054
- /
- 2015
This paper presents an acoustic model transform method using untranscribed speech database for improved speech recognition. In the presented model transform method, an adapted GMM is obtained by employing the conventional adaptation method, and the most similar Gaussian component is selected from the adapted GMM. The bias vector between the mean vectors of the clean GMM and the adapted GMM is used for updating the mean vector of HMM. The presented GAMT combined with MAP or MLLR brings improved speech recognition performance in car noise and speech babble conditions, compared to singly-used MAP or MLLR respectively. The experimental results show that the presented model transform method effectively utilizes untranscribed speech database for acoustic model adaptation in order to increase speech recognition accuracy.
https://doi.org/10.6109/jkiice.2015.19.5.1047 인용 PDF KSCI KPUBS HTML

Active Object Tracking based on stepwise application of Region and Color Information (지역정보와 색 정보의 단계적 적용에 의한 능동 객체 추적)

Jeong, Joon-Yong;Lee, Kyu-Won
- The KIPS Transactions:PartB
- /
- v.19B no.2
- /
- pp.107-112
- /
- 2012
An active object tracking algorithm using Pan and Tilt camera based in the stepwise application of region and color information from realtime image sequences is proposed. To reduce environment noises in input sequences, Gaussian filtering is performed first. An image is divided into background and objects by using the adaptive Gaussian mixture model. Once the target object is detected, an initial search window close to an object region is set up and color information is extracted from the region. We track moving objects in realtime by using the CAMShift algorithm which enables to trace objects in active camera with the color information. The proper tracking is accomplished by controlling the amount of pan and tilt to be placed the center position of object into the middle of field of view. The experimental results show that the proposed method is more effective than the hand-operated window method.
https://doi.org/10.3745/KIPSTB.2012.19B.2.107 인용 PDF KSCI

Estimation of Mixture Numbers of GMM for Speaker Identification (화자 식별을 위한 GMM의 혼합 성분의 개수 추정)

Lee, Youn-Jeong;Lee, Ki-Yong
- Speech Sciences
- /
- v.11 no.2
- /
- pp.237-245
- /
- 2004
In general, Gaussian mixture model(GMM) is used to estimate the speaker model for speaker identification. The parameter estimates of the GMM are obtained by using the expectation-maximization (EM) algorithm for the maximum likelihood(ML) estimation. However, if the number of mixtures isn't defined well in the GMM, those parameters are obtained inappropriately. The problem to find the number of components is significant to estimate the optimal parameter in mixture model. In this paper, to estimate the optimal number of mixtures, we propose the method that starts from the sufficient mixtures, after, the number is reduced by investigating the mutual information between mixtures for GMM. In result, we can estimate the optimal number of mixtures. The effectiveness of the proposed method is shown by the experiment using artificial data. Also, we performed the speaker identification applying the proposed method comparing with other approaches.
PDF

Nonlinear Approximations Using Modified Mixture Density Networks (변형된 혼합 밀도 네트워크를 이용한 비선형 근사)

Cho, Won-Hee;Park, Joo-Young
- Journal of the Korean Institute of Intelligent Systems
- /
- v.14 no.7
- /
- pp.847-851
- /
- 2004
In the original mixture density network(MDN), which was introduced by Bishop and Nabney, the parameters of the conditional probability density function are represented by the output vector of a single multi-layer perceptron. Among the recent modification of the MDNs, there is the so-called modified mixture density network, in which each of the priors, conditional means, and covariances is represented via an independent multi-layer perceptron. In this paper, we consider a further simplification of the modified MDN, in which the conditional means are linear with respect to the input variable together with the development of the MATLAB program for the simplification. In this paper, we first briefly review the original mixture density network, then we also review the modified mixture density network in which independent multi-layer perceptrons play an important role in the learning for the parameters of the conditional probability, and finally present a further modification so that the conditional means are linear in the input. The applicability of the presented method is shown via an illustrative simulation example.
https://doi.org/10.5391/JKIIS.2004.14.7.847 인용 PDF KSCI

Quantitative Assessment Technology of Small Animal Myocardial Infarction PET Image Using Gaussian Mixture Model (다중가우시안혼합모델을 이용한 소동물 심근경색 PET 영상의 정량적 평가 기술)

Woo, Sang-Keun;Lee, Yong-Jin;Lee, Won-Ho;Kim, Min-Hwan;Park, Ji-Ae;Kim, Jin-Su;Kim, Jong-Guk;Kang, Joo-Hyun;Ji, Young-Hoon;Choi, Chang-Woon;Lim, Sang-Moo;Kim, Kyeong-Min
- Progress in Medical Physics
- /
- v.22 no.1
- /
- pp.42-51
- /
- 2011
Nuclear medicine images (SPECT, PET) were widely used tool for assessment of myocardial viability and perfusion. However it had difficult to define accurate myocardial infarct region. The purpose of this study was to investigate methodological approach for automatic measurement of rat myocardial infarct size using polar map with adaptive threshold. Rat myocardial infarction model was induced by ligation of the left circumflex artery. PET images were obtained after intravenous injection of 37 MBq $^{18}F$-FDG. After 60 min uptake, each animal was scanned for 20 min with ECG gating. PET data were reconstructed using ordered subset expectation maximization (OSEM) 2D. To automatically make the myocardial contour and generate polar map, we used QGS software (Cedars-Sinai Medical Center). The reference infarct size was defined by infarction area percentage of the total left myocardium using TTC staining. We used three threshold methods (predefined threshold, Otsu and Multi Gaussian mixture model; MGMM). Predefined threshold method was commonly used in other studies. We applied threshold value form 10% to 90% in step of 10%. Otsu algorithm calculated threshold with the maximum between class variance. MGMM method estimated the distribution of image intensity using multiple Gaussian mixture models (MGMM2, ${\cdots}$ MGMM5) and calculated adaptive threshold. The infarct size in polar map was calculated as the percentage of lower threshold area in polar map from the total polar map area. The measured infarct size using different threshold methods was evaluated by comparison with reference infarct size. The mean difference between with polar map defect size by predefined thresholds (20%, 30%, and 40%) and reference infarct size were $7.04{\pm}3.44%$, $3.87{\pm}2.09%$ and $2.15{\pm}2.07%$, respectively. Otsu verse reference infarct size was $3.56{\pm}4.16%$. MGMM methods verse reference infarct size was $2.29{\pm}1.94%$. The predefined threshold (30%) showed the smallest mean difference with reference infarct size. However, MGMM was more accurate than predefined threshold in under 10% reference infarct size case (MGMM: 0.006%, predefined threshold: 0.59%). In this study, we was to evaluate myocardial infarct size in polar map using multiple Gaussian mixture model. MGMM method was provide adaptive threshold in each subject and will be a useful for automatic measurement of infarct size.
PDF KSCI

Wideband Channel Simulation Algorithm for the Suzuki Fading Channel (Suzuki 페이딩 채널에 대한 광대역 채널 시뮬레이션 알고리즘)

박태준;박상수;김형명
- The Journal of Korean Institute of Communications and Information Sciences
- /
- v.19 no.8
- /
- pp.1493-1502
- /
- 1994
In this paper we propose a new wideband channel simulation algorithm which exactly simulates the Suzuki fading channel, a mixture of short term and long term fading. Proposed algorithm generates the incoming reflected waves as Suzuki distributed random signals and is possible to arbitrarily adjust the correlations among long term fading components of the incoming waves by using the Gaussian-to-lognormal transformation. Proposed algorithm can be applied to the simulation of the system performance.
PDF

Search Result 505, Processing Time 0.025 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)