• Title/Summary/Keyword: Method of Maximum Likelihood

Search Result 999, Processing Time 0.021 seconds

EM Algorithm with Initialization Based on Incremental ${\cal}k-means$ for GMM and Its Application to Speaker Identification (GMM을 위한 점진적 ${\cal}k-means$ 알고리즘에 의해 초기값을 갖는 EM알고리즘과 화자식별에의 적용)

  • Seo Changwoo;Hahn Hernsoo;Lee Kiyong;Lee Younjeong
    • The Journal of the Acoustical Society of Korea
    • /
    • v.24 no.3
    • /
    • pp.141-149
    • /
    • 2005
  • Tn general. Gaussian mixture model (GMM) is used to estimate the speaker model from the speech for speaker identification. The parameter estimates of the GMM are obtained by using the Expectation-Maximization (EM) algorithm for the maximum likelihood (ML) estimation. However the EM algorithm has such drawbacks that it depends heavily on the initialization and it needs the number of mixtures to be known. In this paper, to solve the above problems of the EM algorithm. we propose an EM algorithm with the initialization based on incremental ${\cal}k-means$ for GMM. The proposed method dynamically increases the number of mixtures one by one until finding the optimum number of mixtures. Whenever adding one mixture, we calculate the mutual relationship between it and one of other mixtures respectively. Finally. based on these mutual relationships. we can estimate the optimal number of mixtures which are statistically independent. The effectiveness of the proposed method is shown by the experiment for artificial data. Also. we performed the speaker identification by applying the proposed method comparing with other approaches.

Relationships Among Cow Daily Milk Yield, Calf Body Weight at Birth and 3 Month of Age and Preweaning Daily Gain in Hanwoo (한우 어미 소의 일일 산유량과 송아지의 생시체중, 3개월령 체중 및 포유기 일당증체량에 관한 상관 분석)

  • Hwang, Jeong-Mi;Kim, Si-Dong;Choi, Jae-Gwan;Lee, Chae-Young;Kim, Byung-Wan;Kim, Jong-Bok
    • Journal of Animal Science and Technology
    • /
    • v.50 no.5
    • /
    • pp.591-600
    • /
    • 2008
  • This study was conducted to investigate relationships among cow daily milk yield, calf body weight at birth and 3 month of age and calf preweaning daily gain. Cow daily milk yield(DMILK) of 90 days after postpartum measured by weigh-suckle-weight method, calf birth weight(BW), calf body weight at 3 month of age(WT3), and calf preweaning daily gain from birth to 3 month of age(PDG) were collected from June to November on 2002 and April to November on 2003. All traits studied were considered as cow traits and analysed by WOMBAT packages to obtain A-I REML(average information restricted maximum likelihood) estimates. Heritability estimate of daily milk yield uncorrected for cow-calf separation time(DMILK) was 0.39, which was not differ from 0.36 of heritability estimate corrected on the basis of 24 hour cow-calf separation time(CMILK). Heritability estimates for maternal genetic effects of BW, WT3 and PDG were respectively 0.28, 0.17 and 0.18. Genetic correlations between DMILK and maternal effect of BW, WT3 and PDG were respectively 0.59, 0.79 and 0.68. High genetic correlation coefficients of DMILK with WT3 and PDG indicate that selections for maternal breeding value for WT3 or PDG may be effective to enhance the cow’s milk production ability in Hanwoo.

Estimation of Genetic Parameters for Growth-Related Traits in 1-Year Old of Two Korean Abalone Subspecies, Haliotis discus hannai and H. discus discus, by Using Multiple Traits of Animal Model (다형질 Animal Model에 의한 12개월령 한국산 전북 2 아종의 성장관련형질에 대한 유전모수 추정)

  • Choe, Mi-Kyung;Han, Seock-Jung;Yang, Sang-Geun;Won, Seung-Hwan;Park, Choul-Ji;Yeo, In-Kyu
    • The Korean Journal of Malacology
    • /
    • v.24 no.2
    • /
    • pp.121-130
    • /
    • 2008
  • In other aquaculture species, large improvements in growth have been achieved through selective breeding. Ezo abalone(Haliotis discus hannai) and disk abalone(H. discus discus) are major aquatic animals cultured in Asia, but selective breeding for the promotion of growth with these abalones has not been actively pursued. Recently significant efforts are being made to promote production of these species through selective breeding in Korea. The aims of this work were to estimate the general genetic parameters, heritabilities, and genetic and phenotypic correlations on growth-related traits at 1-year old in two Korean abalone subspecies, H. discus hannai and H. discus discus, by using multiple trait animal model. The data were collected from the records of 1,504 individuals produced from 22 sires and 26 dams in H. discus hannai and 297 individuals produced from 5 sires and 6 dams in H. discus discus, which evaluated by the Genetics and Breeding Research Center, National Fisheries Research & Development Institute(NFRDI). Genetic parameters were estimated for these abalone subspecies raised in Bukjeju branch, NFRDI, from May 20, 2004 to May 16, 2005, respectively. The heritability estimates obtained from restricted maximum likelihood(REML) were higher than expected, ranging from 0.40 to 0.43 for growth traits shell length, shell width and body weight in H. discus hannai and from 0.26 to 0.51 in H. discus discus, respectively. The heritabilities for shell shape and condition factor were lower than others of growth traits such as ranging from 0.09 to 0.19 in H. discus hannai and from 0.10 to 0.23 in H. discus discus, respectively. Genetic and phenotypic were > 0.93 between shell parameters and weight in two abalone species, respectively, indicating that breeding for weight gains could be successfully achieved by selecting for shell length.

  • PDF

Estimation of Genetic Parameters for Growth-related Traits of Two Korean Abalone Subspecies, Haliotis discus hannai and H. discus discus, by using Multiple Traits of Animal Model in Early Growth Period (다형질 Animal Model에 의한 한국산 전복 2 아종의 초기 치패의 성장관련 형질에 대한 유전모수 추정)

  • Choe, Mi-Kyung;Han, Seock-Jung;Yang, Sang-Geun;Won, Seung-Hwan;Park, Choul-Ji;Yeo, In-Kyu
    • The Korean Journal of Malacology
    • /
    • v.23 no.2
    • /
    • pp.217-225
    • /
    • 2007
  • This study was conducted to estimate the general genetic parameters, heritabilities, and genetic and phenotypic correlations on growth-related traits by studying multiple trait animal model in two Korean abalone species, Haliotis discus hannai and H. discus discus. The data was collected from the records of 3,795 individuals produced from 54 sires and 74 dams in Haliotis discus hannai and 399 individuals produced from 7 sires and 7 dams in Haliotis discus discus. The data was evaluated by the Genetics and Breeding Research Center, National Fisheries Research & Development Institute (NFRDI). Genetic parameters were estimated for two abalone species raised in Bukjeju branch, NFRDI, from May 20 to November 1, 2004. The heritability estimates for growth traits of shell length, shell width and body weight obtained from restricted maximum likelihood (REML) were ranging from 0.73 to 0.78 in Haliotis discus hannai, and from 0.87 to 0.89 in H. discus discus. The heritabilities for shell shape and condition factor were ranging from 0.17 to 0.20 in Haliotis discus hannai, and from 0.01 to 0.45 in H. discus discus. Genetic and phenotypic correlations were over than 0.96 between shell parameters and weight in both of abalone subspecies, indicating that breeding for weight gains could successfully be achieved by selecting for shell length.

  • PDF

Rainfall Variations of Temporal Characteristics of Korea Using Rainfall Indicators (강수지표를 이용한 우리나라 강수량의 시간적인 특성 변화)

  • Hong, Seong-Hyun;Kim, Young-Gyu;Lee, Won-Hyun;Chung, Eun-Sung
    • Journal of Korea Water Resources Association
    • /
    • v.45 no.4
    • /
    • pp.393-407
    • /
    • 2012
  • This study suggests the results of temporal and spatial variations for rainfall data in the Korean Peninsula. We got the index of the rainfall amount, frequency and extreme indices from 65 weather stations. The results could be easily understood by drawing the graph, and the Mann-Kendall trend analysis was also used to determine the tendency (up & downward/no trend) of rainfall and temperature where the trend could not be clear. Moreover, by using the FARD, frequency probability rainfalls could be calculated for 100 and 200 years and then compared each other value through the moment method, maximum likelihood method and probability weighted moments. The Average Rainfall Index (ARI) which is meant comprehensive rainfalls risk for the flood could be obtained from calculating an arithmetic mean of the RI for Amount (RIA), RI for Extreme (RIE), and RI for Frequency (RIF) and as well as the characteristics of rainfalls have been mainly classified into Amount, Extremes, and Frequency. As a result, these each Average Rainfall Indices could be increased respectively into 22.3%, 26.2%, and 5.1% for a recent decade. Since this study showed the recent climate change trend in detail, it will be useful data for the research of climate change adaptation.

The Extreme Value Analysis of Deepwater Design Wave Height and Wind Velocity off the Southwest Coast (남서 해역 심해 설계 파고 및 풍속의 극치분석)

  • Kim, Kamg-Min;Lee, Joong-Woo;Lee, Hun;Yang, Sang-Yong;Jeong, Young-Hwan
    • Proceedings of the Korean Institute of Navigation and Port Research Conference
    • /
    • v.29 no.1
    • /
    • pp.245-251
    • /
    • 2005
  • When we design coastal and harbol facilities deepwater design wave and wind speed are the important design parameters. Especially, the analysis of these informations is a vital step for the point of disaster prevention. In this study, we made and an extreme value analysis using a series of deep water significant wave data arranged in the 16 direction and supplied by KORDI real-time wave information system ,and the wind data gained from Wan-Do whether Station 1978-2003. The probability distributions considered in this characteristic analysis were the Weibull, the Gumbel, the Log-Pearson Type III, the Normal, the Lognormal, and the Gamma distribution. The parameter for each distribution was estimated by three methods, i.e. the method of moments, the maximum likelihood, and the method of probability weight moments. Furthermore, probability distributions for the extreme data had been selected by using Chi-square and Kolmogorov-Smirnov test within significant level of 5%, i,e. 95% reliance level. From this study we found that Gumbel distribution is the most proper model for the deep water design wave height off the southwest coast of Korea. However the result shows that the proper distribution made for the selected site is varied in each extreme data set.

  • PDF

Modeling of the Failure Rates and Estimation of the Economical Replacement Time of Water Mains Based on an Individual Pipe Identification Method (개별관로 정의 방법을 이용한 상수관로 파손율 모형화 및 경제적 교체시기의 산정)

  • Park, Su-Wan;Lee, Hyeong-Seok;Bae, Cheol-Ho;Kim, Kyu-Lee
    • Journal of Korea Water Resources Association
    • /
    • v.42 no.7
    • /
    • pp.525-535
    • /
    • 2009
  • In this paper a heuristic method for identifying individual pipes in water pipe networks to determine specific sections of the pipes that need to be replaced due to deterioration. An appropriate minimum pipe length is determined by selecting the pipe length that has the greatest variance of the average cumulative break number slopes among the various pipe lengths used. As a result, the minimum pipe length for the case study water network is determined as 4 m and a total of 39 individual pipe IDs are obtained. The economically optimal replacement times of the individual pipe IDs are estimated by using the threshold break rate of an individual pipe ID and the pipe break trends models for which the General Pipe Break Prediction Model(Park and Loganathan, 2002) that can incorporate the linear, exponential, and in-between of the linear and exponetial failure trends and the ROCOFs based on the modified time scale(Park et al., 2007) are used. The maximum log-likelihoods of the log-linear ROCOF and Weibull ROCOF estimated for the break data of a pipe are compared and the ROCOF that has a greater likelihood is selected for the pipe of interest. The effects of the social costs of a pipe break on the optimal replacement time are also discussed.

Ordinary kriging approach to predicting long-term particulate matter concentrations in seven major Korean cities

  • Kim, Sun-Young;Yi, Seon-Ju;Eum, Young Seob;Choi, Hae-Jin;Shin, Hyesop;Ryou, Hyoung Gon;Kim, Ho
    • Environmental Analysis Health and Toxicology
    • /
    • v.29
    • /
    • pp.12.1-12.8
    • /
    • 2014
  • Objectives Cohort studies of associations between air pollution and health have used exposure prediction approaches to estimate individual-level concentrations. A common prediction method used in Korean cohort studies is ordinary kriging. In this study, performance of ordinary kriging models for long-term particulate matter less than or equal to $10{\mu}m$ in diameter ($PM_{10}$) concentrations in seven major Korean cities was investigated with a focus on spatial prediction ability. Methods We obtained hourly $PM_{10}$ data for 2010 at 226 urban-ambient monitoring sites in South Korea and computed annual average $PM_{10}$ concentrations at each site. Given the annual averages, we developed ordinary kriging prediction models for each of the seven major cities and for the entire country by using an exponential covariance reference model and a maximum likelihood estimation method. For model evaluation, cross-validation was performed and mean square error and R-squared ($R^2$) statistics were computed. Results Mean annual average $PM_{10}$ concentrations in the seven major cities ranged between 45.5 and $66.0{\mu}g/m^3$ (standard deviation=2.40 and $9.51{\mu}g/m^3$, respectively). Cross-validated $R^2$ values in Seoul and Busan were 0.31 and 0.23, respectively, whereas the other five cities had $R^2$ values of zero. The national model produced a higher cross-validated $R^2$ (0.36) than those for the city-specific models. Conclusions In general, the ordinary kriging models performed poorly for the seven major cities and the entire country of South Korea, but the model performance was better in the national model. To improve model performance, future studies should examine different prediction approaches that incorporate $PM_{10}$ source characteristics.

Estimation of Variance Component and Environment Effects on Somatic Cell Scores by Parity in Dairy Cattle (젖소집단의 산차에 따른 체세포점수의 환경효과 및 분산성분 추정)

  • 조광현;나승환;서강석;김시동;박병호;이영창;박종대;손삼규;최재관
    • Journal of Animal Science and Technology
    • /
    • v.48 no.1
    • /
    • pp.39-48
    • /
    • 2006
  • This study utilized test day of somatic cell score data of dairy cattle from 2000 to 2004. The number of data used were 124,635 of first parity, 134,308 of second parity, 77,862 of third parity, 41,787 of forth parity and 37,412 of fifth parity. The data was analyzed by least square mean method using GLM to estimate the effects of calving year, age, lactation stage, parity and season on somatic cell score. Variance component estimation using test day model was determined by using expectation maximization algorithm- restricted maximum likelihood (EM-REML) analysis method. In each parity, somatic cell score was low for younger group and was relatively high in older groups. Likewise, for lactation stage, the score was low in early-lactation and high in late-lactation in first parity and second parity. Nevertheless, for the third, fourth and fifth parity, however, high somatic cell score was observed in mid-lactation. Generally, the score was high in the peak. Although in fourth and fifth parity, the score was low in late-lactation. Environmental effect of season, somatic cell score was generally low from September to November for all parities. The score was high between June and August when the milk production is usually low. The heritability in each parity were 0.05, 0.09, 0.10, 0.05 and 0.05 for parity 1, 2, 3, 4, 5, respectively. Genetic variance value was estimated to be high in second, third and fifth parity in early-lactation and to be low in first and forth parity.

The Comparative Study of NHPP Software Reliability Model Based on Log and Exponential Power Intensity Function (로그 및 지수파우어 강도함수를 이용한 NHPP 소프트웨어 무한고장 신뢰도 모형에 관한 비교연구)

  • Yang, Tae-Jin
    • The Journal of Korea Institute of Information, Electronics, and Communication Technology
    • /
    • v.8 no.6
    • /
    • pp.445-452
    • /
    • 2015
  • Software reliability in the software development process is an important issue. Software process improvement helps in finishing with reliable software product. Infinite failure NHPP software reliability models presented in the literature exhibit either constant, monotonic increasing or monotonic decreasing failure occurrence rates per fault. In this paper, proposes the reliability model with log and power intensity function (log linear, log power and exponential power), which made out efficiency application for software reliability. Algorithm to estimate the parameters used to maximum likelihood estimator and bisection method, model selection based on mean square error (MSE) and coefficient of determination($R^2$), for the sake of efficient model, was employed. Analysis of failure, using real data set for the sake of proposing log and power intensity function, was employed. This analysis of failure data compared with log and power intensity function. In order to insurance for the reliability of data, Laplace trend test was employed. In this study, the log type model is also efficient in terms of reliability because it (the coefficient of determination is 70% or more) in the field of the conventional model can be used as an alternative could be confirmed. From this paper, software developers have to consider the growth model by prior knowledge of the software to identify failure modes which can be able to help.