• Title/Summary/Keyword: posterior allocation

Search Result 8, Processing Time 0.031 seconds

Variational Expectation-Maximization Algorithm in Posterior Distribution of a Latent Dirichlet Allocation Model for Research Topic Analysis

  • Kim, Jong Nam
    • Journal of Korea Multimedia Society
    • /
    • v.23 no.7
    • /
    • pp.883-890
    • /
    • 2020
  • In this paper, we propose a variational expectation-maximization algorithm that computes posterior probabilities from Latent Dirichlet Allocation (LDA) model. The algorithm approximates the intractable posterior distribution of a document term matrix generated from a corpus made up by 50 papers. It approximates the posterior by searching the local optima using lower bound of the true posterior distribution. Moreover, it maximizes the lower bound of the log-likelihood of the true posterior by minimizing the relative entropy of the prior and the posterior distribution known as KL-Divergence. The experimental results indicate that documents clustered to image classification and segmentation are correlated at 0.79 while those clustered to object detection and image segmentation are highly correlated at 0.96. The proposed variational inference algorithm performs efficiently and faster than Gibbs sampling at a computational time of 0.029s.

Generative probabilistic model with Dirichlet prior distribution for similarity analysis of research topic

  • Milyahilu, John;Kim, Jong Nam
    • Journal of Korea Multimedia Society
    • /
    • v.23 no.4
    • /
    • pp.595-602
    • /
    • 2020
  • We propose a generative probabilistic model with Dirichlet prior distribution for topic modeling and text similarity analysis. It assigns a topic and calculates text correlation between documents within a corpus. It also provides posterior probabilities that are assigned to each topic of a document based on the prior distribution in the corpus. We then present a Gibbs sampling algorithm for inference about the posterior distribution and compute text correlation among 50 abstracts from the papers published by IEEE. We also conduct a supervised learning to set a benchmark that justifies the performance of the LDA (Latent Dirichlet Allocation). The experiments show that the accuracy for topic assignment to a certain document is 76% for LDA. The results for supervised learning show the accuracy of 61%, the precision of 93% and the f1-score of 96%. A discussion for experimental results indicates a thorough justification based on probabilities, distributions, evaluation metrics and correlation coefficients with respect to topic assignment.

Bayesian analysis of financial volatilities addressing long-memory, conditional heteroscedasticity and skewed error distribution

  • Oh, Rosy;Shin, Dong Wan;Oh, Man-Suk
    • Communications for Statistical Applications and Methods
    • /
    • v.24 no.5
    • /
    • pp.507-518
    • /
    • 2017
  • Volatility plays a crucial role in theory and applications of asset pricing, optimal portfolio allocation, and risk management. This paper proposes a combined model of autoregressive moving average (ARFIMA), generalized autoregressive conditional heteroscedasticity (GRACH), and skewed-t error distribution to accommodate important features of volatility data; long memory, heteroscedasticity, and asymmetric error distribution. A fully Bayesian approach is proposed to estimate the parameters of the model simultaneously, which yields parameter estimates satisfying necessary constraints in the model. The approach can be easily implemented using a free and user-friendly software JAGS to generate Markov chain Monte Carlo samples from the joint posterior distribution of the parameters. The method is illustrated by using a daily volatility index from Chicago Board Options Exchange (CBOE). JAGS codes for model specification is provided in the Appendix.

Improvements of K-modes Algorithm and ROCK Algorithm (K-모드 알고리즘과 ROCK 알고리즘의 개선)

  • 김보화;김규성
    • The Korean Journal of Applied Statistics
    • /
    • v.15 no.2
    • /
    • pp.381-393
    • /
    • 2002
  • K-modes algorithm and ROCK(RObust Clustering using linKs) algorithm we useful clustering methods for large categorical data. In the paper, we investigate these algorithms and propose improved algorithms of them to correct their weakness. A simulation study shows that the proposed algorithms could increase the performance of data clustering.

Bayesian Method for Modeling Male Breast Cancer Survival Data

  • Khan, Hafiz Mohammad Rafiqullah;Saxena, Anshul;Rana, Sagar;Ahmed, Nasar Uddin
    • Asian Pacific Journal of Cancer Prevention
    • /
    • v.15 no.2
    • /
    • pp.663-669
    • /
    • 2014
  • Background: With recent progress in health science administration, a huge amount of data has been collected from thousands of subjects. Statistical and computational techniques are very necessary to understand such data and to make valid scientific conclusions. The purpose of this paper was to develop a statistical probability model and to predict future survival times for male breast cancer patients who were diagnosed in the USA during 1973-2009. Materials and Methods: A random sample of 500 male patients was selected from the Surveillance Epidemiology and End Results (SEER) database. The survival times for the male patients were used to derive the statistical probability model. To measure the goodness of fit tests, the model building criterions: Akaike Information Criteria (AIC), Bayesian Information Criteria (BIC), and Deviance Information Criteria (DIC) were employed. A novel Bayesian method was used to derive the posterior density function for the parameters and the predictive inference for future survival times from the exponentiated Weibull model, assuming that the observed breast cancer survival data follow such type of model. The Markov chain Monte Carlo method was used to determine the inference for the parameters. Results: The summary results of certain demographic and socio-economic variables are reported. It was found that the exponentiated Weibull model fits the male survival data. Statistical inferences of the posterior parameters are presented. Mean predictive survival times, 95% predictive intervals, predictive skewness and kurtosis were obtained. Conclusions: The findings will hopefully be useful in treatment planning, healthcare resource allocation, and may motivate future research on breast cancer related survival issues.

A Review of Statistical Analysis Methods Applied on Traditional Korean Medicine Research (한의학 연구에 활용된 통계분석 방법에 대한 고찰)

  • Jang, Seon-Il;Yun, Young-Gab;Choi, Kyoung-Ho
    • Herbal Formula Science
    • /
    • v.17 no.1
    • /
    • pp.75-83
    • /
    • 2009
  • Objective : The purpose of this study is to indicate of problems in statistical analysis method of "The Korean Journal of oriental Medical Prescription" and we will be proposed the useful application of the statistical analysis method. Methods : In this paper, we were analysed statistical analysis methodology from published journal articles "The Korean Journal of Oriental Medical Prescription" December, year 2000 to December, year 2008. We were investigated of problems in application of structured analysis methods those journal articles that including statistical analysis techniques and analysis methods. Results : 1. A random allocation of the experimental group and control groups are important factors in the planning process of statistical analysis. However, there are less explanation those journal articles. 2. There are no consideration in specimen size that there will be considerate by the level of significance and statistical test. 3. Many article authors were confused between parametric methods and non-parametric methods that they were applied parametric statistical analysis methods although inapplicable sample size. 4. There were applied the parametric methods consists of t-test instead non-parametric methods in the comparison of average intergroup relations. 5. There were less understanding posterior analysis and were confused with t-test. Conclusion : Our goal was to outline the key methods with a brief discussion of problems(statistical analysis methods), avenues for solutions. we recommend authors to use an appropriate statistical analysis methods for obtaining a more cautions results.

  • PDF

Effectiveness of low-level laser therapy in facilitating maxillary expansion using bone-borne hyrax expander: A randomized clinical trial

  • Abdelwassie, Sara Hassan;Kaddah, Mohammed Amgad;El-Dakroury, Amr Emad;El-Boghdady, Dalia;Abd El-Ghafour, Mohamed;Seifeldin, Nouran Fouad
    • The korean journal of orthodontics
    • /
    • v.52 no.6
    • /
    • pp.399-411
    • /
    • 2022
  • Objective: The objective of this randomized clinical trial was to study the skeletal and dental effects of low-level laser therapy (LLLT) along with a miniscrew-assisted expander (Hyrax) after six months of retention. Methods: After sequence generation, concealed allocation, and implementation, 24 female patients were randomly divided (1:1) into two-groups: bone-borne rapid palatal expansion (BBE) without LLLT (n = 12) and BBE with LLLT (n = 12). Eligibility criteria included female patients aged 10-13 years old with bilateral posterior crossbites. Intraoral and extraoral photographs, cone-beam computed tomography images, and digital study models were obtained before expansion and six months after retention. The 7 mm Hyrax appliance was anchored to four palatal mini-screws, which were activated twice daily for 15 days, then locked and kept in place as a retainer. LLLT was performed in the laser group during expansion and retention, according to the guidelines provided. Results: The records of 24 patients were analyzed. According to the post-retention measurements, both groups showed a significant increase in nasal and maxillary widths and total facial height. In the laser group, the Sella-Nasion-Point A and Point A-Nasion-Point B angles and the interpremolar apical distance were significantly increased. Conclusions: Within the limitations of this study, the results suggest that the parameters and protocol of LLLT do not clinically affect the efficiency of BBE in prepubertal and pubertal patients.

Postoperative Radiotherapy in the Rectal Cancers Patterns of Care Study for the Years of $1998\~1999$ (직장암의 방사선치료에 대한 Patterns of Care Study: $1998{\sim}1999$년도 수술 후 방사선치료 환자들의 특성 및 치료내용에 대한 분석결과)

  • Kim, Jong-Hoon;Oh, Do-Hoon;Kang, Ki-Moon;Kim, Woo-Cheol;Kim, Won-Dong;Kim, Jung, Soo;Kim, June-Sang;Kim, Jin-Hee;Kil, Hak-Jae;Suh, Chang-Ok;Sohn, Seung-Chang;Ahn, Yong-Chan;Yang, Dae-Sik
    • Radiation Oncology Journal
    • /
    • v.23 no.1
    • /
    • pp.22-31
    • /
    • 2005
  • Purpose : To conduct a nationwide survey on the principals in radiotherapy for rectal cancer, and produce a database of Korean Patterns of Care Study. Materials and Methods : We developed web-based Patterns of Care Study system and a national survey was conducted using random sampling based on power allocation methods. Eligible patients were who had postoperative radiotherapy for rectal cancer without gross residual tumor after surgical resection and without previous history of other cancer and radiotherapy to pelvis. Data of patients were Inputted to the web based PCS system by each investigators in 19 institutions. Results : Informations on 309 patients with rectal cancer who received radiotherapy between 1998 and 1999 were collected. Male to female ratio was 59 : 41, and the most common location of tumor was lower rectum ($46\%$). Preoperative CEA was checked in $79\%$ of cases and its value was higher than 6 ng/ml in $32\%$. Pathologic stage were I in $1.5\%$, II in $32\%$, III in $53\%$, and IV in $1.6\%$. Low anterior resection was the most common type of surgery and complete resection was peformed in $95\%$ of cases. Distal resection margin was less than 2 cm in $30\%$, and number of lymph node dissected was less than 12 in $31\%$. Chemotherapy was peformed in $91\%$ and most common regimen was 5-FU and leucovorine ($59\%$). The most common type of field arrangement used for the initial pelvic field was the four field box (Posterior-Right-Left) technique ($65.0\%$), and there was no AP-PA parallel opposing field used. Patient position was prone in $81.2\%$, and the boost field was used in $61.8\%$. To displace bowel outward, pressure modulating devices or bladder filling was used in $40.1\%$. Radiation dose was prescribed to isocenter in $45.3\%$ and to isodose line in 123 cases ($39.8\%$). Percent delivered dose over $90\%$ was achieved in $92.9\%$. Conclusion : We could find the Patterns of Care for the radiotherapy in Korean rectal cancer patients was similar to that of US national survey. The type of surgery and the regimen of chemotherapy were variable according to institutions and the variations of radiation dose and field arrangement were within acceptable range.