• Title/Summary/Keyword: Incomplete Sequence Data

Search Result 10, Processing Time 0.031 seconds

Discrete HMM Training Algorithm for Incomplete Time Series Data (불완전 시계열 데이터를 위한 이산 HMM 학습 알고리듬)

  • Sin, Bong-Kee
    • Journal of Korea Multimedia Society
    • /
    • v.19 no.1
    • /
    • pp.22-29
    • /
    • 2016
  • Hidden Markov Model is one of the most successful and popular tools for modeling real world sequential data. Real world signals come in a variety of shapes and variabilities, among which temporal and spectral ones are the prime targets that the HMM aims at. A new problem that is gaining increasing attention is characterizing missing observations in incomplete data sequences. They are incomplete in that there are holes or omitted measurements. The standard HMM algorithms have been developed for complete data with a measurements at each regular point in time. This paper presents a modified algorithm for a discrete HMM that allows substantial amount of omissions in the input sequence. Basically it is a variant of Baum-Welch which explicitly considers the case of isolated or a number of omissions in succession. The algorithm has been tested on online handwriting samples expressed in direction codes. An extensive set of experiments show that the HMM so modeled are highly flexible showing a consistent and robust performance regardless of the amount of omissions.

A Comparative Study of Assessing Average Bioequivalence in $2{\times}2$ Crossover Design with Missing Observations

  • Park, Sang-Gue;Choi, Ji-Yun
    • Journal of the Korean Data and Information Science Society
    • /
    • v.17 no.1
    • /
    • pp.245-257
    • /
    • 2006
  • A modified Anderson and Hauck(1983) test for analyzing a two-sequence two-period crossover design in bioequivalence trials is proposed when some observations at the second period are missing. It is based on the maximum likelihood estimators of average bioequivalence model and designed for handling missing at random(MAR) situation. The performance of the proposed test is compared to other tests using Monte Carlo simulations.

  • PDF

Characterization of phenotypes and predominant skeletodental patterns in pre-adolescent patients with Pierre-Robin sequence

  • Yang, Il-Hyung;Chung, Jee Hyeok;Lee, Hyeok Joon;Cho, Il-Sik;Choi, Jin-Young;Lee, Jong-Ho;Kim, Sukwha;Baek, Seung-Hak
    • The korean journal of orthodontics
    • /
    • v.51 no.5
    • /
    • pp.337-345
    • /
    • 2021
  • Objective: To investigate the phenotypes and predominant skeletodental pattern in pre-adolescent patients with Pierre-Robin sequence (PRS). Methods: The samples consisted of 26 Korean pre-adolescent PRS patients (11 boys and 15 girls; mean age at the investigation, 9.20 years) treated at the Department of Orthodontics, Seoul National University Dental Hospital between 1998 and 2019. Dental phenotypes, oral manifestation, cephalometric variables, and associated anomalies were investigated and statistically analyzed. Results: Congenitally missing teeth (CMT) were found in 34.6% of the patients (n = 9/26, 20 teeth, 2.22 teeth per patient) with 55.5% (n = 5/9) exhibiting bilaterally symmetric missing pattern. The mandibular incisors were the most common CMT (n = 11/20). Predominant skeletodental patterns included Class II relationship (57.7%), posteriorly positioned maxilla (76.9%) and mandible (92.3%), hyper-divergent pattern (92.3%), high gonial angle (65.4%), small mandibular body length to anterior cranial base ratio (65.4%), linguoversion of the maxillary incisors (76.9%), and linguoversion of the mandibular incisors (80.8%). Incomplete cleft palate (CP) of hard palate with complete CP of soft palate (61.5%) was the most frequently observed, followed by complete CP of hard and soft palate (19.2%) and CP of soft palate (19.2%) (p < 0.05). However, CP severity did not show a significant correlation with any cephalometric variables except incisor mandibular plane angle (p < 0.05). Five craniofacial and 15 extra-craniofacial anomalies were observed (53.8% patients); this implicated the need of routine screening. Conclusions: The results might provide primary data for individualized diagnosis and treatment planning for pre-adolescent PRS patients despite a single institution-based data.

Variation in the number of nucleoli and incomplete homogenization of 18S ribosomal DNA sequences in leaf cells of the cultivated Oriental ginseng (Panax ginseng Meyer)

  • Chelomina, Galina N.;Rozhkovan, Konstantin V.;Voronova, Anastasia N.;Burundukova, Olga L.;Muzarok, Tamara I.;Zhuravlev, Yuri N.
    • Journal of Ginseng Research
    • /
    • v.40 no.2
    • /
    • pp.176-184
    • /
    • 2016
  • Background: Wild ginseng, Panax ginseng Meyer, is an endangered species of medicinal plants. In the present study, we analyzed variations within the ribosomal DNA (rDNA) cluster to gain insight into the genetic diversity of the Oriental ginseng, P. ginseng, at artificial plant cultivation. Methods: The roots of wild P. ginseng plants were sampled from a nonprotected natural population of the Russian Far East. The slides were prepared from leaf tissues using the squash technique for cytogenetic analysis. The 18S rDNA sequences were cloned and sequenced. The distribution of nucleotide diversity, recombination events, and interspecific phylogenies for the total 18S rDNA sequence data set was also examined. Results: In mesophyll cells, mononucleolar nuclei were estimated to be dominant (75.7%), while the remaining nuclei contained two to four nucleoli. Among the analyzed 18S rDNA clones, 20% were identical to the 18S rDNA sequence of P. ginseng from Japan, and other clones differed in one to six substitutions. The nucleotide polymorphism was more expressed at the positions 440-640 bp, and distributed in variable regions, expansion segments, and conservative elements of core structure. The phylogenetic analysis confirmed conspecificity of ginseng plants cultivated in different regions, with two fixed mutations between P. ginseng and other species. Conclusion: This study identified the evidences of the intragenomic nucleotide polymorphism in the 18S rDNA sequences of P. ginseng. These data suggest that, in cultivated plants, the observed genome instability may influence the synthesis of biologically active compounds, which are widely used in traditional medicine.

LSTM Language Model Based Korean Sentence Generation (LSTM 언어모델 기반 한국어 문장 생성)

  • Kim, Yang-hoon;Hwang, Yong-keun;Kang, Tae-gwan;Jung, Kyo-min
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.41 no.5
    • /
    • pp.592-601
    • /
    • 2016
  • The recurrent neural network (RNN) is a deep learning model which is suitable to sequential or length-variable data. The Long Short-Term Memory (LSTM) mitigates the vanishing gradient problem of RNNs so that LSTM can maintain the long-term dependency among the constituents of the given input sequence. In this paper, we propose a LSTM based language model which can predict following words of a given incomplete sentence to generate a complete sentence. To evaluate our method, we trained our model using multiple Korean corpora then generated the incomplete part of Korean sentences. The result shows that our language model was able to generate the fluent Korean sentences. We also show that the word based model generated better sentences compared to the other settings.

A Taxonomic Study of Genus Melanophyllum in Korea (한국산 흑주름버섯속의 분류학적 연구)

  • Seok, Soon Ja;Jin, Yong Joo;Kwon, Soon Woo;Kim, Yang Sup;Kim, Wan Gyu
    • The Korean Journal of Mycology
    • /
    • v.41 no.4
    • /
    • pp.205-211
    • /
    • 2013
  • For the study of species diversity and distribution of Melanophyllum in Korea, M. eyrei and M. haematospermum were identified with unrecorded species and reported species, respectively. Their identification were confirmed based on macroand microscopic descriptions as well as ITS region sequence data. We give a Korean name 'Cheongheukjureumbeoseot' for M. eyrei and provided full description for M. haematospermum due to incomplete previous description.

Confirmation of the Presence of Pleochaeta shiraiana Associated with Powdery Mildew of Celtis sinensis in Korea (팽나무 뒷면흰가루병을 일으키는 병원균 Pleochaeta shiraiana의 존재 확인)

  • Lee, Seung-Kyu;Lee, Sang-Hyun;Cho, Sung-Eun;Park, Ji-Hyun;Shin, Hyeon-Dong
    • Research in Plant Disease
    • /
    • v.17 no.3
    • /
    • pp.391-395
    • /
    • 2011
  • The presence of Pleochaeta shiraiana associated with powdery mildew of Celtis sinensis in Korea is dubious, mainly due to incomplete records of earlier workers. To confirm the occurrence of the powdery mildew and the identity of the causal fungus, morphological characteristics of the anamorph and teleomorph from nine samples deposited in KUS (Herbarium in Korea University) were examined by light microscopy. Identification of the fungus was supported by comparing four ITS sequences from Korean samples with two Japanese data. This is the first confirmed report of the powdery mildew associated with P. shiraiana in Korea.

The Assessment of Risk of Bias on Clinical Studies of Herbal Treatment for Acne (여드름의 한약 치료 임상연구에 대한 비뚤림 위험 평가)

  • Park, Hye-ryun;Roh, Seok-sun
    • Journal of Haehwa Medicine
    • /
    • v.24 no.1
    • /
    • pp.15-24
    • /
    • 2015
  • Objectives : This study was carried out to assess the risk of bias of clinical trials on acne treatment with herbal medicine that have been published in Korea. Methods : 7 electronic databases in Korea were searched for clinical trials on acne treatment. Two independent reviewers selected clinical trials on herbal medicine treatment for acne. Selected studies are categorized according to DAMI(Study Design Algorithm for Medical literature of Intervention). RCTs are assessed according to Cochrane RoB(Risk of Bias), non-randomized studies(Before-after studies) are assessed according to RoBANS(Risk of Bias Assessment tool for Non-randomized Study). Results : After selection process, 25 articles are left. Among 25 articles, 3 RCTs and 4 before-after studies are finally included. In RCTs, the proportion of 'unclear' is high in criteria of 'random sequence generation', 'allocation concealment', and 'blinding'. In before-after studies, 'high' is high in criteria of 'blinding for outcome assessment' and 'incomplete outcome data'. Conclusions : Considering the above results of the assessment, it is necessary to conduct more well designed clinical trials on acne treatment with herbal medicine.

  • PDF

Ceramium riosmenae sp. nov. (Ceramiaceae, Rhodophyta): a new complete corticated species on Gracilaria from Baja California Sur, Mexico

  • Won, Boo-Yeon;Cho, Tae-Oh
    • ALGAE
    • /
    • v.26 no.4
    • /
    • pp.289-297
    • /
    • 2011
  • Ceramium riosmenae sp. nov. is described from Baja California Sur, Mexico based on morphological and molecular data. The new species is characterized by erect thalli only, penetrating rhizoids on Gracilaria, 7-8 periaxial cells, five cortical initials per periaxial cell, complete cortication throughout, an average of 11-12 segments between branching points, rare adventitious branchlets, and cruciate tetrasporangia. Although C. riosmenae sp. nov. is similar to C. interruptum, C. sinicola, and C. codicola reported from Baja California Sur, Mexico in size and habit, it differs from these species in the number of cortical initials, habit, degree of cortication, host, and the shape of rhizoidal tips. C. riosmenae is separated from C. interruptum with interrupted cortication and four cortical initials from C. sinicola with spins near the apex and incomplete cortication near the base and from C. codicola with bulbous rhizoids on Codium. Our rbcL sequences reveal sufficient sequence divergence (2.4-3.9%) between C. riosmenae and C. interruptum, C. sinicola, and C. codicola to warrant species recognition and to separate C. riosmenae from these species on a phylogenetic tree.

Complete Mitochondrial Genome Sequence and Genetic Diversity of Duroc Breed (돼지 Duroc 품종에서 미토콘드리아 유전체 서열의 특성과 집단의 유전적 다양성)

  • Cho, 1.C.;Han, S.H.;Choi, Y.L.;Ko, M.S.;Lee, J.G;Lee, J.H;Jeon, J .T
    • Journal of Animal Science and Technology
    • /
    • v.46 no.6
    • /
    • pp.937-946
    • /
    • 2004
  • Duroc is widely used to improve the meat quality and productivity. To elucidate the phylogenetic relation and the sequence specificity for the maternal property, the complete sequence of mitochondrial genome was determined and the population diversity of Duroc was investigated in this study. The length of mtDNA tested is 16,584-bp. There are several insertion/deletion mutations in the control region and coding regions for tRNA and rRNA, respectively, but not in peptide-coding regions. Four peptide-coding genes(COⅡ, COⅢ, ND3 and ND4) showed incomplete termination codon sequences such as T--, and two(ND2 and ND4L) did alternative initiation codons(AIC), respectively. Especially, the initiation codon sequences of ND2 gene were polymorphic in this population. Polymorphisms were detected in 11-bp duplication motif within control region as well as ND2 and CYTB. Variation patterns observed from the tests on three mtDNA regions were linked completely and then two haplotypes obtained from combining the data dividing this population. Duroc mtDNA is observed at the European pig cluster in the phylogenetic tree, however, the results from the population analyses supported previous opinions. This study suggests that the breed Duroc was mainly originated from the European pig lineage, and Asian lineage was also used to form the pig breed Duroc as maternal progenitors.