• Title/Summary/Keyword: Multi-level classification

Search Result 161, Processing Time 0.023 seconds

Chinese Prosody Generation Based on C-ToBI Representation for Text-to-Speech (음성합성을 위한 C-ToBI기반의 중국어 운율 경계와 F0 contour 생성)

  • Kim, Seung-Won;Zheng, Yu;Lee, Gary-Geunbae;Kim, Byeong-Chang
    • MALSORI
    • /
    • no.53
    • /
    • pp.75-92
    • /
    • 2005
  • Prosody Generation Based on C-ToBI Representation for Text-to-SpeechSeungwon Kim, Yu Zheng, Gary Geunbae Lee, Byeongchang KimProsody modeling is critical in developing text-to-speech (TTS) systems where speech synthesis is used to automatically generate natural speech. In this paper, we present a prosody generation architecture based on Chinese Tone and Break Index (C-ToBI) representation. ToBI is a multi-tier representation system based on linguistic knowledge to transcribe events in an utterance. The TTS system which adopts ToBI as an intermediate representation is known to exhibit higher flexibility, modularity and domain/task portability compared with the direct prosody generation TTS systems. However, the cost of corpus preparation is very expensive for practical-level performance because the ToBI labeled corpus has been manually constructed by many prosody experts and normally requires a large amount of data for accurate statistical prosody modeling. This paper proposes a new method which transcribes the C-ToBI labels automatically in Chinese speech. We model Chinese prosody generation as a classification problem and apply conditional Maximum Entropy (ME) classification to this problem. We empirically verify the usefulness of various natural language and phonology features to make well-integrated features for ME framework.

  • PDF

A Multi-Level Integrator with Programming Based Boosting for Person Authentication Using Different Biometrics

  • Kundu, Sumana;Sarker, Goutam
    • Journal of Information Processing Systems
    • /
    • v.14 no.5
    • /
    • pp.1114-1135
    • /
    • 2018
  • A multiple classification system based on a new boosting technique has been approached utilizing different biometric traits, that is, color face, iris and eye along with fingerprints of right and left hands, handwriting, palm-print, gait (silhouettes) and wrist-vein for person authentication. The images of different biometric traits were taken from different standard databases such as FEI, UTIRIS, CASIA, IAM and CIE. This system is comprised of three different super-classifiers to individually perform person identification. The individual classifiers corresponding to each super-classifier in their turn identify different biometric features and their conclusions are integrated together in their respective super-classifiers. The decisions from individual super-classifiers are integrated together through a mega-super-classifier to perform the final conclusion using programming based boosting. The mega-super-classifier system using different super-classifiers in a compact form is more reliable than single classifier or even single super-classifier system. The system has been evaluated with accuracy, precision, recall and F-score metrics through holdout method and confusion matrix for each of the single classifiers, super-classifiers and finally the mega-super-classifier. The different performance evaluations are appreciable. Also the learning and the recognition time is fairly reasonable. Thereby making the system is efficient and effective.

A Study on Korean Printed Character Type Classification And Nonlinear Grapheme Segmentation (한글 인쇄체 문자의 형식 분류 및 비선형적 자소 분리에 관한 연구)

  • Park Yong-Min;Kim Do-Hyeon;Cha Eui-Young
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2006.05a
    • /
    • pp.784-787
    • /
    • 2006
  • In this paper, we propose a method for nonlinear grapheme segmentation in Korean printed character type classification. The characters are subdivided into six types based on character type information. The feature vector is consist of mesh features, vertical projection features and horizontal projection features which are extracted from gray-level images. We classify characters into 6 types using Back propagation. Character segmentation regions are determined based on character type information. Then, an optimal nonlinear grapheme segmentation path is found using multi-stage graph search algorithm. As the result, a proposed methodology is proper to classify character type and to find nonlinear char segmentation paths.

  • PDF

Cat Monitoring and Disease Diagnosis System based on Deep Learning (딥러닝 기반의 반려묘 모니터링 및 질병 진단 시스템)

  • Choi, Yoona;Chae, Heechan;Lee, Jonguk;Park, Daihee;Chung, Yongwha
    • Journal of Korea Multimedia Society
    • /
    • v.24 no.2
    • /
    • pp.233-244
    • /
    • 2021
  • Recently, several ICT-based cat studies have produced some successful results, according to academic and industry sources. However, research on the level of simply identifying the cat's condition, such as the behavior and sound classification of cats based on images and sound signals, has yet to be found. In this paper, based on the veterinary scientific knowledge of cats, a practical and academic cat monitoring and disease diagnosis system is proposed to monitor the health status of the cat 24 hours a day by automatically categorizing and analyzing the behavior of the cat with location information using LSTM with a beacon sensor and a raspberry pie that can be built at low cost. Validity of the proposed system is verified through experimentation with cats in actual custody (the accuracy of the cat behavior classification and location identification was 96.3% and 92.7% on average, respectively). Furthermore, a rule-based disease analysis system based on the veterinary knowledge was designed and implemented so that owners can check whether or not the cats have diseases at home (or can be used as an auxiliary tool for diagnosis by a pet veterinarian).

Performance Evaluation of Various Normalization Methods and Score-level Fusion Algorithms for Multiple-Biometric System (다중 생체 인식 시스템을 위한 정규화함수와 결합알고리즘의 성능 평가)

  • Woo Na-Young;Kim Hak-Il
    • Journal of the Korea Institute of Information Security & Cryptology
    • /
    • v.16 no.3
    • /
    • pp.115-127
    • /
    • 2006
  • The purpose of this paper is evaluation of various normalization methods and fusion algorithms in addition to pattern classification algorithms for multi-biometric systems. Experiments are performed using various normalization functions, fusion algorithms and pattern classification algorithms based on Biometric Scores Set-Releasel(BSSR1) provided by NIST. The performance results are presented by Half Total Error Rate (WTER). This study gives base data for the study on performance enhancement of multiple-biometric system by showing performance results using single database and metrics.

An Interpretable Bearing Fault Diagnosis Model Based on Hierarchical Belief Rule Base

  • Boying Zhao;Yuanyuan Qu;Mengliang Mu;Bing Xu;Wei He
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.18 no.5
    • /
    • pp.1186-1207
    • /
    • 2024
  • Bearings are one of the main components of mechanical equipment and one of the primary components prone to faults. Therefore, conducting fault diagnosis on bearings is a key issue in mechanical equipment research. Belief rule base (BRB) is essentially an expert system that effectively integrates qualitative and quantitative information, demonstrating excellent performance in fault diagnosis. However, class imbalance often occurs in the diagnosis task, which poses challenges to the diagnosis. Models with interpretability can enhance decision-makers' trust in the output results. However, the randomness in the optimization process can undermine interpretability, thereby reducing the level of trustworthiness in the results. Therefore, a hierarchical BRB model based on extreme gradient boosting (XGBoost) feature selection with interpretability (HFS-IBRB) is proposed in this paper. Utilizing a main BRB alongside multiple sub-BRBs allows for the conversion of a multi-classification challenge into several distinct binary classification tasks, thereby leading to enhanced accuracy. By incorporating interpretability constraints into the model, interpretability is effectively ensured. Finally, the case study of the actual dataset of bearing fault diagnosis demonstrates the ability of the HFS-IBRB model to perform accurate and interpretable diagnosis.

Multi-locus Phylogeny Analysis of Korean Isolates of Phytophthora Species Based on Sequence of Ribosomal and Mitochondrial DNA (핵 및 미토콘드리아 DNA 염기서열을 이용한 국내 Phytophthora 속의 Multi-locus phylogeny 분석)

  • Seo, Mun-Won;Song, Jeong-Young;Kim, Hong-Gi
    • The Korean Journal of Mycology
    • /
    • v.38 no.1
    • /
    • pp.40-47
    • /
    • 2010
  • To investigate genetic relationships either interspecies or intraspecies of 14 Korean Phytophthora species, sequence analyses of nuclear DNA (ypt gene and rDNA-IGS region) and mitochondrial DNA (Cox gene, $\beta$-tubuline gene, and EF1A gene) were performed. All of 14 Korean Phytophthora species clearly clustered into foreign isolates of each species. These Korean isolates in Phytophthora species also showed no correlation between molecular classification and morphological classification like as in case of foreigners. P. palmivora KACC 40167 reported previously from genetic groups of Phytophthora species in Korea was not consistent with the classification system, and therefore was required re-examination for the genetic group analysis. Korean isolates of P. drechsleri KACC 40195 showed very close relationship with P. cryptogea KACC 40161 above 94% bootstrap value in P. cryptogea-P. drechsleri complex group. Identification of these isolates is still unclear, because P. cryptogea and P. drechsleri were not differentiated in this study. On the other hand, it was required to unify species for these two species, since P. parasitica and P. nicotianae were clustered into a group on the level of 99 to 100% sequence homology. Comparing to the sequences of foreigners, Korean isolates were newly divided to ten groups in the phylogenic system. These results could be prepared useful informations to understand genetic diversity of Phytophthora species in Korea.

Feasibility of fully automated classification of whole slide images based on deep learning

  • Cho, Kyung-Ok;Lee, Sung Hak;Jang, Hyun-Jong
    • The Korean Journal of Physiology and Pharmacology
    • /
    • v.24 no.1
    • /
    • pp.89-99
    • /
    • 2020
  • Although microscopic analysis of tissue slides has been the basis for disease diagnosis for decades, intra- and inter-observer variabilities remain issues to be resolved. The recent introduction of digital scanners has allowed for using deep learning in the analysis of tissue images because many whole slide images (WSIs) are accessible to researchers. In the present study, we investigated the possibility of a deep learning-based, fully automated, computer-aided diagnosis system with WSIs from a stomach adenocarcinoma dataset. Three different convolutional neural network architectures were tested to determine the better architecture for tissue classifier. Each network was trained to classify small tissue patches into normal or tumor. Based on the patch-level classification, tumor probability heatmaps can be overlaid on tissue images. We observed three different tissue patterns, including clear normal, clear tumor and ambiguous cases. We suggest that longer inspection time can be assigned to ambiguous cases compared to clear normal cases, increasing the accuracy and efficiency of histopathologic diagnosis by pre-evaluating the status of the WSIs. When the classifier was tested with completely different WSI dataset, the performance was not optimal because of the different tissue preparation quality. By including a small amount of data from the new dataset for training, the performance for the new dataset was much enhanced. These results indicated that WSI dataset should include tissues prepared from many different preparation conditions to construct a generalized tissue classifier. Thus, multi-national/multi-center dataset should be built for the application of deep learning in the real world medical practice.

Ovarian Cancer Microarray Data Classification System Using Marker Genes Based on Normalization (표준화 기반 표지 유전자를 이용한 난소암 마이크로어레이 데이타 분류 시스템)

  • Park, Su-Young;Jung, Chai-Yeoung
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.15 no.9
    • /
    • pp.2032-2037
    • /
    • 2011
  • Marker genes are defined as genes in which the expression level characterizes a specific experimental condition. Such genes in which the expression levels differ significantly between different groups are highly informative relevant to the studied phenomenon. In this paper, first the system can detect marker genes that are selected by ranking genes according to statistics after normalizing data with methods that are the most widely used among several normalization methods proposed the while, And it compare and analyze a performance of each of normalization methods with mult-perceptron neural network layer. The Result that apply Multi-Layer perceptron algorithm at Microarray data set including eight of marker gene that are selected using ANOVA method after Lowess normalization represent the highest classification accuracy of 99.32% and the lowest prediction error estimate.

Constitutional Classification between Tae-eumin and Soyangin Types by Measurement of the Friction Coefficient on the Skin of the Human Hand (손등 피부 마찰계수를 이용한 태음인과 소양인 간의 체질구별)

  • Song, Han-Wook;Park, Yon-Kyu
    • Journal of the Institute of Electronics Engineers of Korea SC
    • /
    • v.47 no.5
    • /
    • pp.52-61
    • /
    • 2010
  • The use of the friction coefficient is known to provide good discrimination ability in the classification of human constitutions, which are used in alternative medicine. In this study, a system that uses a multi-axis load cell and a hemi-circular probe is designed. The equipment consists of a sensor (load cell type, manufactured by the authors), an x-axis linear-bush guide motorized mobile stage that supports the hand being analyzed, and a signal conditioner. Using the proposed system, the friction coefficients from different constitutions were compared, and the relative repeatability error for the friction coefficient measurement was determined to be less than 2 %. The direction along the ring finger line was determined to be the optimum measurement region for a constitutional diagnosis between Tae-eumin and Soyangin types using the proposed system. There were some differences in the friction coefficient between the two constitutions, as reported in ancient literature. The proposed system is applicable to a quantitative constitutional diagnosis between Tae-eumin and Soyangin types within an acceptable level of uncertainty.