• Title/Summary/Keyword: Korean corpus

Search Result 1,202, Processing Time 0.021 seconds

A Korean menu-ordering sentence text-to-speech system using conformer-based FastSpeech2 (콘포머 기반 FastSpeech2를 이용한 한국어 음식 주문 문장 음성합성기)

  • Choi, Yerin;Jang, JaeHoo;Koo, Myoung-Wan
    • The Journal of the Acoustical Society of Korea
    • /
    • v.41 no.3
    • /
    • pp.359-366
    • /
    • 2022
  • In this paper, we present the Korean menu-ordering Sentence Text-to-Speech (TTS) system using conformer-based FastSpeech2. Conformer is the convolution-augmented transformer, which was originally proposed in Speech Recognition. Combining two different structures, the Conformer extracts better local and global features. It comprises two half Feed Forward module at the front and the end, sandwiching the Multi-Head Self-Attention module and Convolution module. We introduce the Conformer in Korean TTS, as we know it works well in Korean Speech Recognition. For comparison between transformer-based TTS model and Conformer-based one, we train FastSpeech2 and Conformer-based FastSpeech2. We collected a phoneme-balanced data set and used this for training our models. This corpus comprises not only general conversation, but also menu-ordering conversation consisting mainly of loanwords. This data set is the solution to the current Korean TTS model's degradation in loanwords. As a result of generating a synthesized sound using ParallelWave Gan, the Conformer-based FastSpeech2 achieved superior performance of MOS 4.04. We confirm that the model performance improved when the same structure was changed from transformer to Conformer in the Korean TTS.

The Effects of the Administration on Oriental Medicine, Antaeeum, in the Pregnant Rat and Their Fetuses (안태음이 임신랫드와 태자에 미치는 영향)

  • Kim, Chang-Seok;Park, Hae-Mo;Lee, Sun-Dong;Lee, Jang-Woo;Kim, Pan-Gyi;Shin, Heon-Tae
    • Journal of Environmental Health Sciences
    • /
    • v.33 no.4
    • /
    • pp.306-316
    • /
    • 2007
  • This study have a object to found out the effects of oriental herb medicine, Antaeeum, to dams of rats and their offsprings. The Antaeeum was savaged to female Sprague-Dawley rats at a dose of 5 mg/kg/day for 3 weeks during gestation periods. Dams of rat were sacrificed at 20th day of gestation, and were observed major internal and reproductive organs. Approximately live fetuses in the 20th days of gestation were selected randomly and examined with stereo microscopes. Others offsprings were fixed with 95% ethanol for skeletal examinations. The fixed fetuses were stained with alcian blue and alizarin red S to observe skeletal variations or malformations. Maternal body weight of Antaeeum treated dams have a tendency of increasing compared with control dams. There were no significant difference in internal and reproductive organs of weight or findings. The spleenic organ relative weight of treated dams were decreased compared with the control significaltly (p<0.05). There were no significant changes between two groups in blood chemistry and hematological values. There were no significant changes in number of corpus luteum, implantation, live fetuses and implantation rate, delivery rate, late resorption rate and sex ratio. But in the Antaeeum treated group showed lower early resorption rate than that of the control dams. Fetal body weight and number of fetus a dam at Antaeeum treated group were higher than that of control group. The fetuses of dams treated with Antaeeum didn't induced external malformations. Vertebral and sternal variations were observed in Antaeeum group, but compared with the control, those variations were not significant. The ossification numbers of rib, cervical, thoracic, and lumber were normal. Fetuses treated with Antaeeum to the dams showed no significant difference in the number of caudal vertebra (P>0.01). From these results, it can be concluded that Antaeeum showed no toxicity effects on maternal side especially on body weight, early resorption rate, and number of live fetuses. Also there were no significant changes on maternal organ weights except spleen, hematological data, reproductive organs. Although skeletal variations were examined at vertebra and sternum, this Antaeeum could not induced significant choses in bone malformation.

THe Effects of the Administration on Oriental Medicine, Dalsaengtang, in the Pregnant Rat and Their Fetuses (달생탕이 랫드의 모체와 태자에 미치는 영향에 대한 연구)

  • Park, Hae-Mo;Kim, Chang-Seok;Lee, Sun-Dong;Lee, Jang-Woo;Ryu, Jae-Hong;Kim, Pan-Gyi
    • Journal of Environmental Health Sciences
    • /
    • v.32 no.4 s.91
    • /
    • pp.342-352
    • /
    • 2006
  • The experiments was undertaken to evaluate the effects of herbal medicine, Dalsaengtang, in pregnant rats and fetuses. Female Sprague-Dawley rats were orally administered with the Dalsaengtang at dose of 5 mg/kg/day for 20 days. Pregnant rats were sacrificed at 20th day of gestation, and observed internal and reproductive organs. Approximately live fetuses in the 20th day of gestation were randomly selected and fixed in 95% ethanol. To observe skeletal malformations, fetuses were stained with alcian blue and alizarin red S. Maternal body weight of dalsaengtang treated group has a tendency to increase compared to that of control group. The relative liver and kidney weights of dalsaengtang treated group were also increased to that of control group. There were no significant changes between two groups in blood chemistry and hematological values. There were no significant changes in number of corpus luteum, implantation, live fetuses and implantation rate, delivery rate, late resorption rate and sex ratio. But Dalsaengtang administered group showed lower early resorption rate than the control group. From the sex ratio, number of females, bigger than number of males in the control group, and more males than females in Dalsaengtang administered group. Neonatal body weight and number of fetus of Dalsaengtang group were increased to that of control group. The fetuses of dams treated with Dalsaengtang didn't showed external malformation. Vertebral and sternal variations were observed in Dalsaengtang group but, compared to the control, those variations were insignificant. The number of ribs, cervical, thoracic, and lumber were normal. The number of sacral and caudal vertebrae were increased. Fetuses showed significant difference in the number of caudal vertebra (P<0.01). From these results, it can be concluded that Dalsaengtang showed no toxicity effects on maternal body weight, early resorption rate, and number of live fetuses. There were no significant changes in organ weight, hematological data, reproductive organs. Although skeletal variations were showed in vertebrate and sternum, Dalsaengtang did not shown significant changes in bone malformation.

Studies on Recovery and Cryopreservation of Embryos in Korean Native Swine (재래돼지에서 수정란의 회수 및 동결보존에 관한 연구)

  • Son D. S.;Yeon S. H.;Hur T. Y.;Kang S. J.;Suh G. H.;Choi S. H.;Ryu I. S.;Lee K. S.;Park C. S.
    • Journal of Embryo Transfer
    • /
    • v.19 no.3
    • /
    • pp.257-264
    • /
    • 2004
  • For safe preservation of Korean Native Pigs (KNP) as an animal genetic resources and a means to maintain the genetic diversity, we performed to investigate the optimal hormone levels for superovulated gilts and establish the cryopresevation methods of embryos. The reults were as follows; 1. The number of ovulated corpus luteums (CL) and follicles were 12.4, 13.6, 30.0 or 23.3 in hCG 500IU and PMSG 500, 750, 1,000IU or hCG 750IU and PMSG 1,000IU respectively. In the case of PMSG 1,000IU와 hCG 500IU, there showed highest number but were no significance among others. The recovery rate of embryos by the ovulated CL were 59.4-79.2%. 2. The morula stage embryos recovered on Day 4 after insemination were significantly higher than Day 5 (P<0.01), but blastocyst stage embryos recovered on Day 5 were sinificantly higher than Day 4 (P<0.05). 3. The survival rate of expanded blastocyst were 23.5% in conventional freezing with 1.4 M glycerol.

Pregnancy Rate of In Vitro Produced Korean Cattle Embryos according to Transport Time Course

  • Park, Hyo-Young;Kim, Eun-Young;Kim, Young-Hun;Mun, Seong-Ho;Oh, Chang-Eon;Han, Young-Joon;Kim, Nam-Hyung;Lee, Sung-Soo;Ko, Moon-Suck;Riu, Key Zung;Park, Se-Pill
    • Reproductive and Developmental Biology
    • /
    • v.33 no.4
    • /
    • pp.257-262
    • /
    • 2009
  • This study was to investigate pregnancy rate of IVM/IVF/IVC Korean cattle (registered in government) embryos according to transport time course. For the production of embryos, oocytes recovered from slaughtered excellent grade cow and highly motile frozen-thawed bull semen (purchased from LIMC, KPN#497) was used. In vitro produced embryos were cultured in CR1aa medium for 8 days and some of them were frozen. The rate of average cleavage (>2-cell) was 83.0% (308/371) and blastocyst rate at day 8 was 34.7% (107/308). Among in vitro produced blastocyst embryos at day 8, most healthy embryos were freshly transferred on production day and some frozen embryos were direct transferred on appropriate day. These embryos were produced in a laboratory, embryo transfer (ET) was planned in 10 areas of the remote island (Jeju) from the laboratory by airplane. Thus, we examined the pregnancy rate in recipient cow according to embryo of transport time course before ET. From embryo transferred 44 recipient cows, overall pregnancy was 40.9% (18/44), these 18 cows were all calved [single, 94% (17/18); twin, 6% (1/18)] and total embryo implantation rate was 26% (19/66). Comparing transport time in the base of 6 hr, pregnancy rate in ET group required less 4 hr (60%, 9/15) was significantly higher than that required more 6 hr (26.3%, 5/19). In direct ET of freezing embryos, the pregnancy rate was 40% (4/10). However, it was difficult to find the meaning of temperature, pH and corpus luteum quality of recipients on comparison of pregnancy rate. When the cell death level of embryos according to storage time in thermos (straw container) before ET was measured by TUNEL staining, apoptotic index was increased with storage time-dependent. These results demonstrated that long distance transfer of IVM/IVF/IVC embryos is possible and the time of embryo transport is very important for the pregnancy rate on field trial.

Effects of Geiji-Bokryung-Hwan on eNOS, nNOS, Caveolin-1 and bFGF Protein Expressions and the Endothelial Cells of the Corpus Cavernosum in Hypercholesterolemic Rat

  • Kim Jae-Woo;Son Soo-Gon;Sa Eun-Ho;Kim Cherl-Ho;Park Won-Hwan
    • Journal of Physiology & Pathology in Korean Medicine
    • /
    • v.20 no.1
    • /
    • pp.174-180
    • /
    • 2006
  • We examine the effect of Geiji-Bokryung-Hwan(GBH) on erectile function in a rat model of hypercholesterolemic erectile dysfunction. GBH, a drug preparation consisting of five herbs of Cinnamomi Ramulus (Geiji), Poria Cocos (Bokryun), Mountan Cortex Radicis (Mokdanpi), Paeoniae Radix (Jakyak), and Persicae Semen (Doin) is a traditional Korean herbal medicine that is widely used in the treatment of atherosclerosis-related disorders. In this study, 3-month-old Sprague-Dawley rats were used. The 6 rats control animals were fed a normal diet and the other 18 rats were fed 1% cholesterol diet for 3 months. After 1 months, GBH was added to the drinking water of the treatment group of 12 rats but not the cholesterol only group of 6 rats. Of the 12 rats 6 received 30 mg/kg per day (group 1) and 6 received 60 mg/kg per day (group 2) of GBH. At 3 months erectile function was evaluated with cavernous nerve electrostimulation in all animals. Penile tissues were collected for electron microscopy, and to perform Western blot for endothelial nitric oxide synthase (eNOS), neuronal nitric oxide synthase (nNOS), basic fibroblast growth factor (bFGF) and caveolin-1. Systemic arterial pressure was not significantly different between the animals that were fed the 1% cholesterol diet and the controls. Conversely erectile function was not impaired in the herbal medicine treated rats. Electron microscopy showed many caveolae with fingerlike processes in the cavernous smooth muscle and endothelial cell membranes in control and treated rats but not in the cholesterol only group of rats. Western blot showed differences among groups in protein expression for eNOS, nNOS, caveolin-1 and bFGF protein expression in penile tissue. Increased eNOS and nNOS protein expressions dy high cholesterol diet were significantly decreased in group 1 and group 2. Interestingly, caveolin-1 and bFGF protein expression was significantly higher in groups 1 and 2 than in the cholesterol only and control groups.

Exploiting Chunking for Dependency Parsing in Korean (한국어에서 의존 구문분석을 위한 구묶음의 활용)

  • Namgoong, Young;Kim, Jae-Hoon
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.11 no.7
    • /
    • pp.291-298
    • /
    • 2022
  • In this paper, we present a method for dependency parsing with chunking in Korean. Dependency parsing is a task of determining a governor of every word in a sentence. In general, we used to determine the syntactic governor in Korean and should transform the syntactic structure into semantic structure for further processing like semantic analysis in natural language processing. There is a notorious problem to determine whether syntactic or semantic governor. For example, the syntactic governor of the word "먹고 (eat)" in the sentence "밥을 먹고 싶다 (would like to eat)" is "싶다 (would like to)", which is an auxiliary verb and therefore can not be a semantic governor. In order to mitigate this somewhat, we propose a Korean dependency parsing after chunking, which is a process of segmenting a sentence into constituents. A constituent is a word or a group of words that function as a single unit within a dependency structure and is called a chunk in this paper. Compared to traditional dependency parsing, there are some advantage of the proposed method: (1) The number of input units in parsing can be reduced and then the parsing speed could be faster. (2) The effectiveness of parsing can be improved by considering the relation between two head words in chunks. Through experiments for Sejong dependency corpus, we have shown that the USA and LAS of the proposed method are 86.48% and 84.56%, respectively and the number of input units is reduced by about 22%p.

Generalization of error decision rules in a grammar checker using Korean WordNet, KorLex (명사 어휘의미망을 활용한 문법 검사기의 문맥 오류 결정 규칙 일반화)

  • So, Gil-Ja;Lee, Seung-Hee;Kwon, Hyuk-Chul
    • The KIPS Transactions:PartB
    • /
    • v.18B no.6
    • /
    • pp.405-414
    • /
    • 2011
  • Korean grammar checkers typically detect context-dependent errors by employing heuristic rules that are manually formulated by a language expert. These rules are appended each time a new error pattern is detected. However, such grammar checkers are not consistent. In order to resolve this shortcoming, we propose new method for generalizing error decision rules to detect the above errors. For this purpose, we use an existing thesaurus KorLex, which is the Korean version of Princeton WordNet. KorLex has hierarchical word senses for nouns, but does not contain any information about the relationships between cases in a sentence. Through the Tree Cut Model and the MDL(minimum description length) model based on information theory, we extract noun classes from KorLex and generalize error decision rules from these noun classes. In order to verify the accuracy of the new method in an experiment, we extracted nouns used as an object of the four predicates usually confused from a large corpus, and subsequently extracted noun classes from these nouns. We found that the number of error decision rules generalized from these noun classes has decreased to about 64.8%. In conclusion, the precision of our grammar checker exceeds that of conventional ones by 6.2%.

Pivot Discrimination Approach for Paraphrase Extraction from Bilingual Corpus (이중 언어 기반 패러프레이즈 추출을 위한 피봇 차별화 방법)

  • Park, Esther;Lee, Hyoung-Gyu;Kim, Min-Jeong;Rim, Hae-Chang
    • Korean Journal of Cognitive Science
    • /
    • v.22 no.1
    • /
    • pp.57-78
    • /
    • 2011
  • Paraphrasing is the act of writing a text using other words without altering the meaning. Paraphrases can be used in many fields of natural language processing. In particular, paraphrases can be incorporated in machine translation in order to improve the coverage and the quality of translation. Recently, the approaches on paraphrase extraction utilize bilingual parallel corpora, which consist of aligned sentence pairs. In these approaches, paraphrases are identified, from the word alignment result, by pivot phrases which are the phrases in one language to which two or more phrases are connected in the other language. However, the word alignment is itself a very difficult task, so there can be many alignment errors. Moreover, the alignment errors can lead to the problem of selecting incorrect pivot phrases. In this study, we propose a method in paraphrase extraction that discriminates good pivot phrases from bad pivot phrases. Each pivot phrase is weighted according to its reliability, which is scored by considering the lexical and part-of-speech information. The experimental result shows that the proposed method achieves higher precision and recall of the paraphrase extraction than the baseline. Also, we show that the extracted paraphrases can increase the coverage of the Korean-English machine translation.

  • PDF

Three-Phase English Syntactic Analysis for Improving the Parsing Efficiency (영어 구문 분석의 효율 개선을 위한 3단계 구문 분석)

  • Kim, Sung-Dong
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.5 no.1
    • /
    • pp.21-28
    • /
    • 2016
  • The performance of an English-Korean machine translation system depends heavily on its English parser. The parser in this paper is a part of the rule-based English-Korean MT system, which includes many syntactic rules and performs the chart-based parsing. The parser generates too many structures due to many syntactic rules, so much time and memory are required. The rule-based parser has difficulty in analyzing and translating the long sentences including the commas because they cause high parsing complexity. In this paper, we propose the 3-phase parsing method with sentence segmentation to efficiently translate the long sentences appearing in usual. Each phase of the syntactic analysis applies its own independent syntactic rules in order to reduce parsing complexity. For the purpose, we classify the syntactic rules into 3 classes and design the 3-phase parsing algorithm. Especially, the syntactic rules in the 3rd class are for the sentence structures composed with commas. We present the automatic rule acquisition method for 3rd class rules from the syntactic analysis of the corpus, with which we aim to continuously improve the coverage of the parsing. The experimental results shows that the proposed 3-phase parsing method is superior to the prior parsing method using only intra-sentence segmentation in terms of the parsing speed/memory efficiency with keeping the translation quality.