• Title/Summary/Keyword: Korean Corpus

Search Result 1,201, Processing Time 0.039 seconds

A Study on the Performance Improvement of Machine Translation Using Public Korean-English Parallel Corpus (공공 한영 병렬 말뭉치를 이용한 기계번역 성능 향상 연구)

  • Park, Chanjun;Lim, Heuiseok
    • Journal of Digital Convergence
    • /
    • v.18 no.6
    • /
    • pp.271-277
    • /
    • 2020
  • Machine translation refers to software that translates a source language into a target language, and has been actively researching Neural Machine Translation through rule-based and statistical-based machine translation. One of the important factors in the Neural Machine Translation is to extract high quality parallel corpus, which has not been easy to find high quality parallel corpus of Korean language pairs. Recently, the AI HUB of the National Information Society Agency(NIA) unveiled a high-quality 1.6 million sentences Korean-English parallel corpus. This paper attempts to verify the quality of each data through performance comparison with the data published by AI Hub and OpenSubtitles, the most popular Korean-English parallel corpus. As test data, objectivity was secured by using test set published by IWSLT, official test set for Korean-English machine translation. Experimental results show better performance than the existing papers tested with the same test set, and this shows the importance of high quality data.

Relaxing Effects of Acanthopanacis Cortex through NO Production and PDE-5 Inhibition in Corpus Cavernosum (오가피의 NO 생성과 PDE-5 억제를 통한 음경해면체 이완효과)

  • Kim, Ho Hyun;Park, Sun Young
    • Journal of Physiology & Pathology in Korean Medicine
    • /
    • v.31 no.1
    • /
    • pp.52-58
    • /
    • 2017
  • This study was aimed to examine relaxing effects of Acanthopanacis cortex(AC) through nitric oxide(NO) production and phosphodiesterase type 5(PDE-5) inhibition in corpus cavernosum. In order to define the relaxation effects of AC extract, rabbit corpus cavernous tissues were prepared in $2{\times}2{\times}8mm$ sized strip. AC extract ($0.01-3.0mg/m{\ell}$) were treated in contracted strips induced by phenylephrine(PE) and $N{\omega}$-nitro-L-arginine (L-NNA) was treated before AC extract-treated. And calcium chloride($Ca^{2+}$) 1 mM was infused into precontracted strips after pretreatment of AC extract in $Ca^{2+}-free$ krebs-ringer solution. When AC extract was applied to human umbilical vein endothelial cell(HUVEC), cell viability was measured by MTT assay, and NO concentration was measured by Griess reagent system. Ratio of smooth muscles to collagen fibers and eNOS, PDE-5 positive reaction were measured by histochemical and immunohistochemical process on mice corpus cavernosum. AC extract significantly affected relaxion of the cavernous strips, and the pretreatment of L-NNA inhibited AC extract-induced relaxation. Contraction induced by the addition of $Ca^{2+}$ was inhibited by treatment with the AC extract in $Ca^{2+}-free$ solution. In AC group, NO concentration, ratio of smooth muscle to collagen fibers, and eNOS positive reaction were increased, PDE-5 positive reaction was decreased compared to PE group. As a result of the above experiment, it was thought that AC extract inhibits the inflow of extracellular $Ca^{2+}$ by activating cGMP through the increase of eNOS / NO and the decrease of PDE-5 which inhibits cGMP activity, in the corpus cavernosum.

Effects of Oja-Shingiwhan in Contracted Corpus Cavernosum Smooth Muscle (五子腎氣丸이 음경해면체 평활근의 수축에 미치는 영향)

  • Park, Jeong Su;Ahn, Sang Hyun;Park, Sun Young
    • Journal of Physiology & Pathology in Korean Medicine
    • /
    • v.30 no.5
    • /
    • pp.308-313
    • /
    • 2016
  • The purpose of this study is to investigate the effects of Oja-Shingiwhan(OS) in contracted corpus cavernosum smooth muscle and its mechanism. To evaluate the relaxation of OS in contracted corpus cavernosum, OS was treated in strips which were precontracted with phenylephrine(PE). To examine its mechanism, OS was treated into corporal strips contracted by PE after pretreatment of Nω-nitro-L-arginine(L-NNA) and compared with non-pretreatment of L-NNA. In calcium chloride(Ca2+)-free krebs solution, Ca2+ 1 mM was treated into corporal strips contracted by PE after pretreatment of OS and compared with non-pretreatment of OS. action were measured by histochemical, immunohistochemical methods. OS significantly affected on the relaxation of corporal strips, and the relaxation effects were inhibited by pretreatment of L-NNA. Contractions induced by Ca2+ influx were inhibited by pretreatment of OS in Ca2+-free krebs solution. OS increased eNOS positive reaction in corpus cavernosum, but decreased PDE-5 positive reaction. These result suggest that the effect of OS in contracted corpus cavernosum smooth muscle are shown by suppressing extracellular Ca2+ influx and increase of eNOS, NO production and decrease of PDE-5.

Morphological Observations of Ovaries in Relation to Infertility in Slaughtered Cows in Kyungnam Province 1. Appearance of follicles and corpus luteums in cow ovaries (경남지방의 도태우에 불임과 관련된 난소의 형태학적 관찰 1. 난포와 황체의 출현에 대하여)

  • 양재훈;표병민;서득록;고필옥;강정부;김종섭;곽수동
    • Journal of Veterinary Clinics
    • /
    • v.19 no.2
    • /
    • pp.147-152
    • /
    • 2002
  • Ovaries from total 192 slaughtered cows, 154 Korean native cows and 38 dairy cows were collected during the slaughtering process in Kimhae, Changyoung and Yangsan abattoirs in Kyungnam province from January 2001 to January 2002. Rates of pregnant and non-pregnant and ovarian findings were invested. Rates of pregnant cows in 192 slaughtered cows were 12.5% (24 cows) and in difference of cow breeds, 11.0% (17 cows) in 154 Korean native cows and 18.4% (7 cows) in 38 dairy cows from total 192 cows, respectively. Ages of fetuses in pregnant Korean native cows were mostly less than 4 months and ages of fetuses in dairy cows were mostly about 7-8 months. Cows which each diameter of follicles and corpus luteums in same cow was more than 5-6 mm in diameter were 69.8% (134 cows) in total 192 slaughtered cows and in difference of cow breeds, 64.7% (11 cows) in 17 Korean native cows and 57.1% (4 cows) in 7 dairy cows. Mean diameter of foliicles and corpus luteums in Korean native cows are 13.7$\pm$5.6$\times$ 11.2$\pm$4.6mm and 17.5$\pm$4.6$\times$14.6$\pm$4.0 mm in non-pregnat cows, and are 11.0$\pm$4.8$\times$9.1 $\pm$ 2.6mm and 21.2$\pm$2.9$\times$18.3$\pm$ 2.7 mm in pregnant cows, respectively. Mean diameter of follicles and corpus luteums in dairy cows are 15.8$\pm$7.1 $\times$ 14.3$\pm$ 6.0 mm and 20.3$\pm$5.9$\times$16.9$\pm$ 5.8 mm in non-pregnant cows, and are 10.1 $\pm$ 3.0$\times$9.2$\pm$2.3 mm and 23.0$\pm$ 1.7$\times$20.1 $\pm$ 1.3 mm in pregnant cows, respectivley. The above findings indicate that the co-appearance rate of follicles and corpus luteums in same cows are higher in both pregnant and non-pregnant cows. Compared in pregnant and non-pregnant cow ovaries, mean size of follicles are smaller in pregnant cows but size of corpus luteums are more larger in pregnant cows than in non-pregnant cows. Correlation of the follicle size (Y) and corpus luteum size (X) in same cows developed each other in inversive size. Those correlative formulas appeared to be Y = -0.2022X+17.175 in Korean native cows and Y= -0.5754 X+24.153 in dairy cows.

Improvement of Korean Homograph Disambiguation using Korean Lexical Semantic Network (UWordMap) (한국어 어휘의미망(UWordMap)을 이용한 동형이의어 분별 개선)

  • Shin, Joon-Choul;Ock, Cheol-Young
    • Journal of KIISE
    • /
    • v.43 no.1
    • /
    • pp.71-79
    • /
    • 2016
  • Disambiguation of homographs is an important job in Korean semantic processing and has been researched for long time. Recently, machine learning approaches have demonstrated good results in accuracy and speed. Other knowledge-based approaches are being researched for untrained words. This paper proposes a hybrid method based on the machine learning approach that uses a lexical semantic network. The use of a hybrid approach creates an additional corpus from subcategorization information and trains this additional corpus. A homograph tagging phase uses the hypernym of the homograph and an additional corpus. Experimentation with the Sejong Corpus and UWordMap demonstrates the hybrid method is to be effective with an increase in accuracy from 96.51% to 96.52%.

The significance of corpus callosal size in the estimation of neurologically abnormal infants (신경학적인 결함이 있었던 영아의 예후 판단에서 뇌량 크기의 중요성)

  • Yu, Seung Taek;Lee, Chang Woo
    • Clinical and Experimental Pediatrics
    • /
    • v.51 no.11
    • /
    • pp.1205-1210
    • /
    • 2008
  • Purpose : The development of the corpus callosum occupies the entire period of cerebral formation. The myelination pattern on magnetic resonance imaging (MRI) is very useful to evaluate neurologic development and to predict neurologic outcome in high risk infants. The thickness of the corpus callosum is believed to depend on the myelination process. It is possible to calculate the length and thickness of the corpus callosum on MRI. Thus, we can quantitatively evaluate the development of the corpus callosum. We investigated the clinical significance of measuring various portions of the corpus callosum in neonate with neurologic disorders such as hypoxic brain damage and seizure disorder. Methods : Forty-two neonates were evaluated by brain MRI. We measured the size of the genu, body, transitional zone, splenium, and length of the corpus callosum. Each measurement was divided by the total length of the corpus callosum to obtain its corrected size. The ratio of corpus callosal length and the anteroposterior diameter of the brain was also measured. Results : There was no statistical significance in the sample size of each part of the corpus callosum. However, the corrected size or the ratio of body of the corpus callosum correlated with periventricular leukomalacia and hypoxic ischemic encephalopathy. Conclusion : The abnormal size of the corpus callosum showed a good correlation with periventricular leukomalacia and hypoxic ischemic encephalopathy in neonates. We can predict clinical neurological problems by estimation of the corpus callosum in the neonatal period.

A Case of Radial Nerve Palsy Treated with Additional Scolopendrae Corpus Herbal-Acupuncture (오공(蜈蚣) 약침(藥鍼)을 병행한 요골신경마비 치험 1례(例))

  • Lee, Yoon-Kyung;Lim, Seong-Chul;Jung, Tae-Young;Han, Sang-Won;Seo, Jung-Chul
    • Journal of Pharmacopuncture
    • /
    • v.8 no.2
    • /
    • pp.91-95
    • /
    • 2005
  • Objective : The purpose of this study is to report the patient with radial nerve palsy, who improved by Scolopendrae Corpus Herbal-Acupuncture and other Oriental medical treatments. Methods : The patient was managed by Scolopendrae Corpus Herbal-Acupuncture, body acupuncture, physical theraphy and herbal medicine. We took picture of the patient's wrist and checked the power of muscles. Result : After 4 week treatment, the movement and power of wrist was restored to nearly normal range. Conclusions : The results suggest that combination of Scolopendrae Corpus Herbal-Acupuncture and other Oriental medical treatments is good method for treatment of radial nerve palsy. But further studies are required to concretely prove the effectiveness of this methods.

Vocabulary Coverage Improvement for Embedded Continuous Speech Recognition Using Knowledgebase (지식베이스를 이용한 임베디드용 연속음성인식의 어휘 적용률 개선)

  • Kim, Kwang-Ho;Lim, Min-Kyu;Kim, Ji-Hwan
    • MALSORI
    • /
    • v.68
    • /
    • pp.115-126
    • /
    • 2008
  • In this paper, we propose a vocabulary coverage improvement method for embedded continuous speech recognition (CSR) using knowledgebase. A vocabulary in CSR is normally derived from a word frequency list. Therefore, the vocabulary coverage is dependent on a corpus. In the previous research, we presented an improved way of vocabulary generation using part-of-speech (POS) tagged corpus. We analyzed all words paired with 101 among 152 POS tags and decided on a set of words which have to be included in vocabularies of any size. However, for the other 51 POS tags (e.g. nouns, verbs), the vocabulary inclusion of words paired with such POS tags are still based on word frequency counted on a corpus. In this paper, we propose a corpus independent word inclusion method for noun-, verb-, and named entity(NE)-related POS tags using knowledgebase. For noun-related POS tags, we generate synonym groups and analyze their relative importance using Google search. Then, we categorize verbs by lemma and analyze relative importance of each lemma from a pre-analyzed statistic for verbs. We determine the inclusion order of NEs through Google search. The proposed method shows better coverage for the test short message service (SMS) text corpus.

  • PDF

Quantifying L2ers' phraseological competence and text quality in L2 English writing (L2 영어 학습자들의 연어 사용 능숙도와 텍스트 질 사이의 수치화)

  • Kwon, Junhyeok;Kim, Jaejun;Kim, Yoolae;Park, Myung-Kwan;Song, Sanghoun
    • 한국어정보학회:학술대회논문집
    • /
    • 2017.10a
    • /
    • pp.281-284
    • /
    • 2017
  • On the basis of studies that show multi-word combinations, that is the field of phraseology, this study aims to examine relationship between the quality of text and phraseological competence in L2 English writing, following Yves Bestegen et al. (2014). Using two different association scores, t-score and Mutual Information(MI), which are opposite ways of measuring phraseological competence, in terms of scoring frequency and infrequency, bigrams from L2 writers' text scored based on a reference corpus, GloWbE (Corpus of Global Web based English). On a cross-sectional approach, we propose that the quality of the essays and the mean MI score of the bigram extracted from YELC, Yonsei English Learner Corpus, correlated to each other. The negative scores of bigrams are also correlated with the quality of the essays in the way that these bigrams are absent from the reference corpus, that is mostly ungrammatical. It indicates that increase in the proportion of the negative scored bigrams debases the quality of essays. The conclusion shows the quality of the essays scored by MI and t-score on cross-sectional approach, and application to teaching method and assessment for second language writing proficiency.

  • PDF

A Corpus-based Analysis of EFL Learners' Use of Hedges in Cross-cultural Communication

  • Min, Su-Jung
    • English Language & Literature Teaching
    • /
    • v.16 no.4
    • /
    • pp.91-106
    • /
    • 2010
  • This study examines the use of hedges in cross-cultural communication between EFL learners in an e-learning environment. The study analyzes the use of hedges in a corpus of an interactive web with a bulletin board system through which college students of English at Japanese and Korean universities interacted with each other discussing the topics of local and global issues. It compares the use of hedges in the students' corpus to that of a native English speakers' corpus. The result shows that EFL learners tend to use relatively smaller number of hedges than the native speakers in terms of the frequencies of the total tokens. It further reveals that the learners' overuse of a single versatile high-frequency hedging item, I think, results in relative underuse of other hedging devices. This indicates that due to their small repertoire of hedges, EFL learners' overuse of a limited number of hedging items may cause their speech or writing to become less competent. Based on the result and interviews with the learners, the study also argues that hedging should be understood in its social contexts and should not be understood just as a lack of conviction or a mark of low proficiency. Suggestions were made for using computer corpora in understanding EFL learners' language difficulties and helping them develop communicative and pragmatic competence.

  • PDF