• Title/Summary/Keyword: Korean corpus

Search Result 1,199, Processing Time 0.029 seconds

A Study of Research on Methods of Automated Biomedical Document Classification using Topic Modeling and Deep Learning (토픽모델링과 딥 러닝을 활용한 생의학 문헌 자동 분류 기법 연구)

  • Yuk, JeeHee;Song, Min
    • Journal of the Korean Society for information Management
    • /
    • v.35 no.2
    • /
    • pp.63-88
    • /
    • 2018
  • This research evaluated differences of classification performance for feature selection methods using LDA topic model and Doc2Vec which is based on word embedding using deep learning, feature corpus sizes and classification algorithms. In addition to find the feature corpus with high performance of classification, an experiment was conducted using feature corpus was composed differently according to the location of the document and by adjusting the size of the feature corpus. Conclusionally, in the experiments using deep learning evaluate training frequency and specifically considered information for context inference. This study constructed biomedical document dataset, Disease-35083 which consisted biomedical scholarly documents provided by PMC and categorized by the disease category. Throughout the study this research verifies which type and size of feature corpus produces the highest performance and, also suggests some feature corpus which carry an extensibility to specific feature by displaying efficiency during the training time. Additionally, this research compares the differences between deep learning and existing method and suggests an appropriate method by classification environment.

Enhancement of a language model using two separate corpora of distinct characteristics

  • Cho, Sehyeong;Chung, Tae-Sun
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.14 no.3
    • /
    • pp.357-362
    • /
    • 2004
  • Language models are essential in predicting the next word in a spoken sentence, thereby enhancing the speech recognition accuracy, among other things. However, spoken language domains are too numerous, and therefore developers suffer from the lack of corpora with sufficient sizes. This paper proposes a method of combining two n-gram language models, one constructed from a very small corpus of the right domain of interest, the other constructed from a large but less adequate corpus, resulting in a significantly enhanced language model. This method is based on the observation that a small corpus from the right domain has high quality n-grams but has serious sparseness problem, while a large corpus from a different domain has more n-gram statistics but incorrectly biased. With our approach, two n-gram statistics are combined by extending the idea of Katz's backoff and therefore is called a dual-source backoff. We ran experiments with 3-gram language models constructed from newspaper corpora of several million to tens of million words together with models from smaller broadcast news corpora. The target domain was broadcast news. We obtained significant improvement (30%) by incorporating a small corpus around one thirtieth size of the newspaper corpus.

Effects of Shingi-whan on the Male Reproductive and Sexual Function : Enhancing Spermatogenesis, Reducing Testicular Toxicity, and Relaxing Smooth Muscle of Corpus Cavernosum (신기환(腎氣丸)의 남성 생식기능 및 성기능 개선효과 : 정자생성 촉진, 고환독성 완화 및 음경해면체 평활근의 이완)

  • Seo, Il-Bok;Park, Sun-Young
    • The Korea Journal of Herbology
    • /
    • v.30 no.3
    • /
    • pp.55-61
    • /
    • 2015
  • Objectives : This study aimed to investigate the effects of Shingi-whan(SG) on the male reproductive and sexual function, so we measured the spermatogenesis and the testicular toxicity in mice and the vasorelaxation in isolated rabbit corpus cavernosum smooth muscle. Methods : To evaluate effect on the spermatogenesis in mice, we prepared two groups, control group and SG group that was orally administered SG(1,000mg/kg) for 20 days, and compared. To analyze testicular toxicity in mice, we also prepared two groups, doxo group that was injected with doxorubicin (3mg/kg) on three times and doxo + SG group that was injected with doxorubicin and SG for 20 days, and compared. To investigate sexual function of SG in mice, we prepared three groups, normal group and aging elicited group consisting of 18-month-old mice, SG treated aging group that was orally administered SG for 60 days, and compared using histochemical staining on mice corpus cavernous tissues. In order to define the relaxation effects of SG, rabbit corpus cavernous tissues were prepared in $2{\times}2{\times}6mm$ sized strip. Then the dose-dependent relaxation responses of SG at 0.01-3.0 mg/ml in contracted strips induced by phenylephrine were measured. Results : The sperm density in dutus epididymis and the diameter of seminiferous tubules of SG group was significantly increased when compared to control group. The testicular weight and the diameter and height of epithelial layer of seminiferous tubules of doxo + SG group was significantly increased when compared to doxo group. The cavernous strips were significantly relaxed by SG extract In SG treated aging group, ratio of smooth muscles to collagen fibers and red blood cell count in venous sinus was increased as compared to aging elicited group. Conclusions : Our findings have shown that SG extract have effect on spermatogenesis and mitigating effect on doxo-induced testicular toxicity. Further, it also have the vasorelaxant effect on rabbit corpus cavernosum.

A Case of Feeling of Cold on Legs Treated with Bee Venom and Scolopendrae Corpus Herbal Acupuncture (봉약침(蜂藥鍼), 오공약침료법(蜈蚣藥鍼療法)을 가미(加味)한 하지부(下肢部) 냉증(冷症) 치험 1례)

  • Lee, Yoon-Kyoung;Lim, Seong-Chul;Jung, Tae-Young;Han, Sang-Won;Seo, Jung-Chul
    • Journal of Pharmacopuncture
    • /
    • v.8 no.3
    • /
    • pp.129-135
    • /
    • 2005
  • Objective : This study was designed to investigate the effect of bee venom and Scolopendrae Corpus herbal acupuncture on the feeling of cold on legs. Methods : The patient was managed by bee venom and Scolopendrae Corpus herbal acupuncture, body acupuncture and herbal medicine. The following points were selected : BL40, BL57, BL60; SP6. After bee venom and Scolopendrae Corpus herbal acupuncture treatment, body acupuncture was performed at the same points. We evaluated the patient through Visual Analogue Scale(VAS) and Digital Infrared Thermal Imaging(D.I.T.I). Results : After 12 times of treatment, the patient showed that clinical symptoms was decreased, VAS changed from 10 to 3 and there was also improvement change on D.I.T.I.. Conclusions : According to the results, bee venom and Scolopendrae Corpus herbal acupuncture may have the effects on the feeling of cold on legs. But further studies and required to prove the effects of this methods.

The Relaxation Effects of Alpiniae Oxyphyllae Fructus on Isolated Corpus Cavernosum Smooth Muscle (益智仁의 음경해면체 평활근 이완효과)

  • Park, Sun-Young
    • The Korea Journal of Herbology
    • /
    • v.30 no.4
    • /
    • pp.71-79
    • /
    • 2015
  • Objectives : These present study was designed to investigate the relaxation effects of Alpiniae Oxyphyllae Fructus(AOF) on isolated corpus cavernosum smooth muscle.Methods : Rabbit corpus cavernous tissues were prepared in strip. Then relaxation responses of AOF at 0.01-3 ㎎/㎖ in contracted strips induced by phenylephrine(PE) were measured. To evaluate mechanisms, indomethacin(IM) tetraethylammonium chloride(TEA), Nω-nitro-L-arginine(L-NNA), methylene blue(MB) were treated before AOF extract(0.1-3 ㎎/㎖) infused into precontracted strips induced by PE. And 1 mM Ca2+was infused into precontracted strips after pretreatment of AOF extract(3 ㎎/㎖) in Ca2+-free krebs-ringer solution. NO concentration was measured by Griess reagent system. Ratio of smooth muscles to collagen fibers and eNOS positive reaction were measured by histocheminal and immunohistochemical process.Results : The cavernous strips were significantly relaxed by AOF extract 0.1, 0.3, 1, 3 ㎎/㎖ and the pretreatment with IM 10 μM,L-NNA 100 μM, MB 10 μM inhibited relaxation of AOF compared to non-pretreatment, but the pretreatment with TEA 100 μM didn't affect relaxation of AOF. In a Ca2+-free solution, pretreatment with AOF reduced increase on contraction of strips by Ca2+supply than non-pretreatment. On HUVEC, NO concentration was increased. On corpus cavernosum of penis in Spontaneous Hypertensive Rat, ratio of smooth muscles to collagen fibers and eNOS positive reaction in AOF group were increased compared to PE groupConclusions : Taken this results, we can suggest that AOF extract exerts a relaxation effects on rabbit corpus cavernosum smooth muscle in part by suppressing influx of extracellular Ca2+throughout prostacyclin, the NO-cGMP system.

Modification of Gene Expression of Connexins in the Rat Corpus Epididymis by Estradiol Benzoate or Flutamide Exposure at the Early Neonatal Age

  • Lee, Ki-Ho
    • Development and Reproduction
    • /
    • v.19 no.2
    • /
    • pp.69-77
    • /
    • 2015
  • Cell-cell direct communication through channel-forming molecules, connexin (Cx), is essential for a tissue to exchange signaling molecules between neighboring cells and establish unique functional characteristics during postnatal development. The corpus epididymis is a well-known androgen-responsive tissue and involves in proper sperm maturation. In the present research, it was attempted to determine if expression of Cx isoforms in the corpus epididymis in the adult is modulated by exposure to estrogenic or anti-androgenic compound during the early postnatal period. The neonatal male rats at 7 days of age were subcutaneously injected by estradiol benzoate (EB) at low-dose ($0.015{\mu}g/kg$ body weight) or high-dose ($1.5{\mu}g/kg$ body weight) or flutamide (Flu) at low-dose ($500{\mu}g/kg$ body weight) or high-dose (50 mg/kg body weight). The corpus epididymis collected at 4 months of age was subjected to evaluate expressional changes of Cx isoforms by quantitative real-time PCR. Treatment of low-dose EB resulted in increases of Cx32, Cx37, and Cx45 transcript levels, while exposure to high-dose EB decreased expression of Cx26, Cx30.3, Cx31, Cx31.1, Cx32, Cx40, Cx43, and Cx45. Treatments of Flu caused significant decreases of expression of all examined Cx isoforms, except Cx37 and Cx43 shown no expressional change with high-dose Flu treatment. These findings imply that expression of most Cx isoforms present in the corpus epididymis would be transcriptionally regulated by actions of androgen and/or estrogen during postnatal period.

Named Entity and Event Annotation Tool for Cultural Heritage Information Corpus Construction (문화유산정보 말뭉치 구축을 위한 개체명 및 이벤트 부착 도구)

  • Choi, Ji-Ye;Kim, Myung-Keun;Park, So-Young
    • Journal of the Korea Society of Computer and Information
    • /
    • v.17 no.9
    • /
    • pp.29-38
    • /
    • 2012
  • In this paper, we propose a named entity and event annotation tool for cultural heritage information corpus construction. Focusing on time, location, person, and event suitable for cultural heritage information management, the annotator writes the named entities and events with the proposed tool. In order to easily annotate the named entities and the events, the proposed tool automatically annotates the location information such as the line number or the word number, and shows the corresponding string, formatted as both bold and italic, in the raw text. For the purpose of reducing the costs of the manual annotation, the proposed tool utilizes the patterns to automatically recognize the named entities. Considering the very little training corpus, the proposed tool extracts simple rule patterns. To avoid error propagation, the proposed patterns are extracted from the raw text without any additional process. Experimental results show that the proposed tool reduces more than half of the manual annotation costs.

ToBI and beyond: Phonetic intonation of Seoul Korean ani in Korean Intonation Corpus (KICo)

  • Ji-eun Kim
    • Phonetics and Speech Sciences
    • /
    • v.16 no.1
    • /
    • pp.1-9
    • /
    • 2024
  • This study investigated the variation in the intonation of Seoul Korean interjection ani across different meanings ("no" and "really?") and speech levels (Intimate and Polite) using data from Korean Intonation Corpus (KICo). The investigation was conducted in two stages. First, IP-final tones in the dataset were categorized according to the K-ToBI convention (Jun, 2000). While significant relationships were observed between the meaning of ani and its IP-final tones, substantial overlap between groups was notable. Second, the F0 characteristics of the final syllable of ani were analyzed to elucidate the apparent many-to-many relationships between intonation and meaning/speech level. Results indicated that these seemingly overlapping relationships could be significantly distinguished. Overall, this study advocates for a deeper analysis of phonetic intonation beyond ToBI-based categorical labels. By examining the F0 characteristics of the IP-final syllable, previously unclear connections between meaning/speech level and intonation become more comprehensible. Although ToBI remains a valuable tool and framework for studying intonation, it is imperative to explore beyond these categories to grasp the "distinctiveness" of intonation, thereby enriching our understanding of prosody.

The Study on the Principles of Selecting Korean Particle 'Ka' and 'Nun' Using Korean-English Parallel Corpus (한영 병렬 말뭉치를 이용한 한국어 조사 '가'와 '는'의 선택 원리 연구)

  • Yoo, Hyun-Kyung;An, Ye-Ri;Yang, Su-Hyang
    • Language and Information
    • /
    • v.11 no.1
    • /
    • pp.1-23
    • /
    • 2007
  • This study aims to research into the meaning of Korean particle 'ka' and 'nun' inductively by examining the correspondences of those particles and English articles on the Korean-English parallel corpus. The correspondences were checked in three ways: semantically, syntactically and pragmatically. This study found that when the semantic or syntactic tier is not salient, the pragmatic tier is activated and particles are selected according to the pragmatic elements such as the amount of information or the change of topic. However, if the meaning of the particles is salient or if there is any syntactic motive, particles are selected in accordance with the semantic or syntactic elements. Former studies which focused on one of those three tiers cannot properly explain such correspondences on the Korean-English parallel corpus. This study shows that semantic, syntactic and pragmatic tiers hierarchically affect the selection of a particle and that the selection process is also related to speaker's intention. This dimensional analysis of particles is expected to contribute to theoretical studies and applied studies like Korean language education as well.

  • PDF

Morphological Observations of Ovaries in Relation to Infertility in Slaughtered Cows in Kyungnam Province 2. Incidences and Morphological Findings of Ovarian Cysts (경남지방의 도태우에 불임과 관련된 난소의 형태학적 관찰 2. 난소낭종의 발생과 낭종형태에 대하여)

  • 곽수동;표병민;양재훈;김철호;서득록;고필옥;강정부
    • Journal of Veterinary Clinics
    • /
    • v.19 no.2
    • /
    • pp.153-158
    • /
    • 2002
  • Ovaries from total 192 slaughtered cows(154 Korean native cows and 38 Holstein cows) were collected during the slaughtering process in Kimhae, Changyoung and Yangsan abattoirs in Kyungnam province from January 2001 to January 2002. In order to investigate incidence of the ovarian cysts, anatomical, histological observations were performed and also TUNEL methods and PCNA antibody by immunogistochemical methods for diagnostic accuracy of cysts in a few ovaries were applied. Apoptotic positive cells by TUNEL method appeared not or a few in cystic walls but appeared more number in normal large follicular walls and the proliferative positive cells by PCNA antibody appeared numerous in normal large follicular walls but not or a few in cystic walls. The incident rates of ovarian cysts were 19.5% in Korean native cows and 18.4% in Holstein cows. The incident rates of ovarian cysts in Holstein cows were lower than that of Koran native cows. The incident rates of follicular cysts and luteal cysts in Korean native cows were 11.7% and 7.8% respectively. The incident rates of follicular cysts and luteal cysts in Holstein cows were 10.5% and 7.9%, respectively. Higher incidence proportions of ovarian cysts according to seasons in Korean native cows were ordered as spring (29.8%), autumn (21.4%) winter (14.3%) and summer (6.7%). Rates of cows with single cyst and multiple cysts were 63.3%(19 heads /30 heads) and 36.7%(11 heads/30 heads) in 30 cystic Korean native cows, respectively. Cystic cows with corpus luteums were 50.0%(15 heads) in 30 Korean native cows and 42.9%(3 heads) in 7 dairy cows, respectively. Among 15 cystic Korean native cows with corpus luteums, rates of cows with single corpus luteum were 66.7%(10 heads) and rates of multiple corpus luteum were 33.3%(5 heads ), respectively. The average diameter of cysts and corpus luteums in cystic ovaries were 21.0$\times$17.1 mm and 18.1$\times$13.8 mm in 30 Korean native cows and 20.6$\times$17.7 mm and 19.3 $\times$ 14.9 mm in 7 Holstein cows, respectively. So the average sizes of cysts in cystic ovaries were larger than those of corpus luteums.