• Title/Summary/Keyword: corpora

Search Result 249, Processing Time 0.022 seconds

A Word Embedding used Word Sense and Feature Mirror Model (단어 의미와 자질 거울 모델을 이용한 단어 임베딩)

  • Lee, JuSang;Shin, JoonChoul;Ock, CheolYoung
    • KIISE Transactions on Computing Practices
    • /
    • v.23 no.4
    • /
    • pp.226-231
    • /
    • 2017
  • Word representation, an important area in natural language processing(NLP) used machine learning, is a method that represents a word not by text but by distinguishable symbol. Existing word embedding employed a large number of corpora to ensure that words are positioned nearby within text. However corpus-based word embedding needs several corpora because of the frequency of word occurrence and increased number of words. In this paper word embedding is done using dictionary definitions and semantic relationship information(hypernyms and antonyms). Words are trained using the feature mirror model(FMM), a modified Skip-Gram(Word2Vec). Sense similar words have similar vector. Furthermore, it was possible to distinguish vectors of antonym words.

Compilation of the Yonsei English Learner Corpus (YELC) 2011 and Its Use for Understanding Current Usage of English by Korean Pre-university Students (한국 예비 대학생의 영어 사용 특성 파악을 위한 대규모 공개 영어 학습자 코퍼스 구축 및 분석)

  • Rhee, Seok-Chae;Jung, Chae Kwan
    • The Journal of the Korea Contents Association
    • /
    • v.14 no.11
    • /
    • pp.1019-1029
    • /
    • 2014
  • In recent years, researchers have become increasingly interested in the creation and pedagogical use of English learner corpora. Many studies have shown that learner corpora can not only make a significant contribution to second language acquisition research but also contribute to the construction and evaluation of language tests by advancing our understanding of English learners. So far, however, little attention has been paid to the Korean EFL (English as a foreign language) learners' corpus. The Yonsei English Learner Corpus (YELC 2011) is a specialized, monolingual, and synchronic Korean EFL learner corpus that was developed by Yonsei University from 2011 to 2012. Over 3,000 Korean high school graduates (or equivalents) who were accepted by Yonsei University for their further studies participated in this project. It consists of 6,572 written texts (1,085,828 words) at nine different English proficiency levels. In this paper, we describe its compilation, and more specifically, how we have corpusized from a text archive to a corpus. After introducing the process of corpusization, we report arresting insights into the specific linguistic features that different proficiency levels of Korean learners of English have. This study also discusses the potential use of the YELC 2011 which is now freely available for research purposes.

Roles of Conceptus Secretory Proteins in Establishment and Maintenance of Pregnancy in Ruminants

  • Bazer, Fuller W.;Song, Gwon-Hwa;Thatcher, William W.
    • Asian-Australasian Journal of Animal Sciences
    • /
    • v.25 no.1
    • /
    • pp.1-16
    • /
    • 2012
  • Reproduction in ruminant species is a highly complex biological process requiring a dialogue between the developing conceptus (embryo-fetus and associated placental membranes) and maternal uterus which must be established during the peri-implantation period for pregnancy recognition signaling and regulation of gene expression by uterine epithelial and stromal cells. The uterus provide a microenvironment in which molecules secreted by uterine epithelia and transported into the uterine lumen represent histotroph, also known as the secretome, that are required for growth and development of the conceptus and receptivity of the uterus to implantation by the elongating conceptus. Pregnancy recognition signaling as related to sustaining the functional lifespan of the corpora lutea, is required to sustain the functional life-span of corpora lutea for production of progesterone which is essential for uterine functions supportive of implantation and placentation required for successful outcomes of pregnancy. It is within the peri-implantation period that most embryonic deaths occur in ruminants due to deficiencies attributed to uterine functions or failure of the conceptus to develop appropriately, signal pregnancy recognition and/or undergo implantation and placentation. The endocrine status of the pregnant ruminant and her nutritional status are critical for successful establishment and maintenance of pregnancy. The challenge is to understand the complexity of key mechanisms that are characteristic of successful reproduction in humans and animals and to use that knowledge to enhance fertility and reproductive health of ruminant species in livestock enterprises.

Ovarian Response and Profile of Plasma Sex Steroids in Goats Against Combined Administration of FSH and LH Isolated from the Pituitaries of Buffaloes

  • Taru Sharma, G.;Pande, J.K.;Sanwal, P.C.;Varshney, V.P.
    • Asian-Australasian Journal of Animal Sciences
    • /
    • v.10 no.5
    • /
    • pp.514-518
    • /
    • 1997
  • This study was designed to record the ovarian response towards a combined administration of heterologous buffalo FSH (buFSH) and LH (buLH) in goats. The impact of such a treatment on ovarian structures and on the plasma profile of the ovarian sex steroids (estradiol $17-{\beta}$ and progesterone) was studied. The buFSH and buLH were isolated from the buffalo pituitaries involving a procedure of ethanolic extraction, acetone precipitation followed by metaphosphoric acid - ammonium sulphate fractionation. Both gonadotrophin samples prepared were found biologically active and potent. There was an increase in the total number of follicles in the treated group ($12.66{\pm}1.24$) vis-a-vis the control group ($8.50{\pm}2.06$). However, the percentage ($51.48{\pm}6.37$) of large follicles were found reduced ($23.74{\pm}5.93$) following the treatment. Again the number of corpora lutea were observed significantly higher ($2.33{\pm}0.47C.L.$) in the treated group than (1 C. L.) in the control group. The peak plasma estradiol- $17{\beta}$ levels achieved, were much higher ($17.16{\pm}9.52pg/ml$) in the treated group, than the peak ($7.22{\pm}1.67pg/ml$) achieved in the control group. Similar trend was observed with respect to the progesterone levels (higher in the treated group). This study thus indicated that, a combined administration of heterologous buffalo FSH and LH to goats speeded up development of larger follicles nearing the ovulation stage. This population of the follicles subsequently got reduced and lead to the formation of the increased number of the corpora lutea observed in this study.

Rule Construction for Determination of Thematic Roles by Using Large Corpora and Computational Dictionaries (대규모 말뭉치와 전산 언어 사전을 이용한 의미역 결정 규칙의 구축)

  • Kang, Sin-Jae;Park, Jung-Hye
    • The KIPS Transactions:PartB
    • /
    • v.10B no.2
    • /
    • pp.219-228
    • /
    • 2003
  • This paper presents an efficient construction method of determination rules of thematic roles from syntactic relations in Korean language processing. This process is one of the main core of semantic analysis and an important issue to be solved in natural language processing. It is problematic to describe rules for determining thematic roles by only using general linguistic knowledge and experience, since the final result may be different according to the subjective views of researchers, and it is impossible to construct rules to cover all cases. However, our method is objective and efficient by considering large corpora, which contain practical osages of Korean language, and case frames in the Sejong Electronic Lexicon of Korean, which is being developed by dozens of Korean linguistic researchers. To determine thematic roles more correctly, our system uses syntactic relations, semantic classes, morpheme information, position of double subject. Especially by using semantic classes, we can increase the applicability of the rules.

Anatomical and Histological Features and Ovarian Hormone Analysis of Ovarian Cysts in Korean Native Cow and Dairy Cow (한우(韓牛) 및 유우(乳牛)의 난소난종(卵巢囊腫)에 관한 해부조직학적(解剖組織學的) 소견(所見) 및 난소(卵巢)호르몬 분석(分析))

  • Kang, Byung-kyu;Choi, Han-sun;Chung, Young-ki
    • Korean Journal of Veterinary Research
    • /
    • v.27 no.1
    • /
    • pp.141-151
    • /
    • 1987
  • A total of 1200 Korean native cow and 240 dairy cow genitalia were collected during the slaughtering process in Seoul and Kwang Ju abattoir and were examined from July 1985 to March 1986. Ovarian follicles were classified as cystic if the diameter was greater than 2.5cm or if follicles were multiple. In order to investigate the ovarian cysts, anatomical and histological examinations were performed. In addition progesterone and estrogen level in different types of cystic follicular fluid and serum were measured by radioimmunoassay. The results were summerized as follows: 1. The incidences of ovarian cysts were 2.0% in Korean native cow and 7.9% in dairy cow. 2. In distribution of cysts in the left, right and both ovaries, the most encountered ovary was right one. The frequency was 45.8% in right ovaries, 33.4% in left ovaries and 20.8% in both ovaries in Korean native cow. On the contrary the frequency was 42.1% in right ovaries, 31.8% in both ovaries and 26.3% in left ovaries in dairy cow. 3. Six speciemens (25.0%) of Korean native cow and six specimens (31.6%) of dairy cow were associated with corpora lutes in both ovaries. 4. The luteinization of theca layer was most significant in the group 2Aa (71.4%) and 2Ba (38.5%) which associated with no granulosa cell and corpora lutea in the same cystic ovaries. 5. Correlation of progesterone concentration between cystic fluid and serum was found only in the group 2Aa and 2Ab (r=0.86). Progesterone and estrogen concentrations in cystic fluid were closely related to the degree of degeneration of granulosa cell layer. The cystic follicles that consist of thickened theca and degenerated granulosa cell layers contained a large amount of progesterone, and small amount of estrogen. In conclusion, various types of ovarian cysts with various levels of progesterone and estrogen were observed in Korean native cow.

  • PDF

Hypertrophical Changes of the Corpus Allatum Caused by Ovariectomy in Blattella germanica (난소제거 바퀴에서 알라타체의 이상비대화에 관한 연구)

  • Han, Sung-Sik;Kim, Kil-Heung;Scha, Coby
    • Applied Microscopy
    • /
    • v.28 no.1
    • /
    • pp.91-105
    • /
    • 1998
  • The present study is undertaken to investigate the hypertrophical changes of the corpus allatum (corpora allata, CA) after the ovariectomy in Blattella germanica. In particular, the ultrastructural aspects of the normal and ovariectomized conditions, and induced factors of the hypertrophic phenomenon are focused. Ultrastructure of the CA from an immediately emergent adult is similar to that of the last larval stage that has stopped secreting juvenile hormone. The CA is composed of undifferentiated cells, exhibiting an electron-lucent matrix, a few mitochondria and less smooth endoplasmic reticulum. The karyoplasm occupy most of the cytoplasm. Electron-dense materials are filled with the intercellular spaces and gap junctions are also found. Almost no ultrastructural changes have been noticed during seven days until the oviposition. However, considerable changes in structure have been detected soon after the oviposition. Mitochondria are increased dramatically in number and cristae, and changed to the filamentous form with a high electron density. In addition, Golgi complexes, microtubules, and polysomes are also increased. After an oviposition, the total volume of the CA are decreased again. The volume of the CA are increased continuously, hypertrophy, after the ovariectomy. Morphological aspects of the CA in an early stage after the removal are similar to the structure of the secondary egg maturation. Large and electron dense globules are observed in the ovariectomized CA cytoplasms and they are present in those cells for a long period of time. Yet such a hypertrophical phenomenon occur only in the specific cells. The hypertrophy are caused by hollowing the part of the CA cells and later filling such site with polysomes. In 42 days after the ovariectomy, the nuclear membranes disappear in the CA cells, thus, exhibiting the prokaryotic-like features. Some results of the current study will contribute to the establishment of the model that explain unusual changes accompanied by certain treatment in insects and/or further in animals.

  • PDF

Lipid and Carbohydrate Contents in the Adult Hemolymph during Flight of the Oriental Tobacco Budworm (Helicoverpa assulta (Guenee)) (비행중인 담배나방의 혈림프내 지질과 탄수화물의 함량변화)

  • 정진교;부경생
    • Korean journal of applied entomology
    • /
    • v.31 no.4
    • /
    • pp.329-337
    • /
    • 1992
  • Studies were carried out to investigate changes of lipid and carbohydrate contents in the hemolymph of the Oriental tobacco budworm(Helicoverpa assulta (Guenee» adults during flight and hormonal effects on mobilization of energy sources in the hemolymph. During a few minutes after flight, both sexes showed a rapid increase in lipid content and the high level was maintained for about 2 hours. But carbohydrate content in the hemolymph during flight showed almost no change but a slight increase seen during the first 10 min of flight in males only. Synthetic adipokinetic(Lom-AKH- n), hypertrehalosemic(Bld-HrTH) hormones and brain/ corpora cardiaca extract of H. assulta adult elevated lipid and carbohydrate contents in hemolymph and the effect was much more pronounced for lipid. These results suggested that lipid is a main fuel for flight activity and lipid mobilization is under the hormonal control. And this study showed that both adipokinetic and hypertrehalosemic factors may exist in H. assulta and these factors may have similar structures to those of Mas-AKH, Hez-HrTH, Lom-AKH- n or Pea-HrTH.

  • PDF

An Effective Estimation method for Lexical Probabilities in Korean Lexical Disambiguation (한국어 어휘 중의성 해소에서 어휘 확률에 대한 효과적인 평가 방법)

  • Lee, Ha-Gyu
    • The Transactions of the Korea Information Processing Society
    • /
    • v.3 no.6
    • /
    • pp.1588-1597
    • /
    • 1996
  • This paper describes an estimation method for lexical probabilities in Korean lexical disambiguation. In the stochastic to lexical disambiguation lexical probabilities and contextual probabilities are generally estimated on the basis of statistical data extracted form corpora. It is desirable to apply lexical probabilities in terms of word phrases for Korean because sentences are spaced in the unit of word phrase. However, Korean word phrases are so multiform that there are more or less chances that lexical probabilities cannot be estimated directly in terms of word phrases though fairly large corpora are used. To overcome this problem, similarity for word phrases is defined from the lexical analysis point of view in this research and an estimation method for Korean lexical probabilities based on the similarity is proposed. In this method, when a lexical probability for a word phrase cannot be estimated directly, it is estimated indirectly through the word phrase similar to the given one. Experimental results show that the proposed approach is effective for Korean lexical disambiguation.

  • PDF

The Method of Color Image Processing Using Adaptive Saturation Enhancement Algorithm (적응형 채도 향상 알고리즘을 이용한 컬러 영상 처리 기법)

  • Yang, Kyoung-Ok;Yun, Jong-Ho;Cho, Hwa-Hyun;Choi, Myung-Ryul
    • The KIPS Transactions:PartB
    • /
    • v.14B no.3 s.113
    • /
    • pp.145-152
    • /
    • 2007
  • In this paper, we propose an automatic extraction model for unknown translations and implement an unknown translation extraction system using the proposed model. The proposed model as a phrase-alignment model is incorporated with three models: a phrase-boundary model, a language model, and a translation model. Using the proposed model we implement the system for extracting unknown translations, which consists of three parts: construction of parallel corpora, alignment of Korean and English words, extraction of unknown translations. To evaluate the performance of the proposed system, we have established the reference corpus for extracting unknown translation, which comprises of 2,220 parallel sentences including about 1,500 unknown translations. Through several experiments, we have observed that the proposed model is very useful for extracting unknown translations. In the future, researches on objective evaluation and establishment of parallel corpora with good quality should be performed and studies on improving the performance of unknown translation extraction should be kept up.