• 제목/요약/키워드: corpus-based

검색결과 568건 처리시간 0.026초

Comparison of Active Contour and Active Shape Approaches for Corpus Callosum Segmentation

  • Adiya, Enkhbolor;Izmantoko, Yonny S.;Choi, Heung-Kook
    • 한국멀티미디어학회논문지
    • /
    • 제16권9호
    • /
    • pp.1018-1030
    • /
    • 2013
  • The corpus callosum is the largest connective structure in the brain, and its shape and size are correlated to sex, age, brain growth and degeneration, handedness, musical ability, and neurological diseases. Manually segmenting the corpus callosum from brain magnetic resonance (MR) image is time consuming, error prone, and operator dependent. In this paper, two semi-automatic segmentation methods are present: the active contour model-based approach and the active shape model-based approach. We tested these methods on an MR image of the human brain and found that the active contour approach had better segmentation accuracy but was slower than the active shape approach.

가상 예제와 Edit-distance 자질을 이용한 SVM 기반의 단백질명 인식 (SVM-based Protein Name Recognition using Edit-Distance Features Boosted by Virtual Examples)

  • Yi, Eun-Ji;Lee, Gary-Geunbae;Park, Soo-Jun
    • 한국생물정보학회:학술대회논문집
    • /
    • 한국생물정보시스템생물학회 2003년도 제2차 연례학술대회 발표논문집
    • /
    • pp.95-100
    • /
    • 2003
  • In this paper, we propose solutions to resolve the problem of many spelling variants and the problem of lack of annotated corpus for training, which are two among the main difficulties in named entity recognition in biomedical domain. To resolve the problem of spotting valiants, we propose a use of edit-distance as a feature for SVM. And we propose a use of virtual examples to automatically expand the annotated corpus to resolve the lack-of-corpus problem. Using virtual examples, the annotated corpus can be extended in a fast, efficient and easy way. The experimental results show that the introduction of edit-distance produces some improvements in protein name recognition performance. And the model, which is trained with the corpus expanded by virtual examples, outperforms the model trained with the original corpus. According to the proposed methods, we finally achieve the performance 75.80 in F-measure(71.89% in precision,80.15% in recall) in the experiment of protein name recognition on GENIA corpus (ver.3.0).

  • PDF

백강잠(白殭蠶)이 남성 골다공증에 미치는 영향 (Effects of Bombycis Corpus on Male Osteoporosis)

  • 김호현;안상현;박선영
    • 동의생리병리학회지
    • /
    • 제33권1호
    • /
    • pp.56-62
    • /
    • 2019
  • To investigate the effect of Bombycis Corpus on male osteoporosis, we performed Dual Energy X-Ray Absorptiometry(DEXA) and histochemical methods. The animals were used ICR-based male mice of 8 weeks and 50 weeks, respectively. ICR male mice at 8 weeks were used in the control group, and ICR male mice at 50 weeks were used in aging group and Bombycis Corpus group(BC group). In the aging group, 0.5 ml of distilled water was administered once a day for 6 months. In BC group, Bombycis Corpus(0.78g/kg) was dissolved in distilled water for 6 months once a day. As a result, Bombycis Corpus decreased bone loss, increased bone density by reducing the loss of bone matrix in the femur due to aging, and increased osteoblast - induced osteopontin(OPN) and osteocalcin(OPC) positivite reaction. In addition, administration of Bombycis Corpus decreased Reaction of activation of nuclear factor kappa B ligand(RANKL) positive reaction, increased osteoprotegerin(OPG) positive reaction, and decreased matrix metalloproteinase-3(MMP-3) and 8-hydroxy-2'-deoxyguanosine(8-OHdG) positivite reaction. Taken together, Bombycis Corpus increases the activity of osteoblasts, inhibits osteoclast function, promotes osteoblast function, inhibits bone tissue degradation, and inhibits bone loss due to oxidative stress. It was observed that Bombycis Corpus reduced bone loss and increased bone density caused by aging to improve male osteoporosis. Therefore, Bombycis Corpus may be used as a preventive and therapeutic agent for male osteoporosis.

A Corpus-based Analysis of EFL Learners' Use of Discourse Markers in Cross-cultural Communication

  • Min, Sujung
    • 영어어문교육
    • /
    • 제17권3호
    • /
    • pp.177-194
    • /
    • 2011
  • This study examines the use of discourse markers in cross-cultural communication between EFL learners in an e-learning environment. The study analyzes the use of discourse markers in a corpus of an interactive web with a bulletin board system through which college students of English at Japanese and Korean universities interacted with each other discussing the topics of local and global issues. It compares the use of discourse markers in the learners' corpus to that of a native English speakers' corpus. The results indicate that discourse markers are useful interactional devices to structure and organize discourse. EFL learners are found to display more frequent use of referentially and cognitively functional discourse markers and a relatively rare use of other markers. Native speakers are found to use a wider variety of discourse markers for different functions. Suggestions are made for using computer corpora in understanding EFL learners' language difficulties and helping them become more interactionally competent speakers.

  • PDF

Citation Practices in Academic Corpora: Implications for EAP Writing

  • Min, Su-Jung
    • 영어어문교육
    • /
    • 제10권3호
    • /
    • pp.113-126
    • /
    • 2004
  • Explicit reference to the work of other authors is an essential feature of most academic research writings. Corpus analysis of academic text can reveal much about what writers actually do and why they do so. Application of corpus tools in language education has been well documented by many scholars (Pedersen, 1995, Swales, 1990, Thompson, 2000). They demonstrate how computer technology can assist in the effective analysis of corpus based data. For teaching purposes, tills recent research provides insights in the areas of English for Academe Purposes (EAP). The need for such support is evident when students have to use appropriate citations in their writings. Using Swales' (1990) division of citation forms into integral and non-integral and Thompson and Tnbble's (2001) classification scheme, this paper codifies academic texts in a corpus. The texts are academic research articles from different disciplines. The results lead into a comparison of the citation practices m different disciplines. Finally, it is argued that the information obtained in this study is useful for EAP writing courses in EFL countries.

  • PDF

한국어 대용량발화말뭉치의 단모음분석 (Monophthong Analysis on a Large-scale Speech Corpus of Read-Style Korean)

  • 윤태진;강윤정
    • 말소리와 음성과학
    • /
    • 제6권3호
    • /
    • pp.139-145
    • /
    • 2014
  • The paper describes methods of conducting vowel analysis from a large-scale corpus with the aids of forced alignment and optimal formant ceiling methods. 'Read Style Corpus of Standard Korean' is used for building the forced alignment system and a subset of the corpus for the processing and extraction of features for vowel analysis based on optimal formant ceiling. The results of the vowel analysis are reliable and comparable to the results obtained using traditional analytical methods. The findings indicate that the methods adopted for the analysis can be extended and be used for more fine-grained analysis without time-consuming manual labeling without losing accuracy and reliability.

Comparison Thai Word Sense Disambiguation Method

  • Modhiran, Teerapong;Kruatrachue, Boontee;Supnithi, Thepchai
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 제어로봇시스템학회 2004년도 ICCAS
    • /
    • pp.1307-1312
    • /
    • 2004
  • Word sense disambiguation is one of the most important problems in natural language processing research topics such as information retrieval and machine translation. Many approaches can be employed to resolve word ambiguity with a reasonable degree of accuracy. These strategies are: knowledge-based, corpus-based, and hybrid-based. This paper pays attention to the corpus-based strategy. The purpose of this paper is to compare three famous machine learning techniques, Snow, SVM and Naive Bayes in Word-Sense Disambiguation on Thai language. 10 ambiguous words are selected to test with word and POS features. The results show that SVM algorithm gives the best results in solving of Thai WSD and the accuracy rate is approximately 83-96%.

  • PDF

English Conditional Inversion: A Construction-Based Approach

  • Kim, Jong-Bok
    • 한국언어정보학회지:언어와정보
    • /
    • 제15권1호
    • /
    • pp.13-29
    • /
    • 2011
  • Conditional sentences also can be formed by inversion of subject and auxiliary, but it happens only in a limited environment. This paper addresses grammatical constraints in conditional inversion and how they behave differently from the regular conditional clauses based on corpus investigations. Our corpus search reveals many different types of conditional inversion constructions, indicating the difficulties of deriving inverted conditionals from movement operations. In this paper, we provide a construction-based approach to the inverted conditional construction. The paper shows that the most optimal way of describing the general as well as idiosyncratic properties of the inverted conditional constructions is an account in the spirit of construction grammar in which a grammar is a repertory of constructions forming a network connected by links of inheritance.

  • PDF

L2 영어 학습자들의 연어 사용 능숙도와 텍스트 질 사이의 수치화 (Quantifying L2ers' phraseological competence and text quality in L2 English writing)

  • 권준혁;김재준;김유래;박명관;송상헌
    • 한국정보과학회 언어공학연구회:학술대회논문집(한글 및 한국어 정보처리)
    • /
    • 한국정보과학회언어공학연구회 2017년도 제29회 한글 및 한국어 정보처리 학술대회
    • /
    • pp.281-284
    • /
    • 2017
  • On the basis of studies that show multi-word combinations, that is the field of phraseology, this study aims to examine relationship between the quality of text and phraseological competence in L2 English writing, following Yves Bestegen et al. (2014). Using two different association scores, t-score and Mutual Information(MI), which are opposite ways of measuring phraseological competence, in terms of scoring frequency and infrequency, bigrams from L2 writers' text scored based on a reference corpus, GloWbE (Corpus of Global Web based English). On a cross-sectional approach, we propose that the quality of the essays and the mean MI score of the bigram extracted from YELC, Yonsei English Learner Corpus, correlated to each other. The negative scores of bigrams are also correlated with the quality of the essays in the way that these bigrams are absent from the reference corpus, that is mostly ungrammatical. It indicates that increase in the proportion of the negative scored bigrams debases the quality of essays. The conclusion shows the quality of the essays scored by MI and t-score on cross-sectional approach, and application to teaching method and assessment for second language writing proficiency.

  • PDF

L2 영어 학습자들의 연어 사용 능숙도와 텍스트 질 사이의 수치화 (Quantifying L2ers' phraseological competence and text quality in L2 English writing)

  • 권준혁;김재준;김유래;박명관;송상헌
    • 한국어정보학회:학술대회논문집
    • /
    • 한국어정보학회 2017년도 제29회 한글및한국어정보처리학술대회
    • /
    • pp.281-284
    • /
    • 2017
  • On the basis of studies that show multi-word combinations, that is the field of phraseology, this study aims to examine relationship between the quality of text and phraseological competence in L2 English writing, following Yves Bestegen et al. (2014). Using two different association scores, t-score and Mutual Information(MI), which are opposite ways of measuring phraseological competence, in terms of scoring frequency and infrequency, bigrams from L2 writers' text scored based on a reference corpus, GloWbE (Corpus of Global Web based English). On a cross-sectional approach, we propose that the quality of the essays and the mean MI score of the bigram extracted from YELC, Yonsei English Learner Corpus, correlated to each other. The negative scores of bigrams are also correlated with the quality of the essays in the way that these bigrams are absent from the reference corpus, that is mostly ungrammatical. It indicates that increase in the proportion of the negative scored bigrams debases the quality of essays. The conclusion shows the quality of the essays scored by MI and t-score on cross-sectional approach, and application to teaching method and assessment for second language writing proficiency.

  • PDF