• Title/Summary/Keyword: Corpus-based Study

Search Result 204, Processing Time 0.022 seconds

Color Recommendation for Text Based on Colors Associated with Words

  • Liba, Saki;Nakamura, Tetsuaki;Sakamoto, Maki
    • Journal of Korea Society of Industrial Information Systems
    • /
    • v.17 no.1
    • /
    • pp.21-29
    • /
    • 2012
  • In this paper, we propose a new method to select colors representing the meaning of text contents based on the cognitive relation between words and colors, Our method is designed on the previous study revealing the existence of crucial words to estimate the colors associated with the meaning of text contents, Using the associative probability of each color with a given word and the strength of color association of the word, we estimate the probability of colors associated with a given text. The goal of this study is to propose a system to recommend the cognitively plausible colors for the meaning of the input text. To build a versatile and efficient database used by our system, two psychological experiments were conducted by using news site articles. In experiment 1, we collected 498 words which were chosen by the participants as having the strong association with color. Subsequently, we investigated which color was associated with each word in experiment 2. In addition to those data, we employed the estimated values of the strength of color association and the colors associated with the words included in a very large corpus of newspapers (approximately 130,000 words) based on the similarity between the words obtained by Latent Semantic Analysis (LSA). Therefore our method allows us to select colors for a large variety of words or sentences. Finally, we verified that our system cognitively succeeded in proposing the colors associated with the meaning of the input text, comparing the correct colors answered by participants with the estimated colors by our method. Our system is expected to be of use in various types of situations such as the data visualization, the information retrieval, the art or web pages design, and so on.

A Study on the Development of English Inflectional Morphemes Based on the CHILDES Corpus (CHILDES 코퍼스를 기반으로 한 아동의 영어 굴절형태소 발달 연구)

  • Min, Myung Sook;Jun, Jongsup;Lee, Sun-Young
    • Korean Journal of Cognitive Science
    • /
    • v.24 no.3
    • /
    • pp.203-235
    • /
    • 2013
  • The goal of this paper is to test the findings about English-speaking children's acquisition of inflectional morphemes in the literature using a large-scale database. For this, we obtained a 4.7-million-word corpus from the CHILDES (Child Language Data Exchange System) database, and analyzed 1,630 British and American children's uses of English derivational morphemes up to age 7. We analyzed the type and token frequencies, type per token ratio (TTR), and the lexical diversity (D) for such inflectional morphemes as the present progressive -ing, the past tense -(e)d, the comparative and superlative -er/est with reference to children's nationality and age groups. To sum up our findings, the correlations between the D value and children's age varied from morpheme to morpheme; e.g. we found no correlation for -ing, a marginal correlation for -ed, and a strong correlation for -er/-est. Our findings are consistent with Brown's (1973) classical observation that children learn progressive forms earlier than the past tense marker. In addition, overgeneralization errors were frequently found for -ed, but rarely for -ing, showing a U-shaped developmental pattern at ages 2-3. Finally, American children showed higher D scores than British children, which showed that American children used inflectional morphemes for more word types compared with British children. The present study has its significance in testing the earlier findings in the literature by setting up well-defined methodology for analyzing the entire CHILDES database.

  • PDF

A Corpus-based Study of Translation Universals in English Translations of Korean Newspaper Texts (한국 신문의 영어 번역에 나타난 번역 보편소의 코퍼스 기반 분석)

  • Goh, Gwang-Yoon;Lee, Younghee (Cheri)
    • Cross-Cultural Studies
    • /
    • v.45
    • /
    • pp.109-143
    • /
    • 2016
  • This article examines distinctive linguistic shifts of translational English in an effort to verify the validity of the translation universals hypotheses, including simplification, explicitation, normalization and leveling-out, which have been most heavily explored to date. A large-scale study involving comparable corpora of translated and non-translated English newspaper texts has been carried out to typify particular linguistic attributes inherent in translated texts. The main findings are as follows. First, by employing the parameters of STTR, top-to-bottom frequency words, and mean values of sentence lengths, the translational instances of simplification have been detected across the translated English newspaper corpora. In contrast, the portion of function words produced contrary results, which in turn suggests that this feature might not constitute an effective test of the hypothesis. Second, it was found that the use of connectives was more salient in original English newspaper texts than translated English texts, being incompatible with the explicitation hypothesis. Third, as an indicator of translational normalization, lexical bundles were found to be more pervasive in translated texts than in non-translated texts, which is expected from and therefore support the normalization hypothesis. Finally, the standard deviations of both STTR and mean sentence lengths turned out to be higher in translated texts, indicating that the translated English newspaper texts were less leveled out within the same corpus group, which is opposed to what the leveling-out hypothesis postulates. Overall, the results suggest that not all four hypotheses may qualify for the label translation universals, or at least that some translational predictors are not feasible enough to evaluate the effectiveness of the translation universals hypotheses.

Development of AI-based Real Time Agent Advisor System on Call Center - Focused on N Bank Call Center (AI기반 콜센터 실시간 상담 도우미 시스템 개발 - N은행 콜센터 사례를 중심으로)

  • Ryu, Ki-Dong;Park, Jong-Pil;Kim, Young-min;Lee, Dong-Hoon;Kim, Woo-Je
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.20 no.2
    • /
    • pp.750-762
    • /
    • 2019
  • The importance of the call center as a contact point for the enterprise is growing. However, call centers have difficulty with their operating agents due to the agents' lack of knowledge and owing to frequent agent turnover due to downturns in the business, which causes deterioration in the quality of customer service. Therefore, through an N-bank call center case study, we developed a system to reduce the burden of keeping up business knowledge and to improve customer service quality. It is a "real-time agent advisor" system that provides agents with answers to customer questions in real time by combining AI technology for speech recognition, natural language processing, and questions & answers for existing call center information systems, such as a private branch exchange (PBX) and computer telephony integration (CTI). As a result of the case study, we confirmed that the speech recognition system for real-time call analysis and the corpus construction method improves the natural speech processing performance of the query response system. Especially with name entity recognition (NER), the accuracy of the corpus learning improved by 31%. Also, after applying the agent advisor system, the positive feedback rate of agents about the answers from the agent advisor was 93.1%, which proved the system is helpful to the agents.

Syllable-based Korean POS Tagging Based on Combining a Pre-analyzed Dictionary with Machine Learning (기분석사전과 기계학습 방법을 결합한 음절 단위 한국어 품사 태깅)

  • Lee, Chung-Hee;Lim, Joon-Ho;Lim, Soojong;Kim, Hyun-Ki
    • Journal of KIISE
    • /
    • v.43 no.3
    • /
    • pp.362-369
    • /
    • 2016
  • This study is directed toward the design of a hybrid algorithm for syllable-based Korean POS tagging. Previous syllable-based works on Korean POS tagging have relied on a sequence labeling method and mostly used only a machine learning method. We present a new algorithm integrating a machine learning method and a pre-analyzed dictionary. We used a Sejong tagged corpus for training and evaluation. While the machine learning engine achieved eojeol precision of 0.964, the proposed hybrid engine achieved eojeol precision of 0.990. In a Quiz domain test, the machine learning engine and the proposed hybrid engine obtained 0.961 and 0.972, respectively. This result indicates our method to be effective for Korean POS tagging.

Developing and Pre-Processing a Dataset using a Rhetorical Relation to Build a Question-Answering System based on an Unsupervised Learning Approach

  • Dutta, Ashit Kumar;Wahab sait, Abdul Rahaman;Keshta, Ismail Mohamed;Elhalles, Abheer
    • International Journal of Computer Science & Network Security
    • /
    • v.21 no.11
    • /
    • pp.199-206
    • /
    • 2021
  • Rhetorical relations between two text fragments are essential information and support natural language processing applications such as Question - Answering (QA) system and automatic text summarization to produce an effective outcome. Question - Answering (QA) system facilitates users to retrieve a meaningful response. There is a demand for rhetorical relation based datasets to develop such a system to interpret and respond to user requests. There are a limited number of datasets for developing an Arabic QA system. Thus, there is a lack of an effective QA system in the Arabic language. Recent research works reveal that unsupervised learning can support the QA system to reply to users queries. In this study, researchers intend to develop a rhetorical relation based dataset for implementing unsupervised learning applications. A web crawler is developed to crawl Arabic content from the web. A discourse-annotated corpus is generated using the rhetorical structural theory. A Naïve Bayes based QA system is developed to evaluate the performance of datasets. The outcome shows that the performance of the QA system is improved with proposed dataset and able to answer user queries with an appropriate response. In addition, the results on fine-grained and coarse-grained relations reveal that the dataset is highly reliable.

The Effect that the Application of Time-Based Electrolysis Has on Acute Ischemia

  • Lee, Jung Sook;Song, Young Wha;Kim, Sung Won
    • Journal of International Academy of Physical Therapy Research
    • /
    • v.6 no.2
    • /
    • pp.846-851
    • /
    • 2015
  • This neurological damage accelerates the infection reaction of cells and apoptosis at the time of reperfusion after ischemia occurs. BCL-2/BCL-2 allogeneic begeminum has a function of suppressing the apoptosis of cells, and thus it is inferred that the susceptibility of cells to apoptosis is determined by the amount of allogeneic begeminum present which is determined based on the amount of BAX. Ischemia was induced in SD mice by occluding the common carotid artery for 5 minutes, after which blood was re-perfused. NEES was applied to acupuncture points, at 12, 24, and 48 hours post-ischemia on the joksamri, Hapgok. Protein expression was investigated through BAX antibody immuno-reactive cells in the cerebral nerve cells and Western blotting. The results were as follows: In the present study as well, as a result of observation of the change in the number of the BAX reaction cells after the inducement of GI, there was the aspect of most of the BAX reaction cells being observed in the corpus striatum area of the GI group 24 hours after the inducement of ischemia. This revealed the same results as those of previous studies in which the change in the number of BAX reaction cells occurred in all areas while ischemia was in progress. The change in the expression of BAX protein after 24 hours showed that there was a very significant reduction in the NEES group compared to the GI group (p<.01). As a result, a greatest amount of change in the number of BAX immunoreactive cells related to apoptosis 24 hours after ischemia appeared in the NEES group. This study that ischemia increases the expression of BAX that induces apoptosis. Thus, it is determined that ischemia is the main cause of the apoptosis of neurons, and this study reveals that low frequency needle electrode electrical stimulation has the effect of blocking the apoptosis of neurons by reducing protein related to the apoptosis of cells that has increased after ischemia has occurred.

Authorship Attribution in Korean Using Frequency Profiles (빈도 정보를 이용한 한국어 저자 판별)

  • Han, Na-Rae
    • Korean Journal of Cognitive Science
    • /
    • v.20 no.2
    • /
    • pp.225-241
    • /
    • 2009
  • This paper presents an authorship attribution study in Korean conducted on a corpus of newspaper column texts. Based on the data set consisting of a total of 160 columns written by four columnists of Chosun Daily, the approach utilizes relative frequencies of various lexical units in Korean such as fully inflected words, morphemes, syllables and their bigrams in an attempt to establish authorship of a blind text selected from the set. Among these various lexical units, "the morpheme" is found to be most effective in predicting who among the four potential candidates authored a text, reporting accuracies of over 93%. The results indicate that quantitative and statistical techniques in authorship attribution and computational stylistics can be successfully applied to Korean texts.

  • PDF

The Effect of NEES on the Occurrence of Caspase-3 in the Cerebellum of Rats with Transient Global Ischemia

  • Lee, Jung Sook;Song, Young Wha;Kim, Sung Won
    • Journal of International Academy of Physical Therapy Research
    • /
    • v.5 no.2
    • /
    • pp.718-722
    • /
    • 2014
  • The cerebellum is known to control balance, equilibrium, and muscle tone. If the cerebellum becomes damaged, the body is unable to retain its balancing functions or involuntary muscle movement. This is why, in stroke patients, there is a high risk of functional disability, as well as a myriad of other disabilities secondary to stroke. Ischemia was induced in SD mice by occluding the common carotid artery for 5 minutes, after which blood was reperfused. Needle electrode electrical stimulation(NEES) was applied to acupuncture points, at 12, 24, and 48 hours post-ischemia on the joksamri. Protein expression was investigated through caspase-3 antibody immuno-reactive cells in the cerebral nerve cells and Western blotting. The results were as follows: The number of caspase-3 reactive cells in the corpus cerebellum 12 and 24 hours post-ischemia was significantly (p<.05) smaller in the NEES group compared to the GI group. caspase-3 expression 12 and 24 hours post-ischemia was significantly(p<.05) smaller in the NEES group compared to the GI group. Based on these results, NEES seems to have a significant effect on Caspase-3 in the cerebellum in an ischemic state at 12 and 24 hours post ischemia, NEES delays the occurrence of early stage apoptosis-inducing Caspase-3, delaying and inhibiting apoptosis. Further systematic studies will have to be conducted in relation to the application of this study's results on stroke patients.

A study of Meridian Scraping Therapy (괄사요법(刮痧療法)에 대한 고찰(考察))

  • Yu, Jin-Ho;Koh, Hyung-Kyun;Kang, Sung-Keel
    • Journal of Acupuncture Research
    • /
    • v.17 no.3
    • /
    • pp.45-56
    • /
    • 2000
  • Objectives : Meridian Scraping Therapy(MST: Kwal-Sa: 刮痧) has not well known in Korea. But it's a kind of Traditional Physical Therapy wide spreaded in China and Taiwan. So I want to introduce this new Therapy by summarizing the basic theories and practical manipulations. Conclusions : To analyse the meaning of the character Kwal-Sa, Kwal(刮) means scraping the skin by some tools and Sa(痧) means originally the red spots on skin of measles but in this therapy it's the red spot on skin after the treatment. Meridian Scraping Therapy is based on the meridiology and holism which are the basic theory of Oriental Medicine. The main effects of Meridian Scraping Therapy are draining of pathogenic factors, promoring the flow of ki and blood, tonify yang, balancing the harmony of intemal organs and enhancing the immuno-ability in body. Rhinoceros corpus is common matrial of MST-plate which is the tool of manipulation. And by this plate several manipulation is performed according to the symptom and the locations of pain. The merits of MST are that easily learned by practician, the effects of the treatment are very quickly manifest and no significant side-effects. And additionally this therapy performed not only for treat diseases but prevent it and preserve health.

  • PDF