• Title/Summary/Keyword: Linguistic

Search Result 1,559, Processing Time 0.031 seconds

Feasibility Test and Design of Korean Translation Memory System (한국어 번역 메모리 시스템의 실현성 분석 및 설계)

  • Ryu, Cheol;Roh, Yoon-Hyung;Lee, Ki-Young;Choi, Sung-Kwon;Park, Sang-Kyu
    • Annual Conference on Human and Language Technology
    • /
    • 2001.10d
    • /
    • pp.281-287
    • /
    • 2001
  • 번역 메모리(Translation Memory) 시스템이란 기존에 번역된 결과를 담고 있는 대용량의 번역 메모리에서 사용자가 제시한 입력문과 가장 유사한 문장을 검색한 후, 유사도 순으로 결과를 제시하여 이후의 번역 작업을 보다 효율적으로 할 수 있도록 도와주는 시스템을 말한다. 이는 기계 번역 시스템과 비교해 볼때, 보다 실현 가능성이 높은 자연어 처리의 응용 분야라고 할 수 있다. 일반적으로 번역 메모리 시스템에서 핵심이 되는 요소는 번역메모리의 구성과 유사성 척도에 대한 정의라고 할 수 있다. 국외의 경우, 이미 많은 상용 시스템들이 개발되어 번역 작업의 시간 및 비용을 줄이는데 많은 도움을 주고 있지만, 국내의 경우 한국어 번역 메모리의 구성 및 한국어 문장간 유사성 척도 등에 대한 연구가 미흡한 실정이다. 따라서 본 논문에서는 한국어를 대상으로 번역 메모리의 효율적인 구성 방법 및 문장간 유사성 척도에 대한 정의를 내리며, 한국어를 대상으로한 번역 메모리 시스템에 대한 실현 가능성을 논한다.

  • PDF

Classification of Characters in Movie by Correlation Analysis of Genre and Linguistic Style

  • You, Eun-Soon;Song, Jae-Won;Park, Seung-Bo
    • Journal of the Korea Society of Computer and Information
    • /
    • v.24 no.1
    • /
    • pp.49-55
    • /
    • 2019
  • The character dialogue created by AI is unnatural when compared with human-made dialogue, and it can not reveal the character's personality properly in spite of remarkable development of AI. The purpose of this paper is to classify characters through the linguistic style and to investigate the relation of the specific linguistic style with the personality. We analyzed the dialogues of 92 characters selected from total 60 movies categorized four movie genres, such as romantic comedy, action, comedy and horror/thriller, using Linguistic Inquiry and Word Count (LIWC), a text analysis software. As a result, we confirmed that there is a unique language style according to genre. Especially, we could find that the emotional tone than analytical thinking are two important features to classify. They were analyzed as very important features for classification as the precision and recall is over 78% for romantic comedy and action. However, the precision and recall were 66% and 50% for comedy and horror/thriller. Their impact on classification was less than romantic comedy and action genre. The characters of romantic comedy deal with the affection between men and women using a very high value of emotional tone than analytical thinking. The characters of action genre who need rational judgment to perform mission have much greater analytical thinking than emotional tone. Additionally, in the case of comedy and horror/thriller, we analyzed that they have many kinds of characters and that characters often change their personalities in the story.

Fillers in the Hong Kong Corpus of Spoken English (HKCSE)

  • Seto, Andy
    • Asia Pacific Journal of Corpus Research
    • /
    • v.2 no.1
    • /
    • pp.13-22
    • /
    • 2021
  • The present study employed an analytical framework that is characterised by a synthesis of quantitative and qualitative analyses with a specially designed computer software SpeechActConc to examine speech acts in business communication. The naturally occurring data from the audio recordings and the prosodic transcriptions of the business sub-corpora of the HKCSE (prosodic) are manually annotated with a speech act taxonomy for finding out the frequency of fillers, the co-occurring patterns of fillers with other speech acts, and the linguistic realisations of fillers. The discoursal function of fillers to sustain the discourse or to hold the floor has diverse linguistic realisations, ranging from a sound (e.g. 'uhuh') and a word (e.g. 'well') to sounds (e.g. 'um er') and words, namely phrase ('sort of') and clause (e.g. 'you know'). Some are even combinations of sound(s) and word(s) (e.g. 'and um', 'yes er um', 'sort of erm'). Among the top five frequent linguistic realisations of fillers, 'er' and 'um' are the most common ones found in all the six genres with relatively higher percentages of occurrence. The remaining more frequent realisations consist of clause ('you know'), word ('yeah') and sound ('erm'). These common forms are syntactically simpler than the less frequent realisations found in the genres. The co-occurring patterns of fillers and other speech acts are diverse. The more common co-occurring speech acts with fillers include informing and answering. The findings show that fillers are not only frequently used by speakers in spontaneous conversation but also mostly represented in sounds or non-linguistic realisations.

A new computational approach to stability analysis of linguistic fuzzy control systems - Part l: Affine modeling of fuzzy system (컴퓨터 연산을 통한 언어형 퍼지 제어 시스템의 새로운 안정도 해석: 1부 - 퍼지 시스템의 어핀 모델링)

  • 김은태;박순형;박민용
    • Proceedings of the IEEK Conference
    • /
    • 2001.06c
    • /
    • pp.169-172
    • /
    • 2001
  • In recent years, many studies regarding the modeling of fuzzy system have been conducted. In this paper, a new computational approach to modeling of linguistic fuzzy system is proposed The fuzzy system is modeled as a combination of affine systems, The proposed method can be used in a rigorous stability analysis of fuzzy system including the linguistic fuzzy controller.

  • PDF

An Interval Approach to Fuzzy Pattern Recognition

  • Karbou, Fatiha;Karbou, Fatima;Karbou, M.
    • Proceedings of the Korean Institute of Intelligent Systems Conference
    • /
    • 1998.06a
    • /
    • pp.278-283
    • /
    • 1998
  • The interval approach to the linguistic expression coding nears us to the human idea. Thus, what seems "weak" for a person can appear very weak for another person or for the same person in others circumstances. However, the utilization of intervals is not restrained to the cases of linguistic expression coding. Indeed, the interval can facilitate the solution of several problems.

  • PDF

Fuzzy Fault Tree Analysis with Natural Language

  • Onisawa, Takehisa
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.7 no.1
    • /
    • pp.5-15
    • /
    • 1997
  • This paper mentions a fault tree analysis using not probability but natural language and fuzzy theory, Reliability estimate of each basic event and dependence level estimate among subsystems are expressed by linguistic terms. Analysis results are also expressed by natural language. The meaning of linguistic terms is expressed by a fuzzy set. In the presented analysis approach parametrized operations of fuzzy sets are considered so that analyst's subjectivity can be introduced into the analysis. This paper gives the Chernobyl accident as an example of the fuzzy fault tree analysis using linguistic terms.

  • PDF

Development of Linguistic Contents for Contextual Dialogue

  • Moon, Sang-Ho
    • Journal of information and communication convergence engineering
    • /
    • v.8 no.1
    • /
    • pp.116-121
    • /
    • 2010
  • New teaching and studying methods using educational contents are gradually widespread with the advancement of information and communication technology. As educational contents, in this paper, we design and implement linguistic contents for studying essential expressions applied to various situations of real life. In detail, the linguistic contents are run on web environments, and have suitable animations for learning essential expressions based on several foreign languages in contextual dialogues. Also, useful functions are included in contents to reinforce what users have learned.

Crowdfunding Scams: The Profiles and Language of Deceivers

  • Lee, Seung-hun;Kim, Hyun-chul
    • Journal of the Korea Society of Computer and Information
    • /
    • v.23 no.3
    • /
    • pp.55-62
    • /
    • 2018
  • In this paper, we propose a model to detect crowdfunding scams, which have been reportedly occurring over the last several years, based on their project information and linguistic features. To this end, we first collect and analyze crowdfunding scam projects, and then reveal which specific project-related information and linguistic features are particularly useful in distinguishing scam projects from non-scams. Our proposed model built with the selected features and Random Forest machine learning algorithm can successfully detect scam campaigns with 84.46% accuracy.

The Theory of Linguistic Semantic Interpretation Rule using Fuzzy Definition (퍼지 논리를 이용한 컴퓨터 언어해석 구현 규칙의 이용법)

  • 진현수
    • Proceedings of the IEEK Conference
    • /
    • 2003.11b
    • /
    • pp.227-230
    • /
    • 2003
  • We can not distinguish semantism of the feature of the current language “big”, “small”, “beautiful”. But we study artificial linguistic interface work and convert natural language to digital binary linguistic theory, we should define the basical conversion process. When we utilize the sum of product fuzzy theory and the visible numerical value, we can establish reasoning rule of input language. Fuzzy theory should be converted to general resulting rule.

  • PDF

An Image Retrieval System with Adjustment for Human Subjectivity

  • Fukushima, Shigenobu;Ralescu, Anca
    • Proceedings of the Korean Institute of Intelligent Systems Conference
    • /
    • 1993.06a
    • /
    • pp.1309-1312
    • /
    • 1993
  • We present a flexible retrieval system of face photographs based on their linguistic descriptions in terms of fuzzy perdicates. While natural for describing a face, linguistic expressions are also subjective, which affects the retrieval result. Thus, the capability of a retrieval system to adjust to different users becomes very important. In this research we use fuzzy logic techniques, for describing image data, inference for retrieval and adjustment to a new user. Experimental results of the adjustment are also included.

  • PDF