• Title/Summary/Keyword: Compound Word

Search Result 107, Processing Time 0.031 seconds

Environment for Translation Domain Adaptation and Continuous Improvement of English-Korean Machine Translation System

  • Kim, Sung-Dong;Kim, Namyun
    • International Journal of Internet, Broadcasting and Communication
    • /
    • v.12 no.2
    • /
    • pp.127-136
    • /
    • 2020
  • This paper presents an environment for rule-based English-Korean machine translation system, which supports the translation domain adaptation and the continuous translation quality improvement. For the purposes, corpus is essential, from which necessary information for translation will be acquired. The environment consists of a corpus construction part and a translation knowledge extraction part. The corpus construction part crawls news articles from some newspaper sites. The extraction part builds the translation knowledge such as newly-created words, compound words, collocation information, distributional word representations, and so on. For the translation domain adaption, the corpus for the domain should be built and the translation knowledge should be constructed from the corpus. For the continuous improvement, corpus needs to be continuously expanded and the translation knowledge should be enhanced from the expanded corpus. The proposed web-based environment is expected to facilitate the tasks of domain adaptation and translation system improvement.

Reinterpretation of Snowpiercer : Posthuman, Cyborg, and the New World

  • Kim, Hye Yoon
    • International journal of advanced smart convergence
    • /
    • v.9 no.1
    • /
    • pp.29-36
    • /
    • 2020
  • We aim to reinterpret Bong Joon-ho's Snowpiercer through theory of posthumanism. Posthuman is a compound word of 'post' and 'human', which means transcendent-man. However, we would like to extend the meaning of posthuman or cyborgs as not only to "new human figure, or transcendent-man" but as to "human living in a digital age of converged technology". Through the extension of the meaning of posthuman, we would be able to not only find posthuman in Science Fiction movies but also apply it to our real world. Also, through the extended meaning, we will reinterpret all the elements from the film as cyberspace and as posthuman or cyborgs. Moreover, through examination of these "cyborg figures" in Bong Joon-ho's Snowpiercer (2013), we argue that the film is criticizing posthumanism in the reality that these days, people are losing the humanity due to the combination with the machine. It seems that he claims of the collapse of the current system, suggesting new human generation as the solution.

Conceptual Extraction of Compound Korean Keywords

  • Lee, Samuel Sangkon
    • Journal of Information Processing Systems
    • /
    • v.16 no.2
    • /
    • pp.447-459
    • /
    • 2020
  • After reading a document, people construct a concept about the information they consumed and merge multiple words to set up keywords that represent the material. With that in mind, this study suggests a smarter and more efficient keyword extraction method wherein scholarly journals are used as the basis for the establishment of production rules based on a concept information of words appearing in a document in a way in which author-provided keywords are functional although they do not appear in the body of the document. This study presents a new way to determine the importance of each keyword, excluding non-relevant keywords. To identify the validity of extracted keywords, titles and abstracts of journals about natural language and auditory language were collected for analysis. The comparison of author-provided keywords with the keyword results of the developed system showed that the developed system was highly useful, with an accuracy rate as good as up to 96%.

Extended document format map service for mobile device (바일 기기를 위한 확장 문서 포맷의 맵 서비스)

  • Kim, Jung Sook
    • Journal of Korea Society of Digital Industry and Information Management
    • /
    • v.6 no.4
    • /
    • pp.83-94
    • /
    • 2010
  • Mobile network infrastructure is being completed with the development of hardware and software for mobile devices. Network in mobile devices has evolved for telematics that is expanded much more than its existing concept. Telematics is compound word that is formed from the words "telecommunication" and "informatics". It means that telematics performs control and monitoring service with using mobile device resources. These services provide their services for users' requests through wired or wireless network from mobile devices and server that offers contents and network service collects management information of mobile devices. Map service is one of the preferred services for many telematics users. However, mobile map service has a limit between traffic and information sharing. Therefore it is very important to supply their information for both service provider and terminal user. In this paper, we design a new interactive sketch map using routes and information on the space to be applied effectively, and provide an extended document format that is defined to an extensible and dynamic clustering scheme to have portability map service for mobile device.

The Philosophy and Linguistics of Dao : the Ancient Chinese Philosophy and Language (도의 철학과 도의 언어학 -고대 중국의 철학과 언어-)

  • 정재현
    • Lingua Humanitatis
    • /
    • v.5
    • /
    • pp.109-126
    • /
    • 2003
  • The aim of this paper is to elucidate ancient Chinese philosophy and linguistics through the concept of the Dao. Ancient Chinese thought had developed together with ancient Chinese theories of language and the linguistic features of Classical Chinese. The concept of the Dao served as an intermediary among them. The Dao which ancient Chinese philosophers sought for has several characteristics: ethical normativity, wholeness, dynamicity, non-reducibility. Linguistic studies also revealed them. The following linguistic features of Classical Chinese are the cause and/or the effect of such Dao-based philosophy and linguistics: No explicit subject-predicate sentential structure, no parts of speech, heavy reliance on the word order and context for meaning determination, no explicit distinction between compound words and a sentence, the pictographic or the ideographic features of Chinese graphs, and non-existence of a copula.

  • PDF

A study on the development for an air transportation cultural index (항공교통문화지수 개발에 관한 연구)

  • Lee, K.S.
    • Journal of the Korean Society for Aviation and Aeronautics
    • /
    • v.13 no.3
    • /
    • pp.61-72
    • /
    • 2005
  • The main purpose of this study is to develop air transportation cultural index which is able to estimate the level of them. Generally Speaking, air transportation cultural, a compound word of 'air transportation' and 'culture', is a substantial entity consisting knowledge, art, morality, legality, cultivation, customs, and etc, which comes from aircraft operation sector, airport operation/management sector and user sector. They are classified in a primary scope, as aircraft operation sector relating to flight operation, airport operation/management sector and user sector. The research and analysis were taken approximately 4 months, from June 2004 to October 2004. To evaluate the index, the detailed item for three categories were chosen and quantified. The grades for each items were induced from calculation formula for air transportation cultural index by applying weight values. The final grade of Korea's air transportation cultural index recorded 63.19 points.

  • PDF

Improvement of retrieval system and generation of compound noun using word weight method (단어 가중치 값을 이용한 복합명사 제한적 확장 및 검색 성능 개선)

  • Kim, Hyun-Jin;Lee, Chung-Hee;Hur, Jeong;Jang, Myeong-Gil
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2002.11a
    • /
    • pp.603-606
    • /
    • 2002
  • 자동색인이나 정보검색 엔진에서는 효율적인 색인어 추출이 주요한 요인으로 작용한다. 특히 색인 집합의 많은 부분을 차지하는 복합명사의 경우에는 색인과 검색 두 분야 모두에 큰 문제로 여겨져 왔다. 본 논문에서는 복합명사를 이루는 단일 단어 중에 단어 가중치가 높은 것을 중심으로 복합명사를 확장하는 방식을 이용하여, 색인어를 추출하여, 복합명사가 제한적으로 확장되는 효과를 보여 주며, 검색에서는 질의문에 나타나는 명사들에 이러한 가중치 값을 적용하여 검색에 효과를 높여 주는 방식을 제안한다.

  • PDF

Research Trends Analysis of Machine Learning and Deep Learning: Focused on the Topic Modeling (머신러닝 및 딥러닝 연구동향 분석: 토픽모델링을 중심으로)

  • Kim, Chang-Sik;Kim, Namgyu;Kwahk, Kee-Young
    • Journal of Korea Society of Digital Industry and Information Management
    • /
    • v.15 no.2
    • /
    • pp.19-28
    • /
    • 2019
  • The purpose of this study is to examine the trends on machine learning and deep learning research in the published journals from the Web of Science Database. To achieve the study purpose, we used the abstracts of 20,664 articles published between 1990 and 2017, which include the word 'machine learning', 'deep learning', and 'artificial neural network' in their titles. Twenty major research topics were identified from topic modeling analysis and they were inclusive of classification accuracy, machine learning, optimization problem, time series model, temperature flow, engine variable, neuron layer, spectrum sample, image feature, strength property, extreme machine learning, control system, energy power, cancer patient, descriptor compound, fault diagnosis, soil map, concentration removal, protein gene, and job problem. The analysis of the time-series linear regression showed that all identified topics in machine learning research were 'hot' ones.

A Study on Fun Elements of Web 2.0 Blog Widget (Web 2.0 블로그 위젯의 재미 요소에 대한 연구)

  • Choi, Sung-Kyu;Kim, Kee-Sung;Jang, Seok-Hyun;Whang, Min-Cheol
    • 한국HCI학회:학술대회논문집
    • /
    • 2009.02a
    • /
    • pp.785-790
    • /
    • 2009
  • Widgets are the instrument for representing user's character and embossing the value of blogs. The compound word of the Windows and Gadget the application, widgets are the functional program to displayed on the screen graphical user interface (GUI) tools as a kind of service that user want to see. On the operating system, the Web, and mobile area, widgets offer the delivery of information, convenience and efficiency. However widgets have been never gave satisfaction to user because it focused transmitting information and representing circumstance than fun. This study is for recognized fun elements that user feel interest and categorized fun elements each type of widgets. Fun elements of widget never been defined, we use fun elements on design and product area and emotional word that is representative of affectivity. And we make up an online questionnaire to blog users. The widget selected by popular degree among the domestic widgets and the Japanese widget. And the results of the questionnaire that 5-scales used based on user preferences to identify the elements that are fun.

  • PDF

Collection and Extraction Algorithm of Field-Associated Terms (분야연상어의 수집과 추출 알고리즘)

  • Lee, Sang-Kon;Lee, Wan-Kwon
    • The KIPS Transactions:PartB
    • /
    • v.10B no.3
    • /
    • pp.347-358
    • /
    • 2003
  • VSField-associated term is a single or compound word whose terms occur in any document, and which makes it possible to recognize a field of text by using common knowledge of human. For example, human recognizes the field of document such as or , a field name of text, when she encounters a word 'Pitcher' or 'election', respectively We Proposes an efficient construction method of field-associated terms (FTs) for specializing field to decide a field of text. We could fix document classification scheme from well-classified document database or corpus. Considering focus field we discuss levels and stability ranks of field-associated terms. To construct a balanced FT collection, we construct a single FTs. From the collections we could automatically construct FT's levels, and stability ranks. We propose a new extraction algorithms of FT's for document classification by using FT's concentration rate, its occurrence frequencies.