• Title/Summary/Keyword: Natural languages

Search Result 130, Processing Time 0.036 seconds

An Advanced Search that Converts Natural Language into the Logic Advanced Search and with Developed History Search Method (자연어의 논리식으로의 변환을 이용한 고급검색 및 이를 활용한 히스토리 검색)

  • Lee, Daehong;Yu, Hansuk;Park, Sangwon
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.9 no.6
    • /
    • pp.195-204
    • /
    • 2020
  • Nowadays there are over 1.6 billion web pages and it is hard to get necessary results that user wants. Most search engines allow you to search with logical form to get accurate results. However, normal users are not familiar to search information as logical form. Therefore, they search in natural language rather than in complicated logical form. In this paper there are some suggestions to improve quality of searching results, converting natural language input by the user into logical form which can able to use advanced search engine. Users tend to make short searches due to the 'Simplicity' which is one of the features of the search form. Therefore we suggest history retrieval method; advanced version of previous suggestion to provide convenience to the normal users. We had improvement on accuracy of the search results converting natural languages to logical form and also can contain every keyword without missing any keywords using searching methods on this paper. It is expected that these search methods will contribute to the development of search engines.

A Model for Post-processing of Speech Recognition Using Syntactic Unit of Morphemes (구문형태소 단위를 이용한 음성 인식의 후처리 모델)

  • 양승원;황이규
    • Journal of Korea Society of Industrial Information Systems
    • /
    • v.7 no.3
    • /
    • pp.74-80
    • /
    • 2002
  • There are many researches on post-processing methods for the Korean continuous speech recognition enhancement using natural language processing techniques. It is very difficult to use a formal morphological analyzer for improving the speech recognition because the analysis technique of natural language processing is mainly for formal written languages. In this paper, we propose a speech recognition enhancement model using syntactic unit of morphemes. This approach uses the functional word level longest match which dose not consider spacing words. We describe the post-processing mechanism for the improving speech recognition by using proposed model which uses the relationship of phonological structure information between predicates md auxiliary predicates or bound nouns that are frequently occurred in Korean sentences.

  • PDF

The Loom-LAG for syntax analysis Adding a language-independent level to LAG

  • Schulze, Markus
    • Proceedings of the Korean Society for Language and Information Conference
    • /
    • 2002.02a
    • /
    • pp.411-420
    • /
    • 2002
  • The left-associative grammar model (LAG) has been applied successfully to the morphologic and syntactic analysis of various european and asian languages. The algebraic definition of the LAG is very well suited for the application to natural language processing as it inherently obeys de Saussure's second law (de Saussure, 1913, p. 103) on the linear nature of language, which phrase-structure grammar (PSG) and categorial grammar (CG) do not. This paper describes the so-called Loom-LAGs (LLAG) -a specialization of LAGs for the analysis of natural language. Whereas the only means of language-independent abstraction in ordinary LAG is the principle of possible continuations, LLAGs introduce a set of more detailed language-independent generalizations that form the so-called loom of a Loom-LAG. Every LLAG uses the very smut loom and adds the language-specific information in the form of a declarative description of the language -much like an ancient mechanised Jacquard-loom would take a program-card providing the specific pattern for the cloth to be woven. The linguistic information is formulated declaratively in so-called syntax plans that describe the sequential structure of clauses and phrases. This approach introduces the explicit notion of phrases and sentence structure to LAG without violating de Saussure's second law iud without leaving the ground of the original algebraic definition of LAG, LLAGS can in fact be shown to be just a notational variant of LAG -but one that is much better suited for the manual development of syntax grammars for the robust analysis of free texts.

  • PDF

The Analysis of the Matrix of Greg Lynn's Digital Space Design based on the Natural Elements (그레그 린의 자연기반 디지털 공간디자인 매트릭스 분석)

  • Lee Hanna;Park Hyun-Ok;Lee Jongsook
    • Korean Institute of Interior Design Journal
    • /
    • v.14 no.1
    • /
    • pp.37-44
    • /
    • 2005
  • Currently, the space design has been expressed the space in kinetic design by digital technology. To look into the concept of digital design, there is the tendency to pursue the harmony of the nature. The digital space designer, Greg Lynn who has been paid attention by international researchers. To compared with the reputation of his works, the information about him has been limited to us. The purpose of this study was to investigate the Greg Lynn's digital design matrix toward the design process in his representative 11 works in his website; www.glform.com. The contents analyses methods were used in this study. Greg Lynn's internet website survey was carried out in the respects of thinking method, space formative language and animate form. The major results of this study are as follows: \circled1 Lynn's design concept and digital methodology were affected by Paolo Soleri and Peter Eisenman: natural architectural concept and digital animate form \circled2 Lynn's space formative languages were 10 items; blob, blob, fold, strand, shred, flower, skin, teeth, branch and lattice \circled3 Lynn's digital design matrix was divided into 3 types; MS(Mass + Structure), PC(Path + Circulation) and FD(Form + Detail) \circled4 According to the analysis of longitudinal, his works have been changed from the MS and PC to FD. This research will be a basic reference to understand digital space design.

Automatic Ontology Generation from Natural Language Sentences Using Predicate Ontology (서술어 온톨로지를 이용한 자연어 문장으로부터의 온톨로지 자동 생성)

  • Min, Young-Kun;Lee, Bog-Ju
    • Journal of Korea Multimedia Society
    • /
    • v.13 no.9
    • /
    • pp.1263-1271
    • /
    • 2010
  • Ontologies, the important implementation tools for semantic web, are widely used in various areas such as search, reasoning, and knowledge representation. Developing well-defined ontologies, however, requires a lot of resources in terms of time and materials. There have been efforts to construct ontologies automatically to overcome these problems. In this paper, ontologies are automatically constructed from the natural languages sentences directly. To do this, the analysis of morphemes and a sentence structure is performed at first. then, the program finds predicates inside the sentence and the predicates are transformed to the corresponding ontology predicates. For matching the corresponding ontology predicate from a predicate in the sentence, we develop the "predicate ontology". An experimental comparison between human ontology engineer and the program shows that the proposed system outperforms the human engineer in an accuracy.

Mapping between CoreNet and SUMO through WordNet (WordNet을 매개로 한 CoreNet-SUMO의 매핑)

  • Kang, Sin-Jae;Kang, In-Su;Nam, Se-Jin;Choi, Key-Sun
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.21 no.2
    • /
    • pp.276-282
    • /
    • 2011
  • CoreNet is a valuable resource to use in the domain of natural language processing including Korean-Chinese-Japanese multilingual text analysis, and translation among natural languages. CoreNet is mapped to SUMO in order to encourage its application in broader fields and enhance its international status as a multilingual lexical semantic network. To do this, indirect and direct mapping methodologies are used. Through the indirect mapping among CoreNet-KorLex-PWN-SUMO, we alleviate the difficulty of translating CoreNet concept terms in Korean into SUMO concepts in English, and maximize recall of SUMO concepts corresponding to the concept of CoreNet.

Hangeul Stem Extraction Algorithm for Text Mining Based on Natural Language Processing (자연어 처리 기반 텍스트 마이닝을 위한 한글 어간 추출 알고리즘)

  • Choi, Ki-won;Choi, Seong-hun;Jo, Sang-hyeon;Kim, Hee-cheol
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2017.05a
    • /
    • pp.718-721
    • /
    • 2017
  • Natural language processing, which is the basis of text mining, differs depending on the type of language. Especially, Hangeul, which has relatively high freedom of expression compared to other languages, has various forms of words depending on the use of ending. The part that does not change in these various forms of words is called the stem. For effective text mining, it is essential to extract words and unify various types of words. Therefore, this paper proposes an extraction algorithm for Hangul word for effective text mining of Hangul document.

  • PDF

Epistemologico-Historic Foundations of Linguistic Relativity (언어상대성 원칙의 역사 인식론적 토대 -문화 언어학을 위한 서설-)

  • 김성도
    • Lingua Humanitatis
    • /
    • v.2 no.1
    • /
    • pp.7-42
    • /
    • 2002
  • This paper reexamines ideas about linguistic relativity in the light of new interest in the theoretical climate. The original idea is based on the incommensurability of the semantic structures of different languages. On this view, language, thought, culture are deeply interconnected, so that each language might be associated with it a distinctive world view. Throughout this work I utilize the historico-epistemological standpoint to dissect the conceptual structure of this principle. In the introduction I will of for a justification of choice of the theme. Section 1 will address some essential definition of the linguistic principle and insist on the necessity to elaborate a typological spectrum of relativism and universalism. In the second section some important landmarks of linguistic relativity were marked from Plato to Humboldt via Condillac and Herder. 1 will subdivide the relativity hypothesis into 3 theses which are interlated. In the final section the epistemological structure of the linguistic principle will be analysed in some detail by providing my exposition of Sapir-Whorf hypothesis. By way of conclusion I will present the works of Wierzbicka who demonstrated the lexicons of different languages suggest different conceptual universes. By rejecting analytical tools derived from the English language she proposed instead a natural semantic metalanguage based on lexical universals, which is made up of universal semantic primitives. In this paper we attempted to construct a general problematics of linguistic relativity, focolizing on the Sapir-Whorf hypothesis. We devided this very problematic question into its ontological and epistemological dimensions. In particular the ambivalance of Whorf's relativity is discussed in some detail. Also, an archeological survey of this subtle question on the relation between language, thinking and culture was provided. (from Aristotle to Humboldt, via Condillac and Nitzche). In conclusion this investigation underlines the necessity of preparing the cultural linguistics to enlarge the scope of contempory linguistics.

  • PDF

The Verification of the Transfer Learning-based Automatic Post Editing Model (전이학습 기반 기계번역 사후교정 모델 검증)

  • Moon, Hyeonseok;Park, Chanjun;Eo, Sugyeong;Seo, Jaehyung;Lim, Heuiseok
    • Journal of the Korea Convergence Society
    • /
    • v.12 no.10
    • /
    • pp.27-35
    • /
    • 2021
  • Automatic post editing is a research field that aims to automatically correct errors in machine translation results. This research is mainly being focus on high resource language pairs, such as English-German. Recent APE studies are mainly adopting transfer learning based research, where pre-training language models, or translation models generated through self-supervised learning methodologies are utilized. While translation based APE model shows superior performance in recent researches, as such researches are conducted on the high resource languages, the same perspective cannot be directly applied to the low resource languages. In this work, we apply two transfer learning strategies to Korean-English APE studies and show that transfer learning with translation model can significantly improves APE performance.

Automatic Extraction of Metadata Information for Library Collections

  • Yang, Gi-Chul;Park, Jeong-Ran
    • International Journal of Advanced Culture Technology
    • /
    • v.6 no.2
    • /
    • pp.117-122
    • /
    • 2018
  • As evidenced through rapidly growing digital repositories and web resources, automatic metadata generation is becoming ever more critical, especially considering the costly and complex operation of manual metadata creation. Also, automatic metadata generation is apt to consistent metadata application. In this sense, metadata quality and interoperability can be enhanced by utilizing a mechanism for automatic metadata generation. In this article, a mechanism of automatic metadata extraction called ExMETA is introduced in order to alleviate issues dealing with inconsistent metadata application and semantic interoperability across ever-growing digital collections. Conceptual graph, one of formal languages that represent the meanings of natural language sentences, is utilized for ExMETA as a mediation mechanism that enhances the metadata quality by disambiguating semantic ambiguities caused by isolation of a metadata element and its corresponding definition from the relevant context. Hence, automatic metadata generation by using ExMETA can be a good way of enhancing metadata quality and semantic interoperability.