• Title/Summary/Keyword: Natural language process

Search Result 243, Processing Time 0.027 seconds

Inverse Document Frequency-Based Word Embedding of Unseen Words for Question Answering Systems (질의응답 시스템에서 처음 보는 단어의 역문헌빈도 기반 단어 임베딩 기법)

  • Lee, Wooin;Song, Gwangho;Shim, Kyuseok
    • Journal of KIISE
    • /
    • v.43 no.8
    • /
    • pp.902-909
    • /
    • 2016
  • Question answering system (QA system) is a system that finds an actual answer to the question posed by a user, whereas a typical search engine would only find the links to the relevant documents. Recent works related to the open domain QA systems are receiving much attention in the fields of natural language processing, artificial intelligence, and data mining. However, the prior works on QA systems simply replace all words that are not in the training data with a single token, even though such unseen words are likely to play crucial roles in differentiating the candidate answers from the actual answers. In this paper, we propose a method to compute vectors of such unseen words by taking into account the context in which the words have occurred. Next, we also propose a model which utilizes inverse document frequencies (IDF) to efficiently process unseen words by expanding the system's vocabulary. Finally, we validate that the proposed method and model improve the performance of a QA system through experiments.

A Study on Image Generation from Sentence Embedding Applying Self-Attention (Self-Attention을 적용한 문장 임베딩으로부터 이미지 생성 연구)

  • Yu, Kyungho;No, Juhyeon;Hong, Taekeun;Kim, Hyeong-Ju;Kim, Pankoo
    • Smart Media Journal
    • /
    • v.10 no.1
    • /
    • pp.63-69
    • /
    • 2021
  • When a person sees a sentence and understands the sentence, the person understands the sentence by reminiscent of the main word in the sentence as an image. Text-to-image is what allows computers to do this associative process. The previous deep learning-based text-to-image model extracts text features using Convolutional Neural Network (CNN)-Long Short Term Memory (LSTM) and bi-directional LSTM, and generates an image by inputting it to the GAN. The previous text-to-image model uses basic embedding in text feature extraction, and it takes a long time to train because images are generated using several modules. Therefore, in this research, we propose a method of extracting features by using the attention mechanism, which has improved performance in the natural language processing field, for sentence embedding, and generating an image by inputting the extracted features into the GAN. As a result of the experiment, the inception score was higher than that of the model used in the previous study, and when judged with the naked eye, an image that expresses the features well in the input sentence was created. In addition, even when a long sentence is input, an image that expresses the sentence well was created.

A Study on Interior Design Process by approaching Typological Method (유형학적 접근방식에 의한 실내디자인 과정에 관한 연구 (II))

  • 한경희;이선민
    • Korean Institute of Interior Design Journal
    • /
    • no.21
    • /
    • pp.165-172
    • /
    • 1999
  • For the useful method capable of modern expression on traditional residence architecture, a study was performed on the methodological establishment and possibility of typological method could be examinated to interior design process by typological method. First of all, through the establishment verbal of our Korean traditional architecture and further investigation of environmental and cultural idealogical facts, it could be extracted from natural instinct, duality, continuance, flexibility and transitiov. In second process, based on these results, it could be framed and described the individual typological language and, for the sake of drawing for visual and spatial typology, it was made by sketch in terms and view of possibile guidance of prototype, transforming and application method. from these results of investigated sketches, it cold be used for criteria of application method as the parts of visual and spatial typological elements to have an applicable expression of it/s traditionality. Based on above facts, for the subjects of spatial system, form & shape system, circulation system, order system, decoration system, color & material system in interior design fields, we cold propose the practical possibility through the consideration of application method for built-in meaning that could be adaptable for the interior design practices. These facts were extracted from the based on visual & spatial typology, as above mentiov. Also, through preparing and suggesting the criteria of evaluation and measurement of design quality , we could propose the applicable methodology for further & basically Korean traditional embodiment.

  • PDF

A Validity Verification of Human Error Probability using a Fuzzy Model (퍼지모델을 이용한 인적오류확률의 타당성 검증)

  • Jang, Tong-Il;Lee, Yong-Hee;Lim, Hyeon-Kyo
    • Journal of the Korean Society of Safety
    • /
    • v.21 no.3 s.75
    • /
    • pp.137-142
    • /
    • 2006
  • Quantification of error possibility, in an HRA process, should be performed so that the result of the qualitative analysis can be utilized in other areas in conjunction with overall safety estimation results. And also, the quantification is an essential process to analyze the error possibility in detail and to obtain countermeasures for the errors through screening procedures. In previous studies for the quantification of error possibility, nominal values were assigned by the experts' judgements and utilized as corresponding probabilities. The values assigned by experts' experiences and judgements, however, require verifications on their reliability. In this study, the validity of new error possibility values in new MCR design was verified by using the Onisawa's model which utilizes fuzzy linguistic values to estimate human error probabilities. With the model of error probabilities are represented as analyst's estimations and natural language expression instead of numerical values. As results, the experts' estimation values about error probabilities are well agreed to the existing error probability estimation model. Thus, it was concluded that the occurrence probabilities of errors derived from the human error analysis process can be assessed by nominal values suggested in the previous studies. It is also expected that our analysis method can supplement the conventional HRA method because the nominal values are based on the consideration of various influencing factors such as PSFs.

Untold story about why King Sejong invented the Korean alphabet

  • JUNG, Sanggyu
    • Journal of Koreanology Reviews
    • /
    • v.1 no.1
    • /
    • pp.1-23
    • /
    • 2022
  • HunMinJeongEum, meaning "the right sound to teach the people," was created in 1443 CE by King Sejong the Great, the fourth king of the Joseon Dynasty. In today's modern language, this letter, called Hangeul, is internationally recognized for its linguistic science. However, it is hard to find a comprehensive study on the fact that King Sejong himself created Hangeul, the Confucian perspective on natural disasters and democracy revealed in the process of writing, the independent efforts emphasized from a certain period, and the achievements of King Sejong, who shared the sorrow of the people and carried out national policies despite the extreme opposition of the nobility. Accordingly, I analyzed the consonants of HunMinJeongEum and looked at the essence of humanity and oriental philosophy (Yin-Yang Five Elements, Sangsu Philosophy, Hado). Surprisingly, different meanings from previous studies and interpretations were found, and King Sejong's "Da Vinci Code," which was left behind in the process of making the consonant, is reinterpreted and revealed. King Sejong's achievements were all connected as one. This is the root of democracy in the Republic of Korea today, and this is why King Sejong was selected as the most beloved and respected historical figure by the Korean people. This study will start with more people's understanding of the fundamental perception and philosophy of the world in Asia, including Korea, to reinterpret and reveal the hardships and great achievements experienced by a leader of a country in the process of creating korean alphabet, and to emphasize democracy, which is an important value for Asians and Westerners' mutual respect and co-prosperity.

Research on R&D Planning Through NLP Analysis of Patent Information: Focusing on Display Technology (특허정보의 NLP 분석을 통한 R&D 계획수립 방안 연구: 디스플레이 기술 분석을 중심으로)

  • Kim, Jung-Heui;Kim, Young-Min
    • Journal of the Korean Society of Industry Convergence
    • /
    • v.25 no.5
    • /
    • pp.817-826
    • /
    • 2022
  • Patent information describes the history of technological progress in the relevant field, so it can be usefully used to identify trends in technological development and change and to establish R&D development strategies. This study proposes a method to identify the needs and problems of technology development at the planning stage of the R&D process and to analyze core technologies through patent analysis using Natural Language Processing(NLP) technology. As a big data source, collected patent documents registered in Google Patents for foldable technology, the latest technology in the display industry, and then extracted keywords using NLP analyzer. By classifying the extracted keywords into needs and problems for technology development, developed technology and materials, identified the needs of the market and customers and analyzed the technologies being researched and developed. Unlike previous studies that performed patent analysis, this methodology is different in that it can quickly and conveniently analyze the latest technology trends from big data called patents even if you do not have specialized knowledge and skills in the text mining. This study contributes to the digitalization of the R&D process based on data analysis.

Rule Construction for Determination of Thematic Roles by Using Large Corpora and Computational Dictionaries (대규모 말뭉치와 전산 언어 사전을 이용한 의미역 결정 규칙의 구축)

  • Kang, Sin-Jae;Park, Jung-Hye
    • The KIPS Transactions:PartB
    • /
    • v.10B no.2
    • /
    • pp.219-228
    • /
    • 2003
  • This paper presents an efficient construction method of determination rules of thematic roles from syntactic relations in Korean language processing. This process is one of the main core of semantic analysis and an important issue to be solved in natural language processing. It is problematic to describe rules for determining thematic roles by only using general linguistic knowledge and experience, since the final result may be different according to the subjective views of researchers, and it is impossible to construct rules to cover all cases. However, our method is objective and efficient by considering large corpora, which contain practical osages of Korean language, and case frames in the Sejong Electronic Lexicon of Korean, which is being developed by dozens of Korean linguistic researchers. To determine thematic roles more correctly, our system uses syntactic relations, semantic classes, morpheme information, position of double subject. Especially by using semantic classes, we can increase the applicability of the rules.

Automation of Service Level Agreement based on Active SLA (Active SLA 기반 서비스 수준 협약의 자동화)

  • Kim, Sang-Rak;Kang, Man-Mo;Bae, Jae-Hak
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.13 no.4
    • /
    • pp.229-237
    • /
    • 2013
  • As demand for IT services increase, which are based on SOA and cloud computing, service level agreements (SLAs) have received more attention in the parties concerned. An SLA is usually a paper contract written in natural language. SLA management tools which are commercially available, implement SLAs implicitly in the application with a procedural language. This makes automation of SLA management difficult. It is also laborious to maintain contract management systems because changes in a contract give rise to extensive modifications in the source code. We see the source of the trouble is the existence of documentary SLAs (paper contracts) and corresponding executable SLAs (contracts coded in the procedural language). In this paper, to resolve the current SLA management problems we propose an active SLM (Active Service Level Management) system, which is based on the active SLA (Active Service Level Agreement). In the proposed system, the separated management and processing of dual SLAs can be unified into a single process with the introduction of active SLAs (ASLAs).

Design and Implementation of Vocal Sound Variation Rules for Korean Language (한국어 음운 변동 처리 규칙의 설계 및 구현)

  • Lee, Gye-Young
    • The Transactions of the Korea Information Processing Society
    • /
    • v.5 no.3
    • /
    • pp.851-861
    • /
    • 1998
  • Korean language is to be characterized by the rich vocal sound variation. In order to increase the probability of vocal sound recognition and to provide a natural vocal sound synthesis, a systematic and thorough research into the characteristics of Korean language including its vocal sound changing rules is required. This paper addresses an effective way of vocal sound recognition and synthesis by providing the design and implementation of the Korean vocal sound variation rule. The regulation we followed for the design of the vocal sound variation rule is the Phonetic Standard(Section 30. Chapter 7) of the Korean Orthographic Standards. We have first factor out rules for each regulations, then grouped them into 27 groups for eaeh final-consonant. The Phonological Change Processing System suggested in the paper provides a fast processing ability for vocal sound variation by a single application of the rule. The contents of the process for information augmented to words or the stem of innected words are included in the rules. We believe that the Phonological Change Processing System will facilitate the vocal sound recognition and synthesis by the sentence. Also, this system may be referred as an example for similar research areas.

  • PDF

Visualizing Unstructured Data using a Big Data Analytical Tool R Language (빅데이터 분석 도구 R 언어를 이용한 비정형 데이터 시각화)

  • Nam, Soo-Tai;Chen, Jinhui;Shin, Seong-Yoon;Jin, Chan-Yong
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2021.05a
    • /
    • pp.151-154
    • /
    • 2021
  • Big data analysis is the process of discovering meaningful new correlations, patterns, and trends in large volumes of data stored in data stores and creating new value. Thus, most big data analysis technology methods include data mining, machine learning, natural language processing, and pattern recognition used in existing statistical computer science. Also, using the R language, a big data tool, we can express analysis results through various visualization functions using pre-processing text data. The data used in this study was analyzed for 21 papers in the March 2021 among the journals of the Korea Institute of Information and Communication Engineering. In the final analysis results, the most frequently mentioned keyword was "Data", which ranked first 305 times. Therefore, based on the results of the analysis, the limitations of the study and theoretical implications are suggested.

  • PDF