• Title/Summary/Keyword: Natural language process

Search Result 247, Processing Time 0.028 seconds

Korean Information Summary System for National R&D Projcet Information Summary (국가R&D과제정보 요약을 위한 한국어 정보요약 시스템)

  • Lee, Jong-Won;Kim, Tae-Hyun;Shin, Dong-Gu;Jo, Woo-Seung
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2022.10a
    • /
    • pp.72-74
    • /
    • 2022
  • The National Science and Technology Knowledge Information Service (NTIS) provides information on national R&D projects. Project information consists of meta-information such as 'project name', 'project performance institution', 'research manager name', and text explaining projects such as 'research goal', 'research content', and 'expected effect'. There is a problem that it takes a lot of time to find the desired project information by checking all of the "research goals" or "research contents" in the list of results of searching for 1 million project information. To solve this problem, this paper proposes a project information summary system that summarizes the parts consisting of long texts within the national R&D project information. By analyzing the linguistic characteristics of the Korean language, a preprocessor was built and a project information summary model based on natural language processing technology was developed to process preprocessed text information. Through this, project information composed of long sentences is provided in a compressed and summarized form, which will help users to easily and quickly infer the overall content with the summary information alone.

  • PDF

Intrusion Detection System based on Packet Payload Analysis using Transformer

  • Woo-Seung Park;Gun-Nam Kim;Soo-Jin Lee
    • Journal of the Korea Society of Computer and Information
    • /
    • v.28 no.11
    • /
    • pp.81-87
    • /
    • 2023
  • Intrusion detection systems that learn metadata of network packets have been proposed recently. However these approaches require time to analyze packets to generate metadata for model learning, and time to pre-process metadata before learning. In addition, models that have learned specific metadata cannot detect intrusion by using original packets flowing into the network as they are. To address the problem, this paper propose a natural language processing-based intrusion detection system that detects intrusions by learning the packet payload as a single sentence without an additional conversion process. To verify the performance of our approach, we utilized the UNSW-NB15 and Transformer models. First, the PCAP files of the dataset were labeled, and then two Transformer (BERT, DistilBERT) models were trained directly in the form of sentences to analyze the detection performance. The experimental results showed that the binary classification accuracy was 99.03% and 99.05%, respectively, which is similar or superior to the detection performance of the techniques proposed in previous studies. Multi-class classification showed better performance with 86.63% and 86.36%, respectively.

A Study on the Development Methodology for User-Friendly Interactive Chatbot (사용자 친화적인 대화형 챗봇 구축을 위한 개발방법론에 관한 연구)

  • Hyun, Young Geun;Lim, Jung Teak;Han, Jeong Hyeon;Chae, Uri;Lee, Gi-Hyun;Ko, Jin Deuk;Cho, Young Hee;Lee, Joo Yeoun
    • Journal of Digital Convergence
    • /
    • v.18 no.11
    • /
    • pp.215-226
    • /
    • 2020
  • Chatbot is emerging as an important interface window for business. This change is due to the continued development of chatbot-related research from NLP to NLU and NLG. However, the reality is that the methodological study of drawing domain knowledge and developing it into a user-friendly interactive interface is weak in the process of developing chatbot. In this paper, in order to present the process criteria of chatbot development, we applied it to the actual project based on the methodology presented in the previous paper and improved the development methodology. In conclusion, the productivity of the test phase, which is the most important step, was improved by 33.3%, and the number of iterations was reduced to 37.5%. Based on these results, the "3 Phase and 17 Tasks Development Methodology" was presented, which is expected to dramatically improve the trial and error of the chatbot development.

A Study on the Variation of Residential Architectural Space Designed by Richard Meier (리차드 마이어의 주거 건축에 나타난 공간 설계 변화 연구)

  • Lee, Sun-Min;Joh, Eun-Ran;Hur, Bum-Pall
    • Korean Institute of Interior Design Journal
    • /
    • v.20 no.2
    • /
    • pp.131-138
    • /
    • 2011
  • This study describes development process of residential architectural space in the Richard Meier's residential works through chronological analysis. By analyzing house layout, floor plan, elevation, materials and colors and light entering of his residential works, changes in his architectural space has been studied. Richard Meier's early work is designed as three-dimensional shape, and places emphasis on geometric form. However, recently completed residential works by Richard Meier focuses on the horizontal sun shading louvers, which has ability. In addition, teak and a special UV protective glass are used to control natural environment. Through this research, it was found that Richard Meier has maintained his architectural language in terms of the design layout of the house and planar with colors that appear on the initial project. Using light input methods and materials used for active development, it was found that many design elements were applied and developed.

A Study on the Architectural Characteristics at Wolfsburg Church (볼프스부르크 교회의 건축 특성에 관한 연구)

  • Chung, Tae-Yong
    • Korean Institute of Interior Design Journal
    • /
    • v.17 no.1
    • /
    • pp.93-101
    • /
    • 2008
  • One of the big difference between Aalto and other modem architects lies in an attitude that Aalto went beyond the principle of dogmatic functionalism. As an architect of Northern Europe, Aalto made unique architecture that showed both the language of modem architecture and vernacular architecture. He presented the best example which showed modernity in architecture without losing the meaning of the context including site, region and cultural tradition. With this background, Aalto developed his own typological design approach. As Wolfsburg church was designed in his prime of time, it showed the origin and evolutionary process of religious architecture as well as general features of Aalto's design. Clustered site plan, dynamic space of interior, the adequate control of natural light are representative characteristics found in Aalto's religious architecture. Wolfsburg church also demonstrate these features in more specific way including the continuity of wall and ceiling that showed spatial dynamics, free opening of wall which reflects on functions, and careful adjust of natural light. As a result, Aalto showed his typological design strategy that added new design ideas to the accumulated design vocabularies in the design of Wolfsburg church.

A study on the aspect-based sentiment analysis of multilingual customer reviews (다국어 사용자 후기에 대한 속성기반 감성분석 연구)

  • Sungyoung Ji;Siyoon Lee;Daewoo Choi;Kee-Hoon Kang
    • The Korean Journal of Applied Statistics
    • /
    • v.36 no.6
    • /
    • pp.515-528
    • /
    • 2023
  • With the growth of the e-commerce market, consumers increasingly rely on user reviews to make purchasing decisions. Consequently, researchers are actively conducting studies to effectively analyze these reviews. Among the various methods of sentiment analysis, the aspect-based sentiment analysis approach, which examines user reviews from multiple angles rather than solely relying on simple positive or negative sentiments, is gaining widespread attention. Among the various methodologies for aspect-based sentiment analysis, there is an analysis method using a transformer-based model, which is the latest natural language processing technology. In this paper, we conduct an aspect-based sentiment analysis on multilingual user reviews using two real datasets from the latest natural language processing technology model. Specifically, we use restaurant data from the SemEval 2016 public dataset and multilingual user review data from the cosmetic domain. We compare the performance of transformer-based models for aspect-based sentiment analysis and apply various methodologies to improve their performance. Models using multilingual data are expected to be highly useful in that they can analyze multiple languages in one model without building separate models for each language.

Improving the Professional Competence of a Specialist in Poland by Implementing Multimedia Technologies

  • Kravchenko, Tetiana;Varga, Lesia;Lypchanko-Kovachyk, Oksana;Chinchoy, Alexander;Yevtushenko, Nataliia;Syladii, Ivan;Kuchai, Oleksandr
    • International Journal of Computer Science & Network Security
    • /
    • v.22 no.9
    • /
    • pp.51-58
    • /
    • 2022
  • The article emphasizes the features of the modern education system in Poland, reveals the peculiarities of improving the professional competence of a specialist in Poland through the implementation of multimedia technologies. Various forms of innovations implemented in improving the professional competence of a specialist are listed: improvement (rationalization), modernization, innovation. The forms of professional improvement through the introduction of computer technologies in general and multimedia technologies, in particular, primarily include various professional courses, qualification, preparatory, methodological conferences, seminars, postgraduate studies, foreign and state internships. At the same time, the main direction is self-education. The subject of professional improvement in the application of computer technologies by specialists is the updating of existing knowledge, exchange of professional experience, planning, as well as discussion of innovative works in which specialists participate. Professional growth of specialists can occur both during work and in higher education institutions during their studies. Modernization of computer technologies, especially multimedia ones, is a necessary condition for the functioning of specialists in modern society, since specialists are at the center of the educational process, during the improvement of professional competence. The main functions of the educational process necessary for improving the professional competence of specialists through the implementation of multimedia technologies are revealed. These functions not only contribute to the professional improvement of specialists, but also affect their solutions and optimize the maintenance of contacts between specialists. The importance of creating conditions that are consistent with the modern needs of innovative education is emphasized.

A Method for Learning the Specialized Meaning of Terminology through Mixed Word Embedding (혼합 임베딩을 통한 전문 용어 의미 학습 방안)

  • Kim, Byung Tae;Kim, Nam Gyu
    • The Journal of Information Systems
    • /
    • v.30 no.2
    • /
    • pp.57-78
    • /
    • 2021
  • Purpose In this study, first, we try to make embedding results that reflect the characteristics of both professional and general documents. In addition, when disparate documents are put together as learning materials for natural language processing, we try to propose a method that can measure the degree of reflection of the characteristics of individual domains in a quantitative way. Approach For this study, the Korean Supreme Court Precedent documents and Korean Wikipedia are selected as specialized documents and general documents respectively. After extracting the most similar word pairs and similarities of unique words observed only in the specialized documents, we observed how those values were changed in the process of embedding with general documents. Findings According to the measurement methods proposed in this study, it was confirmed that the degree of specificity of specialized documents was relaxed in the process of combining with general documents, and that the degree of dissolution could have a positive correlation with the size of general documents.

An Approach to Linguistic Instruction Based Learning and Its Application to Helicopter Flight Control

  • M.Sugeno;Park, G.K.
    • Proceedings of the Korean Institute of Intelligent Systems Conference
    • /
    • 1993.06a
    • /
    • pp.1082-1085
    • /
    • 1993
  • In this paper, we notice the fact that a human learning process is characterized by a process under a natural language environment, and discuss an approach of learning based on indirect linguistic instructions. An instruction is interpreted through some meaning elements and each trend. Fuzzy evaluation rule are constructed for the searched meaning elements of the given instruction, and the performance of a system to be learned is improved by the evaluation rules. In this paper, we propose a framework of learning based on indirect linguistic instruction based learning using fuzzy theory: FULLINS(FUzzy-Learning based on Linguistic IN-Struction). The validity of FULLINS is shown by applying it to helicopter flight control.

  • PDF

Determination of Thematic Roles according to Syntactic Relations Using Rules and Statistical Models in Korean Language Processing (한국어 전산처리에서 규칙과 확률을 이용한 구문관계에 따른 의미역 결정)

  • 강신재;박정혜
    • Journal of Korea Society of Industrial Information Systems
    • /
    • v.8 no.1
    • /
    • pp.33-42
    • /
    • 2003
  • This paper presents an efficient determination method of thematic roles from syntactic relations using rules and statistical model in Korean language processing. This process is one of the main core of semantic analysis and an important issue to be solved in natural language processing. It is problematic to describe rules for determining thematic roles by only using general linguistic knowledge and experience, since the final result may be different according to the subjective views of researchers, and it is impossible to construct rules to cover all cases. However, our hybrid method is objective and efficient by considering large corpora, which contain practical usages of Korean language, and case frames in the Sejong Electronic Lexicon of Korean, which is being developed by dozens of Korean linguistic researchers. To determine thematic roles more correctly, our system uses syntactic relations, semantic classes, morpheme information, position of double subject. Especially by using semantic classes, we can increase the applicability of our system.

  • PDF