• Title/Summary/Keyword: POS-System

Search Result 178, Processing Time 0.036 seconds

A Comparative Study on Optimal Feature Identification and Combination for Korean Dialogue Act Classification (한국어 화행 분류를 위한 최적의 자질 인식 및 조합의 비교 연구)

  • Kim, Min-Jeong;Park, Jae-Hyun;Kim, Sang-Bum;Rim, Hae-Chang;Lee, Do-Gil
    • Journal of KIISE:Software and Applications
    • /
    • v.35 no.11
    • /
    • pp.681-691
    • /
    • 2008
  • In this paper, we have evaluated and compared each feature and feature combinations necessary for statistical Korean dialogue act classification. We have implemented a Korean dialogue act classification system by using the Support Vector Machine method. The experimental results show that the POS bigram does not work well and the morpheme-POS pair and other features can be complementary to each other. In addition, a small number of features, which are selected by a feature selection technique such as chi-square, are enough to show steady performance of dialogue act classification. We also found that the last eojeol plays an important role in classifying an entire sentence, and that Korean characteristics such as free order and frequent subject ellipsis can affect the performance of dialogue act classification.

A Study on the Development of Barcode Laser Scanner Using Optical Information Processing (광 정보처리를 이용한 바코드 레이저 스캐너 개발연구)

  • Shin, Kwang-Yong;Ihm, Jong-Tae;Eun, Jae-Jung;Kim, Nam;Park, Han-Kyu
    • Journal of the Korean Institute of Telematics and Electronics
    • /
    • v.26 no.1
    • /
    • pp.69-77
    • /
    • 1989
  • A hologram scanner for POS bar code symb9ol readers has been developed. This system is composed of scanning optics, optical detector, video signal circuitary and preprocessor. In contrast to conventional scanners using polygonal mirrors, which complicate the scanning optics, the hologram scanner developed in this research was made up with simple optics and higher reading performance was achieved. And in order to read abar code symbol omnidirectionally with highdensity scan patterns, the new real time decoding technique was proposed. The advantage of this technique is less hardware and lower clock rate. High speed processing and improved readability for tilted symbol was confirmed experimentally.

  • PDF

Analysis of Partial Discharge Characteristics in SF6 Gas Insulation (SF6 가스절연에서 부분방전의 특성분석)

  • Kim, Sun-Jae;Wang, Guoming;Park, Seo-Jun;Kil, Gyung-Suk;An, Chang-Hwan
    • Journal of the Korean Institute of Electrical and Electronic Material Engineers
    • /
    • v.29 no.7
    • /
    • pp.429-434
    • /
    • 2016
  • This paper deals with the characteristics of partial discharge (PD) for the purpose of a condition based maintenance (CBM) of gas insulated switchgears (GIS) in power equipment. Four types of electrode systems such as a protrusion on enclosure (POE), a particle on spacer (POS), a free particle (FP) and a Floating were designed and fabricated. PD pulses were measured using UHF sensor with a frequency range of 300 MHz~1.4 GHz and a DAQ with a sampling rate of 250 MS/s. Discharge inception voltage (DIV), discharge extinction voltage (DEV), and phase resolved partial discharge (PRPD) were analyzed depending on electrode systems. The average DIV in the POS was 28.8 kV. It was about 1.7 times higher than that in the FP, which was the lowest value of 17.2 kV. The FP shuffled and jumped at the applied voltage of 23.5 kV. Over 95% of PD pulses in the POE were generated in the negative polarity ($181^{\circ}{\sim}360^{\circ}$) of applied voltage. The results showed the phase (${\Phi}$)-magnitude (dBm) of PD pulses by UHF sensor, a cluster was formed separately depending on electrode systems.

Study on the Improvement of OFDM/64QAM Modem (OFDM/64QAM방식의 모뎀 설계)

  • Park, Jin-Soo
    • Journal of Advanced Navigation Technology
    • /
    • v.16 no.1
    • /
    • pp.158-162
    • /
    • 2012
  • In this paper, we propose a wireless modem, which used OFDM/64QAM method and the ISM band with 2.4GHz radio frequency. In this paper proposed the case of a modem, the main program to process the baseband processor, processing speed, operating voltage, and reliability should be ensured. So we have designed with Ralink's RT2870, witch was used for Wi-Fi solution. The RT2870 provides full support for wireless LAN standard, and supports various modulation formats, 2.4GHz and 5GHz bands, both of which support chip. In this paper, we also output the modulated signal transmitted wirelessly to the 2.4GHz band RF RT2850 chip processing was applied and using 40MHz band 2.422 ~ 2.462GHz wireless bands were designed to occupy. By applying bi-directional transmission between wireless transmitter and receiver, it can be effectively connected with any kinds of wireless LAN with 2.4GHz ISM band. Therefore it could economically be used as peripheral equipments for POS system or personal wireless device based on Android platform.

Performance Comparison Analysis on Named Entity Recognition system with Bi-LSTM based Multi-task Learning (다중작업학습 기법을 적용한 Bi-LSTM 개체명 인식 시스템 성능 비교 분석)

  • Kim, GyeongMin;Han, Seunggnyu;Oh, Dongsuk;Lim, HeuiSeok
    • Journal of Digital Convergence
    • /
    • v.17 no.12
    • /
    • pp.243-248
    • /
    • 2019
  • Multi-Task Learning(MTL) is a training method that trains a single neural network with multiple tasks influences each other. In this paper, we compare performance of MTL Named entity recognition(NER) model trained with Korean traditional culture corpus and other NER model. In training process, each Bi-LSTM layer of Part of speech tagging(POS-tagging) and NER are propagated from a Bi-LSTM layer to obtain the joint loss. As a result, the MTL based Bi-LSTM model shows 1.1%~4.6% performance improvement compared to single Bi-LSTM models.

Syntactic and Semantic Disambiguation for Interpretation of Numerals in the Information Retrieval (정보 검색을 위한 숫자의 해석에 관한 구문적.의미적 판별 기법)

  • Moon, Yoo-Jin
    • Journal of the Korea Society of Computer and Information
    • /
    • v.14 no.8
    • /
    • pp.65-71
    • /
    • 2009
  • Natural language processing is necessary in order to efficiently perform filtering tremendous information produced in information retrieval of world wide web. This paper suggested an algorithm for meaning of numerals in the text. The algorithm for meaning of numerals utilized context-free grammars with the chart parsing technique, interpreted affixes connected with the numerals and was designed to disambiguate their meanings systematically supported by the n-gram based words. And the algorithm was designed to use POS (part-of-speech) taggers, to automatically recognize restriction conditions of trigram words, and to gradually disambiguate the meaning of the numerals. This research performed experiment for the suggested system of the numeral interpretation. The result showed that the frequency-proportional method recognized the numerals with 86.3% accuracy and the condition-proportional method with 82.8% accuracy.

A Study on Efficient Return Logistics of Fashion Industry (패션기업의 반품물류 효율화 방안에 관한 연구)

  • Kim, Han-Seong;Hwang, Dae Sung;Lee, Jae-Gun;Kim, Tae-Won;Kang, Kyung-Sik
    • Journal of the Korea Safety Management & Science
    • /
    • v.18 no.1
    • /
    • pp.159-166
    • /
    • 2016
  • Logistics had only been perceived as a supporting means or auxiliary of corporate activity, but is now surfacing as a priority sector for competitiveness in the infinite competition of the global market. This study looks into the current status of logistics and sets forth the improvement tasks by analyzing the problems of the returned goods logistics. In order to improve the manual process of returned goods, the POS system (point-of-sales system) was implemented and DAS (Digital Assorting System) and SORTER System were synchronized for utilization, which cut the prime cost of the once expensive returned goods logistics, and analyzed the efficiency of establishing automated logistics system for efficiency of returned goods logistics in the aspects of one-stop BPR (Business Process Re-engineering).

Sentiment Analysis of Korean Reviews Using CNN: Focusing on Morpheme Embedding (CNN을 적용한 한국어 상품평 감성분석: 형태소 임베딩을 중심으로)

  • Park, Hyun-jung;Song, Min-chae;Shin, Kyung-shik
    • Journal of Intelligence and Information Systems
    • /
    • v.24 no.2
    • /
    • pp.59-83
    • /
    • 2018
  • With the increasing importance of sentiment analysis to grasp the needs of customers and the public, various types of deep learning models have been actively applied to English texts. In the sentiment analysis of English texts by deep learning, natural language sentences included in training and test datasets are usually converted into sequences of word vectors before being entered into the deep learning models. In this case, word vectors generally refer to vector representations of words obtained through splitting a sentence by space characters. There are several ways to derive word vectors, one of which is Word2Vec used for producing the 300 dimensional Google word vectors from about 100 billion words of Google News data. They have been widely used in the studies of sentiment analysis of reviews from various fields such as restaurants, movies, laptops, cameras, etc. Unlike English, morpheme plays an essential role in sentiment analysis and sentence structure analysis in Korean, which is a typical agglutinative language with developed postpositions and endings. A morpheme can be defined as the smallest meaningful unit of a language, and a word consists of one or more morphemes. For example, for a word '예쁘고', the morphemes are '예쁘(= adjective)' and '고(=connective ending)'. Reflecting the significance of Korean morphemes, it seems reasonable to adopt the morphemes as a basic unit in Korean sentiment analysis. Therefore, in this study, we use 'morpheme vector' as an input to a deep learning model rather than 'word vector' which is mainly used in English text. The morpheme vector refers to a vector representation for the morpheme and can be derived by applying an existent word vector derivation mechanism to the sentences divided into constituent morphemes. By the way, here come some questions as follows. What is the desirable range of POS(Part-Of-Speech) tags when deriving morpheme vectors for improving the classification accuracy of a deep learning model? Is it proper to apply a typical word vector model which primarily relies on the form of words to Korean with a high homonym ratio? Will the text preprocessing such as correcting spelling or spacing errors affect the classification accuracy, especially when drawing morpheme vectors from Korean product reviews with a lot of grammatical mistakes and variations? We seek to find empirical answers to these fundamental issues, which may be encountered first when applying various deep learning models to Korean texts. As a starting point, we summarized these issues as three central research questions as follows. First, which is better effective, to use morpheme vectors from grammatically correct texts of other domain than the analysis target, or to use morpheme vectors from considerably ungrammatical texts of the same domain, as the initial input of a deep learning model? Second, what is an appropriate morpheme vector derivation method for Korean regarding the range of POS tags, homonym, text preprocessing, minimum frequency? Third, can we get a satisfactory level of classification accuracy when applying deep learning to Korean sentiment analysis? As an approach to these research questions, we generate various types of morpheme vectors reflecting the research questions and then compare the classification accuracy through a non-static CNN(Convolutional Neural Network) model taking in the morpheme vectors. As for training and test datasets, Naver Shopping's 17,260 cosmetics product reviews are used. To derive morpheme vectors, we use data from the same domain as the target one and data from other domain; Naver shopping's about 2 million cosmetics product reviews and 520,000 Naver News data arguably corresponding to Google's News data. The six primary sets of morpheme vectors constructed in this study differ in terms of the following three criteria. First, they come from two types of data source; Naver news of high grammatical correctness and Naver shopping's cosmetics product reviews of low grammatical correctness. Second, they are distinguished in the degree of data preprocessing, namely, only splitting sentences or up to additional spelling and spacing corrections after sentence separation. Third, they vary concerning the form of input fed into a word vector model; whether the morphemes themselves are entered into a word vector model or with their POS tags attached. The morpheme vectors further vary depending on the consideration range of POS tags, the minimum frequency of morphemes included, and the random initialization range. All morpheme vectors are derived through CBOW(Continuous Bag-Of-Words) model with the context window 5 and the vector dimension 300. It seems that utilizing the same domain text even with a lower degree of grammatical correctness, performing spelling and spacing corrections as well as sentence splitting, and incorporating morphemes of any POS tags including incomprehensible category lead to the better classification accuracy. The POS tag attachment, which is devised for the high proportion of homonyms in Korean, and the minimum frequency standard for the morpheme to be included seem not to have any definite influence on the classification accuracy.

Coreference Resolution for Korean using Mention Pair with SVM (SVM 기반의 멘션 페어 모델을 이용한 한국어 상호참조해결)

  • Choi, Kyoung-Ho;Park, Cheon-Eum;Lee, Changki
    • KIISE Transactions on Computing Practices
    • /
    • v.21 no.4
    • /
    • pp.333-337
    • /
    • 2015
  • In this paper, we suggest a Coreference Resolution system for Korean using Mention Pair with SVM. The system introduced in this paper, also be able to extract Mention from document which is including automatically tagged name entity information, dependency trees and POS tags. We also built a corpus, including 214 documents with Coreference tags, referencing online news and Wikipedia for training the system and testing the system's performance. The corpus had 14 documents from online news, along with 200 question-and-answer documents from Wikipedia. When we tested the system by corpus, the performance of the system was extracted by MUC-F1 55.68%, B-cube-F1 57.19%, and CEAFE-F1 61.75%.

A Study on Automatic Expansion of Dialogue Examples Using Logs of a Dialogue System (대화시스템의 로그를 이용한 대화예제의 자동 확충에 관한 연구)

  • Hong, Gum-Won;Lee, Jeong-Hoon;Shin, Jung-Hwi;Lee, Do-Gil;Rim, Hae-Chang
    • 한국HCI학회:학술대회논문집
    • /
    • 2009.02a
    • /
    • pp.257-262
    • /
    • 2009
  • This paper studies an automatic expansion of dialogue examples using the logs of an example-based dialogue system. Conventional approaches to example-based dialogue system manually construct dialogue examples between humans and a Chatbot, which are labor intensive and time consuming. The proposed method automatically classifies natural utterance pairs and adds them into dialogue example database. Experimental results show that lexical, POS and modality features are useful for classifying natural utterance pairs, and prove that the dialogue examples can be automatically expanded using the logs of a dialogue system.

  • PDF