• Title/Summary/Keyword: 단어 분리

Search Result 112, Processing Time 0.032 seconds

A Study on Using Rhetoric for Graphical Ideation Tools (수사법을 활용한 그래픽 발상툴 연구)

  • Han, Ki-Beom;Kim, Maeng-Ho
    • The Journal of the Korea Contents Association
    • /
    • v.16 no.10
    • /
    • pp.598-607
    • /
    • 2016
  • The purpose of this study is to suggest necessity of idea expression method suitable for people in our country and deduct Graphic Ideation method by grasping problems of existing idea expression method. The idea expression method (association stimulating method, conceptual shifting method, information combination method) used by many graphic designers is effective in suggesting initial keyword, but has difficulty in the course of deducting the concept. Though deduction of core keyword is important to develop as a concept, the course of separation, combination in keyword connection play an important role, and most of idea expression methods are unavailable for suggesting concrete method for the course. Also as most of idea expression methods were developed and delivered in English-speaking world, it is suitable in English-speaking world culture which has thinking focused on words, but people in our country, which have thinking focusing on narration, cannot consider difference in language thinking due to limitation in idea for each stage. This study deducted idea expression method suitable for emotion of people in our country by proving the value of this idea expression method with style of suggesting and demonstrating 4 hypotheses in order to make the course for easy connection, separation, combination of keyword deducted by existing idea expression method, as well as suggesting idea expression method design based on these hypotheses. This idea expression method used rhetoric so that it is suitable for people our country who are strong for narration expression.

Sentiment Analysis of Korean Reviews Using CNN: Focusing on Morpheme Embedding (CNN을 적용한 한국어 상품평 감성분석: 형태소 임베딩을 중심으로)

  • Park, Hyun-jung;Song, Min-chae;Shin, Kyung-shik
    • Journal of Intelligence and Information Systems
    • /
    • v.24 no.2
    • /
    • pp.59-83
    • /
    • 2018
  • With the increasing importance of sentiment analysis to grasp the needs of customers and the public, various types of deep learning models have been actively applied to English texts. In the sentiment analysis of English texts by deep learning, natural language sentences included in training and test datasets are usually converted into sequences of word vectors before being entered into the deep learning models. In this case, word vectors generally refer to vector representations of words obtained through splitting a sentence by space characters. There are several ways to derive word vectors, one of which is Word2Vec used for producing the 300 dimensional Google word vectors from about 100 billion words of Google News data. They have been widely used in the studies of sentiment analysis of reviews from various fields such as restaurants, movies, laptops, cameras, etc. Unlike English, morpheme plays an essential role in sentiment analysis and sentence structure analysis in Korean, which is a typical agglutinative language with developed postpositions and endings. A morpheme can be defined as the smallest meaningful unit of a language, and a word consists of one or more morphemes. For example, for a word '예쁘고', the morphemes are '예쁘(= adjective)' and '고(=connective ending)'. Reflecting the significance of Korean morphemes, it seems reasonable to adopt the morphemes as a basic unit in Korean sentiment analysis. Therefore, in this study, we use 'morpheme vector' as an input to a deep learning model rather than 'word vector' which is mainly used in English text. The morpheme vector refers to a vector representation for the morpheme and can be derived by applying an existent word vector derivation mechanism to the sentences divided into constituent morphemes. By the way, here come some questions as follows. What is the desirable range of POS(Part-Of-Speech) tags when deriving morpheme vectors for improving the classification accuracy of a deep learning model? Is it proper to apply a typical word vector model which primarily relies on the form of words to Korean with a high homonym ratio? Will the text preprocessing such as correcting spelling or spacing errors affect the classification accuracy, especially when drawing morpheme vectors from Korean product reviews with a lot of grammatical mistakes and variations? We seek to find empirical answers to these fundamental issues, which may be encountered first when applying various deep learning models to Korean texts. As a starting point, we summarized these issues as three central research questions as follows. First, which is better effective, to use morpheme vectors from grammatically correct texts of other domain than the analysis target, or to use morpheme vectors from considerably ungrammatical texts of the same domain, as the initial input of a deep learning model? Second, what is an appropriate morpheme vector derivation method for Korean regarding the range of POS tags, homonym, text preprocessing, minimum frequency? Third, can we get a satisfactory level of classification accuracy when applying deep learning to Korean sentiment analysis? As an approach to these research questions, we generate various types of morpheme vectors reflecting the research questions and then compare the classification accuracy through a non-static CNN(Convolutional Neural Network) model taking in the morpheme vectors. As for training and test datasets, Naver Shopping's 17,260 cosmetics product reviews are used. To derive morpheme vectors, we use data from the same domain as the target one and data from other domain; Naver shopping's about 2 million cosmetics product reviews and 520,000 Naver News data arguably corresponding to Google's News data. The six primary sets of morpheme vectors constructed in this study differ in terms of the following three criteria. First, they come from two types of data source; Naver news of high grammatical correctness and Naver shopping's cosmetics product reviews of low grammatical correctness. Second, they are distinguished in the degree of data preprocessing, namely, only splitting sentences or up to additional spelling and spacing corrections after sentence separation. Third, they vary concerning the form of input fed into a word vector model; whether the morphemes themselves are entered into a word vector model or with their POS tags attached. The morpheme vectors further vary depending on the consideration range of POS tags, the minimum frequency of morphemes included, and the random initialization range. All morpheme vectors are derived through CBOW(Continuous Bag-Of-Words) model with the context window 5 and the vector dimension 300. It seems that utilizing the same domain text even with a lower degree of grammatical correctness, performing spelling and spacing corrections as well as sentence splitting, and incorporating morphemes of any POS tags including incomprehensible category lead to the better classification accuracy. The POS tag attachment, which is devised for the high proportion of homonyms in Korean, and the minimum frequency standard for the morpheme to be included seem not to have any definite influence on the classification accuracy.

The effect of syntatic and pragmatic Constraints on Sentential Representaition and Memory Accessibility (통사적 제약과 화용적 제약이 문장의 표상과 기억접근에 미치는 효과)

  • Kim, Sung-Il;Lee, Jae-Ho
    • Korean Journal of Cognitive Science
    • /
    • v.6 no.2
    • /
    • pp.97-116
    • /
    • 1995
  • This study was conducted to investigate how syntaction and pragmatic constraints influence the sentential representation and memory accessibility. In order to seperate the syntactic constraints from the pragmatic constraint from the pragmatic constraints,the syntactic role of constituent in the sentence (subject or object) and the order of mention(first or second) were manipulted.After each sentence was presented by RSVP procedure,the probe recognition time was measured to investigate memory accessibility.In Experiment 1,in which SOA interval was 255ms,it was found that the subject of a sentece were more accessible than the object and participants first in a sentence were more accessible than participants mentioned later.However, in Experiment 2,in which SOA interval was 1540ms,it was found that participants mentioned first in a sentence were more accessible than participants mentioned later while there was no significant difference between the subject and object of a sentece.These results suggest that the syntactic and pragmatic constraints have an independent effect on the initial senential representation at the early stage of constructing representation,but as time passes only the pragmatic constraints influence sentential representation.These results also support a theoretical position which assumes that sentential representation is constructed through the process of convergent statisfaction of multiple constraints.

  • PDF

발효 콩 추출물의 항돌연변이원성 효과

  • 이효진;문선영;전윤영;최승필;이득식;함승시
    • Proceedings of the Korean Society of Postharvest Science and Technology of Agricultural Products Conference
    • /
    • 2003.04a
    • /
    • pp.146.2-147
    • /
    • 2003
  • 콩 발효식품은 예로부터 단백질 식품원으로서 뿐만 아니라, 식생활에서 없어서는 안되는 매우 중요한 식품 중의 하나였다. 발효식품에 대한 연구가 부진하였던 과거에는 콩 발효식품은 하나의 식품군으로서의 중요성을 가질 뿐 큰 관심의 대상은 아니었다. 그러나 최근에는 많은 연구자들이 콩 발효시 생성되는 기능성 성분 및 생리활성 효과를 점차 밝혀냄으로서 주목을 받기 시작하였다. 따라서 본 실험에서도 콩 발효에 의한 생리활성 효과를 알아보기 위해 Ames법에 의한 항돌연변이원 효과를 실험하였다. 콩 발효는 국산콩을 이용하여 메주에서 분리한 Bacillus sp. 와 Aspergillus sp.를 복합 발효시켜 동결건조 후, 분쇄하여 실험에 사용하였다. 제조된 발효 콩 분말은 일반분석을 행하였으며, 70% 에탄올로 3회 추출하여 감압농축 후, hexane, chloroform ethyl acetate, butanol 및 aqueous로 분획하여 동결 건조시킨 후, S. typhimurium TA98 및 TA100 균주를 이용한 유전자 복귀 돌연변이 시험을 실시하였다. 그 결과, 70% 에탄을 추출물과 각각의 분획물 자체의 돌연변이원성은 없었다. 또한 항돌연변이원 실험에서는 발암물질로서 직접 돌연변이원인 4NQO와 MNNG, 간접 돌연변이원인 Trp-P-1을 이용하였다. 특히 이들 발암물질 중 MNNG(0.4 $\mu\textrm{g}$/plate)의 경우 TA100 균주에서 ehtyl acetate 분획물에서 다른 분획물보다 높은 86.6%의 억제 효과를 나타내었으며, 대부분의 분획물에서도 70%이상의 억제효과를 나타내었다. 또한 각 분획물에서 농도 의존적으로 억제효과 역시 높았으며, 분획물에 따라 서로 다른 억제효과를 나타내었다.아 저장할 때 대비 저온저장고에서는 111일 동안에 11.7%의 중량감모가 발생하였으나, 신기술투입 저온저장고에서는 5.6%의 중량감모만이 발생하여 약 50%의 중량감모를 줄일 수 있었으며, 배의 색깔이나 경도도 대비구 보다 우수하였다. 4. 배를 비닐로 포장하여 대비 저온저장고에 저장한 경우와 비닐로 포장하지 않고 신기술투입 저온저장고에 저장한 경우를 비교할 때 11월~다음해 1월 까지는 중량감모, 과피색깔 및 경도에 큰 차이가 없었으나, 2월부터는 비닐로 포장하여 대비 저온저장고에 저장한 배의 품질변화가 급격히 증가되어 중량감모, 과피색깔 및 경도가 신기술 투입시 보다 급속하게 나빠졌다.를 저장 25일 경과시까지 유지하였다. 수확 시 높은 품온을 갖고 있는 과일을 산지에서 예냉 처리를 한 후 저온 냉장차를 이용하여 유통한다면 관행 유통 구조보다 고품질의 포도를 유통시킬 수 있는 것으로 사료되며 앞으로는 완숙된 고 당도(12.0~15.0Bx)$^{\circ}$ 포도를 수확 한 즉시 예냉 처리하고 저온 유통한다면 보다 신선한 과일을 소비자에게 전달 할 수 있을 것이다.갈변물질이 생성되었다. 이와 같은 결과로 볼 때, BAAG의 처리는 BAAC의 경우보다 가격은 저렴하면서도 항균력은 우수한 천연 항균복합제재로써 농산물 식품원료에 적용하여 선도유지 기간을 연장할 수 있는 효과를 기대할 수 있었다. 과일 등의 포장제로서 이용할 가능성을 확인하였다.로 [-wh] 겹의문사는 복수 의미를 지닐 수 없 다. 그러면 단수 의미는 어떻게 생성되는가\ulcorner 본 논문에서는 표면적 형태에도 불구하고 [-wh]의미의 겹의문사는 병렬적 관계의 합성어가 아니라 내부구조를 지니지 않은 단순한 단어(minimal $X^{0}$

  • PDF

A Development of Telephone for the Hearing Impaired to Improve Listening Ability of Telephone Speech (난청인의 통화 청취도 향상을 위한 전화기 개발)

  • 이상민;송철규;이영묵;김원기
    • Journal of Biomedical Engineering Research
    • /
    • v.18 no.4
    • /
    • pp.457-466
    • /
    • 1997
  • We developed a new hearing aid telephone which helps the hearing impaired person to improve the listening ability of telephone speech. Recently, the hearing impaired person and the elderly who has hearing loss have been continuously increased and their desire for participating society as a producer has been increased also. So they strong1y want the hearing aid devices which make compensation fortheir handicap. The hearing aid telephone is one of the basic aid devices that helps the hearing impaired to communicate well with other poeple and to acquire easily useful information through the phone. We analyze the hearing ability of the hearing impaired, design the new model of the hearing aid telephone and test the telephone in three fields-electrical, word perception, user test. Our new tolephone has lour band pass filter channels and the center frequencies of these filters are 500, 1000, 2000, 3000Hz which are considered psychoacoustic factors and telephone line characteristics. The hearing impaired can adjust the total gain characteristics of receiving sound to his hearing ability by setting four volumes in the telelphone. This procedure is called fitting which is a very important factor for the hearing impaired to take meaning of speech. The total gain of this telephone is over 20dB from 250Hz to 3200Hz range. From the results of the tests we certify that our new model is better for the hearing impaired to understand the meaning or telephone speech than the old general models. The next step of developing the hearing aid telephone is to study about compressing sidetone and noise, dividing frequency bands, selecting hearing aid pattern and compensating psychoacoustic loudness. we expect that the advanced hearing aid telephone can be developed by the research about speech perception characteristics of the hearing impaired in engineering and clinical side.

  • PDF

Development of Music Classification of Light and Shade using VCM and Beat Tracking (VCM과 Beat Tracking을 이용한 음악의 명암 분류 기법 개발)

  • Park, Seung-Min;Park, Jun-Heong;Lee, Young-Hwan;Ko, Kwang-Eun;Sim, Kwee-Bo
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.20 no.6
    • /
    • pp.884-889
    • /
    • 2010
  • Recently, a music genre classification has been studied. However, experts use different criteria to classify each of these classifications is difficult to derive accurate results. In addition, when the emergence of a new genre of music genre is a newly re-defined. Music as a genre rather than to separate search should be classified as emotional words. In this paper, the feelings of people on the basis of brightness and darkness tries to categorize music. The proposed classification system by applying VCM(Variance Considered Machines) is the contrast of the music. In this paper, we are using three kinds of musical characteristics. Based on surveys made throughout the learning, based on musical attributes(beat, timbre, note) was used to study in the VCM. VCM is classified by the trained compared with the results of the survey were analyzed. Note extraction using the MATLAB, sampled at regular intervals to share music via the FFT frequency analysis by the sector average is defined as representing the element extracted note by quantifying the height of the entire distribution was identified. Cumulative frequency distribution in the entire frequency rage, using the difference in Timbre and were quantified. VCM applied to these three characteristics with the experimental results by comparing the survey results to see the contrast of the music with a probability of 95.4% confirmed that the two separate.

A Critical Review of Waterfront Maintenance Plan for Urban Development - Focused on Waterfront Development - (도시개발에 따른 친수공간 정비계획의 재고찰 - 워터프론트개발 사례를 중심으로 -)

  • Choi, Jin-Do;Park, Sung-Je;Ryu, Si-Saeng
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2012.05a
    • /
    • pp.965-969
    • /
    • 2012
  • 도시의 개발을 통한 발전방향은 지역별 기술력을 바탕으로 시대의 흐름과 문화, 경제, 도시정책 등에 따라 다양하다. 특히, 워터프론트는 도시와 가장 근접한 친수공간으로 단어자체에 도시'라는 개념이 포함되어 있다. 즉 도시가 큰 강이나 바다, 호수 등과 접하고 있는 공간을 말한다. 우리나라는 도시계획차원에서 도시 수변공간(urban waterfront)으로 설정하고, 레크레이션, 공원, 경관형성, 환경오염 저감, 정서함양, 생산 등의 다양한 기능을 갖고 있는 매우 공공성이 높은 도시지역의 주요 공간지역으로 의미를 부여하고 있다. 그러나 워터프론트를 개발계획을 수립하는 과정에서 교통계획, 홍수 등의 재해시설, 환경 등에 대한 평가가 제대로 이루어지지 않고 있으며(한국일보, 2011), 개발 계획 추진은 주민 공감대 형성, 사업타당성 검토 없는 '밀어부치기식' 개발 지상주의로 전락하고 있다. 따라서 본 연구는 국내 워터프론트 개발사업 계획을 추진하고 있는 지역으로 인천 송도지역과 부산의 수산시장으로 유명한 자갈치시장 일대의 개발계획, 마산 도시재생지구의 항만 재개발계획 사례를 분석하여 개발계획의 문제점과 언론상에 비춰지는 개발의 현 실태, 지역개발의 효과 등에 대한 시사점을 도출하였다. 사례지역을 비롯한 대부분의 기존 워터프론트 개발은 경제성을 위해 규모가 큰 상업시설 위주로 개발이 이루어지고 있으며, 주거시설이나 문화시설을 비롯한 다양한 시설의 구성이 부족한 특수성을 갖추어 가고 있다. 또한, 기존의 유휴공간을 재개발하면서 도시와의 관계를 제대로 설정하지 못하여 도시와 분리된 폐쇄공간으로 개발이 이루어지고 있으며, 도시구조에 통합되지 못하고 있는 실정이다. 무엇보다 워터프론트 개발사업에 관계된 여러 집단들, 즉 중앙정부, 지방자치단체, 개발주체, 시민 등 이들 사이에 합의 도출의 어려움이 많아 좋은 계획안이 무산되거나 사업이 지체 혹은 중단되는 사례가 많았다. 워터프론트 개발에 대한 지역민사회의 충분한 공감대가 결코 형성되지 않았음에도 오히려 요식적 여론수렴 절차를 강조하고 있으며, 친환경적 도시개발이라는 사업의 목적과 맞지 않는 계획이 많았다. 특히, 관련 사업 중 항만재개발사업에는 막대한 초기 투자비가 소요되어 재개발사업을 위한 자금의 확보가 어려운 경우가 많이 있었으며, 도시의 장기발전계획과 통합된 장기적인 개발전략이 필요한데 이를 소홀히 하는 경우가 있었다. 본 연구를 통해 기존 워터프론트 개발계획의 문제점과 향후 정비계획이 추구해야 하는 친수구역의 관리방법, 주민참여 방안의 대안제시가 이루어 질 수 있었으며, 이를 바탕으로 국내 친수공간 정비계획을 재 고찰 할 수 있는 기회가 되었으면 한다.

  • PDF

Hangeul detection method based on histogram and character structure in natural image (다양한 배경에서 히스토그램과 한글의 구조적 특징을 이용한 문자 검출 방법)

  • Pyo, Sung-Kook;Park, Young-Soo;Lee, Gang Seung;Lee, Sang-Hun
    • Journal of the Korea Convergence Society
    • /
    • v.10 no.3
    • /
    • pp.15-22
    • /
    • 2019
  • In this paper, we proposed a Hangeul detection method using structural features of histogram, consonant, and vowel to solve the problem of Hangul which is separated and detected consonant and vowel The proposed method removes background by using DoG (Difference of Gaussian) to remove unnecessary noise in Hangul detection process. In the image with the background removed, we converted it to a binarized image using a cumulative histogram. Then, the horizontal position histogram was used to find the position of the character string, and character combination was performed using the vertical histogram in the found character image. However, words with a consonant vowel such as '가', '라' and '귀' are combined using a structural characteristic of characters because they are difficult to combine into one character. In this experiment, an image composed of alphabets with various backgrounds, an image composed of Korean characters, and an image mixed with alphabets and Hangul were tested. The detection rate of the proposed method is about 2% lower than that of the K-means and MSER character detection method, but it is about 5% higher than that of the character detection method including Hangul.

Korean Morphological Analysis Method Based on BERT-Fused Transformer Model (BERT-Fused Transformer 모델에 기반한 한국어 형태소 분석 기법)

  • Lee, Changjae;Ra, Dongyul
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.11 no.4
    • /
    • pp.169-178
    • /
    • 2022
  • Morphemes are most primitive units in a language that lose their original meaning when segmented into smaller parts. In Korean, a sentence is a sequence of eojeols (words) separated by spaces. Each eojeol comprises one or more morphemes. Korean morphological analysis (KMA) is to divide eojeols in a given Korean sentence into morpheme units. It also includes assigning appropriate part-of-speech(POS) tags to the resulting morphemes. KMA is one of the most important tasks in Korean natural language processing (NLP). Improving the performance of KMA is closely related to increasing performance of Korean NLP tasks. Recent research on KMA has begun to adopt the approach of machine translation (MT) models. MT is to convert a sequence (sentence) of units of one domain into a sequence (sentence) of units of another domain. Neural machine translation (NMT) stands for the approaches of MT that exploit neural network models. From a perspective of MT, KMA is to transform an input sequence of units belonging to the eojeol domain into a sequence of units in the morpheme domain. In this paper, we propose a deep learning model for KMA. The backbone of our model is based on the BERT-fused model which was shown to achieve high performance on NMT. The BERT-fused model utilizes Transformer, a representative model employed by NMT, and BERT which is a language representation model that has enabled a significant advance in NLP. The experimental results show that our model achieves 98.24 F1-Score.

Digital Transformation: Using D.N.A.(Data, Network, AI) Keywords Generalized DMR Analysis (디지털 전환: D.N.A.(Data, Network, AI) 키워드를 활용한 토픽 모델링)

  • An, Sehwan;Ko, Kangwook;Kim, Youngmin
    • Knowledge Management Research
    • /
    • v.23 no.3
    • /
    • pp.129-152
    • /
    • 2022
  • As a key infrastructure for digital transformation, the spread of data, network, artificial intelligence (D.N.A.) fields and the emergence of promising industries are laying the groundwork for active digital innovation throughout the economy. In this study, by applying the text mining methodology, major topics were derived by using the abstract, publication year, and research field of the study corresponding to the SCIE, SSCI, and A&HCI indexes of the WoS database as input variables. First, main keywords were identified through TF and TF-IDF analysis based on word appearance frequency, and then topic modeling was performed using g-DMR. With the advantage of the topic model that can utilize various types of variables as meta information, it was possible to properly explore the meaning beyond simply deriving a topic. According to the analysis results, topics such as business intelligence, manufacturing production systems, service value creation, telemedicine, and digital education were identified as major research topics in digital transformation. To summarize the results of topic modeling, 1) research on business intelligence has been actively conducted in all areas after COVID-19, and 2) issues such as intelligent manufacturing solutions and metaverses have emerged in the manufacturing field. It has been confirmed that the topic of production systems is receiving attention once again. Finally, 3) Although the topic itself can be viewed separately in terms of technology and service, it was found that it is undesirable to interpret it separately because a number of studies comprehensively deal with various services applied by combining the relevant technologies.