• Title/Summary/Keyword: sentence similarity measurement method

Search Result 4, Processing Time 0.017 seconds

Sentence Similarity Measurement Method Using a Set-based POI Data Search (집합 기반 POI 검색을 이용한 문장 유사도 측정 기법)

  • Ko, EunByul;Lee, JongWoo
    • KIISE Transactions on Computing Practices
    • /
    • v.20 no.12
    • /
    • pp.711-716
    • /
    • 2014
  • With the gradual increase of interest in plagiarism and intelligent file content search, the demand for similarity measuring between two sentences is increasing. There is a lot of researches for sentence similarity measurement methods in various directions such as n-gram, edit-distance and LSA. However, these methods have their own advantages and disadvantages. In this paper, we propose a new sentence similarity measurement method approaching from another direction. The proposed method uses the set-based POI data search that improves search performance compared to the existing hard matching method when data includes the inverse, omission, insertion and revision of characters. Using this method, we are able to measure the similarity between two sentences more accurately and more quickly. We modified the data loading and text search algorithm of the set-based POI data search. We also added a word operation algorithm and a similarity measure between two sentences expressed as a percentage. From the experimental results, we observe that our sentence similarity measurement method shows better performance than n-gram and the set-based POI data search.

Implementation of A Plagiarism Detecting System with Sentence and Syntactic Word Similarities (문장 및 어절 유사도를 이용한 표절 탐지 시스템 구현)

  • Maeng, Joosoo;Park, Ji Su;Shon, Jin Gon
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.8 no.3
    • /
    • pp.109-114
    • /
    • 2019
  • The similarity detecting method that is basically used in most plagiarism detecting systems is to use the frequency of shared words based on morphological analysis. However, this method has limitations on detecting accurate degree of similarity, especially when similar words concerning the same topics are used, sentences are partially separately excerpted, or postpositions and endings of words are similar. In order to overcome this problem, we have designed and implemented a plagiarism detecting system that provides more reliable similarity information by measuring sentence similarity and syntactic word similarity in addition to the conventional word similarity. We have carried out a comparison of on our system with a conventional system using only word similarity. The comparative experiment has shown that our system can detect plagiarized document that the conventional system can detect or cannot.

Implementation of a Spam Message Filtering System using Sentence Similarity Measurements (문장유사도 측정 기법을 통한 스팸 필터링 시스템 구현)

  • Ou, SooBin;Lee, Jongwoo
    • KIISE Transactions on Computing Practices
    • /
    • v.23 no.1
    • /
    • pp.57-64
    • /
    • 2017
  • Short message service (SMS) is one of the most important communication methods for people who use mobile phones. However, illegal advertising spam messages exploit people because they can be used without the need for friend registration. Recently, spam message filtering systems that use machine learning have been developed, but they have some disadvantages such as requiring many calculations. In this paper, we implemented a spam message filtering system using the set-based POI search algorithm and sentence similarity without servers. This algorithm can judge whether the input query is a spam message or not using only letter composition without any server computing. Therefore, we can filter the spam message although the input text message has been intentionally modified. We added a specific preprocessing option which aims to enable spam filtering. Based on the experimental results, we observe that our spam message filtering system shows better performance than the original set-based POI search algorithm. We evaluate the proposed system through extensive simulation. According to the simulation results, the proposed system can filter the text message and show high accuracy performance against the text message which cannot be filtered by the 3 major telecom companies.

『황제내경소문(黃帝內經素問)·칠편대론(七篇大論)』 왕빙 주본(注本)을 통(通)한 운기학설(運氣學說) 관(關)한 연구(硏究)

  • Kim, Gi-Uk;Park, Hyeon-Guk
    • The Journal of Dong Guk Oriental Medicine
    • /
    • v.4
    • /
    • pp.109-140
    • /
    • 1995
  • As we considered in the main subjects, investigations on the theory of 'Doctrine on five elements' motion and six kinds of natural factors(運氣學說)' through 'Wang Bing's Commentary(王氷 注本)' of 'The seven great chapters in The Yellow Emperor's Internal Classic Su Wen' ("黃帝內經素問 七篇大論") are as follows. (1) In The seven great chapters("七篇大論")' Wang Bing supplement theory and in the academic aspects as a interpreter, judging from 'forget(亡)' character. expressed in the 'The missing chapters("素問遺篇")', 'Bonbyung-ron("本病論")' and 'Jabeob-ron(刺法論)', 'The seven great chapters("七篇大論")' must be supplementary work by Wang Bing. Besides, he quoted such forty books as medical books, taoist books, confucianist books, miscellaneous books, etc in the commentary and the contents quoted in the 'Su Wen(素問)' and 'Ling Shu("靈樞")' scripture nearly occupy in the book. As a method of interpreting scripiure as scripture, he edited the order of 'Internal Classic("內經")' ascended from the ancient time and when he compensated for commentary, with exhaustive scholarly mind and by observing the natural phenomena practically and writing the pathology and the methods of treatment. We knew that the book is combined with the study of 'Doctrine on five elements motion and six kinds of natural factors(運氣學說)' (2) When we compare, analyze the similar phrase of 'The seven great chapters in The Yellow Emperor's Internal Classic Su Wen'("黃帝內經素問ㆍ七篇大論") through 'Wang Bing's Commentary(王氷 注本)', he tells abouts organized 'five elements(五行)' and 'heaven's regularly movement(天道運行)' rather than 'Emyangengsangdae-ron("陰陽應象大論")' in 'The seven great chapters("七篇大論")'. Also the 'Ohanunhangdae-ron("五運行大論")' because the repeated sentences with 'Emyangengsangdae-ron("陰陽應象大論")' is long they are omitted. And in the 'Youkmijidae-ron("六微旨大論")', 'Cheonjin ideology(天眞四象)' based on the 'Sanggocheonjin- ron("上古天眞論")', 'Sagijosindae-ron("四氣調神大論")' is written and in the 'Gigoupyondae-ron("氣交變大論")', the syndrome and symptom are explained in detail rather than 'Janggibeobsi-ron("藏氣法時論")', 'Okgijinjang-ron ("玉機眞藏論")' and in the 'Osangieongdae-ron("五常政大論")', the concept of 'five element(五行)' of the 'Gemgwejineon-ron("金櫃眞言論")' is expanded to 'the five elements' motion concept(五運槪念)' and in the 'Youkwonjeonggidae-ron("六元正紀大論")', explanations of 'The five elements' motion and six kinds of natural factors(運氣)' function are mentioned mainly and instead systematic pathology is not revealed rather than 'Emyangengsangdae-ron("陰陽應象大論")'. And in the 'Jijinyodae-ron("至眞要大論")', explanations of the change of atmosphere which correspond to treatment principle by 'The three Yin and Yang(三陰三陽)' as a progressed concepts are revealed. Therefore there are much similarity between the phrase of 'Emyangengsangdae-ron("陰陽應象大論")' and 'chapters of addition(補缺之篇)'. Generally, the doctrine which 'The seven great chapters("七篇大論")' are added by Wang Bing(王氷) is supported because there are more profound concepts rather than the other chapter in 'The seven great chapters("七篇大論")'. (3) When we study Wang Bing's(王氷) 'Pattern on five elements motion and six kinds of natural factors(運氣格局)' in 'The seven great chapter("七篇大論")', in the 'Cheonwongi-dae-ron("天元紀大論")', With 'Cheonjin ideology(天眞思想)' and the concepts of 'Owang(旺)'${\cdot}$'Sang(相)'${\cdot}$'Sa(死)'${\cdot}$'Su(囚)'${\cdot}$'Hu(休)' and 'Cheonbu(天符)'${\cdot}$'Sehwoi(歲會)' are measured time-spacially to the concept of 'Three Sum(三合)' the concept of 'Taeulcheonbu(太乙天符)' is explained. In the 'Ounhangdae-ron("五運行大論")', 'The calender Signs five Sum(天干五合)' is compared to the concepts of 'couples(夫婦)', 'weak-strong(柔强)' and in the 'Youkmijidae-ron("六微旨大論")', 'the relationship of obedience and disobedience(順逆關係)' which conform to the 'energy status(氣位)' change and 'monarch-minister(君相)' position is mentioned. In the 'Gikyobyeondae-ron("氣交變大論")', the concept of 'Sang-duk(相得)', 'Pyungsang(平常)' is emphasized but concrete measurement is mentioned. In the 'Osangieongdae-ron("五常政大論")', the detailed explanation with twenty three 'systemic of the five elements' motion(五運體系)' form and 'rountine-contrary treatment(正治. 反治)' with 'chill-fever-warm-cold(寒${\cdot}$${\cdot}$${\cdot}$凉)' are mentioned according to the 'analyse and differentiate pathological conditions in accordance with the eight principal syndromes(八綱辨證)'. In the 'Youkwonjeonggidae-ron("六元正紀大論")', Wang Bing of doesn't mention the concepts of 'Jungwun(中運)' that is seen in the original classic. In the new corrective edition, as the concepts of 'Jungwun, Dongcheonbu, Dongsehae and Taeulcheonbu(中運, 同天符, 同歲會, 太乙天符)' is appeared, Wang Bing seems to only use the concepts of 'Daewun, Juwun, and Gaekwun(大運, 主運, 客運)'. In the 'Jijinyodaeron("至眞要大論")', Wang Bing added detailed commentary to pathology and treatment doctrine by explaining the numerous appearances of 'Sebo, sufficiency, deficiency(歲步, 有餘, 不足)' and in the relation of 'victory-defeat(勝復)', he argued clearly that it is not mechanical estimation. (4) When we observe the Wang Bing's originality on the study of 'the theory of Doctrine on five elements' motion and six kinds of natural factors(運氣學說)', he emphasized 'The idea of Jeongindogi and Health preserving(全眞導氣${\cdot}$養生思想)' by adding 'Wang Bing's Commentary(王氷 注本)' of 'The seven great chapters("七篇大論")' and explained clearly 'The theory of Doctrine on five elements' motion and six kinds of natural factors(運氣學說)' and simpled and expanded the meaning of 'man, as a microcosm, is connected with the macrocosm(天人相應)' and with 'Atmosphere theory(大氣論)' also explained the meaning of 'rising and falling mechanism(升降氣機)'. In the sentence of 'By examining the pathology, take care of your health(審察病機 無失氣宜)'. he explained the meaning of pathology of 'heart-kidney-water-fire(心腎水火)' and suggested the doctrine and management of prescription. In the estimation and treatment, by suggesting 'asthenia and sthenia(虛實)' two method's estimation, 'contrary treatment(反治)' and treatment principals of 'falling heart fire tonifyng kidney water(降心火益腎水)', 'two class of chill and fever(寒熱二綱)' were demonstrated. There are 'inside and outside in the illness and so inner and outer in the treatment(病有中外 治有表囊)'. This sentence suggests concertedly. 'two class of superfies and interior(表囊二綱)' conforming to the position of disease. Therefore Wang Bing as an excellent theorist and introduced 'Cheoniin ideology(天眞思想)' as a clinician and realized the medical science. With these accomplishes mainly written in 'The theory of Doctrine on five elements' motion and six kinds of natural factors(運氣學說)' of 'The seven great chapters("七篇大論")', he interpreted the ancient medical scriptures and expanded the meaning of scriptures and conclusively contributed to the development of the study 'Korean Oriental Medicine(韓醫學)'.

  • PDF