• Title/Summary/Keyword: number word

Search Result 697, Processing Time 0.024 seconds

Performance Improvement of Context-Sensitive Spelling Error Correction Techniques using Knowledge Graph Embedding of Korean WordNet (alias. KorLex) (한국어 어휘 의미망(alias. KorLex)의 지식 그래프 임베딩을 이용한 문맥의존 철자오류 교정 기법의 성능 향상)

  • Lee, Jung-Hun;Cho, Sanghyun;Kwon, Hyuk-Chul
    • Journal of Korea Multimedia Society
    • /
    • v.25 no.3
    • /
    • pp.493-501
    • /
    • 2022
  • This paper is a study on context-sensitive spelling error correction and uses the Korean WordNet (KorLex)[1] that defines the relationship between words as a graph to improve the performance of the correction[2] based on the vector information of the word embedded in the correction technique. The Korean WordNet replaced WordNet[3] developed at Princeton University in the United States and was additionally constructed for Korean. In order to learn a semantic network in graph form or to use it for learned vector information, it is necessary to transform it into a vector form by embedding learning. For transformation, we list the nodes (limited number) in a line format like a sentence in a graph in the form of a network before the training input. One of the learning techniques that use this strategy is Deepwalk[4]. DeepWalk is used to learn graphs between words in the Korean WordNet. The graph embedding information is used in concatenation with the word vector information of the learned language model for correction, and the final correction word is determined by the cosine distance value between the vectors. In this paper, In order to test whether the information of graph embedding affects the improvement of the performance of context- sensitive spelling error correction, a confused word pair was constructed and tested from the perspective of Word Sense Disambiguation(WSD). In the experimental results, the average correction performance of all confused word pairs was improved by 2.24% compared to the baseline correction performance.

An improved spectrum mapping applied to speaker adaptive Kroean word recognition

  • Matsumoto, Hiroshi;Lee, Yong-Ju;Kim, Hoi-Rim;Kido, Ken'iti
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • 1994.06a
    • /
    • pp.1009-1014
    • /
    • 1994
  • This paper improves the previously proposed spectral mapping method for supervised speaker adaptation in which a mapped spectrum is interpolated from speaker difference vectors at typical spectra based on a minimized distortion criterion. In estimating these difference vectors, it is important to find an appropriate number of typical points. The previous method empirically adjusts the number of typical points, while the present method optimizes the effective number by rank reduction of normal equation. This algorithm was applied to a supervised speaker adaptation for Korean word recognition using the templates form a prototype male speaker. The result showed that the rank reduction technique not only can automatically determine an optimal number of code vectors, but also slightly improves the recognition scores compared with those obtained by the previous method.

  • PDF

A Study on the eWOM and Selecting Movie According to Online Media and Replies (온라인 매체와 댓글에 따른 영화 구전의도 및 관람의도에 관한 연구)

  • Yu, Dengsheng;Lim, Gyoo Gun
    • Journal of Information Technology Services
    • /
    • v.14 no.2
    • /
    • pp.177-193
    • /
    • 2015
  • A great number of customers, who want to watch movies usually check out online reviews before choosing what to watch a movie. The most representative online media that customers consult are portal sites and SNS (Social Network Service). Although there have been numerous studies on online eWOM (e-Word of Mouth) and the effects of online media in businesses, it remains a question that which media is best for WOM (Word of Mouth) when selecting movies. This research examines customer's intention for consulting eWOM and for watching movies according to the number and tendency of online replies. We have compared portal sites and SNS about information of movie. The study shows that a large number of positive replies can affect the intention for WOM and choosing movies. Facebook has more influence than portal sites when choosing what to watch when replies consist of large and positive comments. However, there is no difference between the two types of media when they consist of negative comments.

An Exploratory Study on the Effects of Psychological Distance and Message Type on Word-of-Mouth in Firm's Facebook (회사 페이스북 메시지의 심리적 거리와 메시지 유형이 구전에 미치는 영향에 대한 탐색적 연구)

  • Lee, Seongwon
    • The Journal of Information Systems
    • /
    • v.29 no.2
    • /
    • pp.71-94
    • /
    • 2020
  • Purpose With the development of Social Network Service(SNS) and mobile devices, many companies have been using the Facebook as a Word-of-Mouth(WOM) channel. This study examines the effects of psychological distance and message type on WOM using the Facebook's real messages. And the moderating effect of the message type on the relationship between psychological distance and WOM was also analyzed. Design/methodology/approach A content analysis was used as a research method. A total 7,483 messages were collected from 50 companies' Facebook Fanpage (based on the ranking of socialbakers.com) and content analysis was conducted using human coding. As the influencing variables, the message type and psychological distance and the number of 'Likes', 'Share', and 'Comment' were used as the dependent variable. The R3.4.4 was used to perform descriptive statistics, cross-tab analysis, and analysis of variance(ANOVA). Findings First, a larger proportion of Facebook messages have close psychological distance for all message types(information, advertisement, event, and customer relationship). Second, 'Like' and 'Comment' number were significantly higher in messages of close psychological distance. Third, the effects of psychological distance on 'Like', 'Share', and 'Comment' number were different according to message type. However, 'advertisement' message type had significantly more numbers for all WOMs('Like', 'Share', and 'Comment') in messages with close psychological distance.

A Word Line Ramping Technique to Suppress the Program Disturbance of NAND Flash Memory

  • Lee, Jin-Wook;Lee, Yeong-Taek;Taehee Cho;Lee, Seungjae;Kim, Dong-Hwan;Wook-Ghee, Hahn;Lim, Young-Ho;Suh, Kang-Deog
    • JSTS:Journal of Semiconductor Technology and Science
    • /
    • v.1 no.2
    • /
    • pp.125-131
    • /
    • 2001
  • When the program voltage is applied to a word line, a part of the boosted channel charge in inhibited bit lines is lost due to the coupling between the string select line (SSL) and the adjacent word line. This phenomenon causes the program disturbance in the cells connected to the inhibited bit lines. This program disturbance becomes more serious, as the word line pitch is decreased. To reduce the word line coupling, the rising edge of the word-line voltage waveform was changed from a pulse step into a ramp waveform with a controlled slope. The word-line ramping circuit was composed of a timer, a decoder, a 8 b D/A converter, a comparator, and a high voltage switch pump (HVSP). The ramping voltage was generated by using a stepping waveform. The rising time and the stepping number of the word-line voltage for programming were set to $\mutextrm{m}-$ and 8, respectively,. The ramping circuit was used in a 512Mb NAND flash memory fabricated with a $0.15-\mutextrm{m}$ CMOS technology, reducing the SSL coupling voltage from 1.4V into a value below 0.4V.

  • PDF

A Study on the Reduction of Common Words to Classify Causes of Marine Accidents (해양사고 원인을 분류하기 위한 공통단어의 축소에 관한 연구)

  • Yim, Jeong-Bin
    • Journal of Navigation and Port Research
    • /
    • v.41 no.3
    • /
    • pp.109-118
    • /
    • 2017
  • The key word (KW) is a set of words to clearly express the important causations of marine accidents; they are determined by a judge in a Korean maritime safety tribunal. The selection of KW currently has two main issues: one is maintaining consistency due to the different subjective opinion of each judge, and the second is the large number of KW currently in use. To overcome the issues, the systematic framework used to construct KW's needs to be optimized with a minimal number of KW's being derived from a set of Common Words (CW). The purpose of this study is to identify a set of CW to develop the systematic KW construction frame. To fulfill the purpose, the word reduction method to find minimum number of CW is proposed using P areto distribution function and Pareto index. A total of 2,642 KW were compiled and 56 baseline CW were identified in the data sets. These CW, along with their frequency of use across all KW, are reported. Through the word reduction experiments, an average reduction rate of 58.5% was obtained. The estimated CW according to the reduction rates was verified using the Pareto chart. Through this analysis, the development of a systematic KW construction frame is expected to be possible.

A postprocessing method for korean optical character recognition using eojeol information (어절 정보를 이용한 한국어 문자 인식 후처리 기법)

  • 이영화;김규성;김영훈;이상조
    • Journal of the Korean Institute of Telematics and Electronics C
    • /
    • v.35C no.2
    • /
    • pp.65-70
    • /
    • 1998
  • In this paper, we will to check and to correct mis-recognized word using Eojeol information. First, we divided into 16 classes that constituents in a Eojeol after we analyzed Korean statement into Eojeol units. Eojeol-Constituent state diagram constructed these constitutents, find the Left-Right Connectivity Information. As analogized the speech of connectivity information, reduced the number of cadidate words and restricted case of morphological analysis for mis-recognition Eojeol. Then, we improved correction speed uisng heuristic information as the adjacency information for Eojeol each other. In the correction phase, construct Reverse-Order Word Dictionary. Using this, we can trace word dictionary regardless of mis-recongnition word position. Its results show that improvement of recognition rate from 97.03% to 98.02% and check rate, reduction of chadidata words and morpholgical analysis cases.

  • PDF

A Study on the Vowel Duration of the Buckeye Corpus (벅아이 코퍼스의 모음 길이 연구)

  • Chung, Hyejung;Yoon, Kyuchul
    • Phonetics and Speech Sciences
    • /
    • v.7 no.4
    • /
    • pp.103-110
    • /
    • 2015
  • The purpose of this study is to assess the vowel property by examining the vowel duration of the American English vowles found in the Buckeye corpus[6]. The vowel durations were analyzed in terms of various linguistic factors including the number of syllables of the word containing the vowel, the location of the vowel in a word, types of stress, function versus content word, the word frequency in the corpus and the speech rate calculated from the three consecutive words. The findings from this work agreed mostly with those from earlier studies, but with some exceptions. The relationship between the speech rate and the vowel duration proved non-linear.

Development of Spatio-Temporal Neural Network for Connected Korean Digits Recognition (한국어 연결 숫자음 인식을 위한 시공간 신경회로망의 개발)

  • 이종식
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • 1995.06a
    • /
    • pp.69-72
    • /
    • 1995
  • In this paper, a new approach for Korean connected digits recognition using the spatio-temporal neural network is reported. The data of seven digits phone numbers are used in the recognition of connected words, and in the initial experiment, digit recognition rate of 28% was achieved. In this paper, to increase recognition rate, two different approaches are analyzed. In the first system, to compensate the STNN's own defect and to emphasize the Korean word's phonic characters, the starting point of phone is pointed by comparing the average magnitude and zero-crossing rate and the ending point is pointed by comparing only zero-crossing rate. The digit recoginiton rate increased to 61%. Also, in the second system, to consider fact that same word's phone is varied severally, the number of STNN's of each word is increased from one to five, and then the varied same word's phones can be included to the increased STNN's. The digit recogniton rate of connected words increased to 89%.

  • PDF

An Analysis on Strategies and Errors in Word Problems of Linear Equation for Middle School Students (중학생들의 일차 방정식에 관한 문장제 해결 전략 및 오류 분석)

  • 이정은;김원경
    • The Mathematical Education
    • /
    • v.38 no.1
    • /
    • pp.77-85
    • /
    • 1999
  • In this paper, we analyze strategies and error patterns in solving word problems of linear equation for middle school students. From a test conducted to the sampled 106 second grade middle school students, we obtain the following results: (1)The most difficult types of word problem are velosity and density related problems. The second one is length related problems and the easist one is number related problems. (2)Regardless of the types of word problem, the most familiar strategy is the constructing algebraic equations. However, the most successful strategy is the trial and error. (3)Most likely error patterns are the use of inadequate formulas and wrong trial and errors. Based on these results, a teaching program with various schema is developed and shown to be effective for mid level students in classroom.

  • PDF