• Title/Summary/Keyword: Global Dictionary

Search Result 23, Processing Time 0.022 seconds

Person Re-identification using Sparse Representation with a Saliency-weighted Dictionary

  • Kim, Miri;Jang, Jinbeum;Paik, Joonki
    • IEIE Transactions on Smart Processing and Computing
    • /
    • v.6 no.4
    • /
    • pp.262-268
    • /
    • 2017
  • Intelligent video surveillance systems have been developed to monitor global areas and find specific target objects using a large-scale database. However, person re-identification presents some challenges, such as pose change and occlusions. To solve the problems, this paper presents an improved person re-identification method using sparse representation and saliency-based dictionary construction. The proposed method consists of three parts: i) feature description based on salient colors and textures for dictionary elements, ii) orthogonal atom selection using cosine similarity to deal with pose and viewpoint change, and iii) measurement of reconstruction error to rank the gallery corresponding a probe object. The proposed method provides good performance, since robust descriptors used as a dictionary atom are generated by weighting some salient features, and dictionary atoms are selected by reducing excessive redundancy causing low accuracy. Therefore, the proposed method can be applied in a large scale-database surveillance system to search for a specific object.

An Efficient Preprocessing System for Searching Similar Texts among Massive Document Repository (대용량 문서 집합에서 유사 문서 탐색을 위한 효과적인 전처리 시스템의 설계)

  • Park, Sun-Young;Kim, Ji-Hun;Kim, Seon-Yeong;Kim, Hyung-Joon;Cho, Hwan-Gue
    • Journal of KIISE:Computing Practices and Letters
    • /
    • v.16 no.5
    • /
    • pp.626-630
    • /
    • 2010
  • Since the paper plagiarism has become one of important social issues, it is necessary to develop system for measuring the similarity between papers. The speed and accuracy of the system are very important features. So many researchers are studying the features. In this paper, we propose a preprocessing method using 'Global Dictionary' model to enhance performance of the system. The global dictionary includes information of all words in the document repository. The system uses the model to find similar papers with low computing time. Finally our experiment showed that a set of more than 20,000 documents could be reduced to about 50 documents drastically by our filtering techniques, which proves the excellence of our system.

A Study on the Academic vocabulary Education for Content-Based Korean Language Education: A Basic Study for Online Dictionary Development

  • Hwang, Shung-eun
    • Journal of the Korea Society of Computer and Information
    • /
    • v.25 no.2
    • /
    • pp.67-74
    • /
    • 2020
  • In this paper, we proposes to develop an online academic vocabulary dictionary as a way of educating academic vocabulary for content-oriented Korean language education. Various academic languages exist in the content-based Korean language teaching materials they encounter when studying at university. You cannot understand or produce academic text without knowing the academic vocabulary. Therefore, one of the tasks of Korean language education has become to improve educational efficiency by preparing a method for academic vocabulary education that is most suitable for them in consideration of their own. Prior to the development of the online academic vocabulary dictionary, the institute conducted a basic study on how the content should be contained in the online dictionary. Online academic vocabulary dictionaries allow students to naturally link their limited education into and out of the classroom, thereby overcoming the limitations of vocabulary education at the educational scene and maximizing their educational effectiveness.

Symbolizing Numbers to Improve Neural Machine Translation (숫자 기호화를 통한 신경기계번역 성능 향상)

  • Kang, Cheongwoong;Ro, Youngheon;Kim, Jisu;Choi, Heeyoul
    • Journal of Digital Contents Society
    • /
    • v.19 no.6
    • /
    • pp.1161-1167
    • /
    • 2018
  • The development of machine learning has enabled machines to perform delicate tasks that only humans could do, and thus many companies have introduced machine learning based translators. Existing translators have good performances but they have problems in number translation. The translators often mistranslate numbers when the input sentence includes a large number. Furthermore, the output sentence structure completely changes even if only one number in the input sentence changes. In this paper, first, we optimized a neural machine translation model architecture that uses bidirectional RNN, LSTM, and the attention mechanism through data cleansing and changing the dictionary size. Then, we implemented a number-processing algorithm specialized in number translation and applied it to the neural machine translation model to solve the problems above. The paper includes the data cleansing method, an optimal dictionary size and the number-processing algorithm, as well as experiment results for translation performance based on the BLEU score.

A Study on the Language Independent Dictionary Creation Using International Phoneticizing Engine Technology (국제 음소 기술에 의한 언어에 독립적인 발음사전 생성에 관한 연구)

  • Shin, Chwa-Cheul;Woo, In-Sung;Kang, Heung-Soon;Hwang, In-Soo;Kim, Suk-Dong
    • The Journal of the Acoustical Society of Korea
    • /
    • v.26 no.1E
    • /
    • pp.1-7
    • /
    • 2007
  • One result of the trend towards globalization is an increased number of projects that focus on natural language processing. Automatic speech recognition (ASR) technologies, for example, hold great promise in facilitating global communications and collaborations. Unfortunately, to date, most research projects focus on single widely spoken languages. Therefore, the cost to adapt a particular ASR tool for use with other languages is often prohibitive. This work takes a more general approach. We propose an International Phoneticizing Engine (IPE) that interprets input files supplied in our Phonetic Language Identity (PLI) format to build a dictionary. IPE is language independent and rule based. It operates by decomposing the dictionary creation process into a set of well-defined steps. These steps reduce rule conflicts, allow for rule creation by people without linguistics training, and optimize run-time efficiency. Dictionaries created by the IPE can be used with the Sphinx speech recognition system. IPE defines an easy-to-use systematic approach that can lead to internationalization of automatic speech recognition systems.

The Change of the Concept and Meaning of Bulgogi in Cookery Book & Dictionary (문헌에 나타난 불고기의 개념과 의미 변화)

  • Lee, Kyou-Jin;Cho, Mi-Sook
    • Journal of the Korean Society of Food Culture
    • /
    • v.25 no.5
    • /
    • pp.508-515
    • /
    • 2010
  • The purpose of this research was to investigate the transition of the concept and meaning of "bulgogi". "Bulgogi" is a representative Korean food and is also a global menu item. The first dictionary that presented the word "bulgogi" was the Keunsajeon (big dictionary). The results of an analysis of 17 dictionaries published in the last 60 years showed the immutable definition of "neobiani" as seasoned and broiled beef. In contrast, "bulgogi" has been termed differently, from "simply grilled meat of an animal" to the same meaning as that of "neobiani". Furthermore, to define the difference between common grilled meat in modern versus present time, a review of 26 cookery books from Sieuijeanseo, written in late 1800, to The Taste of Korea, written in 1987, were selected and examined. To date, the first appearance of the word "bulgogi" mentioned in a cook book was in Practice in Higher Cuisine, which was written by Shin- young Bang in 1958. The book states that "bulgogi" is the second name or the vulgar designation of "neobiani".

The Politics of Global English

  • Damrosch, David
    • Journal of English Language & Literature
    • /
    • v.60 no.2
    • /
    • pp.193-209
    • /
    • 2014
  • Writers in England's colonies and former colonies have long struggled with the advantages and disadvantages of employing the language of the colonizer for their creative work, an issue that today reaches beyond the older imperial trade routes in the era of "global English." Creative writers in widely disparate locations are now using global English to their advantage, with what can be described as post-postcolonial strategies. This essay explores the politics of global English, beginning with a satiric dictionary of "Strine" (Australian English) from 1965, and then looking back at the mid-1960s debate at Makerere University between Ngugi wa Thiong'o and Chinua Achebe, in which Achebe famously asserted the importance of remaking English for hi own purposes. The essay then discusses early linguistic experiments by Rudyard Kipling, who became the world's first truly global writer in the 1880s and 1890s and developed a range of strategies for conveying local experience to a global audience. The essay then turns to two contemporary examples: a comic pastiche of Kipling-and of Kiplingese-by the contemporary Tibetan writer Jamyang Norbu, who deploys "Babu English" and the legacy of British rule against Chinese encroachment in Tibet; and, finally, the Korean-American internet group Young-hae Chang Heavy Industries, who interweave African-American English with North Korean political rhetoric to hilariously subversive effect.

Developing the Customer Quality Satisfaction Index Using Online Reviews: Case Study of TV (리뷰를 활용한 고객 품질 만족도 지수 개발 : TV 사례연구)

  • Jiye, Shin;Heesoo, Kim;Jaiho, Lee;Hyoungwoo, Jeon;Jeongsik, Ahn;Sunghoon, Hwang
    • Journal of Korean Society for Quality Management
    • /
    • v.50 no.4
    • /
    • pp.863-876
    • /
    • 2022
  • Purpose: The purpose of this study is to propose the product quality satisfaction index based on multiple linear regression using customer reviews. Methods: The proposed framework is composed of four steps. First, we collect online reviews and divide it into insight phrases. The insight phrases are classified using product attribute dictionary and sentiment analysis is conducted. Second, the importance of attributes is calculated in consideration of both regression coefficient and frequency. Third, the positive rate is calculated concerning sentiment analysis result. Therefore, the quality satisfaction index is measured by the weighted sum of importance and positive rate in the last step. Results: We conduct a case study using 2-years(2020, 2021) of Samsung TV reviews to confirm the effectiveness of the proposed methodology. As a result, we found that Picture quality is the most crucial attribute in TV evaluation. The importance of Gaming and content has grown up as the positive rate has also increased. Therefore, the overall satisfaction of TV has increased in 2021 compared to 2020. Conclusion: The result of this study shows that the proposed index reveals the customer's mind efficiently and can be explained by the importance and positive rate of each attribute. By using the proposed index, companies are able to improve and the priority of improvement can be determined.

Sparse Representation based Two-dimensional Bar Code Image Super-resolution

  • Shen, Yiling;Liu, Ningzhong;Sun, Han
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.11 no.4
    • /
    • pp.2109-2123
    • /
    • 2017
  • This paper presents a super-resolution reconstruction method based on sparse representation for two-dimensional bar code images. Considering the features of two-dimensional bar code images, Kirsch and LBP (local binary pattern) operators are used to extract the edge gradient and texture features. Feature extraction is constituted based on these two features and additional two second-order derivatives. By joint dictionary learning of the low-resolution and high-resolution image patch pairs, the sparse representation of corresponding patches is the same. In addition, the global constraint is exerted on the initial estimation of high-resolution image which makes the reconstructed result closer to the real one. The experimental results demonstrate the effectiveness of the proposed algorithm for two-dimensional bar code images by comparing with other reconstruction algorithms.

Object Cataloging Using Heterogeneous Local Features for Image Retrieval

  • Islam, Mohammad Khairul;Jahan, Farah;Baek, Joong Hwan
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.9 no.11
    • /
    • pp.4534-4555
    • /
    • 2015
  • We propose a robust object cataloging method using multiple locally distinct heterogeneous features for aiding image retrieval. Due to challenges such as variations in object size, orientation, illumination etc. object recognition is extraordinarily challenging problem. In these circumstances, we adapt local interest point detection method which locates prototypical local components in object imageries. In each local component, we exploit heterogeneous features such as gradient-weighted orientation histogram, sum of wavelet responses, histograms using different color spaces etc. and combine these features together to describe each component divergently. A global signature is formed by adapting the concept of bag of feature model which counts frequencies of its local components with respect to words in a dictionary. The proposed method demonstrates its excellence in classifying objects in various complex backgrounds. Our proposed local feature shows classification accuracy of 98% while SURF,SIFT, BRISK and FREAK get 81%, 88%, 84% and 87% respectively.