• Title/Summary/Keyword: data dictionary

Search Result 346, Processing Time 0.027 seconds

An efficient Hardware Architecture of Lempel-Ziv Compressor for Real Time Data Compression (실시간 데이터 압축을 위한 Lempel-Ziv 압축기의 효과적인 구조의 제안)

  • 진용선;정정화
    • Journal of the Institute of Electronics Engineers of Korea TE
    • /
    • v.37 no.3
    • /
    • pp.37-44
    • /
    • 2000
  • In this paper, an efficient hardware architecture of Lempel-Ziv compressor for real time data compression is proposed. The accumulated shift operations in the Lempel-Ziv algorithm are the major problem, because many shift operations are needed to prepare a dictionary buffer and matching symbols. A new efficient architecture for the fast processing of Lempel-Ziv algorithm is presented in this paper. In this architecture, the optimization technique for dictionary size, a new comparing method of multi symbol and a rotational FIFO structure are used to control shift operations easily. For the functional verification, this architecture was modeled by C programming language, and its operation was verified by running on commercial DSP processor. Also, the design of overall architecture in VHDL was synthesized on commercial FPGA chip. The result of critical path analysis shows that this architecture runs well at the input bit rate of 256kbps with 33MHz clock frequency.

  • PDF

Design of Big Data Preference Analysis System (빅데이터 선호도 분석 시스템 설계)

  • Son, Sung Il;Park, Chan Khon
    • Journal of Korea Multimedia Society
    • /
    • v.17 no.11
    • /
    • pp.1286-1295
    • /
    • 2014
  • This paper suggests the way that it could improve the reliability about preference of user's feedback by adding weighting factor on sentiment analysis, and efficiently make a sentiment analysis of users' emotional perspective on the big data massively generated on twitter. To solve errors on earlier studies, this paper has improved recall and precision of sensibility determination by using sensibility dictionary subdivided sentiment polarity based on the level of sensibility and given impotance to sensibility determination by populating slang, new words, emoticons and idiomatic expressions not in the system dictionary. It has considered the context through conjunctive adverbs fixed in korean characteristics which are free to the word order. It also recognize sensibility words such as TF(Term Frequency), RT(Retweet), Follower which are weighting factors of preference and has increased reliability of preference analysis considering weight on 'a very emotional tweet', 'a recognised tweet from users' and 'a tweeter influencer'

On the statistics of Korean Phonetic Dictionary - Basic Survey to make corpus of Korean Speech DB - (발음사전 표제어중의 음소의 통계적 성질-음성 DB용 단어선정을 위하여-)

  • Lee, Y.J.;Kim, K.T.;Jo, C.W.;Rhee, T.W.
    • Proceedings of the KIEE Conference
    • /
    • 1987.07b
    • /
    • pp.1606-1609
    • /
    • 1987
  • Statistical information about spoken Korean was obtained. The data are the results of analyzing the Korean phonetic dictionary. This is one of the basic survey to make phoneme ballanced corpus of Korean Speech Data Base (KSDB).

  • PDF

A Design and Implementation of the Multilingual RDD Registry (다중언어 RDD 레지스트리의 설계 및 구현)

  • 정상원;오원근;윤기송
    • Journal of Broadcast Engineering
    • /
    • v.8 no.4
    • /
    • pp.381-391
    • /
    • 2003
  • This paper deals nth the Multilingual Registry for the Rights Data Dictionary (RDD), which will be used for the semantic representation of rights on digital contents in MPEG-21 framework. The translation of RDD terms owing to different language populations often lacks the desirable precision. The purpose of this paper Is to demonstrate the Multilingual RDD Registry concept to achieve a more precise and interoperable translation of RDD terms among different DRM systems.

Approaches to Creating a Digital Encyclopedia of Korean Archaeology (한국고고학 디지털 사전 구축 방안 연구)

  • LEE Chorong
    • Korean Journal of Heritage: History & Science
    • /
    • v.56 no.2
    • /
    • pp.28-45
    • /
    • 2023
  • Although we have entered the era of digital transformation, there is currently no system that efficiently collects, manages, integrates, and services a large number of archaeological digital source materials produced as a result of cultural relics research, i.e., an intelligent integrated management and service platform for archaeological academic information. In this regard, the need to build a digital dictionary of Korean archaeology was confirmed by examining the problem of the Digital Encyclopedia of Korean Archaeology, which is currently available in PDF format on the web, the current status of the publication and use of the Dictionary of Korean Archaeology, and the cases of building digital platforms at home and abroad. Therefore, this paper aims to suggest a general direction for creating a digital encyclopedia of Korean archaeology based on the Dictionary of Korean Archaeology, which includes quality knowledge information, to reconsider the accessibility of archaeological data in conformity with data access limitations. The application of the series Dictionary of Korean Archaeology, published since 2001, and the necessity for digital transformation were examined, as well as the application of data from the archaeological data archiving platforms of Europe, the USA, Japan, and cases of establishing platforms corresponding to specialized encyclopedias from Korea. Based on these, a three-step implementation plan and detailed projects were suggested to create the Digital Encyclopedia of Korean Archaeology. Through this, we proposed the design of metadata for computerized records and the expansion to semantic (meaning-based) data that gives and shows the relationship information between the produced metadata as the implementation tasks to build the Digital Dictionary of Korean Archaeology. It is hoped that such research will help create an integrated intelligent management and service platform for archaeology, raise awareness, and provide a better understanding of Korean archaeology to the general public.

English-Korean Transfer Dictionary Extension Tool in English-Korean Machine Translation System (영한 기계번역 시스템의 영한 변환사전 확장 도구)

  • Kim, Sung-Dong
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.2 no.1
    • /
    • pp.35-42
    • /
    • 2013
  • Developing English-Korean machine translation system requires the construction of information about the languages, and the amount of information in English-Korean transfer dictionary is especially critical to the translation quality. Newly created words are out-of-vocabulary words and they appear as they are in the translated sentence, which decreases the translation quality. Also, compound nouns make lexical and syntactic analysis complex and it is difficult to accurately translate compound nouns due to the lack of information in the transfer dictionary. In order to improve the translation quality of English-Korean machine translation, we must continuously expand the information of the English-Korean transfer dictionary by collecting the out-of-vocabulary words and the compound nouns frequently used. This paper proposes a method for expanding of the transfer dictionary, which consists of constructing corpus from internet newspapers, extracting the words which are not in the existing dictionary and the frequently used compound nouns, attaching meaning to the extracted words, and integrating with the transfer dictionary. We also develop the tool supporting the expansion of the transfer dictionary. The expansion of the dictionary information is critical to improving the machine translation system but requires much human efforts. The developed tool can be useful for continuously expanding the transfer dictionary, and so it is expected to contribute to enhancing the translation quality.

Performance Improvement of LZ77 Algorithm using a Strategy Table and a Genetic Algorithm (전략 테이블과 유전 알고리즘을 이용한 LZ77 알고리즘의 성능 개선)

  • Jung Soonchul;Seo Dong-Il;Moon Byung-Ro
    • Journal of KIISE:Software and Applications
    • /
    • v.31 no.12
    • /
    • pp.1628-1636
    • /
    • 2004
  • Data compression techniques have been studied for decades because they saved space and time to reduce costs. The Lempel-Ziv 77 (LZ77) is a dictionary-based, lossless compression algorithm. The dictionary size of the LZ77 algorithm is fixed, and the performance of the algorithm is highly dependent on its dictionary size. In this paper, we suggest a dynamic LZ77 algorithm that changes its dictionary size during compression, and also we suggest a genetic algorithm to evolve the dictionary-resizing strategies. The suggested algorithm outperformed the original version up to about 16%.

Construction of an Efficient Pre-analyzed Dictionary for Korean Morphological Analysis (한국어 형태소 분석을 위한 효율적 기분석 사전의 구성 방법)

  • Kwak, Sujeong;Kim, Bogyum;Lee, Jae Sung
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.2 no.12
    • /
    • pp.881-888
    • /
    • 2013
  • A pre-analyzed dictionary is used to increase the speed and the accuracy of morphological analyzers and to decrease the over-generation. However, if the dictionary includes 'Insufficiently-analyzed word-phrases', which do not include all the possible analysis of the word-phrase, it may cause the decrease of the analysis accuracy. In this paper, we measure the accuracy changes according to the number of word-phrase frequency and the size changes of corpus by Sejong corpus. And performance of integrate system(SMA with pre-dictionary) is highest when sufficient analysis rate of pre-dictionary is more than 99.82%. Also pre-dictionary is constructed with word-phrase that frequency more than 32(64) when size of corpus is 1,600,000(6,300,000) word-phrase.

Efficient and Dynamic Authenticated Dictionary Design Using RSA One-way Accumulator (RSA 일방향 어큐뮬레이터를 이용한 효율적이고 동적인 인증 딕셔너리 설계)

  • Kim, Soon-Seok;Lee, Yong-Hee;Lee, Kang-Woo
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.12 no.4
    • /
    • pp.651-660
    • /
    • 2008
  • The widespread use of public networks, such as the Internet, for the exchange of sensitive data that need a severe security, like legally valid documents and business transactions. At the same time public-key certificates used for sensitive data interchange form the viewpoint of data integrity and authentication. But there are some weakness of data transfer capacity and security in public key infrastructure(PKI) environment. This paper use the RSA one-way accumulator to realize an efficient and dynamic authenticated dictionary, where untrusted directories provide cryptographically verifiable answers to membership queries on a set maintained by a trusted source.

Study on the social issue sentiment classification using text mining (텍스트마이닝을 이용한 사회 이슈 찬반 분류에 관한 연구)

  • Kang, Sun-A;Kim, Yoo Sin;Choi, Sang Hyun
    • Journal of the Korean Data and Information Science Society
    • /
    • v.26 no.5
    • /
    • pp.1167-1173
    • /
    • 2015
  • The development of information and communication technology like SNS, blogs, and bulletin boards, was provided a variety of places where you can express your thoughts and comments and allowing Big Data to grow, many people reveal the opinion of the social issues in SNS such as Twitter. In this study, we would like to pre-built sentimental dictionary about social issues and conduct a sentimental analysis with structured dictionary, to gather opinions on social issues that are created on twitter. The data that I used is "bikini", "nakkomsu" including tweet. As the result of analysis, precision is 61% and F1- score is 74%. This study expect to suggest the standard of dictionary construction allowing you to classify positive/negative opinion on specific social issues.