• Title/Summary/Keyword: Zipf Distribution

Search Result 21, Processing Time 0.017 seconds

On Regularity of Daily Distribution of Queries in Search Engine (검색엔진에서 일간질의 어분포의 정상성에 관한 연구)

  • Park, Sang-Gue;Lee, Chan-Kyu;Yoon, Kyung-Hyun;Kim, Seong-Hee;Lee, Jun-Ho
    • Journal of the Korean Society for information Management
    • /
    • v.24 no.4
    • /
    • pp.255-265
    • /
    • 2007
  • In this paper we analyzed regularity of daily patterns of distribution of Queries coming from internet search engine. And then, we proposed a Pareto distribution and Zipf law for identifying the query distribution and applied them to daily queries on the search engine during 2 week. We found that there is some evidence that Pareto and Zipf laws can be applied to evaluate the regularity of daily patterns of distribution of queries in search engine. Those results can be used to provide a better understanding of the social interests and trends using the query distribution patterns.

A Study of Zipfian Phenomena in Hangul Literaure (한글 문헌에 있어서 Zipfian 현상에 관한 연구)

  • 신강현;이두영
    • Journal of the Korean Society for information Management
    • /
    • v.5 no.2
    • /
    • pp.53-98
    • /
    • 1988
  • The purpose of this Study is to irwest~gate the Zipfian distribution in Har~gul literature. The result shows that the formulas derived from the liangul Ilterature are it1 accordance with the getlcra\ized Zipf's first law. The result also shows that the formulas derived from the Harlgul literature arc2 not in accordance with the Zlpf's second law and the penerali~ed Zipf's second law.

  • PDF

Article Data Prefetching Policy using User Access Patterns in News-On-demand System (주문형 전자신문 시스템에서 사용자 접근패턴을 이용한 기사 프리패칭 기법)

  • Kim, Yeong-Ju;Choe, Tae-Uk
    • The Transactions of the Korea Information Processing Society
    • /
    • v.6 no.5
    • /
    • pp.1189-1202
    • /
    • 1999
  • As compared with VOD data, NOD article data has the following characteristics: it is created at any time, has a short life cycle, is selected as not one article but several articles by a user, and has high access locality in time. Because of these intrinsic features, user access patterns of NOD article data are different from those of VOD. Thus, building NOD system using the existing techniques of VOD system leads to poor performance. In this paper, we analysis the log file of a currently running electronic newspaper, show that the popularity distribution of NOD articles is different from Zipf distribution of VOD data, and suggest a new popularity model of NOD article data MS-Zipf(Multi-Selection Zipf) distribution and its approximate solution. Also we present a life cycle model of NOD article data, which shows changes of popularity over time. Using this life cycle model, we develop LLBF (Largest Life-cycle Based Frequency) prefetching algorithm and analysis he performance by simulation. The developed LLBF algorithm supports the similar level in hit-ratio to the other prefetching algorithms such as LRU(Least Recently Used) etc, while decreasing the number of data replacement in article prefetching and reducing the overhead of the prefetching in system performance. Using the accurate user access patterns of NOD article data, we could analysis correctly the performance of NOD server system and develop the efficient policies in the implementation of NOD server system.

  • PDF

An Adaptive Cache Replacement Policy for Web Proxy Servers (웹 프락시 서버를 위한 적응형 캐시 교체 정책)

  • Choi, Seung-Lak;Kim, Mi-Young;Park, Chang-Sup;Cho, Dae-Hyun;Lee, Yoon-Joon
    • Journal of KIISE:Computer Systems and Theory
    • /
    • v.29 no.6
    • /
    • pp.346-353
    • /
    • 2002
  • The explosive increase of World Wide Web usage has incurred significant amount of network traffic and server load. To overcome these problems, web proxy caching replicates frequently requested documents in the web proxy closer to the users. Cache utilization depends on the replacement policy which tries to store frequently requested documents in near future. Temporal locality and Zipf frequency distribution, which are commonly observed in web proxy workloads, are considered as the important properties to predict the popularity of documents. In this paper, we propose a novel cache replacement policy, called Adaptive LFU (ALFU), which incorporates 1) Zipf frequency distribution by utilizing LFU and 2) temporal locality adaptively by measuring the amount of the popularity reduction of documents as time passed efficiently. We evaluate the performance of ALFU by comparing it to other policies via trace-driven simulation. Experimental results show that ALFU outperforms other policies.

Characterizing Memory References for Smartphone Applications and Its Implications

  • Lee, Soyoon;Bahn, Hyokyung
    • JSTS:Journal of Semiconductor Technology and Science
    • /
    • v.15 no.2
    • /
    • pp.223-231
    • /
    • 2015
  • As smartphones support a variety of applications and their memory demand keeps increasing, the design of an efficient memory management policy is becoming increasingly important. Meanwhile, as nonvolatile memory (NVM) technologies such as PCM and STT-MRAM have emerged as new memory media of smartphones, characterizing memory references for NVM-based smartphone memory systems is needed. For the deep understanding of memory access features in smartphones, this paper performs comprehensive analysis of memory references for various smartphone applications. We first analyze the temporal locality and frequency of memory reference behaviors to quantify the effects of the two properties with respect to the re-reference likelihood of pages. We also analyze the skewed popularity of memory references and model it as a Zipf-like distribution. We expect that the result of this study will be a good guidance to design an efficient memory management policy for future smartphones.

Method for Designing Adaptive UI Based on User's Context in the Environment Including Mobile Device and Public Display Device (모바일 장치와 공용 디스플레이 장치를 포함하는 환경에서 사용자의 특성에 기반한 Adaptive UI 설계 방안)

  • Kang, Seung-Soo;Ko, Hyun;Youn, Hee Yong
    • Journal of Information Technology Services
    • /
    • v.11 no.4
    • /
    • pp.181-194
    • /
    • 2012
  • The one of the most meaningful change in the recent ubiquitous environment is the omnipresence of public digital display device for providing ubiquitous information. It is the important issue to provide publicity as well as adaptive information to each user in the field of the public digital display device. This research proposes the idea ensuring fast response speed by the selection of user preference function. The preference function is selected by statistics using Zipf distribution in the system comprising mobile device and digital display device based on NFC (Near Field Communication). The idea is proved by CPM-GOMS model and the improvement of user response can be achieved.

User Centric Content Management System for Open IPTV Over SNS

  • Jeon, Seung Hyun;An, Sanghong;Yoon, Changwoo;Lee, Hyun-woo;Choi, Junkyun
    • Journal of Communications and Networks
    • /
    • v.17 no.3
    • /
    • pp.296-305
    • /
    • 2015
  • Coupled schemes between service-oriented architecture (SOA) and Web 2.0 have recently been researched. Web-based content providers and telecommunications company (Telecom) based Internet protocol television (IPTV) providers have struggled against each other to accommodate more three-screen service subscribers. Since the advent of Web 2.0, more abundant reproduced content can be circulated. However, because according to increasing device's resolution and content formats IPTV providers transcode content in advance, network bandwidth, storage and operation costs for content management systems (CMSs) are wasted. In this paper, we present a user centric CMS for open IPTV, which integrates SOA and Web 2.0. Considering content popularity based on a Zipf-like distribution to solve these problems, we analyze the performance between the user centric CMS and the conventional Web syndication system for normalized costs. Based on the user centric CMS, we implement a social Web TV with device-aware function, which can aggregate, transcode, and deploy content over social networking service independently.

Relationship Between Tweet Frequency and User Velocity on Twitter (트위터에서 트윗 주기와 사용자 속도 사이 관계)

  • Jeon, So-Young;Lee, Al-Chan;Seo, Go-Eun;Shin, Won-Yong
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.19 no.6
    • /
    • pp.1380-1386
    • /
    • 2015
  • Recently, the importance of users' geographic location information has been highlighted with a rapid increase of online social network services. In this paper, by utilizing geo-tagged tweets that provides high-precision location information of users, we first identify both Twitter users' exact location and the corresponding timestamp when the tweet was sent. Then, we analyze a relationship between the tweet frequency and the average user velocity. Specifically, we introduce a tweet-frequency computing algorithm, and show analysis results by country and by city. As a main result, it is shown that the tweet frequency according to user velocity follows a power-law distribution (i.e., Zipf' distribution or a Pareto distribution). In addition, by performing a comparison between the United States and Japan, one can see that the exponent of the distribution in Japan is smaller than that in the United States.

Hierarchically Encoded Multimedia-data Management System for Over The Top Service (OTT 서비스를 위한 계층적 부호화 기반 멀티미디어 데이터 관리 시스템)

  • Lee, Taehoon;Jung, Kidong
    • Journal of KIISE
    • /
    • v.42 no.6
    • /
    • pp.723-733
    • /
    • 2015
  • The OTT service that provides multimedia video has spread over the Internet for terminals with a variety of resolutions. The terminals are in communication via a networks such as 3G, LTE, VDSL, ADSL. The service of the network has been increased for a variety of terminals giving rise to the need for a new way of encoding multimedia is increasing. SVC is an encoding technique optimized for OTT services. We proposed an efficient multimedia management system for the SVC encoded multimedia data. The I/O trace was generated using a zipf distribution, and were comparatively evaluated for performance with the existing system.

A Study on the Behaviors of Complex System Revealed in the Sizes of Public Libraries in Korea (우리나라 공공도서관의 규모에 나타나는 복잡계 현상에 관한 연구)

  • Lee, Soo-Sang
    • Journal of Korean Library and Information Science Society
    • /
    • v.44 no.4
    • /
    • pp.399-419
    • /
    • 2013
  • This paper conducted the empirical analysis of the behaviors revealed in the eight size distributions of the public libraries in Korea. As a result, the behaviors of complex system appeared in all eight size factors. This means that the sizes of public libraries in Korea were highly polarized. Especially, the zipf's law were found in the size factors such as gross area, number of staffs, volume of books, total budget. And the highly uneven distributions were occurred in the size factors such as membership, number of users, number of borrowers, number of borrowed books. This research outcomes show that a new policy of public libraries is needed to resolve the polarization revealed in the sizes of public libraries in Korea.