• Title/Summary/Keyword: data locality

Search Result 237, Processing Time 0.028 seconds

QUISIS: A Query Index Method Using Interval Skip List (QUISIS: Interval Skip List를 활용한 질의 색인 기법)

  • Min, Jun-Ki
    • The KIPS Transactions:PartD
    • /
    • v.15D no.3
    • /
    • pp.297-304
    • /
    • 2008
  • Due to the proliferation of the Internet and intranet, new application domains such as stream data processing have emerged. Stream data is real-timely and continuously generated. In stream data environments, a lot of queries are registered, and then, the arrived data item is evaluated by registered queries. Thus, to accelerate the query performance, diverse continuous query index schemes have been proposed for stream data processing systems. In this paper, we focus on the query index technique for stream data. In general, a stream query contains the range condition. Thus, by using range conditions, the queries can be indexed. In this paper, we propose an efficient query index scheme, called QUISIS, using a modified Interval Skip Lists to accelerate search time. QUISIS utilizes a locality where a value which will arrive in near future is similar to the current value. Through the experimental study, we show the efficiency of our proposed method.

Data Replication Technique for Improving Data Locality of MapReduce (맵리듀스의 데이터 로컬리티 향상을 위한 데이터 복제기법)

  • Lee, Jung-Ha;Yu, Heon-Chang;Lee, Eun-Young
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2012.06a
    • /
    • pp.218-220
    • /
    • 2012
  • 인터넷 활용과 웹 어플리케이션의 개발이 증가함에 따라 처리해야하는 데이터의 양도 또한 증가하고 있다. 대량의 데이터를 효과적으로 처리하기 위한 방법 중 하나로 병렬처리 프로그래밍 모델인 맵리듀스가 있다. 하둡은 맵리듀스의 오픈소스 구현으로 대량의 데이터를 병렬로 처리하는 무료 자바 소프트웨어 프레임워크이다. 분산 파일 시스템을 사용하는 하둡에서는 처리하는 데이터가 다른 노드에 위치하는 데이터 로컬리티 문제가 전체 작업 수행시간의 증가를 야기하는 문제가 있다. 본 논문에서는 하둡에서의 데이터 로컬리티 문제를 해결하기 위한 데이터 복제기법을 제안한다. 제안하는 데이터 복제기법에서는 1) 라그랑지 보간법을 사용하여 과거 접근수를 이용한 미래 접근수를 예측하고, 2) 예측된 값을 Threshold값으로 설정하고, 3) 데이터 로컬리티 문제가 발생하였을 때, 복제사본을 생성할 것인지 캐시를 생성할 것인지를 결정하여 복제 사본의 수를 최적화 한다. 실험을 통해 단순히 복제사본 수를 증가시킴으로써 데이터 로컬리티를 향상을 이루어도 작업 완료시간이 감소하는 것이 아니라는 결과를 볼 수 있었고, 오버 런치로 인한 작업 완료시간 증가를 줄이기 위해 데이터 복제사본 수 최적화의 필요성을 확인할 수 있었다.

Regionalization of CN values at Imha Watershed with SCE-UA (최적화 기법을 이용한 임하호유역 대표 CN값 추정)

  • Jeon, Ji-Hong;Kim, Tae-Dong;Choi, Dong-Hyuk
    • Journal of The Korean Society of Agricultural Engineers
    • /
    • v.53 no.5
    • /
    • pp.9-16
    • /
    • 2011
  • Curve Numbers (CN) for the combination of land use and hydrologic soil group were regionalized at Imha Watershed using Long-term Hydrologic Impact Assessment (L-THIA) coupled with SCE-UA. The L-THIA was calibrated during 1991-2000 and validated during 2001-2007 using monthly observed direct runoff data. The Nash-Sutcliffe (NS) coefficients for calibration and validation were 0.91 and 0.93, respectively, and showed high model efficiency. Based on the criteria of model calibration, both calibration and validation represented 'very good' fit with observe data. The spatial distribution of direct surface runoff by L-THIA represented runoff from Thiessen pologen at Subi and Sukbo rain gage station much higher than other area due to the combination of poor hydrologic condition (hydrologic soil C and D group) and locality heavy rainfall. As a results of hydrologic condition and treatment for land use type based on calibrated CNs, forest is recommended to be hydrologically modelled dived into deciduous, coniferous, and mixed forest due to the hydrological difference. The CNs for forest and upland showed the poor hydrologic condition. The steep slope of forest and alpine agricultural field make high runoff rate which is the poor hydrologic condition because CN method can not consider field slope. L-THIA linded with SCE-UA could generated a regionalized CNs for land use type with minimized time and effort, and maximized model's accuracy.

Insect community dynamics in relation to climate change in Mongolia

  • Iderzorig, Badamnyambuu;Lkhagvadorj, Khureltsetseg;Kwon, Ohseok
    • Journal of Ecology and Environment
    • /
    • v.39 no.1
    • /
    • pp.111-118
    • /
    • 2016
  • In recent years, many research revealed plant-insect interactions are becoming unstable because of climate change, human activities and grazing effect. In this work, it is aimed to disclose that how climate of two different years is influencing on insect community at certain locality in Mongolia. The data on the insect community are collected, covering full flowering season from June to August in 2014 and 2015. In order to include all species of insects in Udleg Station, data was collected in three different sites. One of them was around edge of forest, another one was in ungrazed area or inside the fence that has been kept for more than 9 years and last one was in grazed area. Weather was a perceptible difference during two year's study. This climate differences significantly influenced on the insect community. In 2014, overall 305 insect species were recorded, in which 124 insect species in order of Diptera, 44 in Lepidoptera, 33 in Coleoptera, 31 in Hemiptera, and 73 in Hymenoptera were determined. But in 2015, these number of species noticeably decreased, and total 150 insect species were recorded, in which 58 in Diptera, 26 in Lepidoptera, 13 in Coleoptera, 12 in Hemiptera, and 41 in Hymenoptera were determined.

Duplication-Aware Garbage Collection for Flash Memory-Based Virtual Memory Systems (플래시 메모리 기반의 가상 메모리 시스템을 위한 중복성을 고려한 GC 기법)

  • Ji, Seung-Gu;Shin, Dong-Kun
    • Journal of KIISE:Computer Systems and Theory
    • /
    • v.37 no.3
    • /
    • pp.161-171
    • /
    • 2010
  • As embedded systems adopt monolithic kernels, NAND flash memory is used for swap space of virtual memory systems. While flash memory has the advantages of low-power consumption, shock-resistance and non-volatility, it requires garbage collections due to its erase-before-write characteristic. The efficiency of garbage collection scheme largely affects the performance of flash memory. This paper proposes a novel garbage collection technique which exploits data redundancy between the main memory and flash memory in flash memory-based virtual memory systems. The proposed scheme takes the locality of data into consideration to minimize the garbage collection overhead. Experimental results demonstrate that the proposed garbage collection scheme improves performance by 37% on average compared to previous schemes.

Analysis of Efficiency Gap of Local Governments' Aged Welfare Institutions in Gyeonggi-do Province (경기도 지역 지방자치단체 노인복지시설의 효율성 격차 분석)

  • Yoon, Eui-Young
    • Journal of the Korean association of regional geographers
    • /
    • v.15 no.6
    • /
    • pp.774-788
    • /
    • 2009
  • This research analyzed the relative efficiency of 27 local governments' aged welfare institutions and facilities in Gyeonggi-do Province by using Data Envelopment Analysis. The results show that 12 local governments' aged welfare institutions are relatively inefficient as compare to their reference groups, meaning that there is efficiency gap among localities. It also appears that the most influential input and output elements are the budget for the aged and the number of the aged welfare institutions and facilities, relatively. However, the influential elements differ from one locality to another.

  • PDF

Education, Industry 4.0 and Earnings: Evidence from Provincial-Level Data of Vietnam

  • TU, Anh Thuy;CHU, Phuong Thi Mai;PHAM, Truong Xuan;DO, Ngoc Minh
    • The Journal of Asian Finance, Economics and Business
    • /
    • v.8 no.2
    • /
    • pp.675-684
    • /
    • 2021
  • This paper aims to analyze factors influencing earnings of workers in Vietnam using provincial-level data from 2016 to 2018. We show the important determinants of earnings of workers of more than 15 years old including working hour, labor force, life expectancy, education, regulation measured by Provincial Competitiveness Index (PCI) and especially Industry 4.0, our major depart from literature proxies by government expenditure on science and technology, number of phone lines, and number of internet users. Working hours are a typical measurement of quantity of labor supplied. Labor force represents market size from the supply side. Life expectancy measures the health of laborers, a physical quality measure of workers. PCI stands for institutional status of the locality. Two most important factors of our interest are education, representing qualification of workers, and Industry 4.0, reflecting the new working environment of workers. By estimating a robust standard error fixed-effect model, we have evidence that all factors are significant in explaining earnings of Vietnamese workers. Education and IR4.0 play an important role in earnings of workers of Vietnam. Results also provide an estimation of Vietnam's labor supply in the context of Industry 4.0. In addition, findings contribute to explain the income discrepancy among Vietnamese provinces.

A Data-Centric Clustering Algorithm for Reducing Network Traffic in Wireless Sensor Networks (무선 센서 네트워크에서 네트워크 트래픽 감소를 위한 데이타 중심 클러스터링 알고리즘)

  • Yeo, Myung-Ho;Lee, Mi-Sook;Park, Jong-Guk;Lee, Seok-Jae;Yoo, Jae-Soo
    • Journal of KIISE:Information Networking
    • /
    • v.35 no.2
    • /
    • pp.139-148
    • /
    • 2008
  • Many types of sensor data exhibit strong correlation in both space and time. Suppression, both temporal and spatial, provides opportunities for reducing the energy cost of sensor data collection. Unfortunately, existing clustering algorithms are difficult to utilize the spatial or temporal opportunities, because they just organize clusters based on the distribution of sensor nodes or the network topology but not correlation of sensor data. In this paper, we propose a novel clustering algorithm with suppression techniques. To guarantee independent communication among clusters, we allocate multiple channels based on sensor data. Also, we propose a spatio-temporal suppression technique to reduce the network traffic. In order to show the superiority of our clustering algorithm, we compare it with the existing suppression algorithms in terms of the lifetime of the sensor network and the site of data which have been collected in the base-station. As a result, our experimental results show that the size of data was reduced by $4{\sim}40%$, and whole network lifetime was prolonged by $20{\sim}30%$.

A wear-leveling improving method by periodic exchanging of cold block areas and hot block areas (Cold 블록 영역과 hot 블록 영역의 주기적 교환을 통한 wear-leveling 향상 기법)

  • Jang, Si-Woong
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2008.05a
    • /
    • pp.175-178
    • /
    • 2008
  • While read operation on flash memory is fast and doesn't have any constraints, flash memory can not be overwritten on updating data, new data are updated in new area. If data are frequently updated, garbage collection, which is achieved by erasing blocks, should be performed to reclaim new area. Hence, because the number of erase operations is limited due to characteristics of flash memory, every block should be evenly written and erased. However, if data with access locality are processed by cost benefit algorithm with separation of hot block and cold block, though the performance of processing is high, wear-leveling is not even. In this paper, we propose CB-MG (Cost Benefit between Multi Group) algorithm in which hot data are allocated in one group and cold data in another group, and in which role of hot group and cold group is exchanged every period. Experimental results show that performance and wear-leveling of CB-MG provide better results than those of CB-S.

  • PDF

Calibration of Load and Resistance Factors in KCI Code Based on Domestic Data (국내 통계자료를 이용한 설계기준의 하중저항계수 검증)

  • Kim, Jee-Sang;Kim, Jong-Ho
    • Journal of the Korea Concrete Institute
    • /
    • v.23 no.4
    • /
    • pp.495-501
    • /
    • 2011
  • The load combinations in current KCI Design Code are determined with reference to those in ACI 318-05, which adopts the LRFD (load and resistance factor design) format. The load and resistance factors in LRFD format should be determined to meet the required levels of reliability index or probability of failure for various predetermined failure modes, which are also based on the statistical data reflecting locality and contemporary situation. However, the current KCI Design Code has been written utilizing foreign data, because of insufficiency in accrued data in Korea. This study considered the current safety levels of KCI Code based on published domestic data to evaluate appropriateness of the current KCI regulations. Based on the calibrated reliability index of the existing Code, the new resistance factors are suggested. The results presented in this paper can be considered as a basic research for establishment of unique design format for future Korean Codes.