Search | Korea Science

A MapReduce-based kNN Join Query Processing Algorithm for Analyzing Large-scale Data (대용량 데이터 분석을 위한 맵리듀스 기반 kNN join 질의처리 알고리즘)

Lee, HyunJo;Kim, TaeHoon;Chang, JaeWoo
- Journal of KIISE
- /
- v.42 no.4
- /
- pp.504-511
- /
- 2015
Recently, the amount of data is rapidly increasing with the popularity of the SNS and the development of mobile technology. So, it has been actively studied for the effective data analysis schemes of the large amounts of data. One of the typical schemes is a Voronoi diagram based on kNN join algorithm (VkNN-join) using MapReduce. For two datasets R and S, VkNN-join can reduce the time of the join query processing involving big data because it selects the corresponding subset Sj for each Ri and processes the query with them. However, VkNN-join requires a high computational cost for constructing the Voronoi diagram. Moreover, the computational overhead of the VkNN-join is high because the number of the candidate cells increases as the value of the k increases. In order to solve these problems, we propose a MapReduce-based kNN-join query processing algorithm for analyzing the large amounts of data. Using the seed-based dynamic partitioning, our algorithm can reduce the overhead for constructing the index structure. Also, it can reduce the computational overhead to find the candidate partitions by selecting corresponding partitions with the average distance between two seeds. We show that our algorithm has better performance than the existing scheme in terms of the query processing time.
https://doi.org/10.5626/JOK.2015.42.4.504 인용 KSCI

Generating Combined Query Plan for Content-Based Image Retrieval (내용 기반 이미지 검색을 위한 복합 질의문 계획 생성 기법)

Park, Mi-Hwa;Eom, Gi-Hyeon
- Journal of KIISE:Databases
- /
- v.27 no.4
- /
- pp.562-571
- /
- 2000
이미지 데이터는 텍스트 데이터와는 달리 다양한 색상과 모양, 질감과 같은 비정형적인 특징을 가진다. 따라서 이미지 데이터베이스는 텍스트 기반의 전통 데이터베이스와는 다른 모델링 방법과 질의, 검색 방법을 사용한. 특히, 내용 기반 이미지 검색에서의 검색 속도와 정확도를 향상시키기 위해서는 새로운 복합 질의문 계획 생성 기법이 필요하다. 본 논문에서는 이를 위해 먼저, 단일 조건을 갖는 시각 질의에 대한 처리 기법들을 토대로 여러 조건을 갖는 복합 질의를 처리하기 위한 복합 질의문 계획 생성기법인 SSCC(Similarity Search for Conjunction Combination Query) 알고리즘을 제안한다. SSCC는 이미지 데이터베이스 검색 시스템에서 복합 질의를 처리하기 위한 질의 최적화 과정에서 질의 수행 시간과 투플 I/O를 최소화하는 질의문 계획을 생성하기 위해 사용된다. SSCC 알고리즘은 복합질의를 단일 질의들로 준해하고 퍼지 집합 이론을 도입하여 단일 질의의 결과들을 통합한다. 논문에서 연구된 내용 기반 복합 질의문 계획 생성 기법은 특정 이미지 영역에 국한되지 않으며 다양한 종류의 시각 질의를 수행하기 위한 효율적인 질의문 계획 생성 기법으로 사용될 수 있다.
PDF

Efficient Temporal Query Processing using Materialized View (형성 뷰를 이용한 효율적인 시간지원 질의 처리 기법)

정경자
- Journal of the Korea Society of Computer and Information
- /
- v.3 no.4
- /
- pp.1-9
- /
- 1998
Temporal Databases store all of informations by time varying, so the temporal query processor has to process very large information. Therefore, we propose an efficient method of query processing by using the relevance checking algorithm of input query and view definition. The relevance checking algorithm of query investigates relevance between the input query of user about base relation and the execution tree of view definition stored in system catalog. And related input query with view definition have a process of the query translation to the execution tree of view. So temporal query processor is able to increase performance of query processor by reducing the number of tuple.
PDF

Cache-Answerability of XML Queries on the Web: Implementation and Performance Evaluation (웹에서 캐쉬를 이용한 XML 질의 처리: 구현 및 성능 평가)

박정기;강현철
- Proceedings of the Korean Information Science Society Conference
- /
- 2003.10b
- /
- pp.133-135
- /
- 2003
데이터베이스 기반의 웹 응용을 위한 캐슁 기법이 최근 많이 연구되고 있다. 자주 제기되는 질의의 결과를 캐쉬해두면 반복 질의를 위한 재사용은 물론 관련 질의의 처리에 이용될 수 있다. 웹 상에서 데이터 교환의 표준으로 XML이 등장한 이래 현재 웹 응용들은 네트워크 상의 원격 XML 소스로부터 데이터 검색을 수행하는 경우가 많아졌는데 이의 효율적인 지원을 위해 검색 결과를 캐쉬하는 것은 유용하다. 본 논문은 웹에서 XML 질의를 관련 XML 캐쉬를 이용하여 처리하는 기법의 구현 및 성능 평가에 관한 것이다. XML 질의로 XQuery, XPath, XQL 등과 같은 모든 XML 질의어의 핵심 요소인 경로 표현식을 대상으로 하였고, XML 캐쉬는 XML 실체뷰를 고려하였고, 캐쉬를 이용한 XML 질의 변환 알고리즘은 [12]에 제시된 것을 대상으로 하였다. [12]의 질의 변환 알고리즘을 지원하는 프로토타입 XML 저장 시스템이 관계 DBMS를 이용하여 구현되어 실제 웹에서의 성능 실험에 이용되었다. 성능 실험 결과 웹에서 캐쉬를 이용한 XML 질의 처리의 효율성을 확인하였다.
PDF

A Study on Cost Estimation of Spatial Query Processing for Multiple Spatial Query Optimization in GeoSensor Networks (지오센서 네트워크의 다중 공간질의 최적화를 위한 공간질의처리비용 예측 알고리즘 연구)

Kim, Min Soo;Jang, In Sung;Li, Ki Joune
- Spatial Information Research
- /
- v.21 no.2
- /
- pp.23-33
- /
- 2013
W ith the recent advancement of IoT (Internet of Things) technology, there has been much interest in the spatial query processing which energy-efficiently acquires sensor readings from sensor nodes inside specified geographical area of interests. Therefore, various kinds of spatial query processing algorithms and distributed spatial indexing methods have been proposed. They can minimize energy consumption of sensor nodes by reducing wireless communication among them using in-network spatial filtering technology. However, they cannot optimize multiple spatial queries which w ill be w idely used in IoT, because most of them have focused on a single spatial query optimization. Therefore, we propose a new multiple spatial query optimization algorithm which can energy-efficiently process multiple spatial queries in a sensor network. The algorithm uses a concept of 'query merging' that performs the merged set after merging multiple spatial queries located at adjacent area. Here, our algorithm makes a decision on which is better between the merged and the separate execution of queries. For such the decision making, we additionally propose the cost estimation method on the spatial query execution. Finally, we analyze and clarify our algorithm's distinguished features using the spatial indexing methods of GR-tree, SPIX, CPS.
https://doi.org/10.12672/ksis.2013.21.2.023 인용 PDF KSCI

A K-Nearest Neighbour Search Algorithm based on Hilbert Curve for Outsourced Spatial Database (아웃소싱된 공간 데이터베이스를 위한 힐버트 커브 기반 k-최근접점 질의처리 알고리즘)

Yoo, Hye-Kyeom;Chang, Jae-Woo
- Proceedings of the Korea Information Processing Society Conference
- /
- 2011.11a
- /
- pp.1199-1202
- /
- 2011
최근 클라우드 컴퓨팅에 대한 관심이 고조됨에 따라, 이를 활용한 데이터베이스 아웃소싱에 대한 연구가 활발히 진행되고 있다. 한편, 데이터 소유자가 자신이 가지고 있는 공간 데이터베이스를 그대로 아웃소싱 할 경우, 서비스 제공자는 이를 불법으로 취득하여 악용할 수 있고, 질의 요청자들의 통계 정보를 통해 개인정보를 획득할 수 있다. 따라서 아웃소싱 환경에서 개인정보 보호 및 공간 데이터베이스를 보호하기 위한 데이터 변환기법 및 변환된 데이터베이스 상에서 질의를 처리하는 연구가 필요하다. 따라서, 본 논문에서는 아웃소싱 환경에서 공간 네트워크를 고려한 가공 데이터 생성 기법 및 암호화 기법을 설계한다. 아울러, 인증된 사용자가 질의 요청 시, 서비스 제공자가 저장한 가공 데이터를 이용하여 효율적으로 k-최근접점 질의를 수행하기 위한 힐버트 커브 기반 k-최근접점 질의처리 알고리즘을 제안한다.
https://doi.org/10.3745/PKIPS.y2011m11a.1199 인용 PDF

An Efficient Range Query Processing Algorithm on Encrypted Databases (암호화 데이터베이스 상에서의 효율적인 영역 질의처리 알고리즘)

Choi, Mun-Chul;Kim, Hyeong-Il;Chang, Jae-Woo
- Proceedings of the Korea Information Processing Society Conference
- /
- 2015.10a
- /
- pp.1098-1101
- /
- 2015
최근 클라우드 컴퓨팅에 대한 관심이 고조됨에 따라, 아웃소싱된 암호화 데이터베이스 상에서의 영역 질의처리 연구가 활발히 진행되고 있다. 그러나 기존 연구 중 데이터 접근 패턴 보호를 지원하는 연구는 전무하다. 따라서 본 논문에서는 데이터 보호, 사용자 질의 보호 및 데이터 접근 패턴 보호를 모두 지원하는 암호화 데이터베이스 상에서의 영역 질의처리 알고리즘을 제안한다. 성능평가를 통해, 제안하는 기법이 정보보호를 지원하는 동시에 효율적인 처리 성능을 제공함을 보인다.
https://doi.org/10.3745/PKIPS.y2015m10a.1098 인용 PDF

The Multiple Continuous Query Fragmentation for the Efficient Sensor Network Management (효율적인 센서 네트워크 관리를 위한 다중 연속질의 분할)

Park Jung-Up;Jo Myung-Hyun;Son Jin-Hyun
- Proceedings of the Korea Information Processing Society Conference
- /
- 2006.05a
- /
- pp.43-46
- /
- 2006
최근 센서네트워크에 관련된 많은 연구가 진행되고 있다. 특히, 센서의 전력 보전을 위한 많은 기술들이 개발되고 있는데, 본 논문에서는 센서 네트워크의 불필요한 전력 소비를 줄이는 다중 연속질의 최적화에 관련된 방법을 제시한다. 우리는 센서 네트워크에서 전송되는 데이터의 횟수나 전송량의 원천적 문제가 되는 다중 연속 질의의 중복성 문제를 해결하는 분할 알고리즘을 제안한다. 분할 알고리즘은 새롭게 생성된 사용자 질의와 기존의 질의 들 사이에 질의 중첩 질의 영역을 제거하기 위해, QR-트리 기반의 질의 인덱스를 통해 하나의 질의를 둘 이상의 질의로 분할하는 알고리즘이다. QR-트리는 효율적인 질의 분할을 위해, $R^*$-트리를 본 논문의 구조에 맞게 개량한 것이다.
PDF

Development of New Retieval Performance Measures for Query Reformulation Algorithms (질의 재구성 알고리즘의 검색성능을 측정하기 위한 새로운 평가 방법의 개발)

Kim, Nam-Ho;French, James-C.;Brown, Donald-E.
- The Transactions of the Korea Information Processing Society
- /
- v.4 no.4
- /
- pp.963-972
- /
- 1997
In imformation retrival, query reformulation algorithms construct querise from a set of intial input and feedback documents, and retrieval performance cna be varied by different sets of input documents. In this study, we developed a criterion for measuring the performance sensitivity of query reformulation algorithms to unput sets. In addition, we also propose a way of mesuring the changes in retrived area, (CIRA) during qucry reformulation. We cimpared CIRAs of query refromulation algorithms (i.e., query tree, DNF method, and Dillon's method) using three test sets:the CACM, CISI, and Medlars. In the experiments, the query tree showed the highest decreasing CIRA during refirmulations, which means the fastest convergence rate to an output set. For sensitivity analysis, the query tree sored the highest sensitivity to different input sets even though its differences to the tther algorithms are very small.
PDF

A K-nearest Neighbor Query Processing Algorithm for a Query Region toward User Privacy Protection in Road Network (도로 네트워크에서 사용자 정보 보호를 지원하는 질의영역에 대한 k최근접점 질의 처리 알고리즘)

Kim, Hyeong-Il;Yoo, Hye-Kyeom;Chang, Jae-Woo
- Proceedings of the Korean Information Science Society Conference
- /
- 2011.06a
- /
- pp.65-68
- /
- 2011
최근 무선 통신 기술의 발달 및 모바일 기기의 발달로 인하여 위치 기반 서비스가 주목을 받고 있다. 그러나 사용자의 정확한 위치정보를 통해 LBS 서버에 질의를 요청하는 것은 심각한 개인 정보 누출의 위협이 될 수 있기 때문에, 사용자 정보 보호를 위해 도로 네트워크를 고려하여 질의영역을 생성하는 연구가 활발히 진행되어 왔다. 따라서 질의영역에 대한 효율적인 질의 처리 방법이 요구된다. 이를 위해, 본 논문에서는 도로 네트워크에서 사용자 정보 보호를 지원하는 질의영역에 대한 k최근접점 질의 처리 알고리즘을 제안한다. 제안하는 기법은 POI를 효율적으로 검색하기 위하여 Island 인덱스를 사용한다. 또한, 본 논문은 질의 처리 성능을 향상시키기 위해 적응적 Island 인덱스를 생성하는 방법을 제안한다. 마지막으로, 성능평가를 통해 제안하는 기법이 기존 기법들에 비해 네트워크 확장 비용 및 서비스 시간 측면에서 우수함을 보인다.

Search Result 416, Processing Time 0.028 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)