Search | Korea Science

Architecture of a scalable general-purpose associative processor and its applications (확장 가능한 범용 Associative Processor 구조 및 응용)

Yun, Jae-Bok;Kim, Ju-Young;Kim, Jin-Wook;Park, Tae-Geun
- Proceedings of the IEEK Conference
- /
- 2005.11a
- /
- pp.1141-1144
- /
- 2005
일반 컴퓨터에서 중앙처리장치와 메모리 사이의 병목 현상인 "Von Neumann Bottleneck"을 보이는데 본 논문에서는 이러한 문제점을 해소하고 검색위주의 응용분야에서 우수한 성능을 보이는 확장 가능한 범용 Associative Processor(AP) 구조를 제안하였다. 본 연구에서는 Associative computing을 효율적으로 수행할 수 있는 명령어 세트를 제안하였으며 다양하고 대용량 응용분야에도 적용할 수 있도록 구조를 확장 가능하게 설계함으로써 유연한 구조를 갖는다. 12 가지의 명령어가 정의되었으며 프로그램이 효율적으로 수행될 수 있도록 명령어 셋을 구성하고 연속된 명령어를 하나의 명령어로 구현함으로써 처리시간을 단축하였다. 제안된 프로세서는 bit-serial, word-parallel로 동작하며 대용량 병렬 SIMD 구조를 갖는 32 비트 범용 병렬 프로세서로 동작한다. 포괄적인 검증을 위하여 명령어 단위의 검증 뿐 아니라 최대/최소 검색, 이상/이하 검색, 병렬 덧셈 등의 기본적인 병렬 알고리즘을 검증하였으며 알고리즘은 처리 데이터의 개수와는 무관한 상수의 복잡도 O(k)를 갖으며 데이터의 비트 수만큼의 이터레이션을 갖는다.
PDF

Distributed Processing Environment for Outlier Removal to Analyze Big Data (대용량 데이터 분석을 위한 이상치 제거용 분산처리 환경)

Hong, Yejin;Na, Eunhee;Jung, Yonghwan;Kim, Yangwoo
- Proceedings of the Korean Society of Computer Information Conference
- /
- 2016.07a
- /
- pp.73-74
- /
- 2016
IoT 데이터는 비정형 데이터로 가공되고 분석하였을 때 비로소 가치를 갖기에 전 세계적으로 빅데이터 기술에 관심이 집중되고 있다. IoT 데이터 중 많은 부분을 차치하는 센서 데이터는 수집이 용이하고 활용범위가 넓기 때문에 여러 분야에서 사용되고 있다. 하지만 센서가 정상적으로 작동하지 못한 경우에는 실제와는 다른 값인 이상치를 포함하여 왜곡된 결과가 도출되어 활용할 수 없는 경우가 생긴다. 따라서 본 논문에서는 정확한 결과를 도출하기 위하여 수집된 원자료의 데이터를 분석하기 전에 이상치 탐지 및 제거를 하고자 한다. 또한 점점 늘어나고 있는 대용량 데이터를 신속하게 처리하기 위하여 메모리 접근방식인 스파크를 사용한 분산처리환경에서 이상치 탐지 및 제거하는 것을 제안한다. 맵리듀스 기반의 이상치 탐지 및 제거는 총 4단계로 나누어 구현하였으며 제안한 기법의 성능 평가를 위해 총 3가지 환경에서 비교하여 실험하였다. 실험을 통해 데이터의 용량이 커질수록 분산처리환경에서 스파크를 사용하여 처리하는 방식이 가장 빠를 것 이라는 결과를 얻었다.
PDF

Development of Efficient Data Distribution Storage Algorithm for High Speed Data Backup in DRAM based SSD (DRAM기반 SSD에서 고속백업을 위한 DRAM기반 SSD에서 고속백업을 위한 효율적인 데이터 분산저장 알고리즘 개발)

Song, Han-chun;Ahn, Hyuk-jong
- The Journal of the Institute of Internet, Broadcasting and Communication
- /
- v.15 no.6
- /
- pp.11-15
- /
- 2015
SSD(Solide State Disk) Memory is gradually replacing HDD(Hard Disk Drive) for high speed data input and output processing on Internet server system. In this paper, we proposed efficient data distribution storage algorithm in DRAM based SSD. And we did performance test for proposed algorithm and existing algorithm in the same server environment condition. The result of comparison of performance test is as fellows. Data backup processing time of proposed algorithm was reduced by half than that of existing algorithm.
https://doi.org/10.7236/JIIBC.2015.15.6.11 인용 PDF KSCI

A Study of File Replacement Policy in Data Grid Environments (데이터 그리드 환경에서 파일 교체 정책 연구)

Park, Hong-Jin
- The KIPS Transactions:PartA
- /
- v.13A no.6 s.103
- /
- pp.511-516
- /
- 2006
The data grid computing provides geographically distributed storage resources to solve computational problems with large-scale data. Unlike cache replacement policies in virtual memory or web-caching replacement, an optimal file replacement policy for data grids is the one of the important problems by the fact that file size is very large. The traditional file replacement policies such as LRU(Least Recently Used) LCB-K(Least Cost Beneficial based on K), EBR(Economic-based cache replacement), LVCT(Least Value-based on Caching Time) have the problem that they have to predict requests or need additional resources to file replacement. To solve theses problems, this paper propose SBR-k(Sized-based replacement-k) that replaces files based on file size. The results of the simulation show that the proposed policy performs better than traditional policies.
https://doi.org/10.3745/KIPSTA.2006.13A.6.511 인용 PDF KSCI

Design of Hybrid Storage Manager for Realtime Services in LBS Platform (LBS 플랫폼에서 실시간 서비스를 위한 하이브리드 저장 관리자 설계)

Park Jun-Yong;Kim Ho-Seok;Kim Myoung-Keun;Bae Hae-Young
- Proceedings of the Korea Information Processing Society Conference
- /
- 2004.11a
- /
- pp.153-156
- /
- 2004
위치기반 서비스 시스템에서 주 요구사항은 사용자의 실시간 위치 정보 검색과 질의에 대한 빠른 응답시간이다. 그러나 기존의 위치기반서비스 시스템에서 저장장치로 사용하는 디스크 기반 데이터베이스 시스템은 빈번한 디스크 I/O로 인해 발생하는 질의응답에 대한 시간지연으로 이러한 요구사항을 만족시키지 못한다. 또한 주기억장치 데이터베이스 시스템은 한번에 주기억장치에 적재할 수 있는 양이 제한되어 있기 때문에 대용량의 공간 데이터를 처리하는데 문제가 있다. 본 논문에서는 위치기반서비스를 위하여 위치 정보 데이터의 다단계 저장 관리를 통해 LBS 플랫폼에서 실시간 서비스를 지원하는 하이브리드 저장 관리자를 제안한다. 하이브리드 저장 관리 시스템은 위치기반 서비스 시스템에서 대용량의 이동체 데이터의 효과적인 저장 및 관리를 위해 주기억장치 데이터베이스 관리시스템과 디스크 기반의 공간 데이터베이스 관리시스템을 통합한 구조를 가진다. LBS 플랫폼에서 빠른 응답을 필요로 하고 자주 사용되는 데이터는 메모리 기반의 데이터베이스에 관리를 하고, 사용 빈도가 적은 데이터는 디스크 기반의 데이터베이스에 관리하는 하이브리드 저장 관리자를 제시한다.
PDF

An Effective Indexing Method for Trajectory Databases (대용량 궤적 데이터를 위한 효과적인 인덱싱 기법)

Cha, Chang-Il;Won, Jung-Im;Kim, Sang-Wook
- Proceedings of the Korea Information Processing Society Conference
- /
- 2008.05a
- /
- pp.227-230
- /
- 2008
본 연구에서는 대용량 궤적 데이터베이스에서 영역 질의를 효과적으로 처리하기 위한 인덱싱 기법에 대하여 논의한다. 먼저, 기존 인덱싱 기법의 문제점을 지적하고, 이러한 문제점을 해결하는 새로운 기법을 제안한다. 제안된 기법에서는 우선 시간 차원을 다수의 시간 구간으로 분할하고, 인덱싱의 대상이 되는 전체 라인 세그먼트들을 시간 구간별로 구분한다. 각 시간 구간에 속하는 라인 세그먼트들에 대하여 별도의 인덱스를 구축한다. 또한, 디스크에서 관리되는 과거 시간 구간에 대한 인덱스들과는 달리 최근 시간 구간에 대한 인덱스는 메인 메모리상에 관리함으로써 삽입과 검색의 성능을 크게 개선할 수 있다. 각 시간 구간에 속하는 라인 세그먼트들은 다음과 같은 방식으로 인덱스를 구축한다. 먼저, 2D-트리를 이용하여 전체 공간 차원을 유사한 수의 라인 세그먼트들이 배정되도록 다수의 셀들로 분할한다. 또한, 분할된 각 셀마다 시공간 차원 (x, y, t)에 대한 별도의 3차원 $R^*$-트리를 두어 보다 상세한 인덱싱을 지원한다. 실험 결과에 의하면, 기존 기법에 비하여 작은 인덱스 구조를 갖으면서도 검색 성능면에서 $300%{\sim}1000%$까지의 성능 향상 효과를 갖는 것으로 나타났다.
https://doi.org/10.3745/PKIPS.y2008m05a.227 인용 PDF

Massive 3D Point Cloud Visualization by Generating Artificial Center Points from Multi-Resolution Cube Grid Structure (다단계 정육면체 격자 기반의 가상점 생성을 통한 대용량 3D point cloud 가시화)

Yang, Seung-Chan;Han, Soo Hee;Heo, Joon
- Journal of the Korean Society of Surveying, Geodesy, Photogrammetry and Cartography
- /
- v.30 no.4
- /
- pp.335-342
- /
- 2012
3D point cloud is widely used in Architecture, Civil Engineering, Medical, Computer Graphics, and many other fields. Due to the improvement of 3D laser scanner, a massive 3D point cloud whose gigantic file size is bigger than computer's memory requires efficient preprocessing and visualization. We suggest a data structure to solve the problem; a 3D point cloud is gradually subdivided by arbitrary-sized cube grids structure and corresponding point cloud subsets generated by the center of each grid cell are achieved while preprocessing. A massive 3D point cloud file is tested through two algorithms: QSplat and ours. Our algorithm, grid-based, showed slower speed in preprocessing but performed faster rendering speed comparing to QSplat. Also our algorithm is further designed to editing or segmentation using the original coordinates of 3D point cloud.
https://doi.org/10.7848/ksgpc.2012.30.4.335 인용 PDF KSCI

Video Index Generation and Search using Trie Structure (Trie 구조를 이용한 비디오 인덱스 생성 및 검색)

현기호;김정엽;박상현
- Journal of KIISE:Software and Applications
- /
- v.30 no.7_8
- /
- pp.610-617
- /
- 2003
Similarity matching in video database is of growing importance in many new applications such as video clustering and digital video libraries. In order to provide efficient access to relevant data in large databases, there have been many research efforts in video indexing with diverse spatial and temporal features. however, most of the previous works relied on sequential matching methods or memory-based inverted file techniques, thus making them unsuitable for a large volume of video databases. In order to resolve this problem, this paper proposes an effective and scalable indexing technique using a trie, originally proposed for string matching, as an index structure. For building an index, we convert each frame into a symbol sequence using a window order heuristic and build a disk-resident trie from a set of symbol sequences. For query processing, we perform a depth-first search on the trie and execute a temporal segmentation. To verify the superiority of our approach, we perform several experiments with real and synthetic data sets. The results reveal that our approach consistently outperforms the sequential scan method, and the performance gain is maintained even with a large volume of video databases.
PDF KSCI

A method of Securing Mass Storage for SQL Server by Sharing Network Disks - on the Amazon EC2 Windows Environments - (네트워크 디스크를 공유하여 SQL 서버의 대용량 스토리지 확보 방법 - Amazon EC2 Windows 환경에서 -)

Kang, Sungwook;Choi, Jungsun;Choi, Jaeyoung
- Journal of Internet Computing and Services
- /
- v.17 no.2
- /
- pp.1-9
- /
- 2016
Users are provided infrastructure such as CPU, memory, network, and storage as IaaS (Infrastructure as a Service) service on cloud computing environments. However storage instances cannot support the maximum storage capacity that SQL servers can use, because the capacity of instances provided by service providers is usually limited. In this paper, we propose a method of securing mass storage capacity for SQL servers by sharing network disks with limited storage capacity. We confirmed through experiments that it is possible to secure mass storage capacity, which exceeds the maximum storage capacity provided by an instance with Amazon EBS on Amazon EC2 Windows environments, and it is possible to improve the overall performance of the SQL servers by increasing the disk capacity and performance.
https://doi.org/10.7472/jksii.2016.17.2.01 인용 PDF KSCI

RDBMS Based Efficient Method for Shortest Path Searching Over Large Graphs Using K-degree Index Table (대용량 그래프에서 k-차수 인덱스 테이블을 이용한 RDBMS 기반의 효율적인 최단 경로 탐색 기법)

Hong, Jihye;Han, Yongkoo;Lee, Young-Koo
- KIPS Transactions on Software and Data Engineering
- /
- v.3 no.5
- /
- pp.179-186
- /
- 2014
Current networks such as social network, web page link, traffic network are big data which have the large numbers of nodes and edges. Many applications such as social network services and navigation systems use these networks. Since big networks are not fit into the memory, existing in-memory based analysis techniques cannot provide high performance. Frontier-Expansion-Merge (FEM) framework for graph search operations using three corresponding operators in the relational database (RDB) context. FEM exploits an index table that stores pre-computed partial paths for efficient shortest path discovery. However, the index table of FEM has low hit ratio because the indices are determined by distances of indices rather than the possibility of containing a shortest path. In this paper, we propose an method that construct index table using high degree nodes having high hit ratio for efficient shortest path discovery. We experimentally verify that our index technique can support shortest path discovery efficiently in real-world datasets.
https://doi.org/10.3745/KTSDE.2014.3.5.179 인용 PDF KSCI

Search Result 373, Processing Time 0.03 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)