Search | Korea Science

Dataset Property - based Algebraic Operators for Data Mining Preprocessing (데이터집합 특성에 기반한 데이터 마이닝 전처리 대수 연산자)

Kim, Hyo-Sook;Lee, Won-Suk
- Proceedings of the Korea Information Processing Society Conference
- /
- 2002.11c
- /
- pp.1709-1712
- /
- 2002
지식 탐사 연구의 핵심이 되어온 데이터 마이닝은 축적 데이터로부터 쉽게 추출되지 않는 데이터 상호관계나 일정 패턴과 같은 유용한 내재 정보 추출을 주된 목적으로 수행된다. 그러나, 데이터 마이닝은 대용량의 데이터 처리로 인해 빈번한 메모리 공간 제약과 처리 속도 저하 등의 한계성을 드러낸다. 이를 극복하기 위해 많은 마이닝 알고리즘 개발과 기존 알고리즘 개선 방법이 제시되어 왔으나 여전히 궁극적인 해결방안은 대두되지 않고 있다. 따라서, 만약 데이터 전처리 과정을 통해 마이닝 목적에 적합한 부분 데이터집합 추출 및 가공이 선행된다면 보다 효율적인 데이터 마이닝 작업을 유도할 수 있을 것이다. 본 논문은 효과적 데이터 전처리를 위한 필수 기본 연산 기능들을 주어진 데이터집합의 트랜잭션 및 데이터 특성에 기초하여 관계형 대수 형태로 의미를 정립하고, 적용 사례에 의한 상세 설명 및 실제 구현된 온라인 데이터 전처리 시스템을 제안한다.
PDF

Design of Spatial Clustering Method for Data Mining of Various Spatial Objects (다양한 공간객체의 데이터 마이닝을 위한 공간 클러스터링 기법의 설계)

문상호;최진오;김진덕
- Journal of the Korea Institute of Information and Communication Engineering
- /
- v.8 no.4
- /
- pp.955-959
- /
- 2004
Existing Clustering Methods for spatial data mining process only Point objects, not spatial objects with polygonometry such as lines and areas. It is because that distance computation between objects with polygonometry for clustering is more complex than distance computation between point objects. To solve this problem, we design a clustering method based on regular grid cell structures. In details, it reduces cost and time for distance computation using cell relationships in grid cell structures.
PDF KSCI

Design and Implementation of Spatial Association Rule Discovery System for Spatial Data Analysis (공간 데이터 분석을 위한 공간 연관 규칙 탐사 시스템의 설계 및 구현)

Ahn, Chan-Min;Lee, Yun-Seok;Park, Sang-Ho;Lee, Ju-Hong
- Journal of the Korea Society of Computer and Information
- /
- v.11 no.1 s.39
- /
- pp.27-34
- /
- 2006
Recently, the study about the technology which effectively manage spatial information is actively conducted. For the effective knowledge inquiry, various extended data mining methods are applied in spatial data mining. However, former spatial association rule system appears the problem that does not reflect various non-spatial property along the inquiries because it searches the rule from the calculation among predicates. To resolve the problem, present study suggests the system that extends the inquiries using in spatial database, searches the association rule among non-spatial object property after setting the data based on space information. Especially, the model which is applicable to geographical information system is embodied. Embodied system with this method enables to search more useful spatial association rule in real life since it shows high migration property with extended spatial database and considers spatial property and various non-spatial property.
PDF

Real-time Data Mining application Model In Electronic Commerce (전자상거래 상에서의 실시간 데이터 마이닝 활용 모델)

Kim, Ko-Eun;Ok, Jee-Woong;Kim, Ung-Mo
- Proceedings of the Korean Information Science Society Conference
- /
- 2007.10c
- /
- pp.155-158
- /
- 2007
현재 전자상거래는 우리의 생활과 밀접히 연관되어 있다. 최근 인터넷을 기반으로 전자조달, 수출입 브로커 등과 같은 유형의 B2B 전자상거래가 활발히 이루어지고 있으며, 소비자를 대상으로 하는 전자상거래 또한 점차 확산되는 시장을 형성하고 있다. 국제적으로도 전자상거래 시장 규모가 급속도로 증가할 것이라는 전망은 자명한 사실이다. 전자상거래에 대한 의존도가 높아지면서 관리해야 하는 데이터의 양 또한 급속도로 증가하고 있다. 본 논문에서는 실시간으로 유입되는 데이터를 효율적으로 활용하기 위챈 실시간 데이터 마이닝 활용 모델을 제안한다. 이 실시간 데이터 마이닝 모델은 지속적으로 유입되는 데이터의 규칙화를 통해 저장 공간의 효율성을 극대화하고 중요도 분석을 통한 총체적인 접근 방법을 시도함으로써 전자상거래 상에서 유용하게 쓰일 수 있는 활용 모델이다. 이 실시간 데이터 마이닝 모델의 바탕은 데이터 마이닝의 기법인 SEMMA를 따르며, 그 특징에 따라 규칙 추출과 의사 결정 나무 기법을 이용하여 전자상거래 상에서 유용하게 사용될 수 있는 모델을 제시하고자 한다.
PDF

A Design of SOA-based Data Integration Framework for Effective Spatial Data Mining (효과적인 공간 데이터 마이닝을 위한 SOA 기반 데이터 통합 프레임워크 설계)

Moon, Il-Hwan;Hur, Hwan;Kim, Sam-Keun
- The KIPS Transactions:PartD
- /
- v.18D no.5
- /
- pp.385-392
- /
- 2011
Recently, the concern of IT-in-Agriculture convergence technology that combines information technology and agriculture is increasing rapidly. Especially, the crop cultivation related prediction services by spatial data mining (SDM) can play an important role in reducing the damage of natural disaster and enhancing crop productivity. However, the data conversion and integration procedure to acquire the learning dataset of SDM for the prediction service need a lot of effort and time, because of their heterogeneity between distributed data. In addition, calculating spatial neighborhood relationships between spatial and non-spatial data necessitates requires the complicated calculation procedure for large dataset. In this paper, we suggest a SOA-based data integration framework that can effectively integrate distributed heterogeneous data by treating each data source as a service unit and support to find the optimal prediction service by improving productivity of learning dataset for SDM. In our experiment, we confirmed that our framework can be effectively applied to find the optimal prediction service for the frost damage area, by considering the case of peach crop cultivation in Icheon in Korea.
https://doi.org/10.3745/KIPSTD.2011.18D.5.385 인용 PDF KSCI

Grid Cell Based Spatial Clustering Method (그리드 셀 기반 공간 클러스터링 방법)

이동규;정정수;문상호
- Proceedings of the Korean Information Science Society Conference
- /
- 2001.04b
- /
- pp.10-12
- /
- 2001
대용량의 공간 데이터베이스로부터 임시적이고 유용한 지식을 자동적으로 추출하는 공간데이터 마이닝은 데이터양의 급격히 증가하면서 필요성이 더욱 증대되고 있다. 공간데이타 마이닝에서 데이터를 분석하여 유사한 그룹으로 분류하는 것은 중요한 분야이며, 이를 위해서는 공간 클러스터링 과정이 먼저 수행되어야 한다. 이러한 공간 클러스터링에서 가장 중요한 점은 클러스터링에 드는 비용의 감소와 점 공간객체에 한정된 클러스터링이 아닌 선 및 다각형 객체들의 클러스터링도 가능해야 한다. 본 본문은 이를 위하여 공간지역성을 보장하는 대표적인 공간분할 방법인 그리드 셀을 이용한다. 기존의 클러스터링에서 사용되는 객체들 간의 거리 계산을 인접한 그리드 셀들 간의 관계 연산으로 대체시키는 것이 핵심아이디어이다. 이 방법은 기존 클러스터링에서 객체들 간의 거리 계산으로 인한 비용을 현저하게 줄일 수 있고, 선 및 다각형 객체들의 클러스터링도 가능하게 하는 장점이 있다.
PDF

SOM-based Spatio-Temporal Data Mining System (SOM 기반 시공간 데이터 마이닝 시스템)

Kang Juyoung;Lee Bongjae;Song Jaeju;Shin Jinho;Yong Hwanseung
- Proceedings of the Korea Information Processing Society Conference
- /
- 2004.11a
- /
- pp.105-108
- /
- 2004
데이터 양이 급증함에 따라 축적된 데이터로부터 의미있는 지식을 추출해 내고자 하는 데이터 마이닝에 대한 연구가 활발하게 진행되어 왔다. 특히 최근, 환경이 이동 분산화 되어감에 따라 감시${\cdot}$모니터링 시스템, 기상 관측 시스템, GPS 시스템과 같은 다양한 응용 시스템으로부터 방대한 양의 시공간 데이터가 발생하게 되었고, 이른 효율적으로 분석하고자 하는 시공간 데이터 마이닝 연구에 대한 관심이 더욱 높아지고 있다. 기존의 데이터 마이닝 기법의 경우 문자나 숫자 데이터를 대상으로 최적화 되어있기 때문에 시${\cdot}$공간 속성을 동시에 가지는 데이터를 분석하기에는 한계가 있는 것이 사실이다. 본 논문에서는 SOM(Self-Organizing Map)을 적용하여 시공간 클러스터링 모듈을 개발하고, 개발된 모듈의 성능 및 클러스터링 정확성을 다른 세 가지 군집분석 알고리즘과 비교, 분석하였다. 또한 가시화 모듈을 개발하여 입력 데이터의 특성과 결과를 더욱 정확하게 분석할 수 있도록 하였다.
PDF

Dynamic Subspace Clustering for Online Data Streams (온라인 데이터 스트림에서의 동적 부분 공간 클러스터링 기법)

Park, Nam Hun
- Journal of Digital Convergence
- /
- v.20 no.2
- /
- pp.217-223
- /
- 2022
Subspace clustering for online data streams requires a large amount of memory resources as all subsets of data dimensions must be examined. In order to track the continuous change of clusters for a data stream in a finite memory space, in this paper, we propose a grid-based subspace clustering algorithm that effectively uses memory resources. Given an n-dimensional data stream, the distribution information of data items in data space is monitored by a grid-cell list. When the frequency of data items in the grid-cell list of the first level is high and it becomes a unit grid-cell, the grid-cell list of the next level is created as a child node in order to find clusters of all possible subspaces from the grid-cell. In this way, a maximum n-level grid-cell subspace tree is constructed, and a k-dimensional subspace cluster can be found at the k^th level of the subspace grid-cell tree. Through experiments, it was confirmed that the proposed method uses computing resources more efficiently by expanding only the dense space while maintaining the same accuracy as the existing method.
https://doi.org/10.14400/JDC.2022.20.2.217 인용 PDF KSCI

Mining of Subspace Contrasting Sample Groups in Microarray Data (마이크로어레이 데이터의 부공간 대조 샘플집단 마이닝)

Lee, Kyung-Mi;Lee, Keon-Myung
- Journal of the Korean Institute of Intelligent Systems
- /
- v.21 no.5
- /
- pp.569-574
- /
- 2011
In this paper, we introduce the subspace contrasting group identification problem and propose an algorithm to solve the problem. In order to identify contrasting groups, the algorithm first determines two groups of which attribute values are in one of the contrasting ranges specified by the analyst, and searches for the contrasting groups while increasing the dimension of subspaces with an association rule mining strategy. Because the dimension of microarray data is likely to be tens of thousands, it is burdensome to find all contrasting groups over all possible subspaces by query generation. It is very useful in the sense that the proposed method allows to find those contrasting groups without analyst's involvement.
https://doi.org/10.5391/JKIIS.2011.21.5.569 인용 PDF KSCI

Location Generalization Method for Pattern Mining of Moving Object (이동 객체의 패턴 마이닝을 위한 위치 일반화 방법)

Ko, Hyun;Kim, Kwang-Jong;Lee, Yon-Sik
- Proceedings of the Korea Information Processing Society Conference
- /
- 2006.11a
- /
- pp.405-408
- /
- 2006
사용자들의 특성에 맞게 개인화되고 세분화된 위치 기반 서비스를 제공하기 위해서는 방대한 이동객체의 위치 이력 데이터로부터 유용한 패턴을 추출하기 위한 시간 패턴 탐사가 필요하다. 기존의 시간 패턴 탐사 기법들은 이동 객체의 시간에 따른 공간 속성들의 변화를 충분히 고려하지 못하거나, 시공간 속성을 동시에 고려한 패턴 탐사는 가능하나 제약을 가진 공간 정보를 포함하는 패턴 탐사 문제에는 적용하기 어렵다. 따라서 이동 객체의 위치 이력 데이터들에 대한 시공간적 속성들을 동시에 고려하여 다양한 이동 패턴들 중 공간 제약을 만족하는 패턴들을 추출하기 위한 새로운 이동 패턴 탐사 기법이 요구된다. 이러한 패턴 탐사 기법의 개발을 위해서는 상세 수준의 위치 이력 데이터들을 공간 영역 정보 형태로 변환하는 위치 일반화 접근법이 필요하다. 이에 본 논문에서는 객체의 위치값과 공간 영역간의 위상 관계를 고려하여 이동 객체의 위치 속성에 대한 공간영역으로의 일반화 방법을 제안한다. 이동 객체의 상세 수준의 위치 정보에서는 의미있는 패턴을 찾기가 어렵기 때문에 데이터 전처리 과정을 통해 일반화된 데이터 집합을 형성함으로써 효율적인 이동 객체의 시간 패턴 마이닝을 유도할 수 있다.
PDF

Search Result 177, Processing Time 0.028 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)