• 제목/요약/키워드: High-dimensional Data

검색결과 1,549건 처리시간 0.031초

비스플라인 부피에 기초한 유동 가시화 모델 (Flow Visualization Model Based on B-spline Volume)

  • 박상근;이건우
    • 한국CDE학회논문집
    • /
    • 제2권1호
    • /
    • pp.11-18
    • /
    • 1997
  • Scientific volume visualization addresses the representation, manipulation, and rendering of volumetric data sets, providing mechanisms for looking closely into structures and understanding their complexity and dynamics. In the past several years, a tremendous amount of research and development has been directed toward algorithms and data modeling methods for a scientific data visualization. But there has been very little work on developing a mathematical volume model that feeds this visualization. Especially, in flow visualization, the volume model has long been required as a guidance to display the very large amounts of data resulting from numerical simulations. In this paper, we focus on the mathematical representation of volumetric data sets and the method of extracting meaningful information from the derived volume model. For this purpose, a B-spline volume is extended to a high dimensional trivariate model which is called as a flow visualization model in this paper. Two three-dimensional examples are presented to demonstrate the capabilities of this model.

  • PDF

초고속 통신망상에서 3차원 동시 형상 설계 (3-Dimensional Concurrent Geometric Modeling on High Speed Network)

  • 정운용;한순흥
    • 한국전자거래학회지
    • /
    • 제1권1호
    • /
    • pp.141-157
    • /
    • 1996
  • Data sharing is a major challenge to implement CALS. STEP is the international standard for the product model data exchange among heterogeneous systems and plays a key role in CALS. Advances in computer networks are rapidly changing the product development processes. The network oriented modeling system premises to integrate design activities across the enterprise. To achieve goals of CALS 3-dimensional concurrent modeling that complies international standard is required since integrity and common perception of product data can be assured. This paper presents 3-dimensional concurrent geometric modeling on high speed network using STEP and methodologies.

  • PDF

An Incremental Similarity Computation Method in Agglomerative Hierarchical Clustering

  • Jung, Sung-young;Kim, Taek-soo
    • 한국지능시스템학회논문지
    • /
    • 제11권7호
    • /
    • pp.579-583
    • /
    • 2001
  • In the area of data clustering in high dimensional space, one of the difficulties is the time-consuming process for computing vector similarities. It becomes worse in the case of the agglomerative algorithm with the group-average link and mean centroid method, because the cluster similarity must be recomputed whenever the cluster center moves after the merging step. As a solution of this problem, we present an incremental method of similarity computation, which substitutes the scalar calculation for the time-consuming calculation of vector similarity with several measures such as the squared distance, inner product, cosine, and minimum variance. Experimental results show that it makes clustering speed significantly fast for very high dimensional data.

  • PDF

Comprehensive review on Clustering Techniques and its application on High Dimensional Data

  • Alam, Afroj;Muqeem, Mohd;Ahmad, Sultan
    • International Journal of Computer Science & Network Security
    • /
    • 제21권6호
    • /
    • pp.237-244
    • /
    • 2021
  • Clustering is a most powerful un-supervised machine learning techniques for division of instances into homogenous group, which is called cluster. This Clustering is mainly used for generating a good quality of cluster through which we can discover hidden patterns and knowledge from the large datasets. It has huge application in different field like in medicine field, healthcare, gene-expression, image processing, agriculture, fraud detection, profitability analysis etc. The goal of this paper is to explore both hierarchical as well as partitioning clustering and understanding their problem with various approaches for their solution. Among different clustering K-means is better than other clustering due to its linear time complexity. Further this paper also focused on data mining that dealing with high-dimensional datasets with their problems and their existing approaches for their relevancy

고차원 데이터의 효율적인 최근접 객체 검색 기법 (Efficient Searching Technique for Nearest Neighbor Object in High-Dimensional Data)

  • 김진호;박영배
    • 정보처리학회논문지D
    • /
    • 제11D권2호
    • /
    • pp.269-280
    • /
    • 2004
  • 피라미드 기법은 n-차원 공간 데이터를 1차원 데이터로 변환하여 B+-트리로 표현하며, n-차원 데이터 공간에서 하이퍼큐브 영역질의 처리로 발생하는 “차원의 저주현상”에 영향을 받지 않게 검색 시간 문제를 해결하고 있다. 또 구형 피라미드 기법(SPY-TEC)은 피라미드 기법의 공간 분할 전략을 응용하여 유사도 검색에 적합한 구 영역질의 방법을 사용하고 검색 성능을 개선하고 있다. 하지만 유사도 검색의 응용에서 영역질의는 범위를 지정하는데 어려움이 있어 최근접 질의가 더 효율적이며, 기존의 제안된 인덱스 기법들은 특정 분포의 데이터에 대해서만 우수한 성능을 보이는 단점이 있다. 따라서 이 논문에서는 멀티미디어 데이터와 같은 고차원 데이터의 검색 성능을 향상시키기 위해 제안되었던 PdR-트리를 이용하여 최근접 객체 검색 기법을 제안한다. 다양한 분포의 모의 데이터와 실제 데이터를 이용하여 실험한 결과, PdR-트리가 피라미드 기법과 구형 피라미드 기법보다 검색 성능이 향상되었음을 보이고 있다.

Nonlinear PLS Monitoring Applied to An Wastewater Treatment Process

  • Bang, Yoon-Ho;Yoo, Chang-Kyoo;Park, Sang-Wook;Lee, In-Beum
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 제어로봇시스템학회 2001년도 ICCAS
    • /
    • pp.102.1-102
    • /
    • 2001
  • In this work, extensions to partial least squares (PLS) for wastewater treatment (WWT) process monitoring are discussed. Conventional data gathered by monitoring WWT systems are usually time varying, high dimensional, correlated and nonlinear, PLS has been shown to be an efficient approach in modeling and monitoring high dimensional and correlated data. To represent dynamic and nonlinear features of the data several kinds of dynamic nonlinear PLS (DNLPLS) models have been proposed. However, the complexity and ambiguity of the models make them unsuitable for WWT monitoring, Recently, dynamic fuzzy PLS (DFPLS) was proposed ...

  • PDF

k-NN Join Based on LSH in Big Data Environment

  • Ji, Jiaqi;Chung, Yeongjee
    • Journal of information and communication convergence engineering
    • /
    • 제16권2호
    • /
    • pp.99-105
    • /
    • 2018
  • k-Nearest neighbor join (k-NN Join) is a computationally intensive algorithm that is designed to find k-nearest neighbors from a dataset S for every object in another dataset R. Most related studies on k-NN Join are based on single-computer operations. As the data dimensions and data volume increase, running the k-NN Join algorithm on a single computer cannot generate results quickly. To solve this scalability problem, we introduce the locality-sensitive hashing (LSH) k-NN Join algorithm implemented in Spark, an approach for high-dimensional big data. LSH is used to map similar data onto the same bucket, which can reduce the data search scope. In order to achieve parallel implementation of the algorithm on multiple computers, the Spark framework is used to accelerate the computation of distances between objects in a cluster. Results show that our proposed approach is fast and accurate for high-dimensional and big data.

Adaptive data hiding scheme based on magic matrix of flexible dimension

  • Wu, Hua;Horng, Ji-Hwei;Chang, Chin-Chen
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제15권9호
    • /
    • pp.3348-3364
    • /
    • 2021
  • Magic matrix-based data hiding schemes are applied to transmit secret information through open communication channels safely. With the development of various magic matrices, some higher dimensional magic matrices are proposed for improving the security level. However, with the limitation of computing resource and the requirement of real time processing, these higher dimensional magic matrix-based methods are not advantageous. Hence, a kind of data hiding scheme based on a single or a group of multi-dimensional flexible magic matrices is proposed in this paper, whose magic matrix can be expanded to higher dimensional ones with less computing resource. Furthermore, an adaptive mechanism is proposed to reduce the embedding distortion. Adapting to the secret data, the magic matrix with least distortion is chosen to embed the data and a marker bit is exploited to record the choice. Experimental results confirm that the proposed scheme hides data with high security and a better visual quality.

고차원 데이터를 부분차원 클러스터링하는 효과적인 알고리즘 (An Effective Algorithm for Subdimensional Clustering of High Dimensional Data)

  • 박종수;김도형
    • 정보처리학회논문지D
    • /
    • 제10D권3호
    • /
    • pp.417-426
    • /
    • 2003
  • 고차원 데이터에서 클러스터를 찾아내는 문제는 그 중요성으로 인해 데이터 마이닝 분야에서 잘 알려져 있다. 클러스터 분석은 패턴 인식, 데이터 분석, 시장 분석 등의 여러 응용 분야에 광범위하게 사용되어지고 있다. 최근에 이 문제를 풀 수 있는 투영된 클러스터링이라는 새로운 방법론이 제기되었다. 이것은 먼저 각 후보 클러스터의 부분차원들을 선택하고 이를 근거로 한 거리 함수에 따라 가장 가까운 클러스터에 점이 배정된다. 우리는 고차원 데이터를 부분차원 클러스터링하는 새로운 알고리즘을 제안한다. 알고리즘의 주요한 세 부분은, $\circled1$적절한 개수의 점들을 갖는 여러 개의 후보 클러스터로 입력 점들을 분할하고, $\circled2$다음 단계에서 유용하지 않은 클러스터들을 제외하고, 그리고 $\circled3$선택된 클러스터들은 밀접도 함수를 사용하여 미리 정해진 개수의 클러스터들로 병합한다. 다른 클러스터링 알고리즘과 비교하여 제안된 알고리즘의 좋은 성능을 보여주기 위하여 많은 실험을 수행하였다.

2차원 바코드를 위한 데이터 부호화 알고리즘 설계 (Design of Data Encoding Algorithm for a Two Dimensional Bar Code)

  • 전성구;김일환
    • 산업기술연구
    • /
    • 제25권B호
    • /
    • pp.171-174
    • /
    • 2005
  • In this paper, we propose a new data encoding algorithm for a two-dimensional bar code system. In general, the one-dimensional bar code is just a key which can access detailed information to the host computer database. But the two-dimensional bar code is a new technology which can obtain high density information without access to the host computer database. We implemented the encoding algorithm for Data Matrix bar code which is the most widely used among the many kinds of two-dimensional bar codes in the field of marking using Digital Signal Processor (TMS320C31). The performance of the proposed algorithm is verified by comparing the imprinted symbols on the steel surfaces with the codes which are decoded by a bar code reader.

  • PDF