• 제목/요약/키워드: Multiple Data Sets

검색결과 348건 처리시간 0.026초

다중 지구과학자료를 이용한 GIS 기반 공간통합과 통계량 분석 : 광물 부존 예상도 작성을 위한 사례 연구 (GIS-based Spatial Integration and Statistical Analysis using Multiple Geoscience Data Sets : A Case Study for Mineral Potential Mapping)

  • 이기원;박노욱;권병두;지광훈
    • 대한원격탐사학회지
    • /
    • 제15권2호
    • /
    • pp.91-105
    • /
    • 1999
  • 최근 다중 지질정보의 통합적 해석은 GIS의 중요한 응용 분야중 하나로 인식되고 있다. 공간통합을 위하여 지구통계학적 방법들이 개발되어 있지만, 통합결과와 입력 주제도들 사이의 관계에 대한 통계적, 정량적 분석방법론의 개발은 아직까지 체계적으로 정립되어 있지 못한 상황이다. 본 연구에서는 지질도, 지화학자료, 항공지구물리자료, 지형자료 및 원격탐사 영상등 다양한 지질정보등이 보고된 옥동지역을 대상으로 하여 광물 부존 예상도 작성 사례연구를 수행하여 기존에 이용되고 있는 여러 공간 통합 방법중 확실인자 (Certainty Factor: CF) 추정방법과 다변량 통계 분석방법중 하나인 주성분분석을 시험적인 통합방법으로 우선적으로 적용한 뒤, 입력 자료와 통합결과에 대한 정량적인 통계량 정보를 추출하고자 하였다. 입력 주제도와 통합 결과사이의 관계 규명에는 통계 분할표를 이용한 통계처리를 편의 분석에는 잭나이프 방법을 적용하였다. 통합정보에 대한 통계량 분석을 통하여, 통합 결과와 입력자료 사이의 정량적 관계를 추출할 수 있었으며, 부가적으로 입력자료의 상태수준에 대한 판단정보를 얻을 수 있었다. 이러한 결과는 GIS 관점에서 통합결과 해석에 중요한 결정보조자료로 활용될 수 있으며, 복잡한 다중정보를 다루는데 공간 통합문제에서도 입력정보 검증을 위한 일반적일 처리과정으로도 발전할 수 있을 것으로 생각된다.

다중 3차원 거리정보 데이타의 자동 정합 방법 (Automatic Registration Method for Multiple 3D Range Data Sets)

  • 김상훈;조청운;홍현기
    • 한국정보과학회논문지:소프트웨어및응용
    • /
    • 제30권12호
    • /
    • pp.1239-1246
    • /
    • 2003
  • 대상 물체의 3차원 모델을 구축하기 위해서는 여러 시점에서 측정된 거리정보 데이타들을 하나의 좌표계로 통합하는 정합(registration) 과정이 필수적이다. 3차원 데이타의 정합을 위해 가장 널리 사용되는 ICP(Iterative Closest Point) 알고리즘은 거리정보 데이타 간에 겹치는 영역 또는 일치점 등에 대한 사전 정보가 필요하다. 본 논문에서는 임의의 시점에서 측정된 데이타를 반복적인 방법에 의해 자동으로 정합하는 개선된 ICP 방법이 제안된다. 3차원 데이타가 거리정보 영상으로 맺히는 관계를 나타내는 센서 사영조건(projection constraint), 데이타의 공분산(covariance) 행렬, 교차(cross) 사영 등을 이용하여 정합과정을 자동화하였으며, 유저의 개입이나 3차원 기계 보조 장치 등을 사용하는 별도의 초기값 측정 없이 3차원 모델을 정확하게 구성할 수 있다. 다양한 거리정보 데이타에 대한 실험을 통해 제안된 방법의 우수한 성능을 확인하였다.

Application of SOLAS to the Multiple Imputation for Missing Data

  • Moon, Sung-Ho;Kim, Hyun-Jeong;Shin, Jae-Kyoung
    • Journal of the Korean Data and Information Science Society
    • /
    • 제14권3호
    • /
    • pp.579-590
    • /
    • 2003
  • When we analyze incomplete data, i.e., data with missing values, we need treatment for the missing values. A common way to deal with this problem is to delete the cases with missing values. Various other methods have been developed. Among them are EM algorithm and regression algorithm which can estimate missing values and impute the missing elements with the estimated values. In this paper, we introduce multiple imputation software SOLAS which generates multiple data sets and imputes with them.

  • PDF

다중선형회귀분석에 의한 계절별 저수지 유입량 예측 (Forecasting of Seasonal Inflow to Reservoir Using Multiple Linear Regression)

  • 강재원
    • 한국환경과학회지
    • /
    • 제22권8호
    • /
    • pp.953-963
    • /
    • 2013
  • Reliable long-term streamflow forecasting is invaluable for water resource planning and management which allocates water supply according to the demand of water users. Forecasting of seasonal inflow to Andong dam is performed and assessed using statistical methods based on hydrometeorological data. Predictors which is used to forecast seasonal inflow to Andong dam are selected from southern oscillation index, sea surface temperature, and 500 hPa geopotential height data in northern hemisphere. Predictors are selected by the following procedure. Primary predictors sets are obtained, and then final predictors are determined from the sets. The primary predictor sets for each season are identified using cross correlation and mutual information. The final predictors are identified using partial cross correlation and partial mutual information. In each season, there are three selected predictors. The values are determined using bootstrapping technique considering a specific significance level for predictor selection. Seasonal inflow forecasting is performed by multiple linear regression analysis using the selected predictors for each season, and the results of forecast using cross validation are assessed. Multiple linear regression analysis is performed using SAS. The results of multiple linear regression analysis are assessed by mean squared error and mean absolute error. And contingency table is established and assessed by Heidke skill score. The assessment reveals that the forecasts by multiple linear regression analysis are better than the reference forecasts.

한국 지방공사 의료원의 생산성 평가와 비교 (Productivity Evaluation and Comparision of Korean Provincial Hospitals)

  • 안태식;박정식
    • 한국병원경영학회지
    • /
    • 제2권1호
    • /
    • pp.22-47
    • /
    • 1997
  • This paper evaluated the relative efficiency of 33 provincial medical centers using Data Envelopment Analysis(DEA) and compared the DEA efficiency results with those of the current method conducted by the management evaluation team. DEA Was selected as an alternative efficiency evaluation method since it could handle multiple inputs and multiple outputs simultaneously and identify the sources of inefficiency. To analyze the sensitivity of productivity values to the variable sets, four different sets of input and output variables were identified. Results showed that most of the medical centers are operating far away from the efficiency frontier supporting the previous results. Some centers showed 100% efficiency regardless of the selected variable sets. DEA results are compared with current management evaluation results. Some inconsistencies were found for some DMUs between the results of two methods showing the existence of methodology bias. DEA results and ratio analyses results mostly agree for 1992 data.

  • PDF

다 모델 방식과 모델보상을 통한 잡음환경 음성인식 (A Multi-Model Based Noisy Speech Recognition Using the Model Compensation Method)

  • 정용주;곽성우
    • 대한음성학회지:말소리
    • /
    • 제62호
    • /
    • pp.97-112
    • /
    • 2007
  • The speech recognizer in general operates in noisy acoustical environments. Many research works have been done to cope with the acoustical variations. Among them, the multiple-HMM model approach seems to be quite effective compared with the conventional methods. In this paper, we consider a multiple-model approach combined with the model compensation method and investigate the necessary number of the HMM model sets through noisy speech recognition experiments. By using the data-driven Jacobian adaptation for the model compensation, the multiple-model approach with only a few model sets for each noise type could achieve comparable results with the re-training method.

  • PDF

ERS-1 AND CCRS C-SAR Data Integration For Look Direction Bias Correction Using Wavelet Transform

  • Won, J.S.;Moon, Woo-Il M.;Singhroy, Vern;Lowman, Paul-D.Jr.
    • 대한원격탐사학회지
    • /
    • 제10권2호
    • /
    • pp.49-62
    • /
    • 1994
  • Look direction bias in a single look SAR image can often be misinterpreted in the geological application of radar data. This paper investigates digital processing techniques for SAR image data integration and compensation of the SAR data look direction bias. The two important approaches for reducing look direction bias and integration of multiple SAR data sets are (1) principal component analysis (PCA), and (2) wavelet transform(WT) integration techniques. These two methods were investigated and tested with the ERS-1 (VV-polarization) and CCRS*s airborne (HH-polarization) C-SAR image data sets recorded over the Sudbury test site, Canada. The PCA technique has been very effective for integration of more than two layers of digital image data. When there only two sets of SAR data are available, the PCA thchnique requires at least one more set of auxiliary data for proper rendition of the fine surface features. The WT processing approach of SAR data integration utilizes the property which decomposes images into approximated image ( low frequencies) characterizing the spatially large and relatively distinct structures, and detailed image (high frequencies) in which the information on detailed fine structures are preserved. The test results with the ERS-1and CCRS*s C-SAR data indicate that the new WT approach is more efficient and robust in enhancibng the fine details of the multiple SAR images than the PCA approach.

A Technique to Improve the Fit of Linear Regression Models for Successive Sets of Data

  • Park, Sung H.
    • Journal of the Korean Statistical Society
    • /
    • 제5권1호
    • /
    • pp.19-28
    • /
    • 1976
  • In empirical study for fitting a multiple linear regression model for successive cross-sections data observed on the same set of independent variables over several time periods, one often faces the problem of poor $R^2$, the multiple coefficient of determination, which provides a standard measure of how good a specified regression line fits the sample data.

  • PDF

Deformable image registration in radiation therapy

  • Oh, Seungjong;Kim, Siyong
    • Radiation Oncology Journal
    • /
    • 제35권2호
    • /
    • pp.101-111
    • /
    • 2017
  • The number of imaging data sets has significantly increased during radiation treatment after introducing a diverse range of advanced techniques into the field of radiation oncology. As a consequence, there have been many studies proposing meaningful applications of imaging data set use. These applications commonly require a method to align the data sets at a reference. Deformable image registration (DIR) is a process which satisfies this requirement by locally registering image data sets into a reference image set. DIR identifies the spatial correspondence in order to minimize the differences between two or among multiple sets of images. This article describes clinical applications, validation, and algorithms of DIR techniques. Applications of DIR in radiation treatment include dose accumulation, mathematical modeling, automatic segmentation, and functional imaging. Validation methods discussed are based on anatomical landmarks, physical phantoms, digital phantoms, and per application purpose. DIR algorithms are also briefly reviewed with respect to two algorithmic components: similarity index and deformation models.

Cross platform classification of microarrays by rank comparison

  • Lee, Sunho
    • Journal of the Korean Data and Information Science Society
    • /
    • 제26권2호
    • /
    • pp.475-486
    • /
    • 2015
  • Mining the microarray data accumulated in the public data repositories can save experimental cost and time and provide valuable biomedical information. Big data analysis pooling multiple data sets increases statistical power, improves the reliability of the results, and reduces the specific bias of the individual study. However, integrating several data sets from different studies is needed to deal with many problems. In this study, I limited the focus to the cross platform classification that the platform of a testing sample is different from the platform of a training set, and suggested a simple classification method based on rank. This method is compared with the diagonal linear discriminant analysis, k nearest neighbor method and support vector machine using the cross platform real example data sets of two cancers.