• 제목/요약/키워드: Validation Set

검색결과 679건 처리시간 0.023초

Discrimination Analysis of Gallstones by Near Infrared Spectrometry Using a Soft Independent Modeling of Class Analogy

  • Lee, Sang-Hak;Son, Bum-Mok;Park, Ju-Eun;Choi, Sang-Seob;Nam, Jae-Jak
    • 한국근적외분광분석학회:학술대회논문집
    • /
    • 한국근적외분광분석학회 2001년도 NIR-2001
    • /
    • pp.4106-4106
    • /
    • 2001
  • A method to discriminate human gallstones by nea. infrared(NIR) spectrometry using a soft independent modeling of class analogy (SIMCA) has been studied. The fifty NIR spectra of gallstones in the wavenumber range from 4500 to 10,000 cm$\^$-1/ were measured. The forty samples were classified to three classes, cholesterol stone, calcium bilirubinate stone and calcium carbonate stone according to the contents of major components in each gallstone. The training set which contained objects of the different known class was constructed using forty NIR spectra and the test set was made with ten different gallstone spectra. The number of important principal components(PCs) to describe the class was determined by cross validation in order to improve the decision criterion of the SIMCA for the training set. The score plots of the class training set whose objects belong to the other classes were inspected. The critical distance of each class was computed using both the Euclidean distance and the Mahalanobis distance at a proper level of significance(${\alpha}$). Two methods were compared with respect to classification and their robustness towards the number of PCs selected to describe different classes.

  • PDF

QSO Selections Using Time Variability and Machine Learning

  • 김대원;;변용익
    • 천문학회보
    • /
    • 제36권2호
    • /
    • pp.64-64
    • /
    • 2011
  • We present a new quasi-stellar object (QSO) selection algorithm using a Support Vector Machine, a supervised classification method, on a set of extracted time series features including period, amplitude, color, and autocorrelation value. We train a model that separates QSOs from variable stars, non-variable stars, and microlensing events using 58 known QSOs, 1629 variable stars, and 4288 non-variables in the MAssive Compact Halo Object (MACHO) database as a training set. To estimate the efficiency and the accuracy of the model, we perform a cross-validation test using the training set. The test shows that the model correctly identifies ~80% of known QSOs with a 25% false-positive rate. The majority of the false positives are Be stars. We applied the trained model to the MACHO Large Magellanic Cloud (LMC) data set, which consists of 40 million lightcurves, and found 1620 QSO candidates. During the selection, none of the 33,242 known MACHO variables were misclassified as QSO candidates. In order to estimate the true false-positive rate, we crossmatched the candidates with astronomical catalogs including the Spitzer Surveying the Agents of a Galaxy's Evolution (SAGE) LMC catalog and a few X-ray catalogs. The results further suggest that the majority of the candidates, more than 70%, are QSOs.

  • PDF

암호모듈 검증을 위한 UML 2.0 상태도 기반의 유한상태모델 명세 및 분석 (UML 2.0 Statechart based Modeling and Analysis of Finite State Model for Cryptographic Module Validation)

  • 이강수;정재구;고갑승
    • 정보보호학회논문지
    • /
    • 제19권4호
    • /
    • pp.91-103
    • /
    • 2009
  • 암호알고리즘 및 암호함수를 하드웨어적 또는 소프트웨어적으로 구현한 암호모듈을 암호모듈검증체계 (Cryptographic Module Validation Program, CMVP) 내에서 시험 (또는 인증, 검증)을 받기 위해서는 암호모듈에 대한 유한상태모델(Finite State Model, FSM) 이 개발되고 제공되어야한다. 그러나 FSM을 체계적으로 모델링하고 분석하는 지침은 개발자와 시험자의 경험이므로 잘 알려져 있지 않다. 본 연구에서는 CMVP내에서 암호모듈의 검증을 위해 요구되는 FSM의 모델링, 분석지침, 천이시험경로 생성알고리즘을 제시하고 모델링도구인 CM-Statecharter를 개발하였다. FSM은 UML 2.0의 상태도를 이용해 모델링한다. 상태도는 FSM의 부족한 점 을 보완하고 암호모듈의 FSM을 정형적이고 쉽게 명세할 수 있는 모델이다.

Accuracy of genotype imputation based on reference population size and marker density in Hanwoo cattle

  • Lee, DooHo;Kim, Yeongkuk;Chung, Yoonji;Lee, Dongjae;Seo, Dongwon;Choi, Tae Jeong;Lim, Dajeong;Yoon, Duhak;Lee, Seung Hwan
    • Journal of Animal Science and Technology
    • /
    • 제63권6호
    • /
    • pp.1232-1246
    • /
    • 2021
  • Recently, the cattle genome sequence has been completed, followed by developing a commercial single nucleotide polymorphism (SNP) chip panel in the animal genome industry. In order to increase statistical power for detecting quantitative trait locus (QTL), a number of animals should be genotyped. However, a high-density chip for many animals would be increasing the genotyping cost. Therefore, statistical inference of genotype imputation (low-density chip to high-density) will be useful in the animal industry. The purpose of this study is to investigate the effect of the reference population size and marker density on the imputation accuracy and to suggest the appropriate number of reference population sets for the imputation in Hanwoo cattle. A total of 3,821 Hanwoo cattle were divided into reference and validation populations. The reference sets consisted of 50k (38,916) marker data and different population sizes (500, 1,000, 1,500, 2,000, and 3,600). The validation sets consisted of four validation sets (Total 889) and the different marker density (5k [5,000], 10k [10,000], and 15k [15,000]). The accuracy of imputation was calculated by direct comparison of the true genotype and the imputed genotype. In conclusion, when the lowest marker density (5k) was used in the validation set, according to the reference population size, the imputation accuracy was 0.793 to 0.929. On the other hand, when the highest marker density (15k), according to the reference population size, the imputation accuracy was 0.904 to 0.967. Moreover, the reference population size should be more than 1,000 to obtain at least 88% imputation accuracy in Hanwoo cattle.

행정정보 데이터세트 보존포맷으로서 SIARD 검증에 관한 연구 (A Study on SIARD Verification as a Preservation Format for Data Set Records)

  • 윤성호;이정은;양동민
    • 한국기록관리학회지
    • /
    • 제21권3호
    • /
    • pp.99-118
    • /
    • 2021
  • 4차 산업혁명의 도래로 데이터의 중요성이 커지는 상황에 따라, 해외 각국은 데이터 장기보존 기술 연구를 추진하고 있다. 반면 우리나라는 행정정보 데이터세트가 기록관리 영역으로 법제화됐으나, 구체적인 장기보존 방안이 부재한 상황이다. 이에 본 연구는 여러 선행연구에서 행정정보 데이터세트 보존포맷으로 제안된 SIARD(Software Independent Archiving of Relational Database)에 대한 기초, 교차 검증 시험을 수행했다. 먼저 기초 검증 시험은 SIARD 포맷이 보존할 수 있는 데이터세트의 데이터, 구조, 기능 등을 도출하는데 방점을 두었다. 두 번째 교차 검증 시험은 DBMS 종류에 구애받지 않는 SIARD의 상호호환성 검증에 목적을 두었다. 2차례 검증 시험 결과, SIARD 포맷으로 JSON, UROWID 데이터 타입, FK(Foreign Key), 함수 계열 요소를 보존할 수 없으며, SIARD 2.0 표준에 명시된 기능과 실제 SIARD Suite이 제공하는 기능에 차이가 있음을 확인하였다. 본 연구는 실증적 검증 시험을 진행했으며, SIARD Suite의 기능을 보완하는 개발 방안과 SIARD Suite을 국내 환경에 맞춰 효율적으로 개발할 수 있는 방향성을 제시했다는 점에서 의의가 있다.

A Study on Bandwith Selection Based on ASE for Nonparametric Regression Estimator

  • Kim, Tae-Yoon
    • Journal of the Korean Statistical Society
    • /
    • 제30권1호
    • /
    • pp.21-30
    • /
    • 2001
  • Suppose we observe a set of data (X$_1$,Y$_1$(, …, (X$_{n}$,Y$_{n}$) and use the Nadaraya-Watson regression estimator to estimate m(x)=E(Y│X=x). in this article bandwidth selection problem for the Nadaraya-Watson regression estimator is investigated. In particular cross validation method based on average square error(ASE) is considered. Theoretical results here include a central limit theorem that quantifies convergence rates of the bandwidth selector.tor.

  • PDF

정보기기온칩을 위한 HW/SW 혼합 설계 및 검증 환경 개발 (Developing of HW/SW Co-Design and Verification Environment for Information-App1iance-On-a-Chip)

  • 장준영;신진아;배영환
    • 대한전자공학회:학술대회논문집
    • /
    • 대한전자공학회 2001년도 하계종합학술대회 논문집(2)
    • /
    • pp.117-120
    • /
    • 2001
  • This paper presents a HW/SW co-design environments and its validation for development of virtual component on the 32-bit RISC core which is used in the design of Information-Appliance-On-a-Chip. For the experimental environment, we developed the cycle-accurate instruction set simulator based on SE3208 RISC core of ADChips. To verify the function of RISC core at the cycle level, we implemented the verification environment by grafting this simulator on the Seamless CVE which is a commercial co-verification environment.

  • PDF

Detecting Influential Observations on the Smoothing Parameter in Nonparametric Regression

  • Kim, Choong-Rak;Jeon, Jong-Woo
    • Journal of the Korean Statistical Society
    • /
    • 제24권2호
    • /
    • pp.495-506
    • /
    • 1995
  • We present formula for detecting influential observations on the smoothing parameter in smoothing spline. Further, we express them as functions of basic building blocks such as residuals and leverage, and compare it with the local influence approach by Thomas (1991). An example based on a real data set is given.

  • PDF

모델의 타당성 평가에 기초한 로바스트 동정에 관한 연구 (A Study on Robust Identification Based on the Validation Evaluation of Model)

  • 이동철
    • 동력기계공학회지
    • /
    • 제4권3호
    • /
    • pp.72-80
    • /
    • 2000
  • In order to design a stable robust controller, nominal model, and the upper bound about the uncertainty which is the error of the model are needed. The problem to estimate the nominal model of controlled system and the upper bound of uncertainty at the same time is called robust identification. When the nominal model of controlled system and the upper bound of uncertainty in relation to robust identification are given, the evaluation of the validity of the model and the upper bound makes it possible to distinguish whether there is a model which explains observation data including disturbance among the model set. This paper suggests a method to identity the uncertainty which removes disturbance and expounds observation data by giving a probable postulation and plural data set to disturbance. It also examines the suggested method through a numerical computation simulation and validates its effectiveness.

  • PDF

TV 세트의 스피커에 의한 소음 대책 설계 (Noise Reduction of Mono Type TV Sets Induced by Speaker)

  • 김종연;이중근;김재환;박상덕;최진성;박종성
    • 소음진동
    • /
    • 제9권4호
    • /
    • pp.730-737
    • /
    • 1999
  • This paper illustrates the sound vibration phenomenon of mono type TV set produced by spearker and suggests guidelines for reducing the noise induced by the sound vibrations. In order to illustrate the sound vibration phenomenon, the structural acoustic coupled analysis for the grill and cavity of speaker and structural analysis for main frame are performed. To veify the structural analysis results, experimental modal test is carried out. It is found that the acoustic excitation in the cavity is negligible and main sound vibrations occur near the bottom of TV set. An improved model is found by doing structural modifications based on structural analysis and sound vibration tests are performed to verify the validation of the improved model. The obtained results are applied to similar models and design guide lines for noise reduction are suggested.

  • PDF