• Title/Summary/Keyword: Validation data set

Search Result 379, Processing Time 0.028 seconds

Cancer Registration in Basrah-Southern Iraq: Validation by Household Survey

  • Hussain, Riyadh Abdul-Ameer;Habib, Omran S
    • Asian Pacific Journal of Cancer Prevention
    • /
    • 제17권sup3호
    • /
    • pp.197-200
    • /
    • 2016
  • On an international scale, the burden of cancer in absolute numbers continues to increase, mainly due to aging of population in many countries, the overall growth of the world population, changing lifestyle with increasing cancer-causing behavior, like cigarette smoking, changing dietary habits and sedentary life. Cancer is the second-leading cause of death and disability in the world, after only heart disease. Recently, increasing incidence and mortality of cancer have also become evident in the developing world. In Iraq and particularly in Basrah in the southern part of the country, the burden has definitely increased and deserves extensive research. The present paper is part of an extensive household survey carried out in Basrah in 2013. Among the objectives was to validate official cancer registration in the governorate. The cross-sectional survey had a retrospective component to inquire about the incidence of cancer and cancer-related deaths during the three years preceding the date of inquiry (2010-2012). A convenient sample of 6,999 households with 40,688 inhabitants using multistage cluster sampling was surveyed involving all urban and rural areas of Basrah. The official cancer registration activities in Basrah seemed to have attained a high level of registration coverage (70-80%) but the gap, represented by missed cases, is still high enough to criticize the system. Most of the missing cases were either not notified by treating facilities or they were diagnosed and treated outside Basrah. Using a set of parameters, the pattern of cancer was consistent based on data of the household survey and data of the cancer registry but a gap still existed in the coverage of incident cancer and mortality by cancer registration. Integrated serious steps are required to contain the risk of cancer and its burden on the patient through improving the registration process, improving early detection, diagnostic and management capabilities and encouraging scientific research to explore the hidden risk factors and possible causes of low registration coverage. Periodic household surveys seemed feasible and essential to support routine registration.

Hardware 유역의 수문매개변수 보정을 위한 SWAT-CUP 프로그램의 적용성 평가 (Evaluation of Applicability of SWAT-CUP Program for Hydrologic Parameter Calibration in Hardware Watershed)

  • 김상민
    • 한국농공학회논문집
    • /
    • 제59권3호
    • /
    • pp.63-70
    • /
    • 2017
  • The purpose of this study was to calibrate the hydrologic parameters of SWAT model and analyze the daily runoff for the study watershed using SWAT-CUP. The Hardware watershed is located in Virginia, USA. The watershed area is $356.15km^2$, and the land use accounts for 73.4 % of forest and 23.2 % of pasture. Input data for the SWAT model were obtained from the digital elevation map, landuse map, soil map and others. Water flow data from 1990 to 1994 was used for calibration and from 1997 to 2005 was for validation. The SUFI-2 module of the SWAT-CUP program was used to calibrate the hydrologic parameters. The parameters were calibrated for the highly sensitive parameters presented in previous studies. The P-factor, R-factor, $R^2$, Nash-Sutcliffe efficiency (NS), and average flow were used for the goodness-of-fit measures. The applicability of the model was evaluated by sequentially increasing the number of applied parameters from 4 to 11. In this study, 10-parameter set was accepted for calibration in consideration of goodness-of-fit measures. For the calibration period, P-factor was 0.85, R-factor was 1.76, $R^2$ was 0.51 and NS was 0.49. The model was validated using the adjusted ranges of selected parameters. For the validation period, P-factor was 0.78, R-factor was 1.60, $R^2$ was 0.60 and NS was 0.57.

개선된 배깅 앙상블을 활용한 기업부도예측 (Bankruptcy prediction using an improved bagging ensemble)

  • 민성환
    • 지능정보연구
    • /
    • 제20권4호
    • /
    • pp.121-139
    • /
    • 2014
  • 기업의 부도 예측은 재무 및 회계 분야에서 매우 중요한 연구 주제이다. 기업의 부도로 인해 발생하는 비용이 매우 크기 때문에 부도 예측의 정확성은 금융기관으로서는 매우 중요한 일이다. 최근에는 여러 개의 모형을 결합하는 앙상블 모형을 부도 예측에 적용해 보려는 연구가 큰 관심을 끌고 있다. 앙상블 모형은 개별 모형보다 더 좋은 성과를 내기 위해 여러 개의 분류기를 결합하는 것이다. 이와 같은 앙상블 분류기는 분류기의 일반화 성능을 개선하는 데 매우 유용한 것으로 알려져 있다. 본 논문은 부도 예측 모형의 성과 개선에 관한 연구이다. 이를 위해 사례 선택(Instance Selection)을 활용한 배깅(Bagging) 모형을 제안하였다. 사례 선택은 원 데이터에서 가장 대표성 있고 관련성 높은 데이터를 선택하고 예측 모형에 악영향을 줄 수 있는 불필요한 데이터를 제거하는 것으로 이를 통해 예측 성과 개선도 기대할 수 있다. 배깅은 학습데이터에 변화를 줌으로써 기저 분류기들을 다양화시키는 앙상블 기법으로 단순하면서도 성과가 매우 좋은 것으로 알려져 있다. 사례 선택과 배깅은 각각 모형의 성과를 개선시킬 수 있는 잠재력이 있지만 이들 두 기법의 결합에 관한 연구는 아직까지 없는 것이 현실이다. 본 연구에서는 부도 예측 모형의 성과를 개선하기 위해 사례 선택과 배깅을 연결하는 새로운 모형을 제안하였다. 최적의 사례 선택을 위해 유전자 알고리즘이 사용되었으며, 이를 통해 최적의 사례 선택 조합을 찾고 이 결과를 배깅 앙상블 모형에 전달하여 새로운 형태의 배깅 앙상블 모형을 구성하게 된다. 본 연구에서 제안한 새로운 앙상블 모형의 성과를 검증하기 위해 ROC 커브, AUC, 예측정확도 등과 같은 성과지표를 사용해 다양한 모형과 비교 분석해 보았다. 실제 기업데이터를 사용해 실험한 결과 본 논문에서 제안한 새로운 형태의 모형이 가장 좋은 성과를 보임을 알 수 있었다.

TRADE-OFFS BETWEEN FUEL ECONOMY AND NOX EMISSIONS USING FUZZY LOGIC CONTROL WITH A HYBRID CVT CONFIGURATION

  • Rousseau, A.;Saglini, S.;Jakov, M.;Gray, D.;Hardy, K.
    • International Journal of Automotive Technology
    • /
    • 제4권1호
    • /
    • pp.47-55
    • /
    • 2003
  • The Center for Transportation Research at the Argonne National Laboratory (ANL) supports the DOE by evaluating advanced automotive technologies in a systems context. ha has developed a unique set of compatible simulation tools and test equipment to perform an integrated systems analysis project from modeling through hardware testing and validation. This project utilized these capabilities to demonstrate the trade-off in fuel economy and Oxides of Nitrogen (NOx) emissions in a so-called ‘pre-transmission’ parallel hybrid powertrain. The powertrain configuration (in simulation and on the dynamometer) consists of a Compression Ignition Direct Ignition (CIDI) engine, a Continuously Variable Transmission (CVT) and an electric drive motor coupled to the CVT input shaft. The trade-off is studied in a simulated environment using PSAT with different controllers (fuzzy logic and rule based) and engine models (neural network and steady state models developed from ANL data).

A Test for Equality Form of Covariance Matrices of Multivariate Normal Populations

  • Kim, Hea-Jung
    • Journal of the Korean Statistical Society
    • /
    • 제20권2호
    • /
    • pp.191-201
    • /
    • 1991
  • Given a set of data pxN$_{i}$, matrices X$_{i}$ observed from p-variate normal populations $\prod$$_{i}$~N($\mu$$_{I}$, $\Sigma$$_{i}$) for i=1, …, K, the test for equality form of the covariance matrices is to choose a hypothetical model which best explains the homogeneity/heterogeneity structure across the covariance matrices among the hypothesized class of models. This paper describes a test procedure for selecting the best model. The procedure is based on a synthesis of Bayesian and a cross-validation or sample reuse methodology that makes use of a one-at-a-time schema of observational omissions. Advantages of the test are argued on two grounds, and illustrative examples and simulation results are given.are given.

  • PDF

Online Social Media Review Mining for Living Items with Probabilistic Approach: A Case Study

  • Li, Shuai;Hao, Fei;Kim, Hee-Cheol
    • 스마트미디어저널
    • /
    • 제2권2호
    • /
    • pp.20-27
    • /
    • 2013
  • The concept of social media is top of the agenda for many business executives and decision makers, as well as consultants try to identify ways where companies can make profitable use of applications such as Netflix, Flixster. The social media is playing an increasingly important role as the information sources for customers making product choices etc. With the flourish of Web 2.0 technology, customer reviews are becoming more and more useful and important information resources for people to save their time and energy on purchasing products that they want. This paper proposes the Bayesian Probabilistic Classification algorithm to mine the social media review, and evaluates it by different splits and cross validation mechanism from the real data set. The explored study experimental results show the robustness and effectiveness of proposed approach for mining the social media review.

  • PDF

기록치 오차와 유역모형의 검정(II) - 모니터링 검정방법 - (Errors in Recorded Information and Calibration of a Catchment Modelling System(II) - Monitoring Calibration Approach -)

  • Choi, Kyung Sook
    • 한국농공학회지
    • /
    • 제45권5호
    • /
    • pp.117-125
    • /
    • 2003
  • Since the recorded information used for operation of a catchment modelling system contain errors that influence the calibration of catchment modelling system control parameter values, the accurate estimation of these parameters is difficult. Despite these influences, existing traditional calibration approaches focus only on achieving the best "curve fitting" between simulated and recorded data, and not on generic evaluation of control parameter values. This paper introduces an Early Stopping Technique which is aimed at avoiding the procedure of curve-fitting through monitoring improvements in the objective function used for assessing the optimal parameter set. Application of this approach to the calibration of SWMM (Storm Water Management Model) on the Centennial Park catchment in Sydney, Australia is outlined. outlined.

Prediction of Thermal Decomposition Temperature of Polymers Using QSPR Methods

  • Ajloo, Davood;Sharifian, Ali;Behniafar, Hossein
    • Bulletin of the Korean Chemical Society
    • /
    • 제29권10호
    • /
    • pp.2009-2016
    • /
    • 2008
  • The relationship between thermal decomposition temperature and structure of a new data set of eighty monomers of different polymers were studied by multiple linear regression (MLR). The stepwise method was used in order to variable selection. The best descriptors were selected from over 1400 descriptors including; topological, geometrical, electronic and hybrid descriptors. The effect of number of descriptors on the correlation coefficient (R) and F-ratio were considered. Two models were suggested, one model having four descriptors ($R^2$ = 0.894, $Q^2_{cv}$ = 0.900, F = 172.1) and other model involving 13 descriptors ($R^2$ = 0.956, $Q^2_{cv}$ = 0.956, F = 125.4).

신경회로망을 응용한 현가장치의 폐회로 시스템 규명 (Empirical Closed Loop Modeling of a Suspension System Using Neural Network)

  • Kim, I.Y.;Chong, K.T.;Hong, D.P.
    • 한국정밀공학회지
    • /
    • 제14권7호
    • /
    • pp.29-38
    • /
    • 1997
  • A closed-loop system modeling of an active/semiactive suspension system has been accomplished through an artificial neural network. A 7DOF full model as a system's equation of motion has been derived and an output feedback linear quadratic regulator has been designed for control purpose. A training set of a sample data has been obtained through a computer simulation. A 7DOF full model with LQR controller simulated under several road conditions such as sinusoidal bumps and rectangular bumps. A general multilayer perceptron neural network is used for dynamic modeling and target outputs are fedback to the a layer. A backpropagation method is used as a training algorithm. Model validation of new dataset have been shown through computer simulations.

  • PDF

Text-driven Speech Animation with Emotion Control

  • Chae, Wonseok;Kim, Yejin
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제14권8호
    • /
    • pp.3473-3487
    • /
    • 2020
  • In this paper, we present a new approach to creating speech animation with emotional expressions using a small set of example models. To generate realistic facial animation, two example models called key visemes and expressions are used for lip-synchronization and facial expressions, respectively. The key visemes represent lip shapes of phonemes such as vowels and consonants while the key expressions represent basic emotions of a face. Our approach utilizes a text-to-speech (TTS) system to create a phonetic transcript for the speech animation. Based on a phonetic transcript, a sequence of speech animation is synthesized by interpolating the corresponding sequence of key visemes. Using an input parameter vector, the key expressions are blended by a method of scattered data interpolation. During the synthesizing process, an importance-based scheme is introduced to combine both lip-synchronization and facial expressions into one animation sequence in real time (over 120Hz). The proposed approach can be applied to diverse types of digital content and applications that use facial animation with high accuracy (over 90%) in speech recognition.