• Title/Summary/Keyword: parsimonious model

Search Result 57, Processing Time 0.027 seconds

A Sturdy on Rainfall Runoff Models for Forecast of Long-Term Runoff in Miho Basin (미호천 유역의 장기유출 예측을 위한 개념적 강우유출모형의 적용)

  • Ahn, Sang-Eok;Lee, Hyo-Sang;Jeon, Min-Woo
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2009.05a
    • /
    • pp.991-995
    • /
    • 2009
  • 최근 기후변화 등으로 우리나라의 경우 강수일수는 감소한 반면 집중호우의 발생빈도는 증가하고 있다. 실제 가뭄과홍수와 같은 극치사상의 피해가 증가될 가능성과 이러한 재해로부터 인명 및 재산을 보호하고 효율적인 수자원 활용을 위해서는 장기간 강우-유출과정의 정확한 해석이 필수적이다. 본 연구는 미호천 유역을 대상으로 장기유출을 모의하기 위해 개념적 강우유출모형을 적용하였다. 본 연구의 개념적 강우유출모형은 PDM(Probability Distributed Model)으로 유역을 한 개의 단위구역으로 사용한 집중형(lumped) 모형이고, 분포형 모형에 비하여 간단 (parsimonious)하며 영국의 수자원 및 홍수 관리 목적으로 널리 사용되고 있다. 모형의 검정은 MC(Monte Carlo) 방법과 SCE-UA(Shuffled Complex Evolution-University of Arizona) 방법을 적용하였으며, NSE(Nash Sutcliffe Efficiency) 목적함수를 사용하여 모형의 성능을 검토하였다. 그 결과, MC 방법과 SCE-UA 방법 모두 NSE의 값 0.9 이상으로 만족할 만한 모의성능을 나타내었다. 분포형 모형에 비하여 적은 수문자료 및 검정변수를 갖는 PDM 모형을 수문자료의 취득이 용이하지 않은 중 소규모 유역에 적용하여 모형의 검정 및 유량산정에 있어 우수함을 확인하였다. 이에 우리나라 전역에 걸쳐 다양한 유역을 대상으로 PDM 모형의 검토가 요구되고, 향후 우리나라의 홍수량 산정 및 수자원 관리에 적용될 수 있다고 판단된다.

  • PDF

Secondary Structure and Phylogenetic Implications of ITS2 in the Genus Tricholoma

  • Suh, Seok-Jong;Kim, Jong-Guk
    • Journal of Microbiology and Biotechnology
    • /
    • v.12 no.1
    • /
    • pp.130-136
    • /
    • 2002
  • The internal transcribed spacer (ITS) region in the genus Tricholoma was analyzed, including for its primary nucleotide sequence and secondary structural characterization. The secondary structures of the ITS2 region in the genus Tricholoma were identified for use in bioinformatic processes to study molecular evolution and compare secondary structures. Ten newly sequenced ITS regions were added to the analysis and submitted to the GenBank database. The resulting structure from a minimum energy algorithm indicated the four-domain model, as previously suggested by others. The conserved secondary structure of the ITS2 sequences of the genus Tricholoma exhibited certain unique features, including pyrimidine tracts in the loops of domain A and a complete structure containing four domains, with motifs identified in other ITS2 secondary structures. A phylogenetic tree was derived from sequence alignment based on the secondary structures. From the resulting maximum parsimonious tree, it was found that the species in the genus Tricholoma had evolved monophyletically and were composed of four groups, as supported by the bootstrapping values and pileus color.

Analysis of SEER Adenosquamous Carcinoma Data to Identify Cause Specific Survival Predictors and Socioeconomic Disparities

  • Cheung, Rex
    • Asian Pacific Journal of Cancer Prevention
    • /
    • v.17 no.1
    • /
    • pp.347-352
    • /
    • 2016
  • Background: This study used receiver operating characteristic curve to analyze Surveillance, Epidemiology and End Results (SEER) adenosquamous carcinoma data to identify predictive models and potential disparities in outcome. Materials and Methods: This study analyzed socio-economic, staging and treatment factors available in the SEER database for adenosquamous carcinoma. For the risk modeling, each factor was fitted by a generalized linear model to predict the cause specific survival. An area under the receiver operating characteristic curve (ROC) was computed. Similar strata were combined to construct the most parsimonious models. Results: A total of 20,712 patients diagnosed from 1973 to 2009 were included in this study. The mean follow up time (S.D.) was 54.2 (78.4) months. Some 2/3 of the patients were female. The mean (S.D.) age was 63 (13.8) years. SEER stage was the most predictive factor of outcome (ROC area of 0.71). 13.9% of the patients were un-staged and had risk of cause specific death of 61.3% that was higher than the 45.3% risk for the regional disease and lower than the 70.3% for metastatic disease. Sex, site, radiotherapy, and surgery had ROC areas of about 0.55-0.65. Rural residence and race contributed to socioeconomic disparity for treatment outcome. Radiotherapy was underused even with localized and regional stages when the intent was curative. This under use was most pronounced in older patients. Conclusions: Anatomic stage was predictive and useful in treatment selection. Under-staging may have contributed to poor outcome.

Analysis of SEER Glassy Cell Carcinoma Data: Underuse of Radiotherapy and Predicators of Cause Specific Survival

  • Cheung, Rex
    • Asian Pacific Journal of Cancer Prevention
    • /
    • v.17 no.1
    • /
    • pp.353-356
    • /
    • 2016
  • Background: This study used receiver operating characteristic curve to analyze Surveillance, Epidemiology and End Results (SEER) for glassy cell carcinoma data to identify predictive models and potential disparities in outcome. Materials and Methods: This study analyzed socio-economic, staging and treatment factors. For risk modeling, each factor was fitted by a generalized linear model to predict the cause specific survival. Area under the receiver operating characteristic curves (ROCs) were computed. Similar strata were combined to construct the most parsimonious models. A random sampling algorithm was used to estimate modeling errors. Risk of glassy cell carcinoma death was computed for the predictors for comparison. Results: There were 79 patients included in this study. The mean follow up time (S.D.) was 37 (32.8) months. Female patients outnumbered males 4:1. The mean (S.D.) age was 54.4 (19.8) years. SEER stage was the most predictive factor of outcome (ROC area of 0.69). The risks of cause specific death were, respectively, 9.4% for localized, 16.7% for regional, 35% for the un-staged/others category, and 60% for distant disease. After optimization, separation between the regional and unstaged/others category was removed with a higher ROC area of 0.72. Several socio-economic factors had small but measurable effects on outcome. Radiotherapy had not been used in 90% of patients with regional disease. Conclusions: Optimized SEER stage was predictive and useful in treatment selection. Underuse of radiotherapy may have contributed to poor outcome.

Forcing a Closer Fit in the Lower Tails of a Distribution for Better Estimating Extremely Small Percentiles of Strengths

  • Guess, Frank-M.;Leon, Ramon-V.;Chen, Weiwei;Young, Timothy-M.
    • International Journal of Reliability and Applications
    • /
    • v.5 no.4
    • /
    • pp.129-145
    • /
    • 2004
  • We use a novel, forced censoring technique that closer fits the lower tails of strenth distributions to better estimate extremly smaller percentiles for measuring progress in continuous improvement initiatives. These percentiles are of greater interest for companies, government oversight organizations, and consumers concerned with safely and preventing accidents for many products in general, but specifically for medium density fiberboard (MDF). The international industrial standard for MDF for measuring highest quality is internal bond (IB, also called tensile strengh) and its smaller percentiles are crucial, especially the first percentile and lower ones. We induce censoring at a value just above the median to weight lower observations more. Using this approach, we have better fits in the lower tails of the distribution, where these samller percentiles are impacted most. Finally, bootstrap estimates of the small percentiles are used to demonstrate improved intervals by our forced censoring approach and the fitted model. There was evidence from the study to suggest that MDF has potentially different failure modes for early failures. Overall, our approach is parsimonious and is suitable for real time manufacturing settings. The approach works for either strengths distributions or lifetime distributions.

  • PDF

Symmetric and Asymmetric Approaches to Money Demand Determination in Indonesia: Is Divisia Money Relevant?

  • LEONG, Choi-Meng;PUAH, Chin-Hong;TANG, Maggie May-Jean
    • The Journal of Asian Finance, Economics and Business
    • /
    • v.8 no.7
    • /
    • pp.393-402
    • /
    • 2021
  • This study aims to examine whether symmetric effects or asymmetric effects of exchange rates exist in determining the money demand in Indonesia. Simple-sum money and Divisia money were included in different models for comparison due to the financial developments in Indonesia. This study uses time-series data from 1996Q1 to 2019Q4 for the estimation. The nonlinear autoregressive distributed lag (NARDL) model is utilized to verify the asymmetric effects of exchange rates on money demand. The Augmented Dickey-Fuller and Phillips-Perron unit root tests were performed to verify the order of integration of the variables. The findings of this study revealed that the exchange rate is one of the most important determinants of money demand in Indonesia and the effect is asymmetric. The findings further indicated that money demand function, which incorporates Divisia monetary aggregate is parsimonious. Monetary targets such as money supply and interest rates are critical for monetary policy conduct to achieve inflation levels set by government. As the adoption of an inflation targeting framework needs to be in keeping with the flexible exchange rate system, the asymmetric effect of exchange rate changes can be used in exchange rate policy conduct to achieve financial system and price stability.

Predicting Administrative Issue Designation in KOSDAQ Market Using Machine Learning Techniques (머신러닝을 활용한 코스닥 관리종목지정 예측)

  • Chae, Seung-Il;Lee, Dong-Joo
    • Asia-Pacific Journal of Business
    • /
    • v.13 no.2
    • /
    • pp.107-122
    • /
    • 2022
  • Purpose - This study aims to develop machine learning models to predict administrative issue designation in KOSDAQ Market using financial data. Design/methodology/approach - Employing four classification techniques including logistic regression, support vector machine, random forest, and gradient boosting to a matched sample of five hundred and thirty-six firms over an eight-year period, the authors develop prediction models and explore the practicality of the models. Findings - The resulting four binary selection models reveal overall satisfactory classification performance in terms of various measures including AUC (area under the receiver operating characteristic curve), accuracy, F1-score, and top quartile lift, while the ensemble models (random forest and gradienct boosting) outperform the others in terms of most measures. Research implications or Originality - Although the assessment of administrative issue potential of firms is critical information to investors and financial institutions, detailed empirical investigation has lagged behind. The current research fills this gap in the literature by proposing parsimonious prediction models based on a few financial variables and validating the applicability of the models.

Surveying and Optimizing the Predictors for Ependymoma Specific Survival using SEER Data

  • Cheung, Min Rex
    • Asian Pacific Journal of Cancer Prevention
    • /
    • v.15 no.2
    • /
    • pp.867-870
    • /
    • 2014
  • Purpose: This study used receiver operating characteristic curve to analyze Surveillance, Epidemiology and End Results (SEER) ependymoma data to identify predictive models and potential disparity in outcome. Materials and Methods: This study analyzed socio-economic, staging and treatment factors available in the SEER database for ependymoma. For the risk modeling, each factor was fitted by a Generalized Linear Model to predict the outcome ('brain and other nervous systems' specific death in yes/no). The area under the receiver operating characteristic curve (ROC) was computed. Similar strata were combined to construct the most parsimonious models. A random sampling algorithm was used to estimate the modeling errors. Risk of ependymoma death was computed for the predictors for comparison. Results: A total of 3,500 patients diagnosed from 1973 to 2009 were included in this study. The mean follow up time (S.D.) was 79.8 (82.3) months. Some 46% of the patients were female. The mean (S.D.) age was 34.4 (22.8) years. Age was the most predictive factor of outcome. Unknown grade demonstrated a 15% risk of cause specific death compared to 9% for grades I and II, and 36% for grades III and IV. A 5-tiered grade model (with a ROC area 0.48) was optimized to a 3-tiered model (with ROC area of 0.53). This ROC area tied for the second with that for surgery. African-American patients had 21.5% risk of death compared with 16.6% for the others. Some 72.7% of patient who did not get RT had cerebellar or spinal ependymoma. Patients undergoing surgery had 16.3% risk of death, as compared to 23.7% among those who did not have surgery. Conclusion: Grading ependymoma may dramatically improve modeling of data. RT is under used for cerebellum and spinal cord ependymoma and it may be a potential way to improve outcome.

Multivariate design estimations under copulas constructions. Stage-1: Parametrical density constructions for defining flood marginals for the Kelantan River basin, Malaysia

  • Latif, Shahid;Mustafa, Firuza
    • Ocean Systems Engineering
    • /
    • v.9 no.3
    • /
    • pp.287-328
    • /
    • 2019
  • Comprehensive understanding of the flood risk assessments via frequency analysis often demands multivariate designs under the different notations of return periods. Flood is a tri-variate random consequence, which often pointing the unreliability of univariate return period and demands for the joint dependency construction by accounting its multiple intercorrelated flood vectors i.e., flood peak, volume & durations. Selecting the most parsimonious probability functions for demonstrating univariate flood marginals distributions is often a mandatory pre-processing desire before the establishment of joint dependency. Especially under copulas methodology, which often allows the practitioner to model univariate marginals separately from their joint constructions. Parametric density approximations often hypothesized that the random samples must follow some specific or predefine probability density functions, which usually defines different estimates especially in the tail of distributions. Concentrations of the upper tail often seem interesting during flood modelling also, no evidence exhibited in favours of any fixed distributions, which often characterized through the trial and error procedure based on goodness-of-fit measures. On another side, model performance evaluations and selections of best-fitted distributions often demand precise investigations via comparing the relative sample reproducing capabilities otherwise, inconsistencies might reveal uncertainty. Also, the strength & weakness of different fitness statistics usually vary and having different extent during demonstrating gaps and dispensary among fitted distributions. In this literature, selections efforts of marginal distributions of flood variables are incorporated by employing an interactive set of parametric functions for event-based (or Block annual maxima) samples over the 50-years continuously-distributed streamflow characteristics for the Kelantan River basin at Gulliemard Bridge, Malaysia. Model fitness criteria are examined based on the degree of agreements between cumulative empirical and theoretical probabilities. Both the analytical as well as graphically visual inspections are undertaken to strengthen much decisive evidence in favour of best-fitted probability density.

A Structural Model for Health Promotion and Life Satisfaction of Life in College Students in Korea (대학생들의 건강증진행위와 삶의 만족도에 대한 구조모형)

  • Hong, Youn-Lan;Yi, Ga-Eon;Park, Hyun-Sook
    • Research in Community and Public Health Nursing
    • /
    • v.11 no.2
    • /
    • pp.333-346
    • /
    • 2000
  • The purpose of this study was designed to test and develope the structural model that explains health promoting behaviors among college students in Korea. The hypothetical model was constructed on the Pender's Health promotion Model(l996) and the inclusion of some influential factors for life satisfaction. The conceptual framework was built around eight constructs. Exogenous variables included in the model were self-esteem, perceived health status, self-efficacy, internal locus of control, chance locus of control. powerful other locus of control. Endogenous variables were health promotion behaviors and life satisfaction. The results are as follows; 1. The overall fit of the hypothetical model to the data was moderate <$x^2$=4.18(df=11. p=0.041), GFI= 0.99, AGFI= 0.76, RMR= 0.019, CFI= 0.99, CN= 248.50> 2. Path and variable of the model were modified by considering both its theoretical implication and statistical significance of parameter estimates. Compared to the hypothetical model. the revised model has become parsimonious and had a better fit to the data expected in a chi-square value <$x^2$=8.43( df= 16, p=0.21), GFI= 0.99, AGFI= 0.92., RMR= 0.024, CFI= 0.99, CN= 312.01> 3. Some of the predictive factors. especially self efficacy. self esteem. powerful others locus of control. perceived health status revealed the direct effects on health promoting behaviors. Of these variables. self-efficacy was the most signigicant factor. These predictive variables of health promoting behaviors explained 59% of total variances in the model. 4. Health promoting behaviors, self-esteem. and perceived health status revealed direct effect on the life satisfaction. Self-efficacy was identified as an important variable that contributed indirectly to improve life satisfaction by enhancing health promoting behaviors. These predictive variables of life satisfaction explained 42% of total variances in the model. In conclusion. the derived model in this study is considered appropriate in explaining and predicting health promoting models and life satisfaction among college students in Korea and could effectively be used as a reference model for further studies by suggesting a direction in health promoting nursing practices.

  • PDF