• Title/Summary/Keyword: Regression testing

Search Result 707, Processing Time 0.028 seconds

Multiple Group Testing Procedures for Analysis of High-Dimensional Genomic Data

  • Ko, Hyoseok;Kim, Kipoong;Sun, Hokeun
    • Genomics & Informatics
    • /
    • v.14 no.4
    • /
    • pp.187-195
    • /
    • 2016
  • In genetic association studies with high-dimensional genomic data, multiple group testing procedures are often required in order to identify disease/trait-related genes or genetic regions, where multiple genetic sites or variants are located within the same gene or genetic region. However, statistical testing procedures based on an individual test suffer from multiple testing issues such as the control of family-wise error rate and dependent tests. Moreover, detecting only a few of genes associated with a phenotype outcome among tens of thousands of genes is of main interest in genetic association studies. In this reason regularization procedures, where a phenotype outcome regresses on all genomic markers and then regression coefficients are estimated based on a penalized likelihood, have been considered as a good alternative approach to analysis of high-dimensional genomic data. But, selection performance of regularization procedures has been rarely compared with that of statistical group testing procedures. In this article, we performed extensive simulation studies where commonly used group testing procedures such as principal component analysis, Hotelling's $T^2$ test, and permutation test are compared with group lasso (least absolute selection and shrinkage operator) in terms of true positive selection. Also, we applied all methods considered in simulation studies to identify genes associated with ovarian cancer from over 20,000 genetic sites generated from Illumina Infinium HumanMethylation27K Beadchip. We found a big discrepancy of selected genes between multiple group testing procedures and group lasso.

Implementation and Analysis of the Agent based Object-Oriented Software Test Tool, TAS (에이전트 기반의 객체지향 소프트웨어 테스트 도구인 TAS의 구현 및 분석)

  • Choi, Jeon-Geun;Choi, Byoungju
    • Journal of KIISE:Software and Applications
    • /
    • v.28 no.10
    • /
    • pp.732-742
    • /
    • 2001
  • The concept of an agent has become important in computer science and has been applied to the number of application domains such electronic commerce and information retrieval. But, no one has proposed yet in software test. The test agent system applied the concept of an agent to software test is new test tool. It consists of the User Interface Agent. the Test Case Selection & Testing Agent and the Regression Test Agent. Each of these agents, with their intelligent rules, carry out the tests autonomously by empolying the object-oriented test processes. This system has 2 advantages. Firstly since the tests are carried our autonomously, it minimizes tester interference and secondly, since redundant-free and consistent effective test cases are intellectually selected, the testing time is reduced while the fault detection effectiveness improves. In this paper, by actually showing the testing process being carried out autonomously by the 3 agents that form the TAS, we show that the TAS minimizes tester interference. By also carrying out the 4 different types of experiments on the RE-Rule, CTS-Rule, overall TAS experiment, and the fault-detection effectiveness experiment on the RE-Rule, we show the cut-down on the testing time and improvement in the fault detection effectivity.

  • PDF

Application of Logit Model in Qualitative Dependent Variables (로짓모형을 이용한 질적 종속변수의 분석)

  • Lee, Kil-Soon;Yu, Wann
    • Journal of Families and Better Life
    • /
    • v.10 no.1 s.19
    • /
    • pp.131-138
    • /
    • 1992
  • Regression analysis has become a standard statistical tool in the behavioral science. Because of its widespread popularity. regression has been often misused. Such is the case when the dependent variable is a qualitative measure rather than a continuous, interval measure. Regression estimates with a qualitative dependent variable does not meet the assumptions underlying regression. It can lead to serious errors in the standard statistical inference. Logit model is recommended as alternatives to the regression model for qualitative dependent variables. Researchers can employ this model to measure the relationship between independent variables and qualitative dependent variables without assuming that logit model was derived from probabilistic choice theory. Coefficients in logit model are typically estimated by the method of Maximum Likelihood Estimation in contrast to ordinary regression model which estimated by the method of Least Squares Estimation. Goodness of fit in logit model is based on the likelihood ratio statistics and the t-statistics is used for testing the null hypothesis.

  • PDF

ROBUST TEST BASED ON NONLINEAR REGRESSION QUANTILE ESTIMATORS

  • CHOI, SEUNG-HOE;KIM, KYUNG-JOONG;LEE, MYUNG-SOOK
    • Communications of the Korean Mathematical Society
    • /
    • v.20 no.1
    • /
    • pp.145-159
    • /
    • 2005
  • In this paper we consider the problem of testing statistical hypotheses for unknown parameters in nonlinear regression models and propose three asymptotically equivalent tests based on regression quantiles estimators, which are Wald test, Lagrange Multiplier test and Likelihood Ratio test. We also derive the asymptotic distributions of the three test statistics both under the null hypotheses and under a sequence of local alternatives and verify that the asymptotic relative efficiency of the proposed test statistics with classical test based on least squares depends on the error distributions of the regression models. We give some examples to illustrate that the test based on the regression quantiles estimators performs better than the test based on the least squares estimators of the least absolute deviation estimators when the disturbance has asymmetric and heavy-tailed distribution.

A Study on the Development of Fuzzy Linear Regression I

  • Kim, Hakyun
    • The Journal of Information Systems
    • /
    • v.4
    • /
    • pp.27-39
    • /
    • 1995
  • This study tests the fuzzy linear regression model to see if there is a performance difference between it and the classical linear regression model. These results show that FLR was better as f forecasting technique when compared with CLR. Another important find in the test of the two different regression methods is that they generate two different predicted P/E ratios from expected value test, variance test and error test of two different regressions, though we can not see a significant difference between two regression models doing test in error measurements (GMRAE, MAPE, MSE, MAD). So, in this financial setting we can conclude that FLR is not superior to CLR, comparing and testing between the t재 different regression models. However, FLR is better than CLR in the error measurements.

  • PDF

A Dynamic Approach to Estimate Change Impact using Type of Change Propagation

  • Gupta, Chetna;Singh, Yogesh;Chauhan, Durg Singh
    • Journal of Information Processing Systems
    • /
    • v.6 no.4
    • /
    • pp.597-608
    • /
    • 2010
  • Software evolution is an ongoing process carried out with the aim of extending base applications either for adding new functionalities or for adapting software to changing environments. This brings about the need for estimating and determining the overall impact of changes to a software system. In the last few decades many such change/impact analysis techniques have been developed to identify consequences of making changes to software systems. In this paper we propose a new approach of estimating change/impact analysis by classifying change based on type of change classification e.g. (a) nature and (b) extent of change propagation. The impact set produced consists of two dimensions of information: (a) statements affected by change propagation and (b) percentage i.e. statements affected in each category and involving the overall system. We also propose an algorithm for classifying the type of change. To establish confidence in effectiveness and efficiency we illustrate this technique with the help of an example. Results of our analysis are promising towards achieving the aim of the proposed endeavor to enhance change classification. The proposed dynamic technique for estimating impact sets and their percentage of impact will help software maintainers in performing selective regression testing by analyzing impact sets regarding the nature of change and change dependency.

Epidemiological application of the cycle threshold value of RT-PCR for estimating infection period in cases of SARS-CoV-2

  • Soonjong Bae;Jong-Myon Bae
    • Journal of Medicine and Life Science
    • /
    • v.20 no.3
    • /
    • pp.107-114
    • /
    • 2023
  • Epidemiological control of coronavirus disease 2019 (COVID-19) is needed to estimate the infection period of confirmed cases and identify potential cases. The present study, targeting confirmed cases for which the time of COVID-19 symptom onset was disclosed, aimed to investigate the relationship between intervals (day) from symptom onset to testing the cycle threshold (CT) values of real-time reverse transcription-polymerase chain reaction. Of the COVID-19 confirmed cases, those for which the date of suspected symptom onset in the epidemiological investigation was specifically disclosed were included in this study. Interval was defined as the number of days from symptom onset (as disclosed by the patient) to specimen collection for testing. A locally weighted regression smoothing (LOWESS) curve was applied, with intervals as explanatory variables and CT values (CTR for RdRp gene and CTE for E gene) as outcome variables. After finding its non-linear relationship, a polynomial regression model was applied to estimate the 95% confidence interval values of CTR and CTE by interval. The application of LOWESS in 331 patients identified a U-shaped curve relationship between the CTR and CTE values according to the number of interval days, and both CTR and CTE satisfied the quadratic model for interval days. Active application of these results to epidemiological investigations would minimize the chance of failing to identify individuals who are in contact with COVID-19 confirmed cases, thereby reducing the potential transmission of the virus to local communities.

Development of Monitoring System Using Residual Gas Analyzer (RGA) and Artificial Intelligence Modeling (잔류가스 분석기(RGA)와 인공지능 모델링을 이용한 모니터링 시스템 개발)

  • Ji Soo Lee;Song Hun Kim;Gyeong Su Kim;Hyo Jong Song;Sang-Hoon Park;Deuk-Hoon Goh;Bong-Jae Lee
    • Journal of the Semiconductor & Display Technology
    • /
    • v.23 no.2
    • /
    • pp.129-134
    • /
    • 2024
  • This study aims to talk about the necessity of solving the PFC gas emission problem raised by the recent development of the semiconductor industry and the remote plasma source method monitoring system used in the semiconductor industry. The 'monitoring system' means that the researchers applied machine learning to the existing monitoring technology and modeled it. In the process of this study, Residual Gas Analyzer monitoring technology and linear regression model were used. Through this model, the researchers identified emissions of at least 12700mg CO2 to 75800mg CO2 with values ranging from ion current 0.6A to 1.7A, and expect that the 'monitoring system' will contribute to the effective calculation of greenhouse gas emissions in the semiconductor industry in the future.

  • PDF

Estimate of Compressive Strength for Concrete using Ultrasonics by Multiple Regression Analysis Method (초음파를 이용한 중회귀분석법에 의한 콘크리트의 압축강도추정)

  • Park, I.G.;Han, E.K.;Kim, W.K.
    • Journal of the Korean Society for Nondestructive Testing
    • /
    • v.11 no.2
    • /
    • pp.22-31
    • /
    • 1991
  • Various types of ultrasonic techniques have been used for the estimation of compressive strength of concrete structures. However, conventional ultrasonic velocity method using only longitudial wave cannot be determined the compressive strength of concrete structures with accuracy. In this paper, by using the introduction of multiple parameter, e. g. velocity of shear wave, velocity of longitudinal wave, attenuation coefficient of shear wave, attenuation coefficient of longitudinal wave, combination condition, age and preservation method, multiple regression analysis method was applied to the determination of compressive strength of concrete structures. The experimental results show that velocity of shear wave can be estimated compressive strength of concrete with more accuracy compared with the velocity of longitudinal wave, accuracy of estimated error range of compressive strength of concrete structures can be enhanced within the range of ${\pm}$10% approximately.

  • PDF

Test in Unbalanced Panel Regression Model with Nuisance Parameter (장애모수가 존재하는 불균형 패널회귀모형에서의 검정법)

  • 이재원;정병철;송석헌
    • The Korean Journal of Applied Statistics
    • /
    • v.17 no.3
    • /
    • pp.547-556
    • /
    • 2004
  • This paper consider the testing problem of variance component for the unbalanced two-way error component model with nuisance parameter. We derive the one-sided LM test statistic for testing zero individual(time) effects assuming that the other time-specific(individual) effects are present. Using the Monte Carlo experiments, the computational more demanding LR test slightly underestimates the nominal size and has the low powers relative to LM test statistic.