• Title/Summary/Keyword: Log-linear model

Search Result 229, Processing Time 0.023 seconds

Penalized maximum likelihood estimation with symmetric log-concave errors and LASSO penalty

  • Seo-Young, Park;Sunyul, Kim;Byungtae, Seo
    • Communications for Statistical Applications and Methods
    • /
    • v.29 no.6
    • /
    • pp.641-653
    • /
    • 2022
  • Penalized least squares methods are important tools to simultaneously select variables and estimate parameters in linear regression. The penalized maximum likelihood can also be used for the same purpose assuming that the error distribution falls in a certain parametric family of distributions. However, the use of a certain parametric family can suffer a misspecification problem which undermines the estimation accuracy. To give sufficient flexibility to the error distribution, we propose to use the symmetric log-concave error distribution with LASSO penalty. A feasible algorithm to estimate both nonparametric and parametric components in the proposed model is provided. Some numerical studies are also presented showing that the proposed method produces more efficient estimators than some existing methods with similar variable selection performance.

Polaron Conductivity of Rutile Doped with MgO (MgO 도프된 Rutile의 Polaron 전도도)

  • Kim, Keu-Hong;Kim, Hyung-Tack;Choi, Jae-Shi
    • Journal of the Korean Chemical Society
    • /
    • v.31 no.3
    • /
    • pp.215-224
    • /
    • 1987
  • The electrical conductuctivity measurements have been made on polycrystalline samples of various compositions in the $MgO-TiO_2$ system from 600 to $1100^{\circ}C$ under $Po_2$'s of $10^{-8}\;to\;10^{-1}$atm. Plots of log ${\sigma}$ vs. 1/T at constant $Po_2$ are found to be linear with the inflections, and the activation energies are 1.94eV for the intrinsic range and 0.48eV for the extrinsic range, respectively. The log ${\sigma}$ vs. log $Po_2$ curves are found to be linear at constant temperature, and the conductivity dependences of $Po_2$ are closely approximated by ${\sigma}\;{\alpha}\;Po_2^{-1/6}$ for the extrinsic and ${\sigma}\;{\alpha}\;Po_2^{-1/4}$ for the intrinsic range, respectively. The dominant defects in this system are believed to be oxygen vacancy for the extrinsic and $Ti^{3-}$ interstitial for the intrinsic range. The conduction mechanisms in both the extrinsic and the intrinsic ranges are proposed by the results of the electrical conductivity dependence on the oxygen partial pressure. Polaron model was suggested in the extrinsic region by the conductivity dependences of temperature and $Po_2$.

  • PDF

Log-density Ratio with Two Predictors in a Logistic Regression Model (로지스틱 회귀모형에서 이변량 정규분포에 근거한 로그-밀도비)

  • Kahng, Myung Wook;Yoon, Jae Eun
    • The Korean Journal of Applied Statistics
    • /
    • v.26 no.1
    • /
    • pp.141-149
    • /
    • 2013
  • We present methods for studying the log-density ratio that enables the selection of the predictors and the form to be included in the logistic regression model. Under bivariate normal distributional assumptions, we investigate the form of the log-density ratio as a function of two predictors. If two covariance matrices are equal, then the crossproduct and quadratic terms are not needed. If the variables are uncorrelated, we do not need the crossproduct terms, but we still need the linear and quadratic terms. We also explore other conditions in which the crossproduct and quadratic terms are not needed in the logistic regression model.

Base Flow Estimation in Uppermost Nakdong River Watersheds Using Chemical Hydrological Curve Separation Technique (화학적 수문곡선 분리기법을 이용한 낙동강 최상류 유역 기저유출량 산정)

  • Kim, Ryoungeun;Lee, Okjeong;Choi, Jeonghyeon;Won, Jeongeun;Kim, Sangdan
    • Journal of Korean Society on Water Environment
    • /
    • v.36 no.6
    • /
    • pp.489-499
    • /
    • 2020
  • Effective science-based management of the basin water resources requires an understanding of the characteristics of the streams, such as the baseflow discharge. In this study, the base flow was estimated in the two watersheds with the least artificial factors among the Nakdong River watersheds, as determined using the chemical hydrograph separation technique. The 16-year (2004-2019) discontinuous observed stream flow and electrical conductivity data in the Total Maximum Daily Load (TMDL) monitoring network were extended to continuous daily data using the TANK model and the 7-parameter log-linear model combined with the minimum variance unbiased estimator. The annual base flows at the upper Namgang Dam basin and the upper Nakdong River basin were both analyzed to be about 56% of the total annual flow. The monthly base flow ratio showed a high monthly deviation, as it was found to be higher than 0.9 in the dry season and about 0.46 in the rainy season. This is in line with the prevailing common sense notion that in winter, most of the stream flow is base flow, due to the characteristics of the dry season winter in Korea. It is expected that the chemical-based hydrological separation technique involving TANK and the 7-parameter log-linear models used in this study can help quantify the base flow required for systematic watershed water environment management.

Comparison study on kernel type estimators of discontinuous log-variance (불연속 로그분산함수의 커널추정량들의 비교 연구)

  • Huh, Jib
    • Journal of the Korean Data and Information Science Society
    • /
    • v.25 no.1
    • /
    • pp.87-95
    • /
    • 2014
  • In the regression model, Kang and Huh (2006) studied the estimation of the discontinuous variance function using the Nadaraya-Watson estimator with the squared residuals. The local linear estimator of the log-variance function, which may have the whole real number, was proposed by Huh (2013) based on the kernel weighted local-likelihood of the ${\chi}^2$-distribution. Chen et al. (2009) estimated the continuous variance function using the local linear fit with the log-squared residuals. In this paper, the estimator of the discontinuous log-variance function itself or its derivative using Chen et al. (2009)'s estimator. Numerical works investigate the performances of the estimators with simulated examples.

Analysis of Temperature Effects on Microbial Growth Parameters and Estimation of Food Shelf Life with Confidence Band

  • Park, Jin-Pyo;Lee, Dong-Sun
    • Preventive Nutrition and Food Science
    • /
    • v.13 no.2
    • /
    • pp.104-111
    • /
    • 2008
  • As a way to account for the variability of the primary model parameters in the secondary modeling of microbial growth, three different regression approaches were compared in determining the confidence interval of the temperature-dependent primary model parameters and the estimated microbial growth during storage: bootstrapped regression with all the individual primary model parameter values; bootstrapped regression with average values at each temperature; and simple regression with regression lines of 2.5% and 97.5% percentile values. Temperature dependences of converted parameters (log $q_o$, ${\mu}_{max}^{1/2}$, log $N_{max}$) of hypothetical initial physiological state, maximum specific growth rate, and maximum cell density in Baranyi's model were subjected to the regression by quadratic, linear, and linear function, respectively. With an advantage of extracting the primary model parameters instantaneously at any temperature by using mathematical functions, regression lines of 2.5% and 97.5% percentile values were capable of accounting for variation in experimental data of microbial growth under constant and fluctuating temperature conditions.

Collapsibility and Suppression for Cumulative Logistic Model

  • Hong, Chong-Sun;Kim, Kil-Tae
    • Communications for Statistical Applications and Methods
    • /
    • v.12 no.2
    • /
    • pp.313-322
    • /
    • 2005
  • In this paper, we discuss suppression for logistic regression model. Suppression for linear regression model was defined as the relationship among sums of squared for regression as well as correlation coefficients of. variables. Since it is not common to obtain simple correlation coefficient for binary response variable of logistic model, we consider cumulative logistic models with multinomial and ordinal response variables rather than usual logistic model. As number of category of a response variable for the cumulative logistic model gets collapsed into binary, it is found that suppressions for these logistic models are changed. These suppression results for cumulative logistic models are discussed and compared with those of linear model.

A Longitudinal Study on Farm Householder's Perception of Economic Problem (전업${\cdot}$겸업 농가의 가정 내 경제문제 인지의 변화)

  • Cho, Young-Sook;Lee, Duk-Jae;Hwang, Dae-Yong;Park, Eun-Shik;Ko, Jeong-Sook
    • Journal of Agricultural Extension & Community Development
    • /
    • v.13 no.2
    • /
    • pp.265-276
    • /
    • 2006
  • This study focused on the trends in recognising an economic concern as the most difficult problem in the farm household that was categorised as a full-time farm household or a part-time household according to income structure from agricultural activity. Log-linear models were suggested to compare the change of recognising an economic problem between year 2003 and 2005. As a result, there was no significant difference in recognising an economic problem between full-time farm household and part-time farm household both in 2003 ($x^2$=0.04;p=0.84) and in 2005 ($x^2$=1.08; p=0.30). It was revealed, however, that full-time farm household had increasingly recognised an economic problem over years. Log-linear models showed that full-time farm households tended to recognise the economic problem 1.11 times as much as part-time farm households in 2005, compared to 0.97 times in 2003.

  • PDF

Developing the Index of Foodborne Disease Occurrence (식중독 발생지수 개발)

  • Choi, Kook-Yeol;Kim, Byung-Soo;Bae, Wha-Soo;Jung, Woo-Seok;Cho, Young-Joon
    • The Korean Journal of Applied Statistics
    • /
    • v.21 no.4
    • /
    • pp.649-658
    • /
    • 2008
  • As the Eating Out Businesses are making rapid progress and most of the schools and the firms serve the meals, the foodborne disease has occurred increasingly and lots of researches and the policies are studied to prevent it. In Korea, the foodborne disease index for prevention is developed by using bacterial growth rate on the temperature to give the information about the danger level of the foodborne disease, but the gap between real status of the occurrences and the predicted danger level has been pointed out. This study aims at developing the index of the foodborne occurrence based on the log linear model using the data of the foodborne disease occurrence and the meteorological data for the last three years($2004{\sim}2006$). Comparison between the new index and the existing index showed that the new index is better in explaining the foodborne disease occurrence.

A Study on the Factors Affecting the Arson (방화 발생에 영향을 미치는 요인에 관한 연구)

  • Kim, Young-Chul;Bak, Woo-Sung;Lee, Su-Kyung
    • Fire Science and Engineering
    • /
    • v.28 no.2
    • /
    • pp.69-75
    • /
    • 2014
  • This study derives the factors which affect the occurrence of arson from statistical data (population, economic, and social factors) by multiple regression analysis. Multiple regression analysis applies to 4 forms of functions, linear functions, semi-log functions, inverse log functions, and dual log functions. Also analysis respectively functions by using the stepwise progress which considered selection and deletion of the independent variable factors by each steps. In order to solve a problem of multiple regression analysis, autocorrelation and multicollinearity, Variance Inflation Factor (VIF) and the Durbin-Watson coefficient were considered. Through the analysis, the optimal model was determined by adjusted Rsquared which means statistical significance used determination, Adjusted R-squared of linear function is scored 0.935 (93.5%), the highest of the 4 forms of function, and so linear function is the optimal model in this study. Then interpretation to the optimal model is conducted. As a result of the analysis, the factors affecting the arson were resulted in lines, the incidence of crime (0.829), the general divorce rate (0.151), the financial autonomy rate (0.149), and the consumer price index (0.099).