• 제목/요약/키워드: Log-linear models

검색결과 104건 처리시간 0.034초

Empirical Comparisons of Disparity Measures for Partial Association Models in Three Dimensional Contingency Tables

  • Jeong, D.B.;Hong, C.S.;Yoon, S.H.
    • Communications for Statistical Applications and Methods
    • /
    • 제10권1호
    • /
    • pp.135-144
    • /
    • 2003
  • This work is concerned with comparison of the recently developed disparity measures for the partial association model in three dimensional categorical data. Data are generated by using simulation on each term in the log-linear model equation based on the partial association model, which is a proposed method in this paper. This alternative Monte Carlo methods are explored to study the behavior of disparity measures such as the power divergence statistic I(λ), the Pearson chi-square statistic X$^2$, the likelihood ratio statistic G$^2$, the blended weight chi-square statistic BWCS(λ), the blended weight Hellinger distance statistic BWHD(λ), and the negative exponential disparity statistic NED(λ) for moderate sample sizes. We find that the power divergence statistic I(2/3) and the blended weight Hellinger distance family BWHD(1/9) are the best tests with respect to size and power.

단일방정식과 관련방정식체계를 적용한 소비지출 함수의 모델 적합성 비교 (A Comparison of the Goodness-of-Fit between Two Models of Expenditure Function: a Single-Equation Model versus a Complete- System-of-Demand-Equation Model)

  • 황덕순;김숙향
    • 가정과삶의질연구
    • /
    • 제20권1호
    • /
    • pp.45-56
    • /
    • 2002
  • The main purposes of this article are to introduce the theoretical backgrounds and empirical application methods of two different Models for the function of expenditure, and to compare the goodness-o(-fit of the two models: a single-equation model and a complete-system-of-demand-equation model. For the empirical analysis of the single-equation model, a linear formula and a double-leg formula were employed. In order to test the complete-system-of-demand-equation model empirically, the \"Linear Approximation/Almost Ideal Demand System (LA/AIDS)" was used. The independent variables were the total living expense and expenditure categories Price index. The data used in this study were obtained from the quarterly statistics of "The Annual Report on the Urban Family Income and Expenditure Survey (Dosigagyeyonbo)" and "The Annual Report on the Consumer Price Index (Sobijamulgajaryo)," for the years 1994 to 1997. The goodness-of-fit (R-square) was higher with the complete-system-of-demand-equation model than with the single-equation model for the budget share on food (excluding eating-out expenses) and for the share on cultural and recreational activities. However, there was no difference between the two models in terms of the proportion of the expenditure on automobile fuel.fuel.

육류 신선도 판별을 위한 휴대용 전자코 시스템 설계 및 성능 평가 II - 돈육의 미생물 총균수 예측을 통한 전자코 시스템 성능 검증 (Design and performance evaluation of portable electronic nose systems for freshness evaluation of meats II - Performance analysis of electronic nose systems by prediction of total bacteria count of pork meats)

  • 김재곤;조병관
    • 농업과학연구
    • /
    • 제38권4호
    • /
    • pp.761-767
    • /
    • 2011
  • The objective of this study was to predict total bacteria count of pork meats by using the portable electronic nose systems developed throughout two stages of the prototypes. Total bacteria counts were measured for pork meats stored at $4^{\circ}C$ for 21days and compared with the signals of the electronic nose systems. PLS(Partial least square), PCR (Principal component regression), MLR (Multiple linear regression) models were developed for the prediction of total bacteria count of pork meats. The coefficient of determination ($R_p{^2}$) and root mean square error of prediction (RMSEP) for the models were 0.789 and 0.784 log CFU/g with the 1st system for the pork loin, 0.796 and 0.597 log CFU/g with the 2nd system for the pork belly, and 0.661 and 0.576 log CFU/g with the 2nd system for the pork loin respectively. The results show that the developed electronic system has potential to predict total bacteria count of pork meats.

우도거리에 의한 결정계수 $R^2$에의한 통합적 접근 (Unified Approach to Coefficient of Determination $R^2$ Using Likelihood Distancd)

  • 허명회;이종한;정진환
    • 응용통계연구
    • /
    • 제4권2호
    • /
    • pp.117-127
    • /
    • 1991
  • 결정계수 $R^2$은 회귀분석에서 실제적으로는 매우 이용도가 높은 기술 측도라고 하겠으나, 회귀모형이 절편향을 포함하는 표준적인 선형회귀모형 이외인 경우에는 결정계수의 정의에 관하여 여러 논란이 있어 왔다. 절편항이 없는 선형회귀모형에서와 가중선형회귀모형, 로버스트 선형회귀모형에서의 결정계수의 적절한 정의와 용법이 대표적인 문제라고 하겠다. 기존의 여러 연구, 예를 들어 Kvalseth(1985) 나 Willet and Singer(1988)에서는 이러한 각 경우에 각기 적용될 수 있는 결정계수의 여러 변형들을 제안 $\cdot$ 이런 기존의 연구들이 일반적인 원칙이 없이 경우별로 단편적으로 대응하고 있을뿐더러 약간의 오류를 포함하고 있어 오히려 통계전문가가 아닌 통계 이용자들에게 혼란을 불러 일으킬 염려가 있다. 따라서 결정계수의 일반적 정의를 제안한 본 연구는 현재와 같은 결정계수의 여러변종의 범람으로 인한 혼란을 없애는 데 기여하리라고 생각된다. 이 통합결정계수는 尤度거리(likelihood distance)를 이용하여 정의되는데, 선형회귀모형 이외에도 비선형 회귀모형과 일반화 선형모형에 일관되게 적용 가능하다는 장점을 갖는다.

  • PDF

화학적 수문곡선 분리기법을 이용한 낙동강 최상류 유역 기저유출량 산정 (Base Flow Estimation in Uppermost Nakdong River Watersheds Using Chemical Hydrological Curve Separation Technique)

  • 김령은;이옥정;최정현;원정은;김상단
    • 한국물환경학회지
    • /
    • 제36권6호
    • /
    • pp.489-499
    • /
    • 2020
  • Effective science-based management of the basin water resources requires an understanding of the characteristics of the streams, such as the baseflow discharge. In this study, the base flow was estimated in the two watersheds with the least artificial factors among the Nakdong River watersheds, as determined using the chemical hydrograph separation technique. The 16-year (2004-2019) discontinuous observed stream flow and electrical conductivity data in the Total Maximum Daily Load (TMDL) monitoring network were extended to continuous daily data using the TANK model and the 7-parameter log-linear model combined with the minimum variance unbiased estimator. The annual base flows at the upper Namgang Dam basin and the upper Nakdong River basin were both analyzed to be about 56% of the total annual flow. The monthly base flow ratio showed a high monthly deviation, as it was found to be higher than 0.9 in the dry season and about 0.46 in the rainy season. This is in line with the prevailing common sense notion that in winter, most of the stream flow is base flow, due to the characteristics of the dry season winter in Korea. It is expected that the chemical-based hydrological separation technique involving TANK and the 7-parameter log-linear models used in this study can help quantify the base flow required for systematic watershed water environment management.

상수도 주철 배수관로의 파손자료 유형에 따른 파손율 모형화와 수정된 시간척도를 이용한 최적교체시기의 산정 (Modeling of Rate-of-Occurrence-of-Failure According to the Failure Data Type of Water Distribution Cast Iron Pipes and Estimation of Optimal Replacement Time Using the Modified Time Scale)

  • 박수완;전환돈;김정욱
    • 한국수자원학회논문집
    • /
    • 제40권1호
    • /
    • pp.39-50
    • /
    • 2007
  • 본 논문에서는 대수-선형 파손율 모형(log-linear ROCOF)과 와이블 파솔율 모형(Weibull ROCOF)을 이용하여 상수도 주철 배수관로의 파손율을 모형화하고, '수정된 시간 척도'를 이용하여 최적교체시기를 산정할 수 있는 방법이 개발되었다. 두 ROCOF의 모형화를 위하여 개별 관로의 파손시간을 기록한 '파손 시간자료(failure-time data)'와 일정 시간간격 사이에서 발생하는 파손횟수를 기록한 '파손 횟수자료(failure-number data)'를 이용하였고, 최대로그우도 추정값을 이용하여 두 ROCOF의 각 파손자료 유형에 대한 모형화 수행 능력을 검증하였다. 또한 두 ROCOF를 이용한 관로의 최적교체시기 방정식은 ROCOF의 매개변수 추정에 있어서 수렴성을 보장하기 위하여 '수정된 시간 척도'를 적용하여 유도하였다. 연구대상 주철 배수 관로들의 '파손 시간자료'와 '파손 횟수자료'에 두 파손율 모형을 적용시켜 본 결과 파손 시간자료를 이용할 경우 대수-선형 ROCOF가 와이블 ROCOF 보다 적합한 모형인 것으로 나타났다. 또한 두 모형 모두 '파손 시간자료'를 이용하는 것이 '파손 횟수자료'를 이용하는 것보다 모형화 수행 능력이 높아지는 것으로 나타나서, 분석에 사용된 관로의 파손율 모형화와 최적교체시기 산정을 위해서는 일정 시간간격 동안의 관로 파손횟수를 기록하는 것보다 관로의 파손시간을 기록하는 것이 더욱 우수한 모형화 결과를 낳는 것으로 나타났다.

QSPR Models for Chromatographic Retention of Some Azoles with Physicochemical Properties

  • Polyakova, Yulia;Jin, Long Mei;Row, Kyung-Ho
    • Bulletin of the Korean Chemical Society
    • /
    • 제27권2호
    • /
    • pp.211-218
    • /
    • 2006
  • This work deals with 24 substances composed of nitrogen-containing heterocycles. The relationships between the chromatographic retention factor (k) and those physicochemical properties which are relevant in quantitative structure-properties relationship (QSPR) studies, such as the polarizability $(\alpha)$, molar refractivity (MR), lipophilicity (logP), dipole moment $(\mu)$, total energy $(E_{tot})$, heat of formation $(\Delta H_f)$, molecular surface area $(S_M)$, and binding energy $(E_b)$, were investigated. The accuracy of the simple linear regressions between the chromatographic retention and the descriptors for all of the compounds was satisfactory (correlation coefficient, $0.8 \leq r \leq 1.0$). The QSPR models of these nitrogen-containing heterocyclic compounds could be predicted with a multiple linear regression equation having the statistical index, r = 1.000. This work demonstrated the successful application of the multiple linear approaches through the development of accurate predictive equations for retention factors in liquid chromatography.

A DFT and QSAR Study of Several Sulfonamide Derivatives in Gas and Solvent

  • Abadi, Robabeh Sayyadi kord;Alizadehdakhel, Asghar;Paskiabei, Soghra Tajadodi
    • 대한화학회지
    • /
    • 제60권4호
    • /
    • pp.225-234
    • /
    • 2016
  • The activity of 34 sulfonamide derivatives has been estimated by means of multiple linear regression (MLR), artificial neural network (ANN), simulated annealing (SA) and genetic algorithm (GA) techniques. These models were also utilized to select the most efficient subsets of descriptors in a cross-validation procedure for non-linear -log (IC50) prediction. The results obtained using GA-ANN were compared with MLR-MLR, MLR-ANN, SA-ANN and GA-ANN approaches. A high predictive ability was observed for the MLR-MLR, MLR-ANN, SA-ANN and MLR-GA models, with root mean sum square errors (RMSE) of 0.3958, 0.1006, 0.0359, 0.0326 and 0.0282 in gas phase and 0.2871, 0.0475, 0.0268, 0.0376 and 0.0097 in solvent, respectively (N=34). The results obtained using the GA-ANN method indicated that the activity of derivatives of sulfonamides depends on different parameters including DP03, BID, AAC, RDF035v, JGI9, TIE, R7e+, BELM6 descriptors in gas phase and Mor 32u, ESpm03d, RDF070v, ATS8m, MATS2e and R4p, L1u and R3m in solvent. In conclusion, the comparison of the quality of the ANN with different MLR models showed that ANN has a better predictive ability.

비편향 회귀분석모형을 이용한 낙동강 본류 부유사량 산정방법의 신뢰도 향상 (Improvement of Suspended Solid Loads Estimation in Nakdong River Using Minimum Variance Unbiased Estimator)

  • 한수희;강두기;신현석;유재정;김상단
    • 한국물환경학회지
    • /
    • 제23권2호
    • /
    • pp.251-259
    • /
    • 2007
  • In this study three log-transformed linear regression models are compared with the focus of bias correction problem. The models are the traditional simple linear regression estimator (SL), the quasi maximum likelihood estimator (QMLE) and the minimum variance unbiased estimator (MVUE). Using such models, suspended solid loads can be estimated using the discharge - suspended solid data set that has been measured by NIER Nakdong River Water Environment Laboratory. As a result, SL shows negative bias for most values of the measured discharge range. QMLE is nearly unbiased for moderate values of the measured discharge range, but shows increasingly positive bias for either large or small value of the measured discharge range. MVUE is unbiased. It is also analyzed how the estimated regression coefficient and exponent are distributed along Nakdong river main stream.

직업적으로 납에 노출된 근로자들의 혈액중 납과 ZPP농도와의 관계 (Model Between Lead and ZPP Concentration of Workers Exposed to Lead)

  • 박동욱;백남원;최병순;김태균;이광용;오세민;안규동
    • 한국산업보건학회지
    • /
    • 제6권1호
    • /
    • pp.88-96
    • /
    • 1996
  • This study was conducted to establish model between lead and ZPP concentration in blood of workers exposed to lead. Workers employed in secondary smelting manufacturing industry showed $85.1{\mu}g/dl$ of blood lead level, exceeding $60{\mu}g/dl$, the Criteria for Removal defined by Occupational Safety and Health Act of Korea. Average blood lead level of workers in the battery manufacturing industry was $51.3{\mu}g/dl$, locating between $40{\mu}g/dl$ and $60{\mu}g/dl$, the Criteria for Requiring Medical Removal. Blood lead level of in the litharge and radiator manufacturing industry was below $40{\mu}g/dl$, the Criteria Requiring Temporary Medical Removal. Blood lead levels of workers by industry were Significantly different(p<0.05). 50(21 %) showed blood lead levels above $60{\mu}g/dl$, the Criteria for Removal and 66(27.7 %) showed blood lead levels between the Criteria for Requiring Medical Removal, $40-60{\mu}g/dl$. Thus, approximately 50 percent of workers indicated blood lead levels above $40{\mu}g/dl$, the Criteria Requiring Temporary Medical Removal and should receive medical examination and consultation including biological monitoring. Average ZPP level of workers employed in the secondary smelting industry was $186.2{\mu}g/dl$, exceeding above $150{\mu}g/dl$, the Criteria for Removal. Seventy seven of all workers(32.3 %) showed ZPP level above $100-150{\mu}g/dl$, the Criteria for Requiring Medical Removal. The most appropriate model for predicting ZPP in blood was log-linear regression model. Log linear regression models between lead and ZPP concentrations in blood was Log ZPP(${\mu}g/dl$) = -0.2340 + 1.2270 Log Pb-B(${\mu}g/dl$)(standard error of estimate: 0,089, ${\gamma}^2=0.4456$, n=238, P=0.0001), Blood-in-lead explained 44.56 % of the variance in log(ZPP in blood).

  • PDF