• 제목/요약/키워드: local linear regression

검색결과 175건 처리시간 0.029초

관망자료를 이용한 인공지능 기반의 누수 예측 (Artificial Intelligence-based Leak Prediction using Pipeline Data)

  • 이호현;홍성택
    • 한국정보통신학회논문지
    • /
    • 제26권7호
    • /
    • pp.963-971
    • /
    • 2022
  • 상수도 관망은 국가 수도 시설의 주요한 구성 요소이지만 대부분이 지중에 매립되어 있어 배관의 노후화 정도 및 누수를 파악하기 어려우므로 유지관리 하기가 매우 어렵다. 본 연구에서는 관망에 설치된 다양한 센서 조합을 가정하여, 데이터 조합에 따른 관로 누수 판별 가능성을 검토하기 위하여 선형회귀분석, 뉴로퍼지 등의 인공지능 알고리즘을 통한 유량과 압력 예측을 실시하여 최적 알고리즘을 도출하였다. 공급압력 예측을 통한 누수판별의 경우 뉴로퍼지 알고리즘이 선형회귀분석에 비하여 우수하였다. 누수유량 예측에서는 뉴로퍼지를 이용한 유량예측이 우선 고려되어야 한다. 다만, 유량을 모사하기 힘든 경우에는 선형 알고리즘을 이용한 공급압력 예측이 이루어져야 할 것으로 사료 된다.

Estimation of error variance in nonparametric regression under a finite sample using ridge regression

  • Park, Chun-Gun
    • Journal of the Korean Data and Information Science Society
    • /
    • 제22권6호
    • /
    • pp.1223-1232
    • /
    • 2011
  • Tong and Wang's estimator (2005) is a new approach to estimate the error variance using least squares method such that a simple linear regression is asymptotically derived from Rice's lag- estimator (1984). Their estimator highly depends on the setting of a regressor and weights in small sample sizes. In this article, we propose a new approach via a local quadratic approximation to set regressors in a small sample case. We estimate the error variance as the intercept using a ridge regression because the regressors have the problem of multicollinearity. From the small simulation study, the performance of our approach with some existing methods is better in small sample cases and comparable in large cases. More research is required on unequally spaced points.

로컬푸드에 대한 가치인식이 구매 및 체험에 미치는 영향 (Effects of Local Food Value Perception on Purchasing and Experience)

  • 원미경;박영희;이연정
    • 한국식생활문화학회지
    • /
    • 제30권1호
    • /
    • pp.54-63
    • /
    • 2015
  • This study was conducted to examine the effects of local food value perception on purchasing and experience in consumers. ${\chi}^2$-test, ANOVA, and linear regression analysis were conducted. The findings are summarized as follows: The most common place for buying agricultural products was 'hypermarkets' (41.7%), and the most important factor for purchasing local food was 'local government's certification products' (23.7%). The most important value recognition item for local food was 'I think that local food is a high-quality agricultural products'. (3.74 points), followed by 'I think that local food have a value of respect for customers' (3.61 points) and 'I have a faith for the local food'. (3.61 points) in that order. The main tourism experience activity was 'food experience' (49.0%), and information source of local food experience tourism was 'mass media (TV, newspapers, etc.)' (37.3%). As age increased, experience of local food also increased. The most effectual value recognition item for purchasing local food was 'I think that local food have a value of respect for customers'. The most effectual value recognition item for increasing intake experience of local food was 'I think that the local food is high-quality agricultural products'.

머신러닝을 활용한 지역축제 방문객 수 예측모형 개발 (Development of a Model to Predict the Number of Visitors to Local Festivals Using Machine Learning)

  • 이인지;윤현식
    • 한국정보시스템학회지:정보시스템연구
    • /
    • 제29권3호
    • /
    • pp.35-52
    • /
    • 2020
  • Purpose Local governments in each region actively hold local festivals for the purpose of promoting the region and revitalizing the local economy. Existing studies related to local festivals have been actively conducted in tourism and related academic fields. Empirical studies to understand the effects of latent variables on local festivals and studies to analyze the regional economic impacts of festivals occupy a large proportion. Despite of practical need, since few researches have been conducted to predict the number of visitors, one of the criteria for evaluating the performance of local festivals, this study developed a model for predicting the number of visitors through various observed variables using a machine learning algorithm and derived its implications. Design/methodology/approach For a total of 593 festivals held in 2018, 6 variables related to the region considering population size, administrative division, and accessibility, and 15 variables related to the festival such as the degree of publicity and word of mouth, invitation singer, weather and budget were set for the training data in machine learning algorithm. Since the number of visitors is a continuous numerical data, random forest, Adaboost, and linear regression that can perform regression analysis among the machine learning algorithms were used. Findings This study confirmed that a prediction of the number of visitors to local festivals is possible using a machine learning algorithm, and the possibility of using machine learning in research in the tourism and related academic fields, including the study of local festivals, was captured. From a practical point of view, the model developed in this study is used to predict the number of visitors to the festival to be held in the future, so that the festival can be evaluated in advance and the demand for related facilities, etc. can be utilized. In addition, the RReliefF rank result can be used. Considering this, it will be possible to improve the existing local festivals or refer to the planning of a new festival.

Estimation and variable selection in censored regression model with smoothly clipped absolute deviation penalty

  • Shim, Jooyong;Bae, Jongsig;Seok, Kyungha
    • Journal of the Korean Data and Information Science Society
    • /
    • 제27권6호
    • /
    • pp.1653-1660
    • /
    • 2016
  • Smoothly clipped absolute deviation (SCAD) penalty is known to satisfy the desirable properties for penalty functions like as unbiasedness, sparsity and continuity. In this paper, we deal with the regression function estimation and variable selection based on SCAD penalized censored regression model. We use the local linear approximation and the iteratively reweighted least squares algorithm to solve SCAD penalized log likelihood function. The proposed method provides an efficient method for variable selection and regression function estimation. The generalized cross validation function is presented for the model selection. Applications of the proposed method are illustrated through the simulated and a real example.

선형회귀와 국부적인 RBFN에 의한 점진적인 모델의 설계 (Design of Incremental Model by Linear Regression and Local RBFNs)

  • 이명원;곽근창
    • 한국정보처리학회:학술대회논문집
    • /
    • 한국정보처리학회 2010년도 추계학술발표대회
    • /
    • pp.471-473
    • /
    • 2010
  • 본 논문은 선형회귀(LR: Linear Regression)와 국부적인 방사기저함수 네트워크(RBFN: Radial Basis Function Networks)를 결합한 점진적인 모델(incremental model)의 설계와 관련되어진다. 전형적인 RBFN에 의한 모델링과는 달리, 제안된 방법의 근본적인 원리는 두 단계에 의해 고려되어진다. 첫째, 전체 모델의 설계과정에서 전역적인 모델로써 선형회귀에 의해 데이터의 선형부분을 구축한다. 다음으로, 모델링 오차는 오차가 존재하는 국부적인 공간에서 RBFN에 의해 보상되어진다. 여기서, 오차의 분포로부터 RBFN을 설계하기 위해 컨텍스트 기반 퍼지 클러스터링(CFC: Context-based Fuzzy Clustering)를 통해 정보입자의 형태로 구축되어진다. 실험은 자동차 mpg 연료소비량 예측과 부동산 가격예측문제를 통해 제안된 방법의 우수성을 증명한다.

중회귀 모형을 이용한 울산지역 오존 포텐셜 모형의 설계 및 평가 (Design and Assessment of an Ozone Potential Forecasting Model using Multi-regression Equations in Ulsan Metropolitan Area)

  • 김유근;이소영;임윤규;송상근
    • 한국대기환경학회지
    • /
    • 제23권1호
    • /
    • pp.14-28
    • /
    • 2007
  • This study presented the selection of ozone ($O_3$) potential factors and designed and assessed its potential prediction model using multiple-linear regression equations in Ulsan area during the springtime from April to June, $2000{\sim}2004$. $O_3$ potential factors were selected by analyzing the relationship between meterological parameters and surface $O_3$ concentrations. In addition, cluster analysis (e.g., average linkage and K-means clustering techniques) was performed to identify three major synoptic patterns (e.g., $P1{\sim}P3$) for an $O_3$ potential prediction model. P1 is characterized by a presence of a low-pressure system over northeastern Korea, the Ulsan was influenced by the northwesterly synoptic flow leading to a retarded sea breeze development. P2 is characterized by a weakening high-pressure system over Korea, and P3 is clearly associated with a migratory anticyclone. The stepwise linear regression was performed to develop models for prediction of the highest 1-h $O_3$ occurring in the Ulsan. The results of the models were rather satisfactory, and the high $O_3$ simulation accuracy for $P1{\sim}P3$ synoptic patterns was found to be 79, 85, and 95%, respectively ($2000{\sim}2004$). The $O_3$ potential prediction model for $P1{\sim}P3$ using the predicted meteorological data in 2005 showed good high $O_3$ prediction performance with 78, 75, and 70%, respectively. Therefore the regression models can be a useful tool for forecasting of local $O_3$ concentration.

Local joint flexibility equations for Y-T and K-type tubular joints

  • Asgarian, Behrouz;Mokarram, Vahid;Alanjari, Pejman
    • Ocean Systems Engineering
    • /
    • 제4권2호
    • /
    • pp.151-167
    • /
    • 2014
  • It is common that analyses of offshore platforms being carried out with the assumption of rigid tubular joints. However, many researches have concluded that it is necessary that local joint flexibility (LJF) of tubular joints should be taken into account. Meanwhile, advanced analysis of old offshore platforms considering local joint flexibility leads to more accurate results. This paper presents an extensive finite-element (FE) based study on the flexibility of uni-planner multi-brace tubular Y-T and K-joints commonly found in offshore platforms. A wide range of geometric parameters of Y-T and K-joints in offshore practice is covered to generate reliable parametric equations for flexibility matrices. The formulas are obtained by non-linear regression analyses on the database. The proposed equations are verified against existing analytical and experimental formulations. The equations can be used reliably in global analyses of offshore structures to account for the LJF effects on overall behavior of the structure.

COMPOUNDED METHOD FOR LAND COVERING CLASSIFICATION BASED ON MULTI-RESOLUTION SATELLITE DATA

  • HE WENJU;QIN HUA;SUN WEIDONG
    • 대한원격탐사학회:학술대회논문집
    • /
    • 대한원격탐사학회 2005년도 Proceedings of ISRS 2005
    • /
    • pp.116-119
    • /
    • 2005
  • As to the synthetical estimation of land covering parameters or the compounded land covering classification for multi-resolution satellite data, former researches mainly adopted linear or nonlinear regression models to describe the regression relationship of land covering parameters caused by the degradation of spatial resolution, in order to improve the retrieval accuracy of global land covering parameters based on 1;he lower resolution satellite data. However, these methods can't authentically represent the complementary characteristics of spatial resolutions among different satellite data at arithmetic level. To resolve the problem above, a new compounded land covering classification method at arithmetic level for multi-resolution satellite data is proposed in this .paper. Firstly, on the basis of unsupervised clustering analysis of the higher resolution satellite data, the likelihood distribution scatterplot of each cover type is obtained according to multiple-to-single spatial correspondence between the higher and lower resolution satellite data in some local test regions, then Parzen window approach is adopted to derive the real likelihood functions from the scatterplots, and finally the likelihood functions are extended from the local test regions to the full covering area of the lower resolution satellite data and the global covering area of the lower resolution satellite is classified under the maximum likelihood rule. Some experimental results indicate that this proposed compounded method can improve the classification accuracy of large-scale lower resolution satellite data with the support of some local-area higher resolution satellite data.

  • PDF

조건부 분위수의 중도절단을 고려한 비모수적 추정 (Nonparametric estimation of conditional quantile with censored data)

  • 김은영;최혜미
    • Journal of the Korean Data and Information Science Society
    • /
    • 제24권2호
    • /
    • pp.211-222
    • /
    • 2013
  • 중도절단된 자료가 있을 경우 조건부 분위수함수를 비모수적으로 추정하는 문제에 대하여 다루고 있다. 역함수에 근거한 방법인 Yu와 Jones (1998)에 의해 제안된 중복커널기법 추정량과 Lee 등(2006)의 국소로지스틱기법 추정량을 중도절단된 자료가 있는 경우로 수정하여 새롭게 제안하고, 이들을 기존의 Koenker와 Bassett (1978)의 점검함수에 근거한 커널평활 추정량들과 모의실험을 통해 비교해 보았다. 모의실험을 통하여 역함수에 근거한 추정량들은 조건부 분포가 대칭인 모형에서, 점검함수기법 추정량들은 한쪽으로 치우친 분포인 경우에 조건부 분위수를 대체로 더 잘 추정하고 있음을 알 수 있었다.