• 제목/요약/키워드: Prediction Accuracy

검색결과 3,705건 처리시간 0.028초

협력적 필터링 추천 시스템의 정확도 향상 (Accuracy improvement of a collaborative filtering recommender system)

  • 이석환;박승현
    • 대한안전경영과학회지
    • /
    • 제12권1호
    • /
    • pp.127-136
    • /
    • 2010
  • In this paper, the author proposed following two methods to improve the accuracy of the recommender system. First, in order to classify the users more accurately, the author used a EMC(Expanded Moving Center) heuristic algorithm which improved clustering accuracy. Second, the author proposed the Neighborhood-oriented preference prediction method that improved the conventional preference prediction methods, so the accuracy of the recommender system is improved. The test result of the recommender system which adapted the above two methods suggested in this paper was improved the accuracy than the conventional recommendation methods.

Balanced Accuracy and Confidence Probability of Interval Estimates

  • Liu, Yi-Hsin;Stan Lipovetsky;Betty L. Hickman
    • International Journal of Reliability and Applications
    • /
    • 제3권1호
    • /
    • pp.37-50
    • /
    • 2002
  • Simultaneous estimation of accuracy and probability corresponding to a prediction interval is considered in this study. Traditional application of confidence interval forecasting consists in evaluation of interval limits for a given significance level. The wider is this interval, the higher is probability and the lower is the forecast precision. In this paper a measure of stochastic forecast accuracy is introduced, and a procedure for balanced estimation of both the predicting accuracy and confidence probability is elaborated. Solution can be obtained in an optimizing approach. Suggested method is applied to constructing confidence intervals for parameters estimated by normal and t distributions

  • PDF

Evaluation of accuracies of genomic predictions for body conformation traits in Korean Holstein

  • Md Azizul Haque;Mohammad Zahangir Alam;Asif Iqbal;Yun Mi Lee;Chang Gwon Dang;Jong Joo Kim
    • Animal Bioscience
    • /
    • 제37권4호
    • /
    • pp.555-566
    • /
    • 2024
  • Objective: This study aimed to assess the genetic parameters and accuracy of genomic predictions for twenty-four linear body conformation traits and overall conformation scores in Korean Holstein dairy cows. Methods: A dataset of 2,206 Korean Holsteins was collected, and genotyping was performed using the Illumina Bovine 50K single nucleotide polymorphism (SNP) chip. The traits investigated included body traits (stature, height at front end, chest width, body depth, angularity, body condition score, and locomotion), rump traits (rump angle, rump width, and loin strength), feet and leg traits (rear leg set, rear leg rear view, foot angle, heel depth, and bone quality), udder traits (udder depth, udder texture, udder support, fore udder attachment, front teat placement, front teat length, rear udder height, rear udder width, and rear teat placement), and overall conformation score. Accuracy of genomic predictions was assessed using the single-trait animal model genomic best linear unbiased prediction method implemented in the ASReml-SA v4.2 software. Results: Heritability estimates ranged from 0.10 to 0.50 for body traits, 0.21 to 0.35 for rump traits, 0.13 to 0.29 for feet and leg traits, and 0.05 to 0.46 for udder traits. Rump traits exhibited the highest average heritability (0.29), while feet and leg traits had the lowest estimates (0.21). Accuracy of genomic predictions varied among the twenty-four linear body conformation traits, ranging from 0.26 to 0.49. The heritability and prediction accuracy of genomic estimated breeding value (GEBV) for the overall conformation score were 0.45 and 0.46, respectively. The GEBVs for body conformation traits in Korean Holstein cows had low accuracy, falling below the 50% threshold. Conclusion: The limited response to selection for body conformation traits in Korean Holsteins may be attributed to both the low heritability of these traits and the lower accuracy estimates for GEBVs. Further research is needed to enhance the accuracy of GEBVs and improve the selection response for these traits.

예측성향을 고려한 비대칭 서포트벡터 회귀의 적용 (Application of Asymmetric Support Vector Regression Considering Predictive Propensity)

  • 이동주
    • 산업경영시스템학회지
    • /
    • 제45권1호
    • /
    • pp.71-82
    • /
    • 2022
  • Most of the predictions using machine learning are neutral predictions considering the symmetrical situation where the predicted value is not smaller or larger than the actual value. However, in some situations, asymmetric prediction such as over-prediction or under-prediction may be better than neutral prediction, and it can induce better judgment by providing various predictions to decision makers. A method called Asymmetric Twin Support Vector Regression (ATSVR) using TSVR(Twin Support Vector Regression), which has a fast calculation time, was proposed by controlling the asymmetry of the upper and lower widths of the ε-tube and the asymmetry of the penalty with two parameters. In addition, by applying the existing GSVQR and the proposed ATSVR, prediction using the prediction propensities of over-prediction, under-prediction, and neutral prediction was performed. When two parameters were used for both GSVQR and ATSVR, it was possible to predict according to the prediction propensity, and ATSVR was found to be more than twice as fast in terms of calculation time. On the other hand, in terms of accuracy, there was no significant difference between ATSVR and GSVQR, but it was found that GSVQR reflected the prediction propensity better than ATSVR when checking the figures. The accuracy of under-prediction or over-prediction was lower than that of neutral prediction. It seems that using both parameters rather than using one of the two parameters (p_1,p_2) increases the change in the prediction tendency. However, depending on the situation, it may be better to use only one of the two parameters.

기계학습을 이용한 수출신용보증 사고예측 (The Prediction of Export Credit Guarantee Accident using Machine Learning)

  • 조재영;주지환;한인구
    • 지능정보연구
    • /
    • 제27권1호
    • /
    • pp.83-102
    • /
    • 2021
  • 2020년 8월 정부는 한국판 뉴딜을 뒷받침하기 위한 공공기관의 역할 강화방안으로서 각 공공기관별 역량을 바탕으로 5대 분야에 걸쳐 총 20가지 과제를 선정하였다. 빅데이터(Big Data), 인공지능 등을 활용하여 대국민 서비스를 제고하고 공공기관이 보유한 양질의 데이터를 개방하는 등의 다양한 정책을 통해 한국판 뉴딜(New Deal)의 성과를 조기에 창출하고 이를 극대화하기 위한 다양한 노력을 기울이고 있다. 그중에서 한국무역보험공사(KSURE)는 정책금융 공공기관으로 국내 수출기업들을 지원하기 위해 여러 제도를 운영하고 있는데 아직까지는 본 기관이 가지고 있는 빅데이터를 적극적으로 활용하지 못하고 있는 실정이다. 본 연구는 한국무역보험공사의 수출신용보증 사고 발생을 사전에 예측하고자 공사가 보유한 내부 데이터에 기계학습 모형을 적용하였고 해당 모형 간에 예측성과를 비교하였다. 예측 모형으로는 로지스틱(Logit) 회귀모형, 랜덤 포레스트(Random Forest), XGBoost, LightGBM, 심층신경망을 사용하였고, 평가 기준으로는 전체 표본의 예측 정확도 이외에도 표본별 사고 확률을 구간으로 나누어 높은 확률로 예측된 표본과 낮은 확률로 예측된 경우의 정확도를 서로 비교하였다. 각 모형별 전체 표본의 예측 정확도는 70% 내외로 나타났고 개별 표본을 사고 확률 구간별로 세부 분석한 결과 양 극단의 확률구간(0~20%, 80~100%)에서 90~100%의 예측 정확도를 보여 모형의 현실적 활용 가능성을 보여주었다. 제2종 오류의 중요성 및 전체적 예측 정확도를 종합적으로 고려할 경우, XGBoost와 심층신경망이 가장 우수한 모형으로 평가되었다. 랜덤포레스트와 LightGBM은 그 다음으로 우수하며, 로지스틱 회귀모형은 가장 낮은 성과를 보였다. 본 연구는 한국무역보험공사의 빅데이터를 기계학습모형으로 분석해 업무의 효율성을 높이는 사례로서 향후 기계학습 등을 활용하여 실무 현장에서 빅데이터 분석 및 활용이 활발해지기를 기대한다.

Mobility Prediction Algorithms Using User Traces in Wireless Networks

  • Luong, Chuyen;Do, Son;Park, Hyukro;Choi, Deokjai
    • 한국멀티미디어학회논문지
    • /
    • 제17권8호
    • /
    • pp.946-952
    • /
    • 2014
  • Mobility prediction is one of hot topics using location history information. It is useful for not only user-level applications such as people finder and recommendation sharing service but also for system-level applications such as hand-off management, resource allocation, and quality of service of wireless services. Most of current prediction techniques often use a set of significant locations without taking into account possible location information changes for prediction. Markov-based, LZ-based and Prediction by Pattern Matching techniques consider interesting locations to enhance the prediction accuracy, but they do not consider interesting location changes. In our paper, we propose an algorithm which integrates the changing or emerging new location information. This approach is based on Active LeZi algorithm, but both of new location and all possible location contexts will be updated in the tree with the fixed depth. Furthermore, the tree will also be updated even when there is no new location detected but the expected route is changed. We find that our algorithm is adaptive to predict next location. We evaluate our proposed system on a part of Dartmouth dataset consisting of 1026 users. An accuracy rate of more than 84% is achieved.

TIME SERIES PREDICTION USING INCREMENTAL REGRESSION

  • Kim, Sung-Hyun;Lee, Yong-Mi;Jin, Long;Chai, Duck-Jin;Ryu, Keun-Ho
    • 대한원격탐사학회:학술대회논문집
    • /
    • 대한원격탐사학회 2006년도 Proceedings of ISRS 2006 PORSEC Volume II
    • /
    • pp.635-638
    • /
    • 2006
  • Regression of conventional prediction techniques in data mining uses the model which is generated from the training step. This model is applied to new input data without any change. If this model is applied directly to time series, the rate of prediction accuracy will be decreased. This paper proposes an incremental regression for time series prediction like typhoon track prediction. This technique considers the characteristic of time series which may be changed over time. It is composed of two steps. The first step executes a fractional process for applying input data to the regression model. The second step updates the model by using its information as new data. Additionally, the model is maintained by only recent data in a queue. This approach has the following two advantages. It maintains the minimum information of the model by using a matrix, so space complexity is reduced. Moreover, it prevents the increment of error rate by updating the model over time. Accuracy rate of the proposed method is measured by RME(Relative Mean Error) and RMSE(Root Mean Square Error). The results of typhoon track prediction experiment are performed by the proposed technique IMLR(Incremental Multiple Linear Regression) is more efficient than those of MLR(Multiple Linear Regression) and SVR(Support Vector Regression).

  • PDF

셀 레벨에서의 OPTICS 기반 특질 추출을 이용한 칩 품질 예측 (A Prediction of Chip Quality using OPTICS (Ordering Points to Identify the Clustering Structure)-based Feature Extraction at the Cell Level)

  • 김기현;백준걸
    • 대한산업공학회지
    • /
    • 제40권3호
    • /
    • pp.257-266
    • /
    • 2014
  • The semiconductor manufacturing industry is managed by a number of parameters from the FAB which is the initial step of production to package test which is the final step of production. Various methods for prediction for the quality and yield are required to reduce the production costs caused by a complicated manufacturing process. In order to increase the accuracy of quality prediction, we have to extract the significant features from the large amount of data. In this study, we propose the method for extracting feature from the cell level data of probe test process using OPTICS which is one of the density-based clustering to improve the prediction accuracy of the quality of the assembled chips that will be placed in a package test. Two features extracted by using OPTICS are used as input variables of quality prediction model because of having position information of the cell defect. The package test progress for chips classified to the correct quality grade by performing the improved prediction method is expected to bring the effect of reducing production costs.

Prediction of 305 Days Milk Production from Early Records in Dairy Cattle Using an Empirical Bayes Method

  • Pereira, J.A.C.;Suzuki, M.;Hagiya, K.
    • Asian-Australasian Journal of Animal Sciences
    • /
    • 제14권11호
    • /
    • pp.1511-1515
    • /
    • 2001
  • A prediction of 305 d milk production from early records using an empirical Bayes method (EBM) was performed. The EBM was compared with the best predicted estimation (BPE), test interval method (TIM), and the linearized Wood's model (LWM). Daily milk yields were obtained from 606 first lactation Japanese Holstein cows in three herds. From each file of 305 daily records, 10 random test day records with an interval of approximately one month were taken. The accuracies of these methods were compared using the absolute difference (AD) and the standard deviation (SD) of the differences between the actual and the estimated 305 d milk production. The results showed that in the early stage of the lactation, EBM was superior in obtaining the prediction with high accuracy. When all the herds were analyzed jointly, the AD during the first 5 test day records were on average 373, 590, 917 and 1,042 kg for EBM, BPE, TIM, and LWM, respectively. Corresponding SD for EBM, BPE, TIM, and LWM were on average 488, 733, 747 and 1,605 kg. When the herds were analyzed separately, the EBM predictions retained high accuracy. When more information on the actual lactation was added to the prediction, TIM and LWM gradually achieved better accuracies. Finally, in the last period of the lactation, the accuracy of both of the methods exceeded EBM and BPM. The AD for the last 2 samples analyzing all the herds jointly were on average 141, 142, 164, and 214 kg for LWM, TIM, EBM, and BPE, respectively. In the current practices of collecting monthly records, early prediction of future milk production may be more accurate using EBM. Alternatively, if enough information of the actual lactation is accumulated, TIM may obtain better accuracy in the latter stage of lactation.

시계열 데이터의 성격과 예측 모델의 예측력에 관한 연구 (Relationships Between the Characteristics of the Business Data Set and Forecasting Accuracy of Prediction models)

  • 이원하;최종욱
    • 지능정보연구
    • /
    • 제4권1호
    • /
    • pp.133-147
    • /
    • 1998
  • Recently, many researchers have been involved in finding deterministic equations which can accurately predict future event, based on chaotic theory, or fractal theory. The theory says that some events which seem very random but internally deterministic can be accurately predicted by fractal equations. In contrast to the conventional methods, such as AR model, MA, model, or ARIMA model, the fractal equation attempts to discover a deterministic order inherent in time series data set. In discovering deterministic order, researchers have found that neural networks are much more effective than the conventional statistical models. Even though prediction accuracy of the network can be different depending on the topological structure and modification of the algorithms, many researchers asserted that the neural network systems outperforms other systems, because of non-linear behaviour of the network models, mechanisms of massive parallel processing, generalization capability based on adaptive learning. However, recent survey shows that prediction accuracy of the forecasting models can be determined by the model structure and data structures. In the experiments based on actual economic data sets, it was found that the prediction accuracy of the neural network model is similar to the performance level of the conventional forecasting model. Especially, for the data set which is deterministically chaotic, the AR model, a conventional statistical model, was not significantly different from the MLP model, a neural network model. This result shows that the forecasting model. This result shows that the forecasting model a, pp.opriate to a prediction task should be selected based on characteristics of the time series data set. Analysis of the characteristics of the data set was performed by fractal analysis, measurement of Hurst index, and measurement of Lyapunov exponents. As a conclusion, a significant difference was not found in forecasting future events for the time series data which is deterministically chaotic, between a conventional forecasting model and a typical neural network model.

  • PDF