• Title/Summary/Keyword: Prediction Performance

Search Result 5,497, Processing Time 0.039 seconds

A study on the use of a Business Intelligence system : the role of explanations (비즈니스 인텔리전스 시스템의 활용 방안에 관한 연구: 설명 기능을 중심으로)

  • Kwon, YoungOk
    • Journal of Intelligence and Information Systems
    • /
    • v.20 no.4
    • /
    • pp.155-169
    • /
    • 2014
  • With the rapid advances in technologies, organizations are more likely to depend on information systems in their decision-making processes. Business Intelligence (BI) systems, in particular, have become a mainstay in dealing with complex problems in an organization, partly because a variety of advanced computational methods from statistics, machine learning, and artificial intelligence can be applied to solve business problems such as demand forecasting. In addition to the ability to analyze past and present trends, these predictive analytics capabilities provide huge value to an organization's ability to respond to change in markets, business risks, and customer trends. While the performance effects of BI system use in organization settings have been studied, it has been little discussed on the use of predictive analytics technologies embedded in BI systems for forecasting tasks. Thus, this study aims to find important factors that can help to take advantage of the benefits of advanced technologies of a BI system. More generally, a BI system can be viewed as an advisor, defined as the one that formulates judgments or recommends alternatives and communicates these to the person in the role of the judge, and the information generated by the BI system as advice that a decision maker (judge) can follow. Thus, we refer to the findings from the advice-giving and advice-taking literature, focusing on the role of explanations of the system in users' advice taking. It has been shown that advice discounting could occur when an advisor's reasoning or evidence justifying the advisor's decision is not available. However, the majority of current BI systems merely provide a number, which may influence decision makers in accepting the advice and inferring the quality of advice. We in this study explore the following key factors that can influence users' advice taking within the setting of a BI system: explanations on how the box-office grosses are predicted, types of advisor, i.e., system (data mining technique) or human-based business advice mechanisms such as prediction markets (aggregated human advice) and human advisors (individual human expert advice), users' evaluations of the provided advice, and individual differences in decision-makers. Each subject performs the following four tasks, by going through a series of display screens on the computer. First, given the information of the given movie such as director and genre, the subjects are asked to predict the opening weekend box office of the movie. Second, in light of the information generated by an advisor, the subjects are asked to adjust their original predictions, if they desire to do so. Third, they are asked to evaluate the value of the given information (e.g., perceived usefulness, trust, satisfaction). Lastly, a short survey is conducted to identify individual differences that may affect advice-taking. The results from the experiment show that subjects are more likely to follow system-generated advice than human advice when the advice is provided with an explanation. When the subjects as system users think the information provided by the system is useful, they are also more likely to take the advice. In addition, individual differences affect advice-taking. The subjects with more expertise on advisors or that tend to agree with others adjust their predictions, following the advice. On the other hand, the subjects with more knowledge on movies are less affected by the advice and their final decisions are close to their original predictions. The advances in predictive analytics of a BI system demonstrate a great potential to support increasingly complex business decisions. This study shows how the designs of a BI system can play a role in influencing users' acceptance of the system-generated advice, and the findings provide valuable insights on how to leverage the advanced predictive analytics of the BI system in an organization's forecasting practices.

Carbon Monoxide Dispersion in an Urban Area Simulated by a CFD Model Coupled to the WRF-Chem Model (WRF-Chem 모델과 결합된 CFD 모델을 활용한 도시 지역의 일산화탄소 확산 연구)

  • Kwon, A-Rum;Park, Soo-Jin;Kang, Geon;Kim, Jae-Jin
    • Korean Journal of Remote Sensing
    • /
    • v.36 no.5_1
    • /
    • pp.679-692
    • /
    • 2020
  • We coupled a CFD model to the WRF-Chem model (WRF-CFD model) and investigated the characteristics of flows and carbon monoxide (CO) distributions in a building-congested district. We validated the simulated results against the measured wind speeds, wind directions, and CO concentrations. The WRF-Chem model simulated the winds from southwesterly to southeasterly, overestimating the measured wind speeds. The statistical validation showed that the WRF-CFD model simulated the measured wind speeds more realistically than the WRF-Chem model. The WRF-Chem model significantly underestimated the measured CO concentrations, and the WRF-CFD model improved the CO concentration prediction. Based on the statistical validation results, the WRF-CFD model improved the performance in predicting the CO concentrations by taking complicatedly distributed buildings and mobiles sources of CO into account. At 04 KST on May 22, there was a downdraft around the AQMS, and airflow with a relatively low CO concentration was advected from the upper layer. Resultantly, the CO concentration was lower at the AQMS than the surrounding area. At 15 KST on May 22, there was an updraft around the AQMS. This resulted in a slightly higher CO concentration than the surroundings. The WRF-CFD model transported CO emitted from the mobile sources to the AQMS measurement altitude, well reproducing the measured CO concentration. At 18 KST on May 22, the WRF-CFD model simulated high CO concentrations because of high CO emission, broad updraft area, and an increase in turbulent diffusion cause by wind-shear increase near the ground.

Improvements for Atmospheric Motion Vectors Algorithm Using First Guess by Optical Flow Method (옵티컬 플로우 방법으로 계산된 초기 바람 추정치에 따른 대기운동벡터 알고리즘 개선 연구)

  • Oh, Yurim;Park, Hyungmin;Kim, Jae Hwan;Kim, Somyoung
    • Korean Journal of Remote Sensing
    • /
    • v.36 no.5_1
    • /
    • pp.763-774
    • /
    • 2020
  • Wind data forecasted from the numerical weather prediction (NWP) model is generally used as the first-guess of the target tracking process to obtain the atmospheric motion vectors(AMVs) because it increases tracking accuracy and reduce computational time. However, there is a contradiction that the NWP model used as the first-guess is used again as the reference in the AMVs verification process. To overcome this problem, model-independent first guesses are required. In this study, we propose the AMVs derivation from Lucas and Kanade optical flow method and then using it as the first guess. To retrieve AMVs, Himawari-8/AHI geostationary satellite level-1B data were used at 00, 06, 12, and 18 UTC from August 19 to September 5, 2015. To evaluate the impact of applying the optical flow method on the AMV derivation, cross-validation has been conducted in three ways as follows. (1) Without the first-guess, (2) NWP (KMA/UM) forecasted wind as the first-guess, and (3) Optical flow method based wind as the first-guess. As the results of verification using ECMWF ERA-Interim reanalysis data, the highest precision (RMSVD: 5.296-5.804 ms-1) was obtained using optical flow based winds as the first-guess. In addition, the computation speed for AMVs derivation was the slowest without the first-guess test, but the other two had similar performance. Thus, applying the optical flow method in the target tracking process of AMVs algorithm, this study showed that the optical flow method is very effective as a first guess for model-independent AMVs derivation.

The Impact of Milk Production Level on Profit Traits of Holstein Dairy Cattle in Korea (국내 Holstein종 젖소의 생산수준이 젖소의 수익형질에 미치는 효과)

  • Do, Changhee;Park, Suhun;Cho, Kwang-Hyun;Choi, Yunho;Choi, Taejeong;Park, Byungho;Yun, Hobaek;Lee, Donghee
    • Journal of Animal Science and Technology
    • /
    • v.55 no.5
    • /
    • pp.343-349
    • /
    • 2013
  • Data including 1,372,050 milk records pertaining to 438,019 cows from 1983 to 2011 collected during performance tests conducted by the National Livestock Cooperative Dairy Improvement Center were used to calculate milk income and profit of individuals and investigate the effects of production levels of early lactation (parity 1 and 2, respectively). Individuals with a moderate level of early lactation stayed longer in herds. Among parity 1, the 9,000 kg or higher group had a lower mean number of lactations than the overall mean of 3.13. The 7,000 kg or lower and 10,000 kg or higher groups had lower mean life time milking days than the overall mean of 1,076.8 days. Standard deviations of lifetime traits tended to decrease as production levels increased. For parity 2, the 11,000 kg or higher group had a lower mean number of lactation than the overall mean of 3.43. The lifetime milking days was highest in the 12,000 kg group (1,212.0 days), and generally smaller in the lower groups. Profit increased as the production level of groups increased for both parity 1 and 2. In groups with low production levels, profit of parity 1 was higher than that of parity 2, while the reverse was true in groups with high production levels. These results suggest that individuals in the low production groups had a greater likelihood to be culled due to reproductive or other problems. Furthermore, the accuracy of the prediction of lifetime profit of individuals with a milk yield of 305 days seems to be higher for parity 2 than parity 1; therefore, it is desirable to predict lifetime profit using the 305d milk yield of parity 2. In conclusion, breeding goals are based on many factors in functions for the estimation of profit; however, production levels during early lactation (parity 1 and 2) can be used as indicators of profit to extend profitability.

Optimization of Support Vector Machines for Financial Forecasting (재무예측을 위한 Support Vector Machine의 최적화)

  • Kim, Kyoung-Jae;Ahn, Hyun-Chul
    • Journal of Intelligence and Information Systems
    • /
    • v.17 no.4
    • /
    • pp.241-254
    • /
    • 2011
  • Financial time-series forecasting is one of the most important issues because it is essential for the risk management of financial institutions. Therefore, researchers have tried to forecast financial time-series using various data mining techniques such as regression, artificial neural networks, decision trees, k-nearest neighbor etc. Recently, support vector machines (SVMs) are popularly applied to this research area because they have advantages that they don't require huge training data and have low possibility of overfitting. However, a user must determine several design factors by heuristics in order to use SVM. For example, the selection of appropriate kernel function and its parameters and proper feature subset selection are major design factors of SVM. Other than these factors, the proper selection of instance subset may also improve the forecasting performance of SVM by eliminating irrelevant and distorting training instances. Nonetheless, there have been few studies that have applied instance selection to SVM, especially in the domain of stock market prediction. Instance selection tries to choose proper instance subsets from original training data. It may be considered as a method of knowledge refinement and it maintains the instance-base. This study proposes the novel instance selection algorithm for SVMs. The proposed technique in this study uses genetic algorithm (GA) to optimize instance selection process with parameter optimization simultaneously. We call the model as ISVM (SVM with Instance selection) in this study. Experiments on stock market data are implemented using ISVM. In this study, the GA searches for optimal or near-optimal values of kernel parameters and relevant instances for SVMs. This study needs two sets of parameters in chromosomes in GA setting : The codes for kernel parameters and for instance selection. For the controlling parameters of the GA search, the population size is set at 50 organisms and the value of the crossover rate is set at 0.7 while the mutation rate is 0.1. As the stopping condition, 50 generations are permitted. The application data used in this study consists of technical indicators and the direction of change in the daily Korea stock price index (KOSPI). The total number of samples is 2218 trading days. We separate the whole data into three subsets as training, test, hold-out data set. The number of data in each subset is 1056, 581, 581 respectively. This study compares ISVM to several comparative models including logistic regression (logit), backpropagation neural networks (ANN), nearest neighbor (1-NN), conventional SVM (SVM) and SVM with the optimized parameters (PSVM). In especial, PSVM uses optimized kernel parameters by the genetic algorithm. The experimental results show that ISVM outperforms 1-NN by 15.32%, ANN by 6.89%, Logit and SVM by 5.34%, and PSVM by 4.82% for the holdout data. For ISVM, only 556 data from 1056 original training data are used to produce the result. In addition, the two-sample test for proportions is used to examine whether ISVM significantly outperforms other comparative models. The results indicate that ISVM outperforms ANN and 1-NN at the 1% statistical significance level. In addition, ISVM performs better than Logit, SVM and PSVM at the 5% statistical significance level.

Quantification of Protein and Amylose Contents by Near Infrared Reflectance Spectroscopy in Aroma Rice (근적외선 분광분석법을 이용한 향미벼의 아밀로스 및 단백질 정량분석)

  • Kim, Jeong-Soon;Song, Mi-Hee;Choi, Jae-Eul;Lee, Hee-Bong;Ahn, Sang-Nag
    • Korean Journal of Food Science and Technology
    • /
    • v.40 no.6
    • /
    • pp.603-610
    • /
    • 2008
  • The principal objective of current study was to evaluate the potential of near infrared reflectance spectroscopy (NIRS) as a non-destructive method for the prediction of the amylose and protein contents of un-hulled and brown rice in broad-based calibration models. The average amylose and protein content of 75 rice accessions were 20.3% and 7.1%, respectively. Additionally, the range of amylose and protein content were 16.6-24.5% and 3.8-9.3%, respectively. In total, 79 rice germplasms representing a wide range of chemical characteristics, variable physical properties, and origins were scanned via NIRS for calibration and validation equations. The un-hulled and brown rice samples evidenced distinctly different patterns in a wavelength range from 1,440 nm to 2,400 nm in the original NIR spectra. The optimal performance calibration model could be obtained by MPLS (modified partial least squares) using the first derivative method (1:4:4:1) for un-hulled rice and the second derivative method (2:4:4:1) for brown rice. The correlation coefficients $(r^2)$ and standard error of calibration (SEC) of protein and amylose contents for the un-hulled rice were 0.86, 2.48, and 0.84, 1.13, respectively. The $r^2$ and SEC of protein and amylose content for brown rice were 0.95, 1.09 and 0.94, 0.42, respectively. The results of this study suggest that the NIRS technique could be utilized as a routine procedure for the quantification of protein and amylose contents in large accessions of un-hulled rice germplasms.

Simulation and model validation of Biomass Fast Pyrolysis in a fluidized bed reactor using CFD (전산유체역학(CFD)을 이용한 유동층반응기 내부의 목질계 바이오매스 급속 열분해 모델 비교 및 검증)

  • Ju, Young Min;Euh, Seung Hee;Oh, Kwang cheol;Lee, Kang Yol;Lee, Beom Goo;Kim, Dae Hyun
    • Journal of Energy Engineering
    • /
    • v.24 no.4
    • /
    • pp.200-210
    • /
    • 2015
  • The modeling for fast pyrolysis of biomass in fluidized bed reactor has been developed for accurate prediction of bio-oil and gas products and for yield improvement. The purpose of this study is to analyze and to compare the CFD(Computational Fluid Dynamics) simulation results with the experimental data from the CFD simulation results with the experimental data from the reference(Mellin et al., 2014) for gas products generated during fast pyrolysis of biomass in fluidized bed reactor. CFD(ANSYS FLUENT v.15.0) was used for the simulation. Complex pyrolysis reaction scheme of biomass subcomponents was applied for the simulation of pyrolysis reaction. This pyrolysis reaction scheme was included reaction of cellulose, hemicellulose, lignin in detail, gas products obtained from pyrolysis were mainly $CO_2$, CO, $CH_4$, $H_2$, $C_2H_4$. The deviation between the simulation results from this study and experimental data from the reference was calculated about 3.7%p, 4.6%p, 3.9%p for $CH_4$, $H_2$, $C_2H_4$ respectively, whereas 9.6%p and 6.7%p for $CO_2$ and CO which are relatively high. Through this study, it is possible to predict gas products accurately by using CFD simulation approach. Moreover, this modeling approach should be developed to predict fluidized bed reactor performance and other gas product yields.

Bias Correction for GCM Long-term Prediction using Nonstationary Quantile Mapping (비정상성 분위사상법을 이용한 GCM 장기예측 편차보정)

  • Moon, Soojin;Kim, Jungjoong;Kang, Boosik
    • Journal of Korea Water Resources Association
    • /
    • v.46 no.8
    • /
    • pp.833-842
    • /
    • 2013
  • The quantile mapping is utilized to reproduce reliable GCM(Global Climate Model) data by correct systematic biases included in the original data set. This scheme, in general, projects the Cumulative Distribution Function (CDF) of the underlying data set into the target CDF assuming that parameters of target distribution function is stationary. Therefore, the application of stationary quantile mapping for nonstationary long-term time series data of future precipitation scenario computed by GCM can show biased projection. In this research the Nonstationary Quantile Mapping (NSQM) scheme was suggested for bias correction of nonstationary long-term time series data. The proposed scheme uses the statistical parameters with nonstationary long-term trends. The Gamma distribution was assumed for the object and target probability distribution. As the climate change scenario, the 20C3M(baseline scenario) and SRES A2 scenario (projection scenario) of CGCM3.1/T63 model from CCCma (Canadian Centre for Climate modeling and analysis) were utilized. The precipitation data were collected from 10 rain gauge stations in the Han-river basin. In order to consider seasonal characteristics, the study was performed separately for the flood (June~October) and nonflood (November~May) seasons. The periods for baseline and projection scenario were set as 1973~2000 and 2011~2100, respectively. This study evaluated the performance of NSQM by experimenting various ways of setting parameters of target distribution. The projection scenarios were shown for 3 different periods of FF scenario (Foreseeable Future Scenario, 2011~2040 yr), MF scenario (Mid-term Future Scenario, 2041~2070 yr), LF scenario (Long-term Future Scenario, 2071~2100 yr). The trend test for the annual precipitation projection using NSQM shows 330.1 mm (25.2%), 564.5 mm (43.1%), and 634.3 mm (48.5%) increase for FF, MF, and LF scenarios, respectively. The application of stationary scheme shows overestimated projection for FF scenario and underestimated projection for LF scenario. This problem could be improved by applying nonstationary quantile mapping.

Development of KD- Propeller Series using a New Blade Section (새로운 날개단면을 이용한 KD-프로펠러 씨리즈 개발)

  • J.T. Lee;M.C. Kim;J.W. Ahn;H.C. Kim
    • Journal of the Society of Naval Architects of Korea
    • /
    • v.28 no.2
    • /
    • pp.52-68
    • /
    • 1991
  • A new propeller series is developed using the newly developed blade section(KH18 section) which behaves better cavitation characteristics and higher lift-drag ratio at wide range of angle-of-attack. The pitch and camber distributions are disigned in order to have the same radial and chordwise loading distribution with the selected circumferentially averaged wake input. Since the geometries of the series propeller, such as chord length, thickness, skew and rate distribations, are selected by regression of the recent full scale propeller geometric data, the performance prediction of a propeller at preliminary design stage can be mure realistic. Number of blades of the series propellers is 4 and the expanded blade area ratios are 0.3, 0.45, 0.6 and 0.75. Mean pitch ratios are selected as 0.5, 0.65, 0.8, 0.75 and 1.1 for each expanded area ratio. The new propeller series is composed of 20 propellers and is named as KD(KRISO-DAEWOO) propeller series. Propeller open water tests are performed at the experimental towing tank, and the cavitation observation tests and fluctuating pressure measurements are carried out at the cavitation tunnel of KRISO. $B_{P}-\delta$ curves, which can be used to select the optimum propeller diameter at the preliminary design stage, are derived from a regression analysis of the propeller often water test results. The KD-cavitation chart is derived from the cavitation observation test results by choosing the local maximum lift coefficient and the local cavitation number as parameters. The caviy extent of a propeller can be predicted more accurately by using the KD-cavitation chart at a preliminary design stage, since it is derived from the results of the cavitation observation tests in the selected ship's wake, whereas the existing cavitation charts, such as the Burrill's cavitation chart, are derived from the test results in uniform flow.

  • PDF

The NCAM Land-Atmosphere Modeling Package (LAMP) Version 1: Implementation and Evaluation (국가농림기상센터 지면대기모델링패키지(NCAM-LAMP) 버전 1: 구축 및 평가)

  • Lee, Seung-Jae;Song, Jiae;Kim, Yu-Jung
    • Korean Journal of Agricultural and Forest Meteorology
    • /
    • v.18 no.4
    • /
    • pp.307-319
    • /
    • 2016
  • A Land-Atmosphere Modeling Package (LAMP) for supporting agricultural and forest management was developed at the National Center for AgroMeteorology (NCAM). The package is comprised of two components; one is the Weather Research and Forecasting modeling system (WRF) coupled with Noah-Multiparameterization options (Noah-MP) Land Surface Model (LSM) and the other is an offline one-dimensional LSM. The objective of this paper is to briefly describe the two components of the NCAM-LAMP and to evaluate their initial performance. The coupled WRF/Noah-MP system is configured with a parent domain over East Asia and three nested domains with a finest horizontal grid size of 810 m. The innermost domain covers two Gwangneung deciduous and coniferous KoFlux sites (GDK and GCK). The model is integrated for about 8 days with the initial and boundary conditions taken from the National Centers for Environmental Prediction (NCEP) Final Analysis (FNL) data. The verification variables are 2-m air temperature, 10-m wind, 2-m humidity, and surface precipitation for the WRF/Noah-MP coupled system. Skill scores are calculated for each domain and two dynamic vegetation options using the difference between the observed data from the Korea Meteorological Administration (KMA) and the simulated data from the WRF/Noah-MP coupled system. The accuracy of precipitation simulation is examined using a contingency table that is made up of the Probability of Detection (POD) and the Equitable Threat Score (ETS). The standalone LSM simulation is conducted for one year with the original settings and is compared with the KoFlux site observation for net radiation, sensible heat flux, latent heat flux, and soil moisture variables. According to results, the innermost domain (810 m resolution) among all domains showed the minimum root mean square error for 2-m air temperature, 10-m wind, and 2-m humidity. Turning on the dynamic vegetation had a tendency of reducing 10-m wind simulation errors in all domains. The first nested domain (7,290 m resolution) showed the highest precipitation score, but showed little advantage compared with using the dynamic vegetation. On the other hand, the offline one-dimensional Noah-MP LSM simulation captured the site observed pattern and magnitude of radiative fluxes and soil moisture, and it left room for further improvement through supplementing the model input of leaf area index and finding a proper combination of model physics.