• Title/Summary/Keyword: demand prediction

Search Result 634, Processing Time 0.028 seconds

An Integrated Model based on Genetic Algorithms for Implementing Cost-Effective Intelligent Intrusion Detection Systems (비용효율적 지능형 침입탐지시스템 구현을 위한 유전자 알고리즘 기반 통합 모형)

  • Lee, Hyeon-Uk;Kim, Ji-Hun;Ahn, Hyun-Chul
    • Journal of Intelligence and Information Systems
    • /
    • v.18 no.1
    • /
    • pp.125-141
    • /
    • 2012
  • These days, the malicious attacks and hacks on the networked systems are dramatically increasing, and the patterns of them are changing rapidly. Consequently, it becomes more important to appropriately handle these malicious attacks and hacks, and there exist sufficient interests and demand in effective network security systems just like intrusion detection systems. Intrusion detection systems are the network security systems for detecting, identifying and responding to unauthorized or abnormal activities appropriately. Conventional intrusion detection systems have generally been designed using the experts' implicit knowledge on the network intrusions or the hackers' abnormal behaviors. However, they cannot handle new or unknown patterns of the network attacks, although they perform very well under the normal situation. As a result, recent studies on intrusion detection systems use artificial intelligence techniques, which can proactively respond to the unknown threats. For a long time, researchers have adopted and tested various kinds of artificial intelligence techniques such as artificial neural networks, decision trees, and support vector machines to detect intrusions on the network. However, most of them have just applied these techniques singularly, even though combining the techniques may lead to better detection. With this reason, we propose a new integrated model for intrusion detection. Our model is designed to combine prediction results of four different binary classification models-logistic regression (LOGIT), decision trees (DT), artificial neural networks (ANN), and support vector machines (SVM), which may be complementary to each other. As a tool for finding optimal combining weights, genetic algorithms (GA) are used. Our proposed model is designed to be built in two steps. At the first step, the optimal integration model whose prediction error (i.e. erroneous classification rate) is the least is generated. After that, in the second step, it explores the optimal classification threshold for determining intrusions, which minimizes the total misclassification cost. To calculate the total misclassification cost of intrusion detection system, we need to understand its asymmetric error cost scheme. Generally, there are two common forms of errors in intrusion detection. The first error type is the False-Positive Error (FPE). In the case of FPE, the wrong judgment on it may result in the unnecessary fixation. The second error type is the False-Negative Error (FNE) that mainly misjudges the malware of the program as normal. Compared to FPE, FNE is more fatal. Thus, total misclassification cost is more affected by FNE rather than FPE. To validate the practical applicability of our model, we applied it to the real-world dataset for network intrusion detection. The experimental dataset was collected from the IDS sensor of an official institution in Korea from January to June 2010. We collected 15,000 log data in total, and selected 10,000 samples from them by using random sampling method. Also, we compared the results from our model with the results from single techniques to confirm the superiority of the proposed model. LOGIT and DT was experimented using PASW Statistics v18.0, and ANN was experimented using Neuroshell R4.0. For SVM, LIBSVM v2.90-a freeware for training SVM classifier-was used. Empirical results showed that our proposed model based on GA outperformed all the other comparative models in detecting network intrusions from the accuracy perspective. They also showed that the proposed model outperformed all the other comparative models in the total misclassification cost perspective. Consequently, it is expected that our study may contribute to build cost-effective intelligent intrusion detection systems.

A study on the use of a Business Intelligence system : the role of explanations (비즈니스 인텔리전스 시스템의 활용 방안에 관한 연구: 설명 기능을 중심으로)

  • Kwon, YoungOk
    • Journal of Intelligence and Information Systems
    • /
    • v.20 no.4
    • /
    • pp.155-169
    • /
    • 2014
  • With the rapid advances in technologies, organizations are more likely to depend on information systems in their decision-making processes. Business Intelligence (BI) systems, in particular, have become a mainstay in dealing with complex problems in an organization, partly because a variety of advanced computational methods from statistics, machine learning, and artificial intelligence can be applied to solve business problems such as demand forecasting. In addition to the ability to analyze past and present trends, these predictive analytics capabilities provide huge value to an organization's ability to respond to change in markets, business risks, and customer trends. While the performance effects of BI system use in organization settings have been studied, it has been little discussed on the use of predictive analytics technologies embedded in BI systems for forecasting tasks. Thus, this study aims to find important factors that can help to take advantage of the benefits of advanced technologies of a BI system. More generally, a BI system can be viewed as an advisor, defined as the one that formulates judgments or recommends alternatives and communicates these to the person in the role of the judge, and the information generated by the BI system as advice that a decision maker (judge) can follow. Thus, we refer to the findings from the advice-giving and advice-taking literature, focusing on the role of explanations of the system in users' advice taking. It has been shown that advice discounting could occur when an advisor's reasoning or evidence justifying the advisor's decision is not available. However, the majority of current BI systems merely provide a number, which may influence decision makers in accepting the advice and inferring the quality of advice. We in this study explore the following key factors that can influence users' advice taking within the setting of a BI system: explanations on how the box-office grosses are predicted, types of advisor, i.e., system (data mining technique) or human-based business advice mechanisms such as prediction markets (aggregated human advice) and human advisors (individual human expert advice), users' evaluations of the provided advice, and individual differences in decision-makers. Each subject performs the following four tasks, by going through a series of display screens on the computer. First, given the information of the given movie such as director and genre, the subjects are asked to predict the opening weekend box office of the movie. Second, in light of the information generated by an advisor, the subjects are asked to adjust their original predictions, if they desire to do so. Third, they are asked to evaluate the value of the given information (e.g., perceived usefulness, trust, satisfaction). Lastly, a short survey is conducted to identify individual differences that may affect advice-taking. The results from the experiment show that subjects are more likely to follow system-generated advice than human advice when the advice is provided with an explanation. When the subjects as system users think the information provided by the system is useful, they are also more likely to take the advice. In addition, individual differences affect advice-taking. The subjects with more expertise on advisors or that tend to agree with others adjust their predictions, following the advice. On the other hand, the subjects with more knowledge on movies are less affected by the advice and their final decisions are close to their original predictions. The advances in predictive analytics of a BI system demonstrate a great potential to support increasingly complex business decisions. This study shows how the designs of a BI system can play a role in influencing users' acceptance of the system-generated advice, and the findings provide valuable insights on how to leverage the advanced predictive analytics of the BI system in an organization's forecasting practices.

Health Care Utilization Pattern and Its Related Factors of Low-income Population with Abnormal Results through Health Examination (저소득층 건강검진 유소견자의 의료이용 양상 및 관련요인)

  • Kwon, Bog-Soon;Kam, Sin;Han, Chang-Hyun
    • Journal of agricultural medicine and community health
    • /
    • v.28 no.2
    • /
    • pp.87-105
    • /
    • 2003
  • Objectives: The purpose of this study was to examine the health care utilization pattern and its related factors of low-income population with abnormal results through health examination. Methods: Analysed data were collected through a questionnaire survey, which was given to 263 persons who 30 years or over with abnormal results through health examination at Health Center. This survey was conducted in March, 2003. This study employed Andersen's prediction model as most well known medical demand mode and data were analysed through 2-test, and multiple logistic regression analysis. Results: The proportion of medical utilization for thorough examination or treatment among study subjects was 51.0%. In multiple logistic regression analysis as dependent variable with medical utilization, the variables affecting the medical utilization were 'feeling about abnormal result(anxiety versus no anxiety: odds ratio 2.25, 95% confidence intervals 1.07-4.75)', 'type of health security(medicaid type I versus health insurance: odds ratio 2.82, 95% confidence intervals 1.04-7.66; medicaid type II versus health insurance: odds ratio 3.22, 95% confidence intervals 1.37-7.53)', 'experience of health examination during past 2 years(odds ratio 2.39, 95% confidence intervals 1.09-5.21)' and 'family member's response for abnormal result(recommendation for medical utilization versus no response: odds ratio 4.90, 95% confidence intervals 1.75-13.75; family member recommended to utilize medical facilities with him/her versus no response: odds ratio 19.47, 95% confidence intervals 5.01-75.73)'. The time of medical utilization was 8-15 days after they received the result(29.9%), 16-30 days after they receive the result(27.6%), 2-7 days after they received the result(20.9%) in order. The most important reason why they didn't take a medical utilization was that it seemed insignificant to them(32.4%). Conclusions: In order to promote medical utilization of low-income population, health education for abnormal result and its management would be necessary to family member as well as person with abnormal result. And follow-up management program for person with abnormal result through health examination such as home-visit health care would be necessary.

  • PDF

Summer Environmental Evaluation of Water and Sediment Quality in the South Sea and East China Sea (남해 및 동중국해의 하계 수질 및 저질 환경평가)

  • Lee, Dae-In;Cho, Hyeon-Seo;Yoon, Yang-Ho;Choi, Young-Chan;Lee, Jeong-Hoon
    • Journal of the Korean Society for Marine Environment & Energy
    • /
    • v.8 no.2
    • /
    • pp.83-99
    • /
    • 2005
  • To evaluate environmental charateristics of the South Sea and East China Sea on summer, water and sediment quality were measured in June 2001-2003. Surface layer was affceted by Warm water originated from the high temperature and salinity-Tsushima Warm Current, on the other hand, Yellow Sea Cold Water was spread to the bottom layer in the south-western part of the Jeju island, and salinity at stations near the Yangtze River was decreased below 29psu because of a enormous freshwater discharges. Thermocline-depth was formed at about 10m, and chlorophyll maximum layer was existed in and below the thermocline. COD(Chemical Oxygen Demand), TN(Total Nitrogen), and TP(Total Phosphorus) concentrations showed seawater quality grade II in surface layer of the most area, but concentrations of such as COD, Chl. a, TSS(Total Suspended Solid), and nutrients were greatly increased in the effect area of Yangtze River discharges. Correlations between dissolved inorganic nitrogen, Chl. a and salinity were negative patterns strongly, in contrast, those of inorganic phosphorus, COD and Chl. a were positive, which indicates that phytoplankton biomass and phosphorus are considered as important factors of organic matter distribution and algal growth, respectively. in the study area. The distribution of ignition loss, COD, and $H_2S$ of surface sediment were in the ranges of 2.61-8.81%, $0.64-11.86mgO_2/g-dry$, and ND-0.25 mgS/g-dry, respectively, with relatively high concentration in the eastern part of the study area. Therefore, to effective and sustainable use and management of this area, continuous monitoring and countermeasures about major input sources to the water and sediment, and prediction according to the environmental variation, are necessary.

  • PDF

Macroeconomic Consequences of Pay-as-you-go Public Pension System (부과방식 공적연금의 거시경제적 영향)

  • Park, Chang-Gyun;Hur, Seok-Kyun
    • KDI Journal of Economic Policy
    • /
    • v.30 no.2
    • /
    • pp.225-270
    • /
    • 2008
  • We analyze macroeconomic consequences of pay-as-you-go (PAYGO) public pension system with a simple overlapping generations model. Contrary to large body of existing literatures offering quantitative results based on simulation study, we take another route by adopting a highly simplified framework in search of qualitatively tractable analytical results. The main contribution of our results lies in providing a sound theoretical foundation that can be utilized in interpreting various quantitative results offered by simulation studies of large scale general equilibrium models. We present a simple overlapping generations model with a defined benefit(DB) PAYGO public pension system as a benchmark case and derive an analytical equilibrium solution utilizing graphical illustration. We also discuss the modifications of the benchmark model required to encompass a defined contribution(DC) public pension system into the basic framework. Comparative statics analysis provides three important implications; First, introduction and expansion of the PAYGO public pension, DB or DC, result in lower level of capital accumulation and higher expected rate of return on the risky asset. Second, it is shown that the progress of population aging is accompanied by lower capital stock due to decrease in both demand and supply of risky asset. Moreover, risk premium for risky asset increases(decreases) as the speed of population aging accelerates(decelerates) so that the possibility of so-called "the great meltdown" of asset market cannot be excluded although the odds are not high. Third, it is most likely that the switch from DB PAYGO to DC PAYGO would result in lower capital stock and higher expected return on the risky asset mainly due to the fact that the young generation regards DC PAYGO pension as another risky asset competing against the risky asset traded in the market. This theoretical prediction coincides with one of the firmly established propositions in empirical literature that the currently dominant form of public pension system has the tendency to crowd out private capital accumulation.

  • PDF

Analysis of the Elderly Travel Characteristics and Travel Behavior with Daily Activity Schedules (the Case of Seoul, Korea) (활동 스케줄 분석을 통한 고령자의 통행특성과 통행행태에 관한 연구)

  • Seo, Sang-Eon;Jeong, Jin-Hyeok;Kim, Sun-Gwan
    • Journal of Korean Society of Transportation
    • /
    • v.24 no.5 s.91
    • /
    • pp.89-108
    • /
    • 2006
  • Korea has been entering the ageing society as the population of age over 65 shared over 7% since the year 2000. The ageing society needs to have transportation facility considering elderly people's travel behavior. This study aims to understand the elderly people's travel behavior using recent data in Korea. The activity schedule approach begins with travel outcomes are part of an activitv scheduling decision. For tho?e approach. used discrete choice models (especially. Nested Logit Model) to address the basic modeling problem capturing decision interaction among the many choice dimensions of the immense activity schedule choice set The day activity schedule is viewed as a sot of tours and at-home activity episodes tied togather with overarching day activity pattern using the Seoul Metropolitan Area Transportation Survey data, which was conducted in June, 2002. Decisions about a specific tour in the schedule are conditioned by the choice of day activity pattern. The day activity scheduling model estimated in this study consists of tours interrelated in a day activity pattern. The day activity pattern model represents the basic decision of activity participation and priorities and places each activity in a configuration of tours and at-home episodes. Each pattern alternative is defined by the primary activity of the day, whether the primary activity occurs at home or away, and the type of tour for the primary activity. In travel mode choice of the elderly and non-workers, especially, travel cost was found to be important in understanding interpersonal variations in mode choice behavior though, travel time was found to be less important factor in choosing travel mode. In addition, although, generally, the elderly was likely to choose transit mode, private mode was preferred for the elderly over 75 years old owing to weakened physical health for such things as going up and down of stairs. Therefore. as entering the ageing society, transit mode should be invested heavily in transportation facility Planning tor improving elderly transportation service. Although the model has not yet been validated in before-and-after prediction studies. this study gives strong evidence of its behavioral soundness, current practicality. and potential for improving reliability of transportation Projects superior to those of the best existing systems in Korea.

Recent Changes in Bloom Dates of Robinia pseudoacacia and Bloom Date Predictions Using a Process-Based Model in South Korea (최근 12년간 아까시나무 만개일의 변화와 과정기반모형을 활용한 지역별 만개일 예측)

  • Kim, Sukyung;Kim, Tae Kyung;Yoon, Sukhee;Jang, Keunchang;Lim, Hyemin;Lee, Wi Young;Won, Myoungsoo;Lim, Jong-Hwan;Kim, Hyun Seok
    • Journal of Korean Society of Forest Science
    • /
    • v.110 no.3
    • /
    • pp.322-340
    • /
    • 2021
  • Due to climate change and its consequential spring temperature rise, flowering time of Robinia pseudoacacia has advanced and a simultaneous blooming phenomenon occurred in different regions in South Korea. These changes in flowering time became a major crisis in the domestic beekeeping industry and the demand for accurate prediction of flowering time for R. pseudoacacia is increasing. In this study, we developed and compared performance of four different models predicting flowering time of R. pseudoacacia for the entire country: a Single Model for the country (SM), Modified Single Model (MSM) using correction factors derived from SM, Group Model (GM) estimating parameters for each region, and Local Model (LM) estimating parameters for each site. To achieve this goal, the bloom date data observed at 26 points across the country for the past 12 years (2006-2017) and daily temperature data were used. As a result, bloom dates for the north central region, where spring temperature increase was more than two-fold higher than southern regions, have advanced and the differences compared with the southwest region decreased by 0.7098 days per year (p-value=0.0417). Model comparisons showed MSM and LM performed better than the other models, as shown by 24% and 15% lower RMSE than SM, respectively. Furthermore, validation with 16 additional sites for 4 years revealed co-krigging of LM showed better performance than expansion of MSM for the entire nation (RMSE: p-value=0.0118, Bias: p-value=0.0471). This study improved predictions of bloom dates for R. pseudoacacia and proposed methods for reliable expansion to the entire nation.

Prediction of Energy Requirements for Maintenance and Growth of Female Korean Black Goats (번식용 교잡 흑염소의 유지와 성장을 위한 대사에너지 요구량 추정)

  • Lee, Jinwook;Kim, Kwan Woo;Lee, Sung Soo;Ko, Yeoung Gyu;Lee, Yong Jae;Kim, Sung Woo;Jeon, Da Yeon;Roh, Hee Jong;Yun, Yeong Sik;Kim, Do Hyung
    • Journal of The Korean Society of Grassland and Forage Science
    • /
    • v.39 no.1
    • /
    • pp.1-8
    • /
    • 2019
  • This study was conducted to predict the energy requirements for maintenance and growth of female Korean black goats during their growth and pregnancy phases. Fifty female goats ($18.7{\pm}0.27kg$) in their growth phase with an average age of 5 months were stratified by weight and randomly assigned into 5 groups. They were fed 5 diets varying in metabolic energy (ME) [2.32 (G1), 2.49 (G2), 2.74 (G3), 2.99 (G4), and 3.24 (G5) Mcal/kg] until they were 9-month-old. After natural breeding, 50 female goats ($30.7{\pm}0.59kg$) were stratified by weight and randomly assigned into 5 groups. They were fed 5 diets varying in ME [2.32 (P1), 2.43 (P2), 2.55 (P3), 2.66 (P4), and 2.78 (P5) Mcal/kg]. The average feed intake ranged between 1.5 and 2.0% of the body weight (BW), and there was no significant difference between the treatment groups with goats in growth or pregnancy phases. Average daily gain (ADG) in diet demand during the growth phase increased with an increasing ME density and ranged from 46 to 69 g/d (p<0.01). Feed conversion ratio (FCR) improved with the ME density during the growth phase (p<0.01). The intercept of the regression equation between ME intake and ADG indicated that energy requirement for maintenance of goats during growth and pregnancy phases was $103.53kcal/BW^{0.75}$ and $102.7kcal/BW^{0.75}$, respectively. These results may serve as a basis for the establishment of goat feeding standards in Korea. Further studies are required to assess the nutrient requirement of goats using various methods for improving accuracy.

Study on Standardization of the Environmental Impact Evaluation Method of Extremely Low Frequency Magnetic Fields near High Voltage Overhead Transmission Lines (고압 가공송전선로의 극저주파자기장 환경영향평가 방법 표준화에 관한 연구)

  • Park, Sung-Ae;Jung, Joonsig;Choi, Taebong;Jeong, Minjoo;Kim, Bu-Kyung;Lee, Jongchun
    • Journal of Environmental Impact Assessment
    • /
    • v.27 no.6
    • /
    • pp.658-673
    • /
    • 2018
  • Social conflicts with extremely low frequency magnetic field(ELF-MF) exposures are expected to exacerbate due to continued increase in electric power demand and construction of high voltage transmission lines(HVTL). However, in current environmental impact assessment(EIA) act, specific guidelines have not been included concretely about EIA of ELF-MF. Therefore, this study conducted a standardization study on EIA method through case analysis, field measurement, and expert consultation of the EIA for the ELF-MF near HVTL which is the main cause of exposures. The status of the EIA of the ELF-MF and the problem to be improved are derived and the EIA method which can solve it is suggested. The main contents of the study is that the physical characteristics of the ELF-MF affected by distance and powerload should be considered at all stages of EIA(survey of the current situation - Prediction of the impacts - preparation of mitigation plan ? post EIA planning). Based on this study, we also suggested the 'Measurement method for extremely low frequency magnetic field on transmission line' and 'Table for extremely low frequency magnetic field measurement record on transmission line'. The results of this study can be applied to the EIA that minimizes the damage and conflict to the construction of transmission line and derives rational measures at the present time when the human hazard to long term exposure of the ELF-MF is unclear.

A Machine Learning-based Total Production Time Prediction Method for Customized-Manufacturing Companies (주문생산 기업을 위한 기계학습 기반 총생산시간 예측 기법)

  • Park, Do-Myung;Choi, HyungRim;Park, Byung-Kwon
    • Journal of Intelligence and Information Systems
    • /
    • v.27 no.1
    • /
    • pp.177-190
    • /
    • 2021
  • Due to the development of the fourth industrial revolution technology, efforts are being made to improve areas that humans cannot handle by utilizing artificial intelligence techniques such as machine learning. Although on-demand production companies also want to reduce corporate risks such as delays in delivery by predicting total production time for orders, they are having difficulty predicting this because the total production time is all different for each order. The Theory of Constraints (TOC) theory was developed to find the least efficient areas to increase order throughput and reduce order total cost, but failed to provide a forecast of total production time. Order production varies from order to order due to various customer needs, so the total production time of individual orders can be measured postmortem, but it is difficult to predict in advance. The total measured production time of existing orders is also different, which has limitations that cannot be used as standard time. As a result, experienced managers rely on persimmons rather than on the use of the system, while inexperienced managers use simple management indicators (e.g., 60 days total production time for raw materials, 90 days total production time for steel plates, etc.). Too fast work instructions based on imperfections or indicators cause congestion, which leads to productivity degradation, and too late leads to increased production costs or failure to meet delivery dates due to emergency processing. Failure to meet the deadline will result in compensation for delayed compensation or adversely affect business and collection sectors. In this study, to address these problems, an entity that operates an order production system seeks to find a machine learning model that estimates the total production time of new orders. It uses orders, production, and process performance for materials used for machine learning. We compared and analyzed OLS, GLM Gamma, Extra Trees, and Random Forest algorithms as the best algorithms for estimating total production time and present the results.