• 제목/요약/키워드: time series prediction

검색결과 889건 처리시간 0.028초

다분류 SVM을 이용한 DEA기반 벤처기업 효율성등급 예측모형 (The Prediction of DEA based Efficiency Rating for Venture Business Using Multi-class SVM)

  • 박지영;홍태호
    • Asia pacific journal of information systems
    • /
    • 제19권2호
    • /
    • pp.139-155
    • /
    • 2009
  • For the last few decades, many studies have tried to explore and unveil venture companies' success factors and unique features in order to identify the sources of such companies' competitive advantages over their rivals. Such venture companies have shown tendency to give high returns for investors generally making the best use of information technology. For this reason, many venture companies are keen on attracting avid investors' attention. Investors generally make their investment decisions by carefully examining the evaluation criteria of the alternatives. To them, credit rating information provided by international rating agencies, such as Standard and Poor's, Moody's and Fitch is crucial source as to such pivotal concerns as companies stability, growth, and risk status. But these types of information are generated only for the companies issuing corporate bonds, not venture companies. Therefore, this study proposes a method for evaluating venture businesses by presenting our recent empirical results using financial data of Korean venture companies listed on KOSDAQ in Korea exchange. In addition, this paper used multi-class SVM for the prediction of DEA-based efficiency rating for venture businesses, which was derived from our proposed method. Our approach sheds light on ways to locate efficient companies generating high level of profits. Above all, in determining effective ways to evaluate a venture firm's efficiency, it is important to understand the major contributing factors of such efficiency. Therefore, this paper is constructed on the basis of following two ideas to classify which companies are more efficient venture companies: i) making DEA based multi-class rating for sample companies and ii) developing multi-class SVM-based efficiency prediction model for classifying all companies. First, the Data Envelopment Analysis(DEA) is a non-parametric multiple input-output efficiency technique that measures the relative efficiency of decision making units(DMUs) using a linear programming based model. It is non-parametric because it requires no assumption on the shape or parameters of the underlying production function. DEA has been already widely applied for evaluating the relative efficiency of DMUs. Recently, a number of DEA based studies have evaluated the efficiency of various types of companies, such as internet companies and venture companies. It has been also applied to corporate credit ratings. In this study we utilized DEA for sorting venture companies by efficiency based ratings. The Support Vector Machine(SVM), on the other hand, is a popular technique for solving data classification problems. In this paper, we employed SVM to classify the efficiency ratings in IT venture companies according to the results of DEA. The SVM method was first developed by Vapnik (1995). As one of many machine learning techniques, SVM is based on a statistical theory. Thus far, the method has shown good performances especially in generalizing capacity in classification tasks, resulting in numerous applications in many areas of business, SVM is basically the algorithm that finds the maximum margin hyperplane, which is the maximum separation between classes. According to this method, support vectors are the closest to the maximum margin hyperplane. If it is impossible to classify, we can use the kernel function. In the case of nonlinear class boundaries, we can transform the inputs into a high-dimensional feature space, This is the original input space and is mapped into a high-dimensional dot-product space. Many studies applied SVM to the prediction of bankruptcy, the forecast a financial time series, and the problem of estimating credit rating, In this study we employed SVM for developing data mining-based efficiency prediction model. We used the Gaussian radial function as a kernel function of SVM. In multi-class SVM, we adopted one-against-one approach between binary classification method and two all-together methods, proposed by Weston and Watkins(1999) and Crammer and Singer(2000), respectively. In this research, we used corporate information of 154 companies listed on KOSDAQ market in Korea exchange. We obtained companies' financial information of 2005 from the KIS(Korea Information Service, Inc.). Using this data, we made multi-class rating with DEA efficiency and built multi-class prediction model based data mining. Among three manners of multi-classification, the hit ratio of the Weston and Watkins method is the best in the test data set. In multi classification problems as efficiency ratings of venture business, it is very useful for investors to know the class with errors, one class difference, when it is difficult to find out the accurate class in the actual market. So we presented accuracy results within 1-class errors, and the Weston and Watkins method showed 85.7% accuracy in our test samples. We conclude that the DEA based multi-class approach in venture business generates more information than the binary classification problem, notwithstanding its efficiency level. We believe this model can help investors in decision making as it provides a reliably tool to evaluate venture companies in the financial domain. For the future research, we perceive the need to enhance such areas as the variable selection process, the parameter selection of kernel function, the generalization, and the sample size of multi-class.

높은 체감온도가 서울의 여름철 질병 사망자 증가에 미치는 영향, 1991-2000 (The Impact of High Apparent Temperature on the Increase of Summertime Disease-related Mortality in Seoul: 1991-2000)

  • 최광용;최종남;권호장
    • Journal of Preventive Medicine and Public Health
    • /
    • 제38권3호
    • /
    • pp.283-290
    • /
    • 2005
  • Objectives : The aim of this paper was to examine the relationship between the summertime (June to August) heat index, which quantifies the bioclimatic apparent temperature in sultry weather, and the daily disease-related mortality in Seoul for the period from 1991 to 2000. Methods : The daily maximum (or minimum) summertime heat indices, which show synergetic apparent temperatures, were calculated from the six hourly temperatures and real time humidity data for Seoul from 1991 to 2000. The disease-related daily mortality was extracted with respect to types of disease, age and sex, etc. and compared with the time series of the daily heat indices. Results : The summertime mortality in 1994 exceeded the normal by 626 persons. Specifically, blood circulation-related and cancer-related mortalities increased in 1994 by 29.7% (224 persons) and 15.4% (107 persons), respectively, compared with those in 1993. Elderly persons, those above 65 years, were shown to be highly susceptible to strong heat waves, whereas the other age and sex-based groups showed no significant difference in mortality. In particular, a heat wave episode on the 22nd of July 2004 ($>45^{\circ}C$ daily heat index) resulted in double the normal number of mortalities after a lag time of 3 days. Specifically, blood circulation-related mortalities, such as cerebral infraction, were predominant causes. Overall, a critical mortality threshold was reached when the heat index exceeded approximately $37^{\circ}C$, which corresponds to human body temperature. A linear regression model based on the heat indices above $37^{\circ}C$, with a 3 day lag time, accounted for 63% of the abnormally increased mortality (${\geq}+2$ standard deviations). Conclusions : This study revealed that elderly persons, those over 65 years old, are more vulnerable to mortality due to abnormal heat waves in Seoul, Korea. When the daily maximum heat index exceeds approximately $37^{\circ}C$, blood circulation-related mortality significantly increases. A linear regression model, with respect to lag-time, showed that the heat index based on a human model is a more dependable indicator for the prediction of hot weather-related mortality than the ambient air temperature.

도심지 토사재해 고위험지역 극치강우 시간분포 시나리오 분석 (Analysis of Extreme Rainfall Distribution Scenarios over the Landslide High Risk Zones in Urban Areas)

  • 윤선권;장상민;이진영
    • 한국농공학회논문집
    • /
    • 제58권3호
    • /
    • pp.57-69
    • /
    • 2016
  • In this study, we analyzed the extreme rainfall distribution scenarios based on probable rainfall calculation and applying various time distribution models over the landslide high risk zones in urban areas. We used observed rainfall data form total 71 ASOS (Automated Synoptic Observing System) station and AWS (Automatic Weather Station) in KMA (Korea Meteorological Administration), and we analyzed the linear trends for 1-hr and 24-hr annual maximum rainfall series using simple linear regression method, which are identified their increasing trends with slopes of 0.035 and 0.660 during 1961-2014, respectively. The Gumbel distribution was applied to obtain the return period and probability precipitation for each duration. The IDF (Intensity-Duration-Frequency) curves for landslide high risk zones were derived by applying integrated probability precipitation intensity equation. Results from IDF analysis indicate that the probability precipitation varies from 31.4~38.3 % for 1 hr duration, and 33.0~47.9 % for 24 hr duration. It also showed different results for each area. The $Huff-4^{th}$ Quartile method as well as Mononobe distribution were selected as the rainfall distribution scenarios of landslide high risk zones. The results of this study can be used to provide boundary conditions for slope collapse analysis, to analyze sediment disaster risk, and to use as input data for risk prediction of debris flow.

Liver Cancer Mortality Characteristics and Trends in China from 1991 to 2012

  • Fang, Jia-Ying;Wu, Ku-Sheng;Zeng, Yang;Tang, Wen-Rui;Du, Pei-Ling;Xu, Zhen-Xi;Xu, Xiao-Ling;Luo, Jia-Yi;Lin, Kun
    • Asian Pacific Journal of Cancer Prevention
    • /
    • 제16권5호
    • /
    • pp.1959-1964
    • /
    • 2015
  • Purpose: To investigate the distribution of liver cancer mortality as well as its developing trend from 1991 to 2012, forecast the future five-year trend, and provide a basis for the comprehensive prevention and management. Materials and Methods: Mortality data for liver cancer in China from 1991 to 2012 were used to describe characteristics and distribution of liver cancer mortality. Trend surface analysis was used to study the geographical distribution of liver cancer mortality. Curve estimation, time series modeling, gray modeling (GM) and joinpoint regression were used to predict and forecast future trends. Results: The mortality rate of liver cancer has constantly increased in China since 1991. Rates in rural areas are higher than in urban areas, and in males are higher than in females. In addition, our data predicted that the trend will continue to increase in the next 5 years. The age-specific mortality of liver cancer increases with age and peaks in the group of 80-84 years old. Geographical analysis showed the liver mortality rate was higher in the southeast provinces, such as Jiangsu, Zhejiang and Guangdong, and southwest regions like Guangxi Province. Conclusions: The standardized mortality rate of liver cancer in China has consistently increased from 1991 to 2012, and the upward trend is predicted to continue in the future. Much better prevention and management of liver cancer is needed in high mortality areas (the southwestern and southeastern parts of China) and high mortality age groups (80- to 84-year-olds), especially in rural areas.

약액주입 사질고결토의 크리프 예측 (Creep Prediction of Chemical Grouted Sands)

  • 강희복;김종렬;강권수;김태훈;황성원
    • 한국구조물진단유지관리공학회 논문집
    • /
    • 제8권2호
    • /
    • pp.195-204
    • /
    • 2004
  • 본 연구에서는 약액주입 사질고결토에 대해 일정재하크리프시험과 반복재하크리프시험을 실시하여 점 탄소성 거동 규명과 크리프예측을 수행하였다. 일정재하크리프 시험결과 총 변형률은 탄성, 소성 그리고 점탄성변형률로 구분되었으며 이러한 변형률은 응력의 증가에 비례하여 증가하였고 회복된 변형률은 제하시간에 무관함을 알았다. 일정재하크리프시험 예측결과 일반화된 모델과 지수함수모델은 시험결과와 잘 일치하였다. 반복재하크리프시험에서 순간회복변형률은 반복횟수에 무관하였고 누적소성 변형률은 반복횟수에 따라 증가하였으며 응력레벨에 비례함을 알 수 있었다. 반복재하크리프시험의 예측결과 첫 사이클에서는 잘 일치하였으나 반복횟수가 증가함에 따라 약간의 오차가 발생되었다.

하이브리드 VLSI 신경망 프로세서에서의 양자화에 따른 영향 분석 (Analysis of the Effect on the Quantization of the Network's Outputs in the Neural Processor by the Implementation of Hybrid VLSI)

  • 권오준;김성우;이종민
    • 정보처리학회논문지B
    • /
    • 제9B권4호
    • /
    • pp.429-436
    • /
    • 2002
  • 인공 신경망을 실제적인 응용 분야에 적용하기 위하여 하드웨어 시스템으로 구현하는 것이 필요하다. 하드웨어로 구현하는 방법에는 현재 하이브리드 VLSI 신경망 칩으로 구현하는 것이 가장 유망하다. 이미 학습된 신경망을 하이브리드 신경망 칩을 사용하여 구현하는 경우 뉴런 출력과 가중치 값의 양자화 과정이 필수적이다. 이러한 과정은 신경망의 출력층 뉴런의 이미 학습된 출력에 비해 왜곡을 야기한다. 본 논문에서는 이러한 신경망의 출력 왜곡에 대한 통계적 특성을 자세하게 분석하였다. 분석 결과는 신경망의 출력 왜곡을 줄이기 위해서는 입력 벡터의 정규화와 가중치 값들이 작아야 한다는 사실을 보여 주었다. 시계열 데이터에 대한 실험 결과는 분석 결과를 고려하여 학습된 신경망들의 경우 실제로 뉴런 출력 및 가중치 값의 양자화로 인한 출력층 뉴런의 출력 왜곡이 상당히 줄어들 수 있음을 명확히 보여 주었다.

시계열 자료의 분할에 관한 사례 연구 (A Study on the Disaggregation Method of Time Series Data)

  • 문승호;이정형
    • 디지털융복합연구
    • /
    • 제12권6호
    • /
    • pp.155-160
    • /
    • 2014
  • 마케팅 자료를 입수하는 경우, 시장조사에 시간이 많이 소요되는 등의 이유로, 월간으로 입수하지 못하고, 2개월 간격으로 합산되거나 분기별로 합산된 자료만 입수할 수 있는 경우가 있다. 이러한 자료를 활용하여 월간으로 시장을 평가 혹은 예측하거나 마케팅 전략을 수립하여야 하는 경우, 격월 혹은 분기별로 합산된 자료를 월간 자료로 변환하여야 한다. 본 논문에서는 두 달 간격으로 합산되어 집계되는 자료를 월별 자료로 변환하는 여러 가지 방법을 소개한다. 이런 변환 방법에는 2개월간의 자료를 단순히 2로 나누는 단순평균법, 2개월간의 자료의 증감률을 월별 자료의 증감률에 그대로 적용하여 월별 자료로 변환하는 방법, 전문가의 판단에 따른 가중치를 적용하는 방법, 단순회귀모형 등의 모형을 정의하고 그 모형에 의해 월별 자료로 분해하는 방법 등이 있다. 본 논문에서는 유럽의 특정 국가의 가전제품 판매 사례를 활용하여, 두 달 간격으로 합산된 시장 자료를 월별 자료로 변환하는 모형을 활용한 방법을 소개하고자 한다. 나아가 이 모형을 활용하여 향후의 자료를 예측하는 방법도 소개한다.

VAR모형을 이용한 수출상품 수요예측에 관한 연구: 소형 승용차 모델별 분기별 대미수출을 중심으로 (A Study on Demand Forecasting of Export Goods Based on Vector Autoregressive Model : Subject to Each Small Passenger Vehicles Quarterly Exported to USA)

  • 조중형
    • 통상정보연구
    • /
    • 제16권3호
    • /
    • pp.73-96
    • /
    • 2014
  • 본 연구는 우리나라 수출 상위 5개 품목 중 하나인 자동차 수출을 대상으로, 승용차 브랜드별 단기 수출수요에 영향을 미치는 이론적 잠재요인을 발굴 및 설계하여 이론적 수출수요예측모델을 개발하고, 다변량시계열분석 기반의 VAR(Vector Auto Regressive)모형을 이용한 실증분석을 통해 개별상품과 시장특성이 반영된 단기수출수요예측모델을 검정하고자 하였다. 따라서 미국에 수출되고 있는 우리나라 소형 승용차 2개 브랜드(엑센트, 아반떼)에 대해 VAR모형을 이용한 분기단위 단기수요예측모델을 개발하고, 브랜드별 예측모델을 통해 산출된 t+1분기 시점의 예측값과 실제 판매된 판매대수를 대상기간을 1분기씩 달리하여 비교평가 하였다. 그 결과 엑센트와 아반떼의 RMSE %는 각각 4.3%와 20.0%로 났으며, 일평균 판매량을 기준으로 보았을 때 엑센트는 3.9일에 해당하고 아반떼는 18.4일에 해당하는 물량임을 알 수 있었다. 따라서 본 연구의 단기수출수요예측모델은 예측력과 검정시점별 일관성 측면에서 활용성이 높은 것으로 평가할 수 있었다.

  • PDF

태풍 내습 시 지상 최대풍 추정을 위한 WRF 수치모의 사례 연구 : 태풍 RUSA와 MAEMI를 대상으로 (A Case Study of WRF Simulation for Surface Maximum Wind Speed Estimation When the Typhoon Attack : Typhoons RUSA and MAEMI)

  • 정우식;박종길;김은별;이보람
    • 한국환경과학회지
    • /
    • 제21권4호
    • /
    • pp.517-533
    • /
    • 2012
  • This study calculated wind speed at the height of 10 m using a disaster prediction model(Florida Public Hurricane Loss Model, FPHLM) that was developed and used in the United States. Using its distributions, a usable information of surface wind was produced for the purpose of disaster prevention when the typhoon attack. The advanced research version of the WRF (Weather Research and Forecasting) was used in this study, and two domains focusing on South Korea were determined through two-way nesting. A horizontal time series and vertical profile analysis were carried out to examine whether the model provided a resonable simulation, and the meteorological factors, including potential temperature, generally showed the similar distribution with observational data. We determined through comparison of observations that data taken at 700 hPa and used as input data to calculate wind speed at the height of 10 m for the actual terrain was suitable for the simulation. Using these results, the wind speed at the height of 10 m for the actual terrain was calculated and its distributions were shown. Thus, a stronger wind occurred in coastal areas compared to inland areas showing that coastal areas are more vulnerable to strong winds.

추계학적 기법을 통한 공주지점 유출예측 연구 (Study of Stochastic Techniques for Runoff Forecasting Accuracy in Gongju basin)

  • 안정민;허영택;황만하;천근호
    • 대한토목학회논문집
    • /
    • 제31권1B호
    • /
    • pp.21-27
    • /
    • 2011
  • 유출예측량을 모의할 때 과거와 현재의 수문자료를 이용한다는 측면에서 미래 예측결과의 불확실성을 완전히 제거할 수는 없겠지만, 다양한 기법별 분석에 의하여 불확실성을 감소시킬 수 있다. 본 연구에서는 유출예측의 정확성 향상을 위해 다양한 유출예측 기법을 적용 및 평가하였으며 확률론적 예측을 가능하게 하는 예측기법인 ESP와 관측 시계열 자료를 이용한 통계기법으로 공주지점의 유출예측을 수행하였다. 각 기법에 따른 유출예측 결과의 신뢰성 평가는 MAE(Mean Absolute Error), RMSE(Root Mean Squared Error), RRMSE(Relative Root Mean Squared Error), Mean Absolute Percentage Error (MAPE), TIC(Theil Inequality Coefficient)를 이용하였다. ESP 확률을 이용하여 예측한 유출결과와 통계적 시계열 분석에 의해 예측된 유출결과를 MAE, RMSE, RRMSE, MAPE, TIC를 이용하여 비교 분석하였으며 유출예측의 개선효과를 확인해본 결과, ESP 확률을 이용한 예측이 MAE(10.6), RMSE(15.14), RRMSE(0.244), MAPE(22.74%), TIC(0.13)으로 평가되었으며 MAE(23.2), RMSE(37.13), RRMSE(0.596), MAPE(26.69%), TIC(0.30)으로 평가된 ARMA와 MAE(26.4), RMSE(34.44), RRMSE(0.563), MAPE(47.38%), TIC(0.25)으로 평가된 Winters 에 비해 신뢰성이 높게 나타났다.