• Title/Summary/Keyword: 통계학분류

Search Result 73, Processing Time 0.021 seconds

한국증권시장(韓國證券市場)에서 다변량검증(多變量檢證)에 근거한 CAPM과 APM의 실증적(實證的) 검증(檢證)

  • Gu, Bon-Yeol
    • The Korean Journal of Financial Studies
    • /
    • v.5 no.1
    • /
    • pp.135-164
    • /
    • 1999
  • 본(本) 연구(硏究)는 Jobson(1982)의 주식(株式)의 수익율(收益率)이 정규분포(正規分布)를 할 경우에 다변량(多變量)의 통계학(統計學)을 이용하여 CAPM과 APM을 검증(檢證)하는 방법(方法)을 유도하였다. 이에따라 회귀분석(回歸分析)에 의한 검증방법(檢證方法)과 다변량(多變量)의 검증방법(檢證方法)을 제시하고 현실적으로 CAPM과 APM이 한국증권시장(韓國證券市場)에서 적용가능(適用可能)한가에 대한 실증적(實證的) 검증(檢證)을 실시하였다. 실증적(實證的) 검증(檢證)을 위하여 먼저 우리나라의 주식수익율자료(株式收益率資料)를 1980년 1월부터 1997년 6월까지의 월별자료(月別資料)에 의하여 11개 산업별(産業別) 분류작업을 통하여 산업별(産業別)포트폴리오를 구성하였다. 특히 APM의 경우에는 요인(要因)의 증가에 따라 APM이 한국증권시장에서 적용가능한가를 검증(檢證)하기 위하여 요인(要因)을 2개, 6개 그리고 10개까지 증가시켜 모형(模型)의 적합성(適合性)을 검증(檢證)하였다. 검증결과(檢證結果), CAPM과 APM모두 한국증권시장(韓國證券市場)에서 적용가능(適用可能)한 것으로 나타났다. 특히 APM의 경우에는 요인(要因)이 2개, 6개와 10개로 증가시 어떤 경우에도 적용가능한 것으로 나타났다. 이는 기대수익율(期待收益率)의 설명력을 높이기 위하여 몇 개의 가격화(價格化) 요인(要因)이 APM에 영향을 미치는 가를 연구하는 전통적인 검증방법(檢證方法)은 큰 의미가 없는 것으로 나타났다.

  • PDF

An Explorator Spatial Analysis of Shigellosis (세균성 이질의 탐색적 공간분석)

  • 박기호
    • Journal of the Korean Geographical Society
    • /
    • v.34 no.5
    • /
    • pp.473-491
    • /
    • 1999
  • 세균성 이질은 국내 제1종 법정 전염병으로 분류되어 관리되고 있는 질환으로서 1998년 이후 그 발병 사례가 급속히 증가하고 있다. 본 연구는 1999년 3월 부산시 사상구에서 집단 발병한 세균성 이질을 대상으로 하여, 각 환자들의 발병 시점과 장소의 분포패턴에 대한 지리학적 고찰을 목적으로 한다. 환자분포의 특징적 공간패턴과 그들의 시계열적 확산 양상 등을 탐색하기 위한 방법론은 보건지리학과 지도학 및 공간통계학에 기반을 둔 공간분석기법을 중심으로 설정하였다. 분석자료는 해당 지역의 수치지형도, 지적도, 인구 센서스 자료를 포함한 GIS 데이터베이스로 구축되었다. 인구분포를 감안한 밀도구분도를 바탕으로 개별환자의 위치자료와 동 단위로 집계된 자료를 자료의 형태에 따라 분석기법을 달리하였으며, 환자 발생 밀도, 상대적 위험지수 등을 지도화하여 역학자료의 시각적 통계적 분석을 수행하였다. 환자분포의 공간적 중심위치와 분산의 변화 등 기술적 통계분석과 함께 제1차 공간속성을 커널추정법으로 찾아보았다. 이와 더불어 ‘공간적 의존성’과 관련된 제2차 공간속성은 K-함수와 시뮬레이션을 통해 분석하여 군집성 등이 통계적으로 확인되었다. 본 연구를 통해 역학조사시 GIS의 활용사례가 제시되었으며, 모집단 인구를 고려한 확률지도 작성 기법과 다양한 데이터 가시화 방법, 그리고 시계열별 발생 환자들의 지리적 변이를 분석 하는데 따르는 문제들이 논의되었다.

  • PDF

Value Weighted Regularized Logistic Regression Model (속성값 기반의 정규화된 로지스틱 회귀분석 모델)

  • Lee, Chang-Hwan;Jung, Mina
    • Journal of KIISE
    • /
    • v.43 no.11
    • /
    • pp.1270-1274
    • /
    • 2016
  • Logistic regression is widely used for predicting and estimating the relationship among variables. We propose a new logistic regression model, the value weighted logistic regression, which comprises of a fine-grained weighting method, and assigns adapted weights to each feature value. This gradient approach obtains the optimal weights of feature values. Experiments were conducted on several data sets from the UCI machine learning repository, and the results revealed that the proposed method achieves meaningful improvement in the prediction accuracy.

Forecasting Electric Power Demand Using Census Information and Electric Power Load (센서스 정보 및 전력 부하를 활용한 전력 수요 예측)

  • Lee, Heon Gyu;Shin, Yong Ho
    • Journal of Korea Society of Industrial Information Systems
    • /
    • v.18 no.3
    • /
    • pp.35-46
    • /
    • 2013
  • In order to develop an accurate analytical model for domestic electricity demand forecasting, we propose a prediction method of the electric power demand pattern by combining SMO classification techniques and a dimension reduction conceptualized subspace clustering techniques suitable for high-dimensional data cluster analysis. In terms of electricity demand pattern prediction, hourly electricity load patterns and the demographic and geographic characteristics can be analyzed by integrating the wireless load monitoring data as well as sub-regional unit of census information. There are composed of a total of 18 characteristics clusters in the prediction result for the sub-regional demand pattern by using census information and power load of Seoul metropolitan area. The power demand pattern prediction accuracy was approximately 85%.

Review for time-dependent ROC analysis under diverse survival models (생존 분석 자료에서 적용되는 시간 가변 ROC 분석에 대한 리뷰)

  • Kim, Yang-Jin
    • The Korean Journal of Applied Statistics
    • /
    • v.35 no.1
    • /
    • pp.35-47
    • /
    • 2022
  • The receiver operating characteristic (ROC) curve was developed to quantify the classification ability of marker values (covariates) on the response variable and has been extended to survival data with diverse missing data structure. When survival data is understood as binary data (status of being alive or dead) at each time point, the ROC curve expressed at every time point results in time-dependent ROC curve and time-dependent area under curve (AUC). In particular, a follow-up study brings the change of cohort and incomplete data structures such as censoring and competing risk. In this paper, we review time-dependent ROC estimators under several contexts and perform simulation to check the performance of each estimators. We analyzed a dementia dataset to compare the prognostic power of markers.

Entropy and its Relation with the Property of Molecule, Phase and Component (엔트로피와 분자 특성, 상 및 성분의 관계)

  • Jaeeon Chang
    • Korean Chemical Engineering Research
    • /
    • v.61 no.1
    • /
    • pp.116-122
    • /
    • 2023
  • We study the relationship of entropy with the properties of molecules and also with the macroscopic specifications of the system, i.e., component and phase. Understanding different viewpoints of classical mechanics and quantum mechanics for the indistinguishability of molecules belonging to the same component, we discuss a few thermodynamic systems in which the properties of molecules are to be consistent with the component as a macroscopic term of classifying the molecules. With a clear definition of thermodynamic microstate, the drawback of the Boltzmann statistics caused by the distinguishability of molecules is avoided, and the Gibbs paradox of entropy consequently disappears. Corresponding to the characteristics of fluid and solid phases, we investigated the effects of the indistinguishability and the symmetry number of molecules and the number of microstates realized in time on the partition function and the entropy. In particular, we show that crystalline solid can be regarded as a system which does not satisfy the ergodic hypothesis.

Research on Driving Pattern Analysis Techniques Using Contrastive Learning Methods (대조학습 방법을 이용한 주행패턴 분석 기법 연구)

  • Hoe Jun Jeong;Seung Ha Kim;Joon Hee Kim;Jang Woo Kwon
    • The Journal of The Korea Institute of Intelligent Transport Systems
    • /
    • v.23 no.1
    • /
    • pp.182-196
    • /
    • 2024
  • This study introduces driving pattern analysis and change detection methods using smartphone sensors, based on contrastive learning. These methods characterize driving patterns without labeled data, allowing accurate classification with minimal labeling. In addition, they are robust to domain changes, such as different vehicle types. The study also examined the applicability of these methods to smartphones by comparing them with six lightweight deep-learning models. This comparison supported the development of smartphone-based driving pattern analysis and assistance systems, utilizing smartphone sensors and contrastive learning to enhance driving safety and efficiency while reducing the need for extensive labeled data. This research offers a promising avenue for addressing contemporary transportation challenges and advancing intelligent transportation systems.

Prediction of the direction of stock prices by machine learning techniques (기계학습을 활용한 주식 가격의 이동 방향 예측)

  • Kim, Yonghwan;Song, Seongjoo
    • The Korean Journal of Applied Statistics
    • /
    • v.34 no.5
    • /
    • pp.745-760
    • /
    • 2021
  • Prediction of a stock price has been a subject of interest for a long time in financial markets, and thus, many studies have been conducted in various directions. As the efficient market hypothesis introduced in the 1970s acquired supports, it came to be the majority opinion that it was impossible to predict stock prices. However, recent advances in predictive models have led to new attempts to predict the future prices. Here, we summarize past studies on the price prediction by evaluation measures, and predict the direction of stock prices of Samsung Electronics, LG Chem, and NAVER by applying various machine learning models. In addition to widely used technical indicator variables, accounting indicators such as Price Earning Ratio and Price Book-value Ratio and outputs of the hidden Markov Model are used as predictors. From the results of our analysis, we conclude that no models show significantly better accuracy and it is not possible to predict the direction of stock prices with models used. Considering that the models with extra predictors show relatively high test accuracy, we may expect the possibility of a meaningful improvement in prediction accuracy if proper variables that reflect the opinions and sentiments of investors would be utilized.

Development of Subsurface Spatial Information Model System using Clustering and Geostatistics Approach (클러스터링과 지구통계학 기법을 이용한 지하공간정보 모델 생성시스템 개발)

  • Lee, Sang-Hoon
    • Journal of the Korean Association of Geographic Information Studies
    • /
    • v.11 no.4
    • /
    • pp.64-75
    • /
    • 2008
  • Since the current database systems for managing geotechnical investigation results were limited by being described boring test result in point feature, it has been trouble for using other GIS data. Although there are some studies for spatial characteristics of subsurface modeling, it is rather lack of being interoperable with GIS, considering geotechnical engineering facts. This is reason for difficulty of practical uses. In this study, we has developed subsurface spatial information model through extracting needed geotechnical engineering data from geotechnical information DB. The developed geotechnical information clustering program(GEOCL) has made a cluster of boring formation(and formation ratio), classification of layer, and strength characteristics of subsurface. The interpolation of boring data has been achieved through zonal kriging method in the consideration of spatial distribution of created cluster. Finally, we make a subsurface spatial information model to integrate with digital elevation model, and visualize 3-dimensional model by subsurface spatial information viewing program(SSIVIEW). We expect to strengthen application capacity of developed model in subsurface interpretation and foundation design of construction works.

  • PDF

Optimal number of dimensions in linear discriminant analysis for sparse data (희박한 데이터에 대한 선형판별분석에서 최적의 차원 수 결정)

  • Shin, Ga In;Kim, Jaejik
    • The Korean Journal of Applied Statistics
    • /
    • v.30 no.6
    • /
    • pp.867-876
    • /
    • 2017
  • Datasets with small n and large p are often found in various fields and the analysis of the datasets is still a challenge in statistics. Discriminant analysis models for such datasets were recently developed in classification problems. One approach of those models tries to detect dimensions that distinguish between groups well and the number of the detected dimensions is typically smaller than p. In such models, the number of dimensions is important because the prediction and visualization of data and can be usually determined by the K-fold cross-validation (CV). However, in sparse data scenarios, the CV is not reliable for determining the optimal number of dimensions since there can be only a few observations for each fold. Thus, we propose a method to determine the number of dimensions using a measure based on the standardized distance between the mean values of each group in the reduced dimensions. The proposed method is verified through simulations.