• Title/Summary/Keyword: 전통적인 통계

Search Result 553, Processing Time 0.023 seconds

Morpheme Segmentation and Part-Of-Speech Tagging Using Restricted Resources (제한된 자원을 사용한 한국어 형태소 분석)

  • Kang, Sangwoo;Yang, Jaechul;Kim, Harksoo;Seo, Jungyun
    • Annual Conference on Human and Language Technology
    • /
    • 2012.10a
    • /
    • pp.212-214
    • /
    • 2012
  • 한국어 형태소 분석 및 품사 부착에 대한 연구는 지속적으로 이루어져 왔으며 규칙 기반 방법, 통계 기반 방법 등을 중심으로 연구되었다. 본 논문에서는 최근 활용도가 높아지고 있는 모바일 기기에 적합한 한국어 형태소 분석 및 품사 부착 방법을 제안한다. 모바일 기기는 계산 처리 능력과 사용 가능한 메모리가 제한되기 때문에 전통적인 방법을 사용하여 형태소 분석 및 품사 부착을 수행하기에는 한계가 있다. 본 논문에서는 기존의 규칙 기반 형태소 분석 방법인 좌최장일치법을 변형하여 형태소 분석을 수행 하고, 통계적인 방법인 hidden Markov model 을 축소하여 형태소 품사 부착을 수행한다. 제안하는 방법은 기존의 hidden Markov model을 사용한 시스템과 유사한 성능을 보여주며 모바일 기기에 적합하도록 소량의 메모리 사용과 월등히 빠른 속도로 형태소 분석 및 품사 부칙을 수행할 수 있다.

  • PDF

A Learning Using GA Optimized Neural Networks (유전자 알고리즘 최적화 신경망을 이용한 학습)

  • YeoChang Yoon
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2008.11a
    • /
    • pp.27-29
    • /
    • 2008
  • 시스템 분석에 주로 사용하는 자료 중에는 비선형 자료와 시계열 등이 있다. 이들 자료는 그 함축적인 관계가 매우 복잡하여 전통적인 통계분석 도구로 분석하는데 어려움이 많다. 본 연구에서는 현실 세계에서 다양하게 나타나는 복잡성을 다루기 위하여 하이브리드 진화 신경망 모델링 접근 방법으로 자료를 모형화 하고 이를 통한 학습의 적합도를 살펴본다. 비선형 자료 등을 모형화하기 위한 학습은 역전파 신경망 기법을 이용한다. 학습의 효율을 높이기 의해서 격자감소 학습 알고리즘과 함께 이용하는 유전자 알고리즘은 네트워크 구조를 최적화 시킬 수 있는 초기가중값을 이용한 전역 최소값을 찾는데 이용한다. 학습 결과를 통해 제안된 하이브리드형 접근방법의 학습이 보다 효율적임을 살펴보기 위하여 유전자 알고리즘으로 최적화된 신경망 학습 알고리즘을 비선형 모의자료의 학습에 적용하여 보았다.

Penalized quantile regression tree (벌점화 분위수 회귀나무모형에 대한 연구)

  • Kim, Jaeoh;Cho, HyungJun;Bang, Sungwan
    • The Korean Journal of Applied Statistics
    • /
    • v.29 no.7
    • /
    • pp.1361-1371
    • /
    • 2016
  • Quantile regression provides a variety of useful statistical information to examine how covariates influence the conditional quantile functions of a response variable. However, traditional quantile regression (which assume a linear model) is not appropriate when the relationship between the response and the covariates is a nonlinear. It is also necessary to conduct variable selection for high dimensional data or strongly correlated covariates. In this paper, we propose a penalized quantile regression tree model. The split rule of the proposed method is based on residual analysis, which has a negligible bias to select a split variable and reasonable computational cost. A simulation study and real data analysis are presented to demonstrate the satisfactory performance and usefulness of the proposed method.

Course Probability of Yut according to Starting Order (출발순서에 따른 윷말의 코스 경유 확률)

  • Cho, Daehyeon
    • The Korean Journal of Applied Statistics
    • /
    • v.28 no.3
    • /
    • pp.443-455
    • /
    • 2015
  • The Korean game of yut is a traditional games that everyone can enjoy regardless of gender or ages. Yut consists of four sticks with a Head and Tail. We are interested in the course probabilities in the game of yut that are different according to the starting order of the four pieces of yut. So we consider the probabilities of five results of yut which we toss according to the probability of Head. We calculate probabilities according to 4 courses where one piece of yut can go through in a yutpan according to the starting order of each piece of yut.

Machine Learning vs. Statistical Model for Prediction Modelling: Application in Medical Imaging Research (예측모형의 머신러닝 방법론과 통계학적 방법론의 비교: 영상의학 연구에서의 적용)

  • Leeha Ryu;Kyunghwa Han
    • Journal of the Korean Society of Radiology
    • /
    • v.83 no.6
    • /
    • pp.1219-1228
    • /
    • 2022
  • Clinical prediction models has been increasingly published in radiology research. In particular, as a radiomics research is being actively conducted, the prediction model is developed based on the traditional statistical model, as well as machine learning, to account for the high-dimensional data. In this review, we investigated the statistical and machine learning methods used in clinical prediction model research, and briefly summarized each analytical method for statistical model, machine learning, and statistical learning. Finally, we discussed several considerations for choosing the prediction modeling method.

The Effects of Activity-Centered Instruction on Understanding of Natural Selection Concept (자연선택개념의 이해를 위한 활동중심수업의 효과)

  • Park, Jong-Boon;Lee, Mi-Sook;Lee, Kil-Jae
    • Journal of The Korean Association For Science Education
    • /
    • v.23 no.5
    • /
    • pp.505-516
    • /
    • 2003
  • The concept of evolution is one of the most important concepts in the learning biology. However lots of students have difficulties in understanding its mechanism because their preexisted alternative concepts interrupt in gaining the correct idea of evolution. Students usually have the Larmarkian or teleological ideas of evolution. The purpose of this study is to investigate the effectiveness of an activity-centered instruction on the learner's conceptual change from misconception into the scientific concept, Darwinian one, and achievement. For the study, 162 students were sampled from a high school: 81 students for the activity-centered instruction and 81 students for traditional instruction. The result is as follows; 1) The activity-centered instruction is more effective than the traditional one in understanding the concept of Darwinian natural selection(p<.05) and in changing the students' various misconceptions of evolution into Darwinian one. 2) The activity-centered instruction concerning natural selection is more effective in their achievement(p<.01). 3) However, after both of the instructions, some students still kept the Lamarkian thoughts.

An Analysis on the Asymmetric Time Varying Spillover Effect between Capesize and Panamax Markets (케이프사이즈와 파나막스 시장간의 비대칭 시간가변 파급효과에 관한 분석)

  • Chung, Sang-Kuck
    • Journal of Korea Port Economic Association
    • /
    • v.27 no.3
    • /
    • pp.41-64
    • /
    • 2011
  • This article investigates the interrelationships in daily returns using fractionally integrated error correction term and volatilities using constant conditional correlation and dynamic conditional correlation GARCH with asymmetries between Capesize and Panamax markets. Our findings are as follows. First, for the fractionally cointegrated error correction model, there is a unidirectional relationship in returns from the Panamax market to the Capesize market, but a bidirectional causal relationship prevails for the traditional error correction models. Second, the coefficients for the error correction term are all statistically significant. Of particular interest are the signs of the estimates for the error correction term, which are all negative for the Capesize return equation and all positive for the Panamax return. Third, there are bidirectional volatility spillovers between both markets and the direction of the information flow seems to be stronger from Panamax to Capesize. Fourth, the coefficients for the asymmetric term are all significantly positive in the Capesize market, but the Panamax market does not have a significant effect. However, the coefficients for the asymmetric term are all significant, implying that the leverage effect does exist in the Capesize and Panamax markets.

EMD based hybrid models to forecast the KOSPI (코스피 예측을 위한 EMD를 이용한 혼합 모형)

  • Kim, Hyowon;Seong, Byeongchan
    • The Korean Journal of Applied Statistics
    • /
    • v.29 no.3
    • /
    • pp.525-537
    • /
    • 2016
  • The paper considers a hybrid model to analyze and forecast time series data based on an empirical mode decomposition (EMD) that accommodates complex characteristics of time series such as nonstationarity and nonlinearity. We aggregate IMFs using the concept of cumulative energy to improve the interpretability of intrinsic mode functions (IMFs) from EMD. We forecast aggregated IMFs and residue with a hybrid model that combines the ARIMA model and an exponential smoothing method (ETS). The proposed method is applied to forecast KOSPI time series and is compared to traditional forecast models. Aggregated IMFs and residue provide a convenience to interpret the short, medium and long term dynamics of the KOSPI. It is also observed that the hybrid model with ARIMA and ETS is superior to traditional and other types of hybrid models.

Production Efficiency Analysis of Offshore and Coastal Fisheries Considering Greenhouse Gas (온실가스를 고려한 연근해어업의 생산효율성 분석)

  • Jeon, Yonghan;Nam, Jongoh
    • Environmental and Resource Economics Review
    • /
    • v.30 no.1
    • /
    • pp.79-105
    • /
    • 2021
  • In the circumstance of standing out the climate change issue, the purpose of this study is to compare the efficiency of offshore and coastal fisheries according to whether or not greenhouse gas (GHG) emissions are considered, and then to present policy alternatives based on the analysis results. For analysis, the traditional data envelopment analysis (DEA), the slacks-based measure (SBM) and the SBM-undesirable models were used, and robust analysis of variance (ANOVA) and Wilcoxon Signed-rank tests were performed. As a result, the study showed that the average efficiency of fisheries decreased as the traditional DEA extended to the SBM model considering the slack and the SBM-undesirable model including the GHG emissions. Specifically, the average efficiency of the traditional DEA model, SBM model, and SBM-undesirable model was analyzed as 0.7350, 0.5820 and 0.4976 respectively. In addition, the results of the robust ANOVA and Wilcoxon Signed-rank tests all showed that there are statistically significant differences in efficiency between offshore and coastal fisheries as well as among traditional DEA, SBM and SBM-undesirable models. As a policy alternative to the analysis, it was suggested that to improve the efficiency of coastal and offshore fisheries, it is necessary to actively implement the new fishing vessel project and develop smart and electric hybrid fishing vessels.

Spatial Analysis of Flood Rainfall Based on Kriging Technique in Nakdong River Basin (크리깅 기법을 이용한 낙동강 유역 홍수강우의 공간해석 연구)

  • Yoon, Kang-Hoon;Seo, Bong-Chul;Shin, Hyun-Suk
    • Journal of Korea Water Resources Association
    • /
    • v.37 no.3
    • /
    • pp.233-240
    • /
    • 2004
  • Most of hydrological analyses in the field of water resources are launched by gathering and analyzing rainfall data. Several methods have been developed to estimate areal rainfall from point rainfall data and to fill missing or ungaged data. Thiessen and Reciprocal Distance Squared(RDS) methods whose parameters are only dependent on inter-station distance are classical work in hydrology, but these techniques do not provide a continuous representation of the hydrologic process involved. In this study, kriging technique was applied to rainfall analysis in Nakdong river basin in order to complement the defects of these classical methods and to reflect spatial characteristics of regional rainfall. After spatial correlation and semi-variogram analyses were performed to perceive regional rainfall property, kriging analysis was performed to interpolate rainfall data for each grid Thus, these procedures were enable to estimate average rainfall of subbasins. In addition, poor region of rainfall observation was analyzed by spatial interpolation error for each grid and mean error for each subbasin.