• Title/Summary/Keyword: vector data

Search Result 3,298, Processing Time 0.03 seconds

Validation on Residual Variation and Covariance Matrix of USSTRATCOM Two Line Element

  • Yim, Hyeon-Jeong;Chung, Dae-Won
    • Journal of Astronomy and Space Sciences
    • /
    • v.29 no.3
    • /
    • pp.287-293
    • /
    • 2012
  • Satellite operating agencies are constantly monitoring conjunctions between satellites and space objects. Two line element (TLE) data, published by the Joint Space Operations Center of the United States Strategic Command, are available as raw data for a preliminary analysis of initial conjunction with a space object without any orbital information. However, there exist several sorts of uncertainties in the TLE data. In this paper, we suggest and analyze a method for estimating the uncertainties in the TLE data through mean, standard deviation of state vector residuals and covariance matrix. Also the estimation results are compared with actual results of orbit determination to validate the estimation method. Characteristics of the state vector residuals depending on the orbital elements are examined by applying the analysis to several satellites in various orbits. Main source of difference between the covariance matrices are also analyzed by comparing the matrices. Particularly, for the Korea Multi-Purpose Satellite-2, we examine the characteristics of the residual variation of state vector and covariance matrix depending on the orbital elements. It is confirmed that a realistic consideration on the space situation of space objects is possible using information from the analysis of mean, standard deviation of the state vector residuals of TLE and covariance matrix.

Smoothing Kaplan-Meier estimate using monotone support vector regression (단조 서포트벡터기계를 이용한 카플란-마이어 생존함수의 평활)

  • Hwang, Changha;Shim, Jooyong
    • Journal of the Korean Data and Information Science Society
    • /
    • v.23 no.6
    • /
    • pp.1045-1054
    • /
    • 2012
  • Support vector machine is known to be the very useful statistical method in classification and nonlinear function estimation. In this paper we propose a monotone support vector regression (SVR) for the estimation of monotonically decreasing function. The proposed monotone SVR is applied to smooth the Kaplan-Meier estimate of survival function. Experimental results are then presented which indicate the performance of the proposed monotone SVR using survival functions obtained by exponential distribution.

Analysis of market share attraction data using LS-SVM (최소제곱 서포트벡터기계를 이용한 시장점유율 자료 분석)

  • Park, Hye-Jung
    • Journal of the Korean Data and Information Science Society
    • /
    • v.20 no.5
    • /
    • pp.879-886
    • /
    • 2009
  • The purpose of this article is to present the application of Least Squares Support Vector Machine in analyzing the existing structure of brand. We estimate the parameters of the Market Share Attraction Model using a non-parametric technique for function estimation called Least Squares Support Vector Machine, which allows us to perform even nonlinear regression by constructing a linear regression function in a high dimensional feature space. Estimation by Least Squares Support Vector Machine technique makes it a good candidate for solving the Market Share Attraction Model. To illustrate the performance of the proposed method, we use the car sales data in South Korea's car market.

  • PDF

Multi-Dimensional Vector Approximation Tree with Dynamic Bit Allocation (동적 비트 할당을 통한 다차원 벡터 근사 트리)

  • 복경수;허정필;유재수
    • The Journal of the Korea Contents Association
    • /
    • v.4 no.3
    • /
    • pp.81-90
    • /
    • 2004
  • Recently, It has been increased to use a multi-dimensional data in various applications with a rapid growth of the computing environment. In this paper, we propose the vector approximate tree for content-based retrieval of multi-dimensional data. The proposed index structure reduces the depth of tree by storing the many region information in a node because of representing region information using space partition based method and vector approximation method. Also it efficiently handles 'dimensionality curse' that causes a problem of multi-dimensional index structure by assigning the multi-dimensional data space to dynamic bit. And it provides the more correct regions by representing the child region information as the parent region information relatively. We show that our index structure outperforms the existing index structure by various experimental evaluations.

  • PDF

Comparison between Word Embedding Techniques in Traditional Korean Medicine for Data Analysis: Implementation of a Natural Language Processing Method (한의학 고문헌 데이터 분석을 위한 단어 임베딩 기법 비교: 자연어처리 방법을 적용하여)

  • Oh, Junho
    • Journal of Korean Medical classics
    • /
    • v.32 no.1
    • /
    • pp.61-74
    • /
    • 2019
  • Objectives : The purpose of this study is to help select an appropriate word embedding method when analyzing East Asian traditional medicine texts as data. Methods : Based on prescription data that imply traditional methods in traditional East Asian medicine, we have examined 4 count-based word embedding and 2 prediction-based word embedding methods. In order to intuitively compare these word embedding methods, we proposed a "prescription generating game" and compared its results with those from the application of the 6 methods. Results : When the adjacent vectors are extracted, the count-based word embedding method derives the main herbs that are frequently used in conjunction with each other. On the other hand, in the prediction-based word embedding method, the synonyms of the herbs were derived. Conclusions : Counting based word embedding methods seems to be more effective than prediction-based word embedding methods in analyzing the use of domesticated herbs. Among count-based word embedding methods, the TF-vector method tends to exaggerate the frequency effect, and hence the TF-IDF vector or co-word vector may be a more reasonable choice. Also, the t-score vector may be recommended in search for unusual information that could not be found in frequency. On the other hand, prediction-based embedding seems to be effective when deriving the bases of similar meanings in context.

Estimating Hydrodynamic Coefficients of Real Ships Using AIS Data and Support Vector Regression

  • Hoang Thien Vu;Jongyeol Park;Hyeon Kyu Yoon
    • Journal of Ocean Engineering and Technology
    • /
    • v.37 no.5
    • /
    • pp.198-204
    • /
    • 2023
  • In response to the complexity and time demands of conventional methods for estimating the hydrodynamic coefficients, this study aims to revolutionize ship maneuvering analysis by utilizing automatic identification system (AIS) data and the Support Vector Regression (SVR) algorithm. The AIS data were collected and processed to remove outliers and impute missing values. The rate of turn (ROT), speed over ground (SOG), course over ground (COG) and heading (HDG) in AIS data were used to calculate the rudder angle and ship velocity components, which were then used as training data for a regression model. The accuracy and efficiency of the algorithm were validated by comparing SVR-based estimated hydrodynamic coefficients and the original hydrodynamic coefficients of the Mariner class vessel. The validated SVR algorithm was then applied to estimate the hydrodynamic coefficients for real ships using AIS data. The turning circle test wassimulated from calculated hydrodynamic coefficients and compared with the AIS data. The research results demonstrate the effectiveness of the SVR model in accurately estimating the hydrodynamic coefficients from the AIS data. In conclusion, this study proposes the viability of employing SVR model and AIS data for accurately estimating the hydrodynamic coefficients. It offers a practical approach to ship maneuvering prediction and control in the maritime industry.

Performance Comparison of Clustering Validity Indices with Business Applications (경영사례를 이용한 군집화 유효성 지수의 성능비교)

  • Lee, Soo-Hyun;Jeong, Youngseon;Kim, Jae-Yun
    • Journal of the Korean Operations Research and Management Science Society
    • /
    • v.41 no.2
    • /
    • pp.17-33
    • /
    • 2016
  • Clustering is one of the leading methods to analyze big data and is used in many different fields. This study deals with Clustering Validity Index (CVI) to verify the effectiveness of clustering results. We compare the performance of CVIs with business applications of various field. In this study, the used CVIs for comparing performance are DU, CH, DB, SVDU, SVCH, and SVDB. The first three CVIs are well-known ones in the existing research and the last three CVIs are based on support vector data description. It has been verified with outstanding performance and qualified as the application ability of CVIs based on support vector data description.

Soil moisture prediction using a support vector regression

  • Lee, Danhyang;Kim, Gwangseob;Lee, Kyeong Eun
    • Journal of the Korean Data and Information Science Society
    • /
    • v.24 no.2
    • /
    • pp.401-408
    • /
    • 2013
  • Soil moisture is a very important variable in various area of hydrological processes. We predict the soil moisture using a support vector regression. The model is trained and tested using the soil moisture data observed in five sites in the Yongdam dam basin. With respect to soil moisture data of of four sites-Jucheon, Bugui, Sangieon and Ahncheon which are used to train the model, the correlation coefficient between the esimtates and the observed values is about 0.976. As the result of the application to Cheoncheon2 for validating the model, the correlation coefficient between the estimates and the observed values of soil moisture is about 0.835. We compare those results with those of artificial neural network models.

Support vector quantile regression ensemble with bagging

  • Shim, Jooyong;Hwang, Changha
    • Journal of the Korean Data and Information Science Society
    • /
    • v.25 no.3
    • /
    • pp.677-684
    • /
    • 2014
  • Support vector quantile regression (SVQR) is capable of providing more complete description of the linear and nonlinear relationships among random variables. To improve the estimation performance of SVQR we propose to use SVQR ensemble with bagging (bootstrap aggregating), in which SVQRs are trained independently using the training data sets sampled randomly via a bootstrap method. Then, they are aggregated to obtain the estimator of the quantile regression function using the penalized objective function composed of check functions. Experimental results are then presented, which illustrate the performance of SVQR ensemble with bagging.

How to improve oil consumption forecast using google trends from online big data?: the structured regularization methods for large vector autoregressive model

  • Choi, Ji-Eun;Shin, Dong Wan
    • Communications for Statistical Applications and Methods
    • /
    • v.29 no.1
    • /
    • pp.41-51
    • /
    • 2022
  • We forecast the US oil consumption level taking advantage of google trends. The google trends are the search volumes of the specific search terms that people search on google. We focus on whether proper selection of google trend terms leads to an improvement in forecast performance for oil consumption. As the forecast models, we consider the least absolute shrinkage and selection operator (LASSO) regression and the structured regularization method for large vector autoregressive (VAR-L) model of Nicholson et al. (2017), which select automatically the google trend terms and the lags of the predictors. An out-of-sample forecast comparison reveals that reducing the high dimensional google trend data set to a low-dimensional data set by the LASSO and the VAR-L models produces better forecast performance for oil consumption compared to the frequently-used forecast models such as the autoregressive model, the autoregressive distributed lag model and the vector error correction model.