• Title/Summary/Keyword: correlation feature analysis

Search Result 245, Processing Time 0.034 seconds

Construction of Scientific Impact Evaluation Model Based on Altmetrics

  • Li, Jiapei;Shin, Seong Yoon;Lee, Hyun Chang
    • Journal of information and communication convergence engineering
    • /
    • v.15 no.3
    • /
    • pp.165-169
    • /
    • 2017
  • Altmetrics is an emergent research area whereby social media is applied as a source of metrics to evaluate scientific impact. Recently, the interest in altmetrics has been growing. Traditional scientific impact evaluation indictors are based on the number of publications, citation counts and peer reviews of a researcher. As research publications were increasingly placed online, usage metrics as well as webometrics appeared. This paper explores the potential benefits of altmetrics and the deep relationship between each metrics. Firstly, we found a weak-to-medium correlation among the 11 altmetrics and visualized such correlation. Secondly, we conducted principal component analysis and exploratory factor analysis on altmetrics of social media, divided the 11 altmetrics into four feature sets, confirming the dispersion and relative concentration of altmetrics groups and developed the altmetrics evaluation model. We can use this model to evaluate the scientific impact of articles on social media.

A Novel Network Anomaly Detection Method based on Data Balancing and Recursive Feature Addition

  • Liu, Xinqian;Ren, Jiadong;He, Haitao;Wang, Qian;Sun, Shengting
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.14 no.7
    • /
    • pp.3093-3115
    • /
    • 2020
  • Network anomaly detection system plays an essential role in detecting network anomaly and ensuring network security. Anomaly detection system based machine learning has become an increasingly popular solution. However, due to the unbalance and high-dimension characteristics of network traffic, the existing methods unable to achieve the excellent performance of high accuracy and low false alarm rate. To address this problem, a new network anomaly detection method based on data balancing and recursive feature addition is proposed. Firstly, data balancing algorithm based on improved KNN outlier detection is designed to select part respective data on each category. Combination optimization about parameters of improved KNN outlier detection is implemented by genetic algorithm. Next, recursive feature addition algorithm based on correlation analysis is proposed to select effective features, in which a cross contingency test is utilized to analyze correlation and obtain a features subset with a strong correlation. Then, random forests model is as the classification model to detection anomaly. Finally, the proposed algorithm is evaluated on benchmark datasets KDD Cup 1999 and UNSW_NB15. The result illustrates the proposed strategies enhance accuracy and recall, and decrease the false alarm rate. Compared with other algorithms, this algorithm still achieves significant effects, especially recall in the small category.

Development of Correlation Based Feature Selection Method by Predicting the Markov Blanket for Gene Selection Analysis

  • Adi, Made;Yun, Zhen;Keong, Kwoh-Chee
    • Proceedings of the Korean Society for Bioinformatics Conference
    • /
    • 2005.09a
    • /
    • pp.183-187
    • /
    • 2005
  • In this paper, we propose a heuristic method to select features using a Two-Phase Markov Blanket-based (TPMB) algorithm. The first phase, filtering phase, of TPMB algorithm works by filtering the obviously redundant features. A non-linear correlation method based on Information theory is used as a metric to measure the redundancy of a feature [1]. In second phase, approximating phase, the Markov Blanket (MB) of a system is estimated by employing the concept of cross entropy to identify the MB. We perform experiments on microarray data and report two popular dataset, AML-ALL [3] and colon tumor [4], in this paper. The experimental results show that the TPMB algorithm can significantly reduce the number of features while maintaining the accuracy of the classifiers.

  • PDF

A study on applying multivariate statistical method for making casual structure in management information (경영정보의 인과구조 구축을 위한 다변량통계기법 적용에 관한 연구)

  • 조성훈;김태성
    • Proceedings of the Korean Operations and Management Science Society Conference
    • /
    • 1996.10a
    • /
    • pp.117-120
    • /
    • 1996
  • The objective of this study is to suggest modified Covariance Structure Analysis that combine with existing Multivariate Statistical Method which is used Casual Analysis Method in Management Information. For this purpose, we'll consider special feature and limitation about Correlation Analysis, Regression Analysis, Path Analysis and connect Covariance Structure Analysis with Statistical Factor Analysis so that theoretical casual model compare with variables structure in collecting data. A example is also presented to show the practical applicability of this approach.

  • PDF

Health Risk Management using Feature Extraction and Cluster Analysis considering Time Flow (시간흐름을 고려한 특징 추출과 군집 분석을 이용한 헬스 리스크 관리)

  • Kang, Ji-Soo;Chung, Kyungyong;Jung, Hoill
    • Journal of the Korea Convergence Society
    • /
    • v.12 no.1
    • /
    • pp.99-104
    • /
    • 2021
  • In this paper, we propose health risk management using feature extraction and cluster analysis considering time flow. The proposed method proceeds in three steps. The first is the pre-processing and feature extraction step. It collects user's lifelog using a wearable device, removes incomplete data, errors, noise, and contradictory data, and processes missing values. Then, for feature extraction, important variables are selected through principal component analysis, and data similar to the relationship between the data are classified through correlation coefficient and covariance. In order to analyze the features extracted from the lifelog, dynamic clustering is performed through the K-means algorithm in consideration of the passage of time. The new data is clustered through the similarity distance measurement method based on the increment of the sum of squared errors. Next is to extract information about the cluster by considering the passage of time. Therefore, using the health decision-making system through feature clusters, risks able to managed through factors such as physical characteristics, lifestyle habits, disease status, health care event occurrence risk, and predictability. The performance evaluation compares the proposed method using Precision, Recall, and F-measure with the fuzzy and kernel-based clustering. As a result of the evaluation, the proposed method is excellently evaluated. Therefore, through the proposed method, it is possible to accurately predict and appropriately manage the user's potential health risk by using the similarity with the patient.

Optimal Facial Emotion Feature Analysis Method based on ASM-LK Optical Flow (ASM-LK Optical Flow 기반 최적 얼굴정서 특징분석 기법)

  • Ko, Kwang-Eun;Park, Seung-Min;Park, Jun-Heong;Sim, Kwee-Bo
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.21 no.4
    • /
    • pp.512-517
    • /
    • 2011
  • In this paper, we propose an Active Shape Model (ASM) and Lucas-Kanade (LK) optical flow-based feature extraction and analysis method for analyzing the emotional features from facial images. Considering the facial emotion feature regions are described by Facial Action Coding System, we construct the feature-related shape models based on the combination of landmarks and extract the LK optical flow vectors at each landmarks based on the centre pixels of motion vector window. The facial emotion features are modelled by the combination of the optical flow vectors and the emotional states of facial image can be estimated by the probabilistic estimation technique, such as Bayesian classifier. Also, we extract the optimal emotional features that are considered the high correlation between feature points and emotional states by using common spatial pattern (CSP) analysis in order to improvise the operational efficiency and accuracy of emotional feature extraction process.

FAFS: A Fuzzy Association Feature Selection Method for Network Malicious Traffic Detection

  • Feng, Yongxin;Kang, Yingyun;Zhang, Hao;Zhang, Wenbo
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.14 no.1
    • /
    • pp.240-259
    • /
    • 2020
  • Analyzing network traffic is the basis of dealing with network security issues. Most of the network security systems depend on the feature selection of network traffic data and the detection ability of malicious traffic in network can be improved by the correct method of feature selection. An FAFS method, which is short for Fuzzy Association Feature Selection method, is proposed in this paper for network malicious traffic detection. Association rules, which can reflect the relationship among different characteristic attributes of network traffic data, are mined by association analysis. The membership value of association rules are obtained by the calculation of fuzzy reasoning. The data features with the highest correlation intensity in network data sets are calculated by comparing the membership values in association rules. The dimension of data features are reduced and the detection ability of malicious traffic detection algorithm in network is improved by FAFS method. To verify the effect of malicious traffic feature selection by FAFS method, FAFS method is used to select data features of different dataset in this paper. Then, K-Nearest Neighbor algorithm, C4.5 Decision Tree algorithm and Naïve Bayes algorithm are used to test on the dataset above. Moreover, FAFS method is also compared with classical feature selection methods. The analysis of experimental results show that the precision and recall rate of malicious traffic detection in the network can be significantly improved by FAFS method, which provides a valuable reference for the establishment of network security system.

Moving Window Principal Component Analysis for Detecting Positional Fluctuation of Spectral Changes

  • Ryu, Soo-Ryeon;Noda, Isao;Jung, Young-Mee
    • Bulletin of the Korean Chemical Society
    • /
    • v.32 no.7
    • /
    • pp.2332-2338
    • /
    • 2011
  • In this study, we proposed a new promising idea of utilizing moving window principal component analysis (MWPCA) as a sensitive diagnostic tool to detect the presence of peak position shift. In this approach, the moving window is constructed from a small data segment along the wavenumber axis. For each window bound by a narrow wavenumber region, separate PCA analysis was applied. Simulated spectra with complex spectral feature variations were analyzed to explore the possibility of MWPCA technique. This MWPCA-based detection of the peak shift, potentially coupled with 2D correlation analysis to provide additional verification, may offer an attractive solution.

Analysis of Ground Height from Automatic Correlation Matching Result Considering Density Measure of Tree (수목차폐율을 고려한 자동상관매칭 수치고도 결과 분석)

  • Eo, Yang-Dam
    • Spatial Information Research
    • /
    • v.15 no.2
    • /
    • pp.181-187
    • /
    • 2007
  • To make digital terrain data, automatic correlation matching by stereo airborne/satellite images has been researched. The result of automatic correlation matching has a limit on extracting exact ground height because of angle of sensor, tree of height. Therefore, the amount of editing works depend on the distribution of spatial feature in images as well as image quality. This paper shows that the automatic correlation matching result was affected by density and height of tree.

  • PDF

Evaluation of Chaotic evaluation of degradation signals of AISI 304 steel using the Attractor Analysis (어트랙터 해석을 이용한 AISI 304강 열화 신호의 카오스의 평가)

  • 오상균
    • Journal of the Korean Society of Manufacturing Technology Engineers
    • /
    • v.9 no.2
    • /
    • pp.45-51
    • /
    • 2000
  • This study proposes that analysis and evaluation method of time series ultrasonic signal using the chaotic feature extrac-tion for degradation extent. Features extracted from time series data using the chaotic time series signal analyze quantitatively material degradation extent. For this purpose analysis objective in this study if fractal dimension lyapunov exponent and strange attractor on hyperspace. The lyapunov exponent is a measure of the rate at which nearby trajectories in phase space diverge. Chaotic trajectories have at least one positive lyapunov exponent. The fractal dimension appears as a metric space such as the phase space trajectory of a dynamical syste, In experiment fractal(correlation) dimensions and lyapunov experiments showed values of mean 3.837-4.211 and 0.054-0.078 in case of degradation material The proposed chaotic feature extraction in this study can enhances ultrasonic pattern recognition results from degrada-tion signals.

  • PDF