• Title/Summary/Keyword: function-level features

Search Result 179, Processing Time 0.021 seconds

Cross-architecture Binary Function Similarity Detection based on Composite Feature Model

  • Xiaonan Li;Guimin Zhang;Qingbao Li;Ping Zhang;Zhifeng Chen;Jinjin Liu;Shudan Yue
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.17 no.8
    • /
    • pp.2101-2123
    • /
    • 2023
  • Recent studies have shown that the neural network-based binary code similarity detection technology performs well in vulnerability mining, plagiarism detection, and malicious code analysis. However, existing cross-architecture methods still suffer from insufficient feature characterization and low discrimination accuracy. To address these issues, this paper proposes a cross-architecture binary function similarity detection method based on composite feature model (SDCFM). Firstly, the binary function is converted into vector representation according to the proposed composite feature model, which is composed of instruction statistical features, control flow graph structural features, and application program interface calling behavioral features. Then, the composite features are embedded by the proposed hierarchical embedding network based on a graph neural network. In which, the block-level features and the function-level features are processed separately and finally fused into the embedding. In addition, to make the trained model more accurate and stable, our method utilizes the embeddings of predecessor nodes to modify the node embedding in the iterative updating process of the graph neural network. To assess the effectiveness of composite feature model, we contrast SDCFM with the state of art method on benchmark datasets. The experimental results show that SDCFM has good performance both on the area under the curve in the binary function similarity detection task and the vulnerable candidate function ranking in vulnerability search task.

Detection of an Open-Source Software Module based on Function-level Features (함수 수준 특징정보 기반의 오픈소스 소프트웨어 모듈 탐지)

  • Kim, Dongjin;Cho, Seong-je
    • Journal of KIISE
    • /
    • v.42 no.6
    • /
    • pp.713-722
    • /
    • 2015
  • As open-source software (OSS) becomes more widely used, many users breach the terms in the license agreement of OSS, or reuse a vulnerable OSS module. Therefore, a technique needs to be developed for investigating if a binary program includes an OSS module. In this paper, we propose an efficient technique to detect a particular OSS module in an executable program using its function-level features. The conventional methods are inappropriate for determining whether a module is contained in a specific program because they usually measure the similarity between whole programs. Our technique determines whether an executable program contains a certain OSS module by extracting features such as its function-level instructions, control flow graph, and the structural attributes of a function from both the program and the module, and comparing the similarity of features. In order to demonstrate the efficiency of the proposed technique, we evaluate it in terms of the size of features, detection accuracy, execution overhead, and resilience to compiler optimizations.

Relation Based Bayesian Network for NBNN

  • Sun, Mingyang;Lee, YoonSeok;Yoon, Sung-eui
    • Journal of Computing Science and Engineering
    • /
    • v.9 no.4
    • /
    • pp.204-213
    • /
    • 2015
  • Under the conditional independence assumption among local features, the Naive Bayes Nearest Neighbor (NBNN) classifier has been recently proposed and performs classification without any training or quantization phases. While the original NBNN shows high classification accuracy without adopting an explicit training phase, the conditional independence among local features is against the compositionality of objects indicating that different, but related parts of an object appear together. As a result, the assumption of the conditional independence weakens the accuracy of classification techniques based on NBNN. In this work, we look into this issue, and propose a novel Bayesian network for an NBNN based classification to consider the conditional dependence among features. To achieve our goal, we extract a high-level feature and its corresponding, multiple low-level features for each image patch. We then represent them based on a simple, two-level layered Bayesian network, and design its classification function considering our Bayesian network. To achieve low memory requirement and fast query-time performance, we further optimize our representation and classification function, named relation-based Bayesian network, by considering and representing the relationship between a high-level feature and its low-level features into a compact relation vector, whose dimensionality is the same as the number of low-level features, e.g., four elements in our tests. We have demonstrated the benefits of our method over the original NBNN and its recent improvement, and local NBNN in two different benchmarks. Our method shows improved accuracy, up to 27% against the tested methods. This high accuracy is mainly due to consideration of the conditional dependences between high-level and its corresponding low-level features.

Detection of Microcalcification Using the Wavelet Based Adaptive Sigmoid Function and Neural Network

  • Kumar, Sanjeev;Chandra, Mahesh
    • Journal of Information Processing Systems
    • /
    • v.13 no.4
    • /
    • pp.703-715
    • /
    • 2017
  • Mammogram images are sensitive in nature and even a minor change in the environment affects the quality of the images. Due to the lack of expert radiologists, it is difficult to interpret the mammogram images. In this paper an algorithm is proposed for a computer-aided diagnosis system, which is based on the wavelet based adaptive sigmoid function. The cascade feed-forward back propagation technique has been used for training and testing purposes. Due to the poor contrast in digital mammogram images it is difficult to process the images directly. Thus, the images were first processed using the wavelet based adaptive sigmoid function and then the suspicious regions were selected to extract the features. A combination of texture features and gray-level co-occurrence matrix features were extracted and used for training and testing purposes. The system was trained with 150 images, while a total 100 mammogram images were used for testing. A classification accuracy of more than 95% was obtained with our proposed method.

Evaluation of Resident Satisfaction Level on the Environment Friendly Consolidation Canals (친환경 정비수로에 대한 주민 만족도 평가)

  • Kim, Sun-Joo;An, Min-Woo;Ko, Jae-Sun
    • Journal of Korean Society of Rural Planning
    • /
    • v.9 no.3 s.20
    • /
    • pp.47-52
    • /
    • 2003
  • The purpose of this study is to evaluate the satisfaction level and to analyze the factors in the environment friendly consolidation canals. In this study, the survey consists of four parts, such as the actual status of environment friendly canals, satisfaction level for each items, overall satisfaction level and the personal features of the users. Total 128 samples out of 140 respondents were used for the final analysis. Analysis of the satisfaction level were fulfilled to survey results such as the basic statistics and the correlations of variables, in addition, dispersion analysis for two user groups were carried out. Function of canal, water quality, landscape, and convenience facilities were chosen as independent variables in the model. Residents in Yeoju area were satisfied with water quality, water quantity and the function of canal. And residents of Yeonggwang area were satisfied with the convenience facilities and the function of the newly consolidated canal.

Lexical Sophistication Features to Distinguish the English Proficiency Level Using a Discriminant Function Analysis (판별분석을 통해 살펴본 영어 능력 수준을 구별하는 어휘의 정교화 특성)

  • Lee, Young-Ju
    • The Journal of the Convergence on Culture Technology
    • /
    • v.8 no.5
    • /
    • pp.691-696
    • /
    • 2022
  • This study explored the lexical sophistication features to distinguish the group membership of English proficiency, using the automatic analysis program of lexical sophistication. A total of 600 essays written by 300 Korean college students were extracted from the ICNALE (International Corpus Network of Asian Learners of English) corpus and a discriminant function analysis was performed using SPSS program. Results showed that the lexical features to distinguish three groups of English proficiency are SUBTLEXUS frequency content words, age of acquisition content words, lexical decision mean reaction time function words, and hypernymy verbs. High-level Korean students used frequent content words from SUBTLEXUS corpus to a lesser degree and produced more sophisticated words that can be learned at a later age and take longer reaction time in lexical decision task, and more concrete verbs.

A Study on the Adaptive Method for Extracting Optimum Features of Speech Signal (음성신호의 최적특징을 적응적으로 추출하는 방법에 관한 연구)

  • 장승관;차태호;최웅세;김창석
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.19 no.2
    • /
    • pp.373-380
    • /
    • 1994
  • In this paper, we proposed a method of extracting optimum features of speech signal to adjust signal level. For extracting features of speech signal we used FRLS(Fast Recursive Least Square) algorithm, we adjusted each frames of equal to constant level, and extracted optimum features of speech signal by using equalized autocorrelation function proposed in this paper.

  • PDF

Music Emotion Classification Based On Three-Level Structure (3 레벨 구조 기반의 음악 무드분류)

  • Kim, Hyoung-Gook;Jeong, Jin-Guk
    • The Journal of the Acoustical Society of Korea
    • /
    • v.26 no.2E
    • /
    • pp.56-62
    • /
    • 2007
  • This paper presents the automatic music emotion classification on acoustic data. A three-level structure is developed. The low-level extracts the timbre and rhythm features. The middle-level estimates the indication functions that represent the emotion probability of a single analysis unit. The high-level predicts the emotion result based on the indication function values. Experiments are carried out on 695 homogeneous music pieces labeled with four emotions, including pleasant, calm, sad, and excited. Three machine learning methods, GMM, MLP, and SVM, are compared on the high-level. The best result of 90.16% is obtained by MLP method.

Mechanical Fault Classification of an Induction Motor using Texture Analysis (질감 분석을 이용한 유도 전동기의 기계적 결함 분류)

  • Jang, Won-Chul;Park, Yong-Hoon;Kang, Myeong-Su;Kim, Jong-Myon
    • Journal of the Korea Society of Computer and Information
    • /
    • v.18 no.12
    • /
    • pp.11-19
    • /
    • 2013
  • This paper proposes an algorithm using vibration signals and texture analysis for mechanical fault diagnosis of an induction motor. We analyze characteristics of contrast and pattern of an image converted from vibration signal and extract three texture features using gray-level co-occurrence model(GLCM). Then, the extracted features are used as inputs of a multi-level support vector machine(MLSVM) which utilizes the radial basis function(RBF) kernel function to classify each fault type. In addition, we evaluate the classification performance with varying the parameter from 0.3 to 1.0 for the RBF kernel function of MLSVM, and the proposed algorithm achieved 100% classification accuracy with the parameter of the RBF from 0.3 to 1.0. Moreover, the proposed algorithm achieved about 98% classification accuracy with 15dB and 20dB noise inserted vibration signals.

Preference of Women Cosmetics Consumption Value on SNS Features of Cosmetics Brands (여성의 화장품 소비가치에 따른 화장품 브랜드의 SNS 특성 선호도)

  • Kim, Cho-Rong;Kwak, Tai-Gi
    • Journal of the Korea Fashion and Costume Design Association
    • /
    • v.18 no.3
    • /
    • pp.99-111
    • /
    • 2016
  • In order to use cosmetics brands SNS effectively and establish strategy, the purpose of this study is to provide consumer date. According to cosmetics consumption value groups, this study examined each cosmetics consumption value group's level of concern of three cosmetics brands SNS features, informativity, enjoyment and interactivity. For the data, questionnaire was collected by 198 women, and the data were measured by ANOVA, factor analysis, cluster analysis and Ducan test. According to cluster analysis cosmetics consumption value groups were divided into four groups, unconcern group, hedonic value pursuit group, function of brand value pursuit group, high concern group. The results of the study are as follows: First, high concern group thought highly of all cosmetics brands SNS features, including informativity, enjoyment and interactivity. In addition all consumer groups were concerned informativity rather then enjoyment and interactivity. Second, comparing with high concern group, other groups which include unconcern group, hedonic value pursuit group and function of brand value pursuit group were not concerned about cosmetics brands SNS features' enjoyment and interactivity. Except informativity, hedonic value pursuit group and unconcern group were more concerned interactivity than enjoyment. While, high concern group and function of brand value pursuit group were more concerned enjoyment than interactivity.

  • PDF