• Title/Summary/Keyword: Similarity Measurement

Search Result 356, Processing Time 0.027 seconds

Word Sense Similarity Clustering Based on Vector Space Model and HAL (벡터 공간 모델과 HAL에 기초한 단어 의미 유사성 군집)

  • Kim, Dong-Sung
    • Korean Journal of Cognitive Science
    • /
    • v.23 no.3
    • /
    • pp.295-322
    • /
    • 2012
  • In this paper, we cluster similar word senses applying vector space model and HAL (Hyperspace Analog to Language). HAL measures corelation among words through a certain size of context (Lund and Burgess 1996). The similarity measurement between a word pair is cosine similarity based on the vector space model, which reduces distortion of space between high frequency words and low frequency words (Salton et al. 1975, Widdows 2004). We use PCA (Principal Component Analysis) and SVD (Singular Value Decomposition) to reduce a large amount of dimensions caused by similarity matrix. For sense similarity clustering, we adopt supervised and non-supervised learning methods. For non-supervised method, we use clustering. For supervised method, we use SVM (Support Vector Machine), Naive Bayes Classifier, and Maximum Entropy Method.

  • PDF

Air Similarity Performance Test of Turbopump Turbine (터보펌프용 터빈 공기상사 성능시험)

  • Lim Byeung-Jun;Hong Chang-Uk;Kim Jin-Han
    • Journal of the Korean Society of Propulsion Engineers
    • /
    • v.10 no.2
    • /
    • pp.39-45
    • /
    • 2006
  • In liquid rocket engine turbopump, it is difficult to evaluate turbine performance for high pressure, high temperature circumstance. Turbine test is often done by using air at similarity condition so that the turbine can be tested at lower risk. This paper describes an air similarity test program of liquid rocket engine turbopump turbine. A test facility has been built to evaluate aerodynamic performance of turbines. The test facility consists of high pressure air supply system, mass flow rate measuring nozzle, test section, hydraulic break, exit orifice for pressure control, instrumentation and control system. This paper also presents how to decide the similarity conditions of the turbine test and describes how to control test conditions. Relative standard deviation of measurement parameter was less than 1% and measured turbine efficiency corresponded with analysis result within 2%.

Micro-seismic monitoring in mines based on cross wavelet transform

  • Huang, Linqi;Hao, Hong;Li, Xibing;Li, Jun
    • Earthquakes and Structures
    • /
    • v.11 no.6
    • /
    • pp.1143-1164
    • /
    • 2016
  • Time Delay of Arrival (TDOA) estimation methods based on correlation function analysis play an important role in the micro-seismic event monitoring. It makes full use of the similarity in the recorded signals that are from the same source. However, those methods are subjected to the noise effect, particularly when the global similarity of the signals is low. This paper proposes a new approach for micro-seismic monitoring based on cross wavelet transform. The cross wavelet transform is utilized to analyse the measured signals under micro-seismic events, and the cross wavelet power spectrum is used to measure the similarity of two signals in a multi-scale dimension and subsequently identify TDOA. The offset time instant associated with the maximum cross wavelet transform spectrum power is identified as TDOA, and then the location of micro-seismic event can be identified. Individual and statistical identification tests are performed with measurement data from an in-field mine. Experimental studies demonstrate that the proposed approach significantly improves the robustness and accuracy of micro-seismic source locating in mines compared to several existing methods, such as the cross-correlation, multi-correlation, STA/LTA and Kurtosis methods.

B-Corr Model for Bot Group Activity Detection Based on Network Flows Traffic Analysis

  • Hostiadi, Dandy Pramana;Wibisono, Waskitho;Ahmad, Tohari
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.14 no.10
    • /
    • pp.4176-4197
    • /
    • 2020
  • Botnet is a type of dangerous malware. Botnet attack with a collection of bots attacking a similar target and activity pattern is called bot group activities. The detection of bot group activities using intrusion detection models can only detect single bot activities but cannot detect bots' behavioral relation on bot group attack. Detection of bot group activities could help network administrators isolate an activity or access a bot group attacks and determine the relations between bots that can measure the correlation. This paper proposed a new model to measure the similarity between bot activities using the intersections-probability concept to define bot group activities called as B-Corr Model. The B-Corr model consisted of several stages, such as extraction feature from bot activity flows, measurement of intersections between bots, and similarity value production. B-Corr model categorizes similar bots with a similar target to specify bot group activities. To achieve a more comprehensive view, the B-Corr model visualizes the similarity values between bots in the form of a similar bot graph. Furthermore, extensive experiments have been conducted using real botnet datasets with high detection accuracy in various scenarios.

Plagiarism Detection among Source Codes using Adaptive Methods

  • Lee, Yun-Jung;Lim, Jin-Su;Ji, Jeong-Hoon;Cho, Hwaun-Gue;Woo, Gyun
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.6 no.6
    • /
    • pp.1627-1648
    • /
    • 2012
  • We propose an adaptive method for detecting plagiarized pairs from a large set of source code. This method is adaptive in that it uses an adaptive algorithm and it provides an adaptive threshold for determining plagiarism. Conventional algorithms are based on greedy string tiling or on local alignments of two code strings. However, most of them are not adaptive; they do not consider the characteristics of the program set, thereby causing a problem for a program set in which all the programs are inherently similar. We propose adaptive local alignment-a variant of local alignment that uses an adaptive similarity matrix. Each entry of this matrix is the logarithm of the probabilities of the keywords based on their frequency in a given program set. We also propose an adaptive threshold based on the local outlier factor (LOF), which represents the likelihood of an entity being an outlier. Experimental results indicate that our method is more sensitive than JPlag, which uses greedy string tiling for detecting plagiarism-suspected code pairs. Further, the adaptive threshold based on the LOF is shown to be effective, and the detection performance shows high sensitivity with negligible loss of specificity, compared with that using a fixed threshold.

The Relationship between Other Customer Perception and Experience with Role of Interpersonal Mindfulness in Brand Distribution

  • Linh Thi Dieu NGUYEN;Anh Thuy TRINH
    • Journal of Distribution Science
    • /
    • v.21 no.6
    • /
    • pp.69-81
    • /
    • 2023
  • Purpose: The study investigates the moderating impact of interpersonal mindfulness (IM) on the link between perceived similarity (OPS), physical appearance (OPA), and suitable behavior (OSB) - three key factors of other consumer perception (OCP) and brand experience (BE) in distribution of OCP and brand. Research design, data, and methodology: This study collected data from 612 consumers at shopping malls. SmartPLS 3.3.9 software were used to assess the measurement model and structural model. Results: According to the study's findings, IM has a negative modality in the impact between BE and OPS, OPA, and OSB. That also demonstrates how distribution of OCP and brand can affect a person's brand experience. Conclusions: The distribution of OCP and IM interactions have a significant influence on the brand experience in brand distribution. The study's results show that IM including mindfulness will function as a moderator between perceived similarity, physical appearance, suitable behavior regarded proper by other consumers, and brand experiences; therefore, they impact to brand distribution. The findings give a foundation for further IM research and add to the brand distribution theory that already exists. The findings also have some managerial implications in brand distribution.

Similarity Measurement Between Titles and Abstracts Using Bijection Mapping and Phi-Correlation Coefficient

  • John N. Mlyahilu;Jong-Nam Kim
    • Journal of the Institute of Convergence Signal Processing
    • /
    • v.23 no.3
    • /
    • pp.143-149
    • /
    • 2022
  • This excerpt delineates a quantitative measure of relationship between a research title and its respective abstract extracted from different journal articles documented through a Korean Citation Index (KCI) database published through various journals. In this paper, we propose a machine learning-based similarity metric that does not assume normality on dataset, realizes the imbalanced dataset problem, and zero-variance problem that affects most of the rule-based algorithms. The advantage of using this algorithm is that, it eliminates the limitations experienced by Pearson correlation coefficient (r) and additionally, it solves imbalanced dataset problem. A total of 107 journal articles collected from the database were used to develop a corpus with authors, year of publication, title, and an abstract per each. Based on the experimental results, the proposed algorithm achieved high correlation coefficient values compared to others which are cosine similarity, euclidean, and pearson correlation coefficients by scoring a maximum correlation of 1, whereas others had obtained non-a-number value to some experiments. With these results, we found that an effective title must have high correlation coefficient with the respective abstract.

Water-Simulant Facility Installation for the Sodium-Cooled Fast Reactor KALIMER-600 and Global Flow Measurement (소듐냉각고속로 KALIMER-600 축소 물모의 열유동 가시화 실험장치 구축 및 거시 유동장 특성 측정)

  • Cha, Jae-Eun;Kim, Seong-O
    • Journal of the Korean Society of Visualization
    • /
    • v.9 no.4
    • /
    • pp.54-62
    • /
    • 2011
  • KAERI has developed a KALIMER-600 which is a pool-type sodium-cooled fast reactor with a 600MWe electric generation capacity. For a SFR development, one of the main topics is an enhancement of the reactor system safety. Therefore, we have a long-term plan to design the large sodium experimental facility to evaluate the reactor safety and component performance. In order to extrapolate a thermal hydraulic phenomena in a large sodium reactor, the thermal hydraulics phenomena is under investigation in a 1/$10^{th}$ water-simulant facility for the KALIMER-600. In this paper, we shortly described the experimental facility setup and the measurement of the isothermal global flow behavior. For the flow field measurement, the PIV method was used in a transparent Plexiglas reactor vessel model at around $20^{\circ}C$ water condition.

Estimation of Pollution Degree for Liveline Insulator with Leakage Current Measurement (누설전류 측정을 통한 활선 절연물의 오손도 추정)

  • Shim, Kyu-Il;Choi, Nam-Ho;Park, Kang-Sik;Han, Sang-Ok
    • Proceedings of the KIEE Conference
    • /
    • 2001.07c
    • /
    • pp.1472-1474
    • /
    • 2001
  • In this paper, a method was presented to estimate the contamination degree of outdoor insulator by the measurement of surface leakage current. Contamination is one of the most important factor to determine the performance of insulator. Thus, it is very important to exam the contamination degree on the outdoor insulator. There are many limits, such as reliability of data, interval of measurement and similarity of environmental conditions, in conventional method. So, the estimation technique for contamination has been needed to monitor the accurate pollution degree of insulator in situ. In this investigation, phase difference was measured to compare the variance of phase difference with the contamination degree and relative humidity. From the result, we could confirm the capability of the estimation method.

  • PDF

Measurement of Flow Field in the Pebble Bed Type High Temperature Gas-cooled Reactor (페블 베드 타입 고온 가스 냉각 원자로 내부 유동장 측정)

  • Lee, Sa-Ya;Lee, Jae-Young
    • Proceedings of the KSME Conference
    • /
    • 2008.11b
    • /
    • pp.2088-2093
    • /
    • 2008
  • In this study, flow field measurement of the Pebble Bed Reactor(PBR) for the High Temperature Gas-cooled Reactor(HTGR) was performed. Large number of pebbles in the core of PBR provides complicated flow channel. Due to the complicated geometries, numerical analysis has been intensively made rather than experimental observation. However, the justification of computational simulation by the experimental study is crucial to develop solid analysis of design method. In the present study, a wind tunnel installed with pebbles stacked was constructed and equipped with the Particle Image Velocimetry(PIV). We designed the system scaled up to realize the room temperature condition according to the similarity. The PIV observation gave us stagnation points, low speed region so that the suspected high temperature region can be identified. With the further supplementary experimental works, the present system may produce valuable data to justify the Computational Fluid Dynamics(CFD) simulation method.

  • PDF