• 제목/요약/키워드: Similarity Measurement

검색결과 352건 처리시간 0.026초

벡터 공간 모델과 HAL에 기초한 단어 의미 유사성 군집 (Word Sense Similarity Clustering Based on Vector Space Model and HAL)

  • 김동성
    • 인지과학
    • /
    • 제23권3호
    • /
    • pp.295-322
    • /
    • 2012
  • 본 연구에서는 벡터 공간 모델과 HAL (Hyperspace Analog to Language)을 적용해서 단어 의미 유사성을 군집한다. 일정한 크기의 문맥을 통해서 단어 간의 상관성을 측정하는 HAL을 도입하고(Lund and Burgess 1996), 상관성 측정에서 고빈도와 저빈도에 다르게 측정되는 왜곡을 줄이기 위해서 벡터 공간 모델을 적용해서 단어 쌍의 코사인 유사도를 측정하였다(Salton et al. 1975, Widdows 2004). HAL과 벡터 공간 모델로 만들어지는 공간은 다차원이므로, 차원을 축소하기 위해서 PCA (Principal Component Analysis)와 SVD (Singular Value Decomposition)를 적용하였다. 유사성 군집을 위해서 비감독 방식과 감독 방식을 적용하였는데, 비감독 방식에는 클러스터링을 감독 방식에는 SVM (Support Vector Machine), 나이브 베이즈 구분자(Naive Bayes Classifier), 최대 엔트로피(Maximum Entropy) 방식을 적용하였다. 이 연구는 언어학적 측면에서 Harris (1954), Firth (1957)의 분포 가설(Distributional Hypothesis)을 활용한 의미 유사도를 측정하였으며, 심리언어학적 측면에서 의미 기억을 설명하기 위한 모델로 벡터 공간 모델과 HAL을 결합하였으며, 전산적 언어 처리 관점에서 기계학습 방식 중 감독 기반과 비감독 기반을 적용하였다.

  • PDF

터보펌프용 터빈 공기상사 성능시험 (Air Similarity Performance Test of Turbopump Turbine)

  • 임병준;홍창욱;김진한
    • 한국추진공학회지
    • /
    • 제10권2호
    • /
    • pp.39-45
    • /
    • 2006
  • 로켓 엔진 터보펌프용 터빈은 고온, 고압의 연소가스를 사용하기 때문에 실제 환경에서 성능시험을 수행하기가 매우 어렵다. 따라서 대부분의 경우, 시험에 따르는 위험을 줄이기 위하여 공기를 사용한 시험을 통하여 성능을 평가한다. 본 논문에서는 10 톤급 액체로켓엔진 터보펌프용 터빈에 대한 공기 상사 성능시험에 대하여 기술하였다. 터빈의 공기역학적인 성능을 평가하기 위한 성능시험설비를 구성하였으며, 성능시험설비는 고압공기 공급시스템, 유량측정용 노즐, 시험부, 동력계. 압력조절을 위한 출구 오리피스 그리고 측정 및 제어 시스템으로 구성된다. 본 논문에서는 터빈성능 시험을 위한 상사시험 조건을 결정하는 방법과 시험조건을 조절하는 방법에 대하여 기술하였다. 시험결과, 측정 변수들의 상대 표준오차는 1%이내였으며 측정된 터빈 효율은 해석결과와 2% 이내로 일치하였다.

Micro-seismic monitoring in mines based on cross wavelet transform

  • Huang, Linqi;Hao, Hong;Li, Xibing;Li, Jun
    • Earthquakes and Structures
    • /
    • 제11권6호
    • /
    • pp.1143-1164
    • /
    • 2016
  • Time Delay of Arrival (TDOA) estimation methods based on correlation function analysis play an important role in the micro-seismic event monitoring. It makes full use of the similarity in the recorded signals that are from the same source. However, those methods are subjected to the noise effect, particularly when the global similarity of the signals is low. This paper proposes a new approach for micro-seismic monitoring based on cross wavelet transform. The cross wavelet transform is utilized to analyse the measured signals under micro-seismic events, and the cross wavelet power spectrum is used to measure the similarity of two signals in a multi-scale dimension and subsequently identify TDOA. The offset time instant associated with the maximum cross wavelet transform spectrum power is identified as TDOA, and then the location of micro-seismic event can be identified. Individual and statistical identification tests are performed with measurement data from an in-field mine. Experimental studies demonstrate that the proposed approach significantly improves the robustness and accuracy of micro-seismic source locating in mines compared to several existing methods, such as the cross-correlation, multi-correlation, STA/LTA and Kurtosis methods.

B-Corr Model for Bot Group Activity Detection Based on Network Flows Traffic Analysis

  • Hostiadi, Dandy Pramana;Wibisono, Waskitho;Ahmad, Tohari
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제14권10호
    • /
    • pp.4176-4197
    • /
    • 2020
  • Botnet is a type of dangerous malware. Botnet attack with a collection of bots attacking a similar target and activity pattern is called bot group activities. The detection of bot group activities using intrusion detection models can only detect single bot activities but cannot detect bots' behavioral relation on bot group attack. Detection of bot group activities could help network administrators isolate an activity or access a bot group attacks and determine the relations between bots that can measure the correlation. This paper proposed a new model to measure the similarity between bot activities using the intersections-probability concept to define bot group activities called as B-Corr Model. The B-Corr model consisted of several stages, such as extraction feature from bot activity flows, measurement of intersections between bots, and similarity value production. B-Corr model categorizes similar bots with a similar target to specify bot group activities. To achieve a more comprehensive view, the B-Corr model visualizes the similarity values between bots in the form of a similar bot graph. Furthermore, extensive experiments have been conducted using real botnet datasets with high detection accuracy in various scenarios.

Plagiarism Detection among Source Codes using Adaptive Methods

  • Lee, Yun-Jung;Lim, Jin-Su;Ji, Jeong-Hoon;Cho, Hwaun-Gue;Woo, Gyun
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제6권6호
    • /
    • pp.1627-1648
    • /
    • 2012
  • We propose an adaptive method for detecting plagiarized pairs from a large set of source code. This method is adaptive in that it uses an adaptive algorithm and it provides an adaptive threshold for determining plagiarism. Conventional algorithms are based on greedy string tiling or on local alignments of two code strings. However, most of them are not adaptive; they do not consider the characteristics of the program set, thereby causing a problem for a program set in which all the programs are inherently similar. We propose adaptive local alignment-a variant of local alignment that uses an adaptive similarity matrix. Each entry of this matrix is the logarithm of the probabilities of the keywords based on their frequency in a given program set. We also propose an adaptive threshold based on the local outlier factor (LOF), which represents the likelihood of an entity being an outlier. Experimental results indicate that our method is more sensitive than JPlag, which uses greedy string tiling for detecting plagiarism-suspected code pairs. Further, the adaptive threshold based on the LOF is shown to be effective, and the detection performance shows high sensitivity with negligible loss of specificity, compared with that using a fixed threshold.

The Relationship between Other Customer Perception and Experience with Role of Interpersonal Mindfulness in Brand Distribution

  • Linh Thi Dieu NGUYEN;Anh Thuy TRINH
    • 유통과학연구
    • /
    • 제21권6호
    • /
    • pp.69-81
    • /
    • 2023
  • Purpose: The study investigates the moderating impact of interpersonal mindfulness (IM) on the link between perceived similarity (OPS), physical appearance (OPA), and suitable behavior (OSB) - three key factors of other consumer perception (OCP) and brand experience (BE) in distribution of OCP and brand. Research design, data, and methodology: This study collected data from 612 consumers at shopping malls. SmartPLS 3.3.9 software were used to assess the measurement model and structural model. Results: According to the study's findings, IM has a negative modality in the impact between BE and OPS, OPA, and OSB. That also demonstrates how distribution of OCP and brand can affect a person's brand experience. Conclusions: The distribution of OCP and IM interactions have a significant influence on the brand experience in brand distribution. The study's results show that IM including mindfulness will function as a moderator between perceived similarity, physical appearance, suitable behavior regarded proper by other consumers, and brand experiences; therefore, they impact to brand distribution. The findings give a foundation for further IM research and add to the brand distribution theory that already exists. The findings also have some managerial implications in brand distribution.

Similarity Measurement Between Titles and Abstracts Using Bijection Mapping and Phi-Correlation Coefficient

  • John N. Mlyahilu;Jong-Nam Kim
    • 융합신호처리학회논문지
    • /
    • 제23권3호
    • /
    • pp.143-149
    • /
    • 2022
  • This excerpt delineates a quantitative measure of relationship between a research title and its respective abstract extracted from different journal articles documented through a Korean Citation Index (KCI) database published through various journals. In this paper, we propose a machine learning-based similarity metric that does not assume normality on dataset, realizes the imbalanced dataset problem, and zero-variance problem that affects most of the rule-based algorithms. The advantage of using this algorithm is that, it eliminates the limitations experienced by Pearson correlation coefficient (r) and additionally, it solves imbalanced dataset problem. A total of 107 journal articles collected from the database were used to develop a corpus with authors, year of publication, title, and an abstract per each. Based on the experimental results, the proposed algorithm achieved high correlation coefficient values compared to others which are cosine similarity, euclidean, and pearson correlation coefficients by scoring a maximum correlation of 1, whereas others had obtained non-a-number value to some experiments. With these results, we found that an effective title must have high correlation coefficient with the respective abstract.

소듐냉각고속로 KALIMER-600 축소 물모의 열유동 가시화 실험장치 구축 및 거시 유동장 특성 측정 (Water-Simulant Facility Installation for the Sodium-Cooled Fast Reactor KALIMER-600 and Global Flow Measurement)

  • 차재은;김성오
    • 한국가시화정보학회지
    • /
    • 제9권4호
    • /
    • pp.54-62
    • /
    • 2011
  • KAERI has developed a KALIMER-600 which is a pool-type sodium-cooled fast reactor with a 600MWe electric generation capacity. For a SFR development, one of the main topics is an enhancement of the reactor system safety. Therefore, we have a long-term plan to design the large sodium experimental facility to evaluate the reactor safety and component performance. In order to extrapolate a thermal hydraulic phenomena in a large sodium reactor, the thermal hydraulics phenomena is under investigation in a 1/$10^{th}$ water-simulant facility for the KALIMER-600. In this paper, we shortly described the experimental facility setup and the measurement of the isothermal global flow behavior. For the flow field measurement, the PIV method was used in a transparent Plexiglas reactor vessel model at around $20^{\circ}C$ water condition.

누설전류 측정을 통한 활선 절연물의 오손도 추정 (Estimation of Pollution Degree for Liveline Insulator with Leakage Current Measurement)

  • 심규일;최남호;박강식;한상옥
    • 대한전기학회:학술대회논문집
    • /
    • 대한전기학회 2001년도 하계학술대회 논문집 C
    • /
    • pp.1472-1474
    • /
    • 2001
  • In this paper, a method was presented to estimate the contamination degree of outdoor insulator by the measurement of surface leakage current. Contamination is one of the most important factor to determine the performance of insulator. Thus, it is very important to exam the contamination degree on the outdoor insulator. There are many limits, such as reliability of data, interval of measurement and similarity of environmental conditions, in conventional method. So, the estimation technique for contamination has been needed to monitor the accurate pollution degree of insulator in situ. In this investigation, phase difference was measured to compare the variance of phase difference with the contamination degree and relative humidity. From the result, we could confirm the capability of the estimation method.

  • PDF

페블 베드 타입 고온 가스 냉각 원자로 내부 유동장 측정 (Measurement of Flow Field in the Pebble Bed Type High Temperature Gas-cooled Reactor)

  • 이사야;이재영
    • 대한기계학회:학술대회논문집
    • /
    • 대한기계학회 2008년도 추계학술대회B
    • /
    • pp.2088-2093
    • /
    • 2008
  • In this study, flow field measurement of the Pebble Bed Reactor(PBR) for the High Temperature Gas-cooled Reactor(HTGR) was performed. Large number of pebbles in the core of PBR provides complicated flow channel. Due to the complicated geometries, numerical analysis has been intensively made rather than experimental observation. However, the justification of computational simulation by the experimental study is crucial to develop solid analysis of design method. In the present study, a wind tunnel installed with pebbles stacked was constructed and equipped with the Particle Image Velocimetry(PIV). We designed the system scaled up to realize the room temperature condition according to the similarity. The PIV observation gave us stagnation points, low speed region so that the suspected high temperature region can be identified. With the further supplementary experimental works, the present system may produce valuable data to justify the Computational Fluid Dynamics(CFD) simulation method.

  • PDF