• 제목/요약/키워드: Distance measures

검색결과 697건 처리시간 0.027초

워드넷을 이용한 문서내에서 단어 사이의 의미적 유사도 측정 (Semantic Similarity Measures Between Words within a Document using WordNet)

  • 강석훈;박종민
    • 한국산학기술학회논문지
    • /
    • 제16권11호
    • /
    • pp.7718-7728
    • /
    • 2015
  • 단어 사이의 의미적 유사성은 많은 분야에 적용 될 수 있다. 예를 들면 컴퓨터 언어학, 인공지능, 정보처리 분야이다. 본 논문에서 우리는 단어 사이의 의미적 유사성을 측정하는 문서 내의 단어 가중치 적용 방법을 제시한다. 이 방법은 워드넷의 간선의 거리와 깊이를 고려한다. 그리고 문서 내의 정보를 기반으로 단어 사이의 의미적 유사성을 구한다. 문서 내의 정보는 단어의 빈도수와 단어의 의미 빈도수를 사용한다. 문서 내에서 단어 마다 단어 빈도수와 의미 빈도수를 통해 각 단어의 가중치를 구한다. 본 방법은 단어 사이의 거리, 깊이, 그리고 문서 내의 단어 가중치 3가지를 혼합한 유사도 측정 방법이다. 실험을 통하여 기존의 다른 방법과 성능을 비교하였다. 그 결과 기존 방법에 대비하여 성능의 향상을 가져왔다. 이를 통해 문서 내에서 단어의 가중치를 문서 마다 구할 수 있다. 단순한 최단거리 기반의 방법들과 깊이를 고려한 기존의 방법들은, 정보에 대한 특성을 제대로 표현하지 못했거나 다른 정보를 제대로 융합하지 못했다. 본 논문에서는 최단거리와 깊이 그리고 문서 내에서 단어의 정보량까지 고려하였고, 성능의 개선을 보였다.

편마비 환자의 팔 뻗기 과제 수행 시 목표거리와 건·환측 사용에 따른 운동시간과 체간의 움직임 분석 (Analysis of Movement Time and Trunk Motions According to Target Distances and Use of Sound and Affected Side During Upper Limb Reaching Task in Patients With Hemiplegia)

  • 김기송;유환석;정도헌;전혜선
    • 한국전문물리치료학회지
    • /
    • 제17권1호
    • /
    • pp.36-42
    • /
    • 2010
  • The aim of this study was to investigate effects of reaching distance on movement time and trunk kinematics in hemiplegic patients. Eight hemiplegic patients participated in this study. The independent variables were side (sound side vs. affected side) and target distance (70%, 90%, 110%, and 130% of upper limb). The dependent variables were movement time measured by pressure switch and trunk kinematics measured by motion analysis device. Two-way analysis of variance with repeated measures was used with Bonferroni post-hoc test. (1) There were significant main effects in side and reaching distance for movement time (p=.01, p=.02). Post-hoc test revealed that there was a significant difference between 110% and 130% of reaching distance (p=.01). (2) There was a significant main effect in side and reaching distance for trunk flexion (p=.01, p=.00). Post-hoc test revealed that there were significant differences in all pair-wise reaching distance comparison. (3) There was a significant side by target distance interaction for trunk rotation (p=.04). There was a significant main effect in target distance (p=.00). Post-hoc test revealed that there were significant differences between 70% and 110%, 70% and 130%, 90% and 110%, 90% and 130% of target distance. It was known that trunk flexion is used more than trunk rotation during reaching task in hemiplegic patients from the findings of this study. It is also recommended that reaching training is performed with limiting trunk movement within 90% of target distance whereas reaching training is performed incorporating with trunk movement beyond 90% of target distance in patients with hemiplegia.

한손 수동물자취급에 관한 문헌 조사 (Literature Review on One.Handed Manual Material Handling)

  • 모승민;곽종선;정명철
    • 대한인간공학회지
    • /
    • 제29권5호
    • /
    • pp.819-829
    • /
    • 2010
  • By referring thirty-seven previous studies on manual material handling (MMH), this paper analyzed guidelines and main factors of one-handed tasks. The previous studies concerned main factors of distance, weight, frequency, posture, gender, age, training, direction of force, height of the force exerted, and object shape and size. Based on these factors, the criteria used to understand one-handed tasks were objective measures of maximum strength, reaction force, etc., psychophysical measures of maximum acceptable frequency and weight, etc., and physiological measures of oxygen uptake, heart rate, electromyography, etc. An allowance threshold model regarding quantitative and objective fatigue and workload would be suggested for future research. This study would be expected that it serve to establish and Korean recommendations of one-handed tasks.

A Study on Decision Tree for Multiple Binary Responses

  • Lee, Seong-Keon
    • Communications for Statistical Applications and Methods
    • /
    • 제10권3호
    • /
    • pp.971-980
    • /
    • 2003
  • The tree method can be extended to multivariate responses, such as repeated measure and longitudinal data, by modifying the split function so as to accommodate multiple responses. Recently, some decision trees for multiple responses have been constructed by Segal (1992) and Zhang (1998). Segal suggested a tree can analyze continuous longitudinal response using Mahalanobis distance for within node homogeneity measures and Zhang suggested a tree can analyze multiple binary responses using generalized entropy criterion which is proportional to maximum likelihood of joint distribution of multiple binary responses. In this paper, we will modify CART procedure and suggest a new tree-based method that can analyze multiple binary responses using similarity measures.

Recovery Levels of Clustering Algorithms Using Different Similarity Measures for Functional Data

  • Chae, Seong San;Kim, Chansoo;Warde, William D.
    • Communications for Statistical Applications and Methods
    • /
    • 제11권2호
    • /
    • pp.369-380
    • /
    • 2004
  • Clustering algorithms with different similarity measures are commonly used to find an optimal clustering or close to original clustering. The recovery level of using Euclidean distance and distances transformed from correlation coefficients is evaluated and compared using Rand's (1971) C statistic. The C values present how the resultant clustering is close to the original clustering. In simulation study, the recovery level is improved by applying the correlation coefficients between objects. Using the data set from Spellman et al. (1998), the recovery levels with different similarity measures are also presented. In general, the recovery level of true clusters was increased by using the correlation coefficients.

불량 매립지에서의 지하수 오염특성과 환경오염 방지방안 (Characteristics of Groundwater Contamination in Uncontrolled Landfill and Pollution Control Measures)

  • 구자중;윤석표
    • 한국지반공학회:학술대회논문집
    • /
    • 한국지반공학회 1993년도 지반.환경 매립에 관한 학술발표회 논문집
    • /
    • pp.28-44
    • /
    • 1993
  • Remediation actions in uncontrolled landfill site should be conducted after the investigation of contamination status and potential health risk or damage. Based on the above, proper control measures should be established and operated. Also continuous monitoring should be followed. In this study, the status of ground water contamination around Nanji Landfill Site was investigated. Monitoring wells were installed around the landfill and ground water was sampled once a month and analyzed. Water quality of each monitoring well was different depending on the horizontal and vertical distance from the landfill, and the seasonal leachate characteristics were not significantly changed because percolating water stayed long time in the deep waste layer. It was predicted that major multivalent cations were mainly precipitated as metal carbonate form, and chemical mass balances (CMBs) could be applied for the apportionment of leachate contamination to ground water quality of surrounding areas of Nanji Landfill. Parameters required to estimate pollutant flux to the receptor near landfill were listed and discussion to get these parameters was made. Finally, based on the above data, control measures of ground water contamination were suggested and discussed.

  • PDF

PREVENTION STRATEGIES TO CONTROL AN EPIDEMIC USING A SEIQHRV MODEL

  • Mohit Soni;Rajesh Kumar Sharma;Shivram Sharma
    • 한국수학교육학회지시리즈B:순수및응용수학
    • /
    • 제31권2호
    • /
    • pp.131-158
    • /
    • 2024
  • This study investigates the impact of precautionary measures, such as isolating exposed individuals, wearing masks, and maintaining physical distance, on preventing infectious disease. A deterministic SEIQHRV epidemic model is employed for this purpose. The model's positivity, boundedness, disease-free, and endemic equilibrium points are identified. A sensitivity test assesses the impact of preventive measures on infected classes. Results show that a basic reproduction number less than unity drives disease eradiction, while a higher unity value encourages the adoption of preventive measures.

유사도 알고리즘을 활용한 시맨틱 프로세스 검색방안 (Semantic Process Retrieval with Similarity Algorithms)

  • 이홍주
    • Asia pacific journal of information systems
    • /
    • 제18권1호
    • /
    • pp.79-96
    • /
    • 2008
  • One of the roles of the Semantic Web services is to execute dynamic intra-organizational services including the integration and interoperation of business processes. Since different organizations design their processes differently, the retrieval of similar semantic business processes is necessary in order to support inter-organizational collaborations. Most approaches for finding services that have certain features and support certain business processes have relied on some type of logical reasoning and exact matching. This paper presents our approach of using imprecise matching for expanding results from an exact matching engine to query the OWL(Web Ontology Language) MIT Process Handbook. MIT Process Handbook is an electronic repository of best-practice business processes. The Handbook is intended to help people: (1) redesigning organizational processes, (2) inventing new processes, and (3) sharing ideas about organizational practices. In order to use the MIT Process Handbook for process retrieval experiments, we had to export it into an OWL-based format. We model the Process Handbook meta-model in OWL and export the processes in the Handbook as instances of the meta-model. Next, we need to find a sizable number of queries and their corresponding correct answers in the Process Handbook. Many previous studies devised artificial dataset composed of randomly generated numbers without real meaning and used subjective ratings for correct answers and similarity values between processes. To generate a semantic-preserving test data set, we create 20 variants for each target process that are syntactically different but semantically equivalent using mutation operators. These variants represent the correct answers of the target process. We devise diverse similarity algorithms based on values of process attributes and structures of business processes. We use simple similarity algorithms for text retrieval such as TF-IDF and Levenshtein edit distance to devise our approaches, and utilize tree edit distance measure because semantic processes are appeared to have a graph structure. Also, we design similarity algorithms considering similarity of process structure such as part process, goal, and exception. Since we can identify relationships between semantic process and its subcomponents, this information can be utilized for calculating similarities between processes. Dice's coefficient and Jaccard similarity measures are utilized to calculate portion of overlaps between processes in diverse ways. We perform retrieval experiments to compare the performance of the devised similarity algorithms. We measure the retrieval performance in terms of precision, recall and F measure? the harmonic mean of precision and recall. The tree edit distance shows the poorest performance in terms of all measures. TF-IDF and the method incorporating TF-IDF measure and Levenshtein edit distance show better performances than other devised methods. These two measures are focused on similarity between name and descriptions of process. In addition, we calculate rank correlation coefficient, Kendall's tau b, between the number of process mutations and ranking of similarity values among the mutation sets. In this experiment, similarity measures based on process structure, such as Dice's, Jaccard, and derivatives of these measures, show greater coefficient than measures based on values of process attributes. However, the Lev-TFIDF-JaccardAll measure considering process structure and attributes' values together shows reasonably better performances in these two experiments. For retrieving semantic process, we can think that it's better to consider diverse aspects of process similarity such as process structure and values of process attributes. We generate semantic process data and its dataset for retrieval experiment from MIT Process Handbook repository. We suggest imprecise query algorithms that expand retrieval results from exact matching engine such as SPARQL, and compare the retrieval performances of the similarity algorithms. For the limitations and future work, we need to perform experiments with other dataset from other domain. And, since there are many similarity values from diverse measures, we may find better ways to identify relevant processes by applying these values simultaneously.

가중치 하우스도르프 거리를 이용한 프로파일 얼굴인식 (Face Recognition Based on Weighted Hausdorff Distance for Profile Image)

  • 이영학
    • 한국멀티미디어학회논문지
    • /
    • 제7권4호
    • /
    • pp.474-483
    • /
    • 2004
  • 본 논문에서는 3차원 정면 얼굴 영상으로부터 추출된 프로파일(profile) 영상을 깊이 정보가 반영된 가중치 하우스도르프 거리(weighted hausdorff distance-WHD)를 이용하여 두 영상을 비교하는 인식 알고리즘을 제안한다. 3차원 얼굴 영상은 2차원과 달리, 깊이 정보를 가지고 있으므로 사람 얼굴의 프로파일 영상을 보다 정확하게 그리고 다양한 얼굴 위치에서 추출되어 질 수 있다. 코는 얼굴에서 가장 돌출된 형상을 가지고 있으므로, 3차원 데이터의 깊이 값을 평균을 이용한 반복 선택 방법을 사용하여 코의 정점 위치를 찾는다. 이를 기준점으로 수직성분들의 깊이 값을 2차원 평면으로 나타내면 프로파일 영상이 추출된다. 입력 영상과 데이터베이스 영상과의 유사도 비교를 위해, 깊이정보를 가중치로 사용한 WHD방법으로서 두 프로파일 영상의 거리비교는 Ll을 이용하여 비교하였다. 제안된 방법으로, 인식률은 5위 이내가 94.3%의 인식률을 나타내었다.

  • PDF

산업용 레이저 거리 계측기 개발 (The Development of Industrial Laser Range Finder)

  • 배영철;김천석;김이곤;조의주;박종배
    • 한국전자통신학회논문지
    • /
    • 제2권4호
    • /
    • pp.228-235
    • /
    • 2007
  • 레이저 거리 계측기는 과거에 주로 군사용으로 사용되어 헬기나 전차에 탑재하여 목표물과 발사체와의 거리를 측정하는데 사용하였다. 따라서 군사용의 경우 측정 거리가 수 km에서 수십 km 까지의 범위이며 측정하고자하는 오차도 5m 내외였다. 이와 같이 군사용 사용하던 레이저 거리 계측기를 산업용에 사용하고자 하는 노력이 계속되고 있으며 이에 본 논문에서는 산업용에 적용할 수 있는 산업용 레이저 거리 계측기를 개발하고 그 유용성을 검증하였다.

  • PDF