• Title/Summary/Keyword: 유사도 비

Search Result 8,122, Processing Time 0.042 seconds

An Adaptive Algorithm for Plagiarism Detection in a Controlled Program Source Set (제한된 프로그램 소스 집합에서 표절 탐색을 위한 적응적 알고리즘)

  • Ji, Jeong-Hoon;Woo, Gyun;Cho, Hwan-Gue
    • Journal of KIISE:Software and Applications
    • /
    • v.33 no.12
    • /
    • pp.1090-1102
    • /
    • 2006
  • This paper suggests a new algorithm for detecting the plagiarism among a set of source codes, constrained to be functionally equivalent, such are submitted for a programming assignment or for a programming contest problem. The typical algorithms largely exploited up to now are based on Greedy-String Tiling, which seeks for a perfect match of substrings, and analysis of similarity between strings based on the local alignment of the two strings. This paper introduces a new method for detecting the similar interval of the given programs based on an adaptive similarity matrix, each entry of which is the logarithm of the probabilities of the keywords based on the frequencies of them in the given set of programs. We experimented this method using a set of programs submitted for more than 10 real programming contests. According to the experimental results, we can find several advantages of this method compared to the previous one which uses fixed similarity matrix(+1 for match, -1 for mismatch, -2 for gap) and also can find that the adaptive similarity matrix can be used for detecting various plagiarism cases.

Measuring Web Page Similarity using Tags (태그를 이용한 웹 페이지간의 유사도 측정 방법)

  • Kang, Sang-Wook;Lee, Ki-Yong;Kim, Hyeon-Gyu;Kim, Myoung-Ho
    • Journal of KIISE:Databases
    • /
    • v.37 no.2
    • /
    • pp.104-112
    • /
    • 2010
  • Social bookmarking is one of the most interesting trends in the current web environment. In a social bookmarking system, users annotate a web page with tags, which describe the contents of the page. Numerous studies have been done using this information, mostly on enhancing the quality of web search. In this paper, we use this information to measure the semantic similarity between two web pages. Since web pages consist of various types of multimedia data, it is quite difficult to compare the semantics of two web pages by comparing the actual data contained in the pages. With the help of social bookmarks, this comparison can be performed very effectively. In this paper, we propose a new similarity measure between web pages, called Web Page Similarity Based on Entire Tags (WSET), based on social bookmarks. The experimental results show that the proposed measure yields more satisfactory results than the previous ones.

Improvement on Similarity Calculation in Collaborative Filtering Recommendation using Demographic Information (인구 통계 정보를 이용한 협업 여과 추천의 유사도 개선 기법)

  • 이용준;이세훈;왕창종
    • Journal of KIISE:Computing Practices and Letters
    • /
    • v.9 no.5
    • /
    • pp.521-529
    • /
    • 2003
  • In this paper we present an improved method by using demographic information for overcoming the similarity miss-calculation from the sparsity problem in collaborative filtering recommendation systems. The similarity between a pair of users is only determined by the ratings given to co-rated items, so items that have not been rated by both users are ignored. To solve this problem, we add virtual neighbor's rating using demographic information of neighbors for improving prediction accuracy. It is one kind of extentions of traditional collaborative filtering methods using the peason correlation coefficient. We used the Grouplens movie rating data in experiment and we have compared the proposed method with the collaborative filtering methods by the mean absolute error and receive operating characteristic values. The results show that the proposed method is more efficient than the collaborative filtering methods using the pearson correlation coefficient about 9% in MAE and 13% in sensitivity of ROC.

Changes in benthic macroinvertebrates communities in response to biological mosquito larvae control techniques (생물학적 모기유충 방제기법 적용에 따른 저서성 대형무척추동물 군집 변동)

  • Han, Jung Soo;An, Chae Hui;Choi, Jun Kil;Lee, Hwang Goo
    • Korean Journal of Environmental Biology
    • /
    • v.37 no.4
    • /
    • pp.600-606
    • /
    • 2019
  • The study site was the camping area in the Hwarang Amusement Park in Danwon-gu, Ansan-si. Study activities were conducted three times a week from July 20, 2018, to August 1, 2018. A control site, natural enemy site, and Bti(Bacillus thuringiensis israelensis) site were selected. The analyses included habitat environment and species composition analyses, community analysis, correlation analysis, and similarity analysis. The water quality analysis found no significant difference in water quality over the study period (p>0.05). A total of 4,818 individuals, 38 species, 22 families, and 11 orders were observed during the study period. The natural enemy site observed during the study period had a similar species composition as the control site. The Bti site differed from other sites by the low number of species and individuals present. According to the community analysis, the natural enemy site was a stable community and the Bti site was an unstable community during the study period. Diptera showed negative associations with temperature and water temperature and mosquito larvae showed significant correlations with temperature and water temperature. The similarity analysis showed that the control site and the natural enemy site were 61.11-73.68% and the Bti site showed 30.77-56.00% similarity.

A Study on Estimate of Sediment Yield Using Tank Model in Oship River Mouth of East Coast (Tank 모형을 이용한 동해안 오십천 하구의 유사량 평가에 관한 연구)

  • Kang, Sank-Hyeok;Ok, Yong-Sik;Kim, Sang-Ryul;Ji, Jeong-Hwan
    • Korean Journal of Environmental Agriculture
    • /
    • v.30 no.3
    • /
    • pp.268-274
    • /
    • 2011
  • BACKGROUND: A large scale of sediment load delivered from watershed causes substantial waterway damages and water quality degradation. Controlling sediment loading requires the knowledge of the soil erosion and sedimentation. The various factors such as watershed size, slope, climate, land use may affect sediment delivery processes. Traditionally sediment delivery ratio prediction equations have been developed by relating watershed characteristics to measured sediment yield divided by predicted gross erosion. However, sediment prediction equations have been developed for only a few regions because of limited sediment data. Besides, little research has been done on the prediction of sediment delivery ratio for asia monsoon period in mountainous watershed. METHODS AND RESULTS: In this study Tank model was expanded and applied for estimating sediment yield to Oship River of east coast. The rainfall-runoff in 2006 was verified using the Tank model and we derived good result between observed and calculated discharge in 2009 at the same conditions. In relation to sediment yield, the sediment delivery rate of 2006 was very high than 2009 regardless of methods for estimating sediment load. It was thought to be affected by heavy rainfall due to the typhoon. CONCLUSION(s): For estimating sediment volume from watershed, long-term monitoring data on discharge and sediment is needed. This model will be able to apply to predict discharge and sediment yield simultaneously in ungauged area. This approach is more effective and less expensive method than the traditional method which needs a lot of data collection.

A Study of Document Ranking Algorithms in a P-norm Retrieval System (P-norm 검색의 문헌 순위화 기법에 관한 실험적 연구)

  • 고미영;정영미
    • Journal of the Korean Society for information Management
    • /
    • v.16 no.1
    • /
    • pp.7-30
    • /
    • 1999
  • This study is to develop effective document ranking algorithms in the P-norm retrieval system which can be implemented to the Boolean retrieval system without major difficulties by using non-statistical term weights based on document structure. Also, it is to enhance the performance by introducing the rank adjustment process which rearranges the ranks of retrieved documents according to the similarity between the top ranked documents and the rest of them. Of the non-statistical term weight algorithms, this study uses field weight and term pair distance weight. In the rank adjustment process, five retrieval experiments were performed, ranging between the case of using one record for the similarity measurement and the case of using first five records. It is proved that non-statistical term weights are highly effective and the rank adjustment process enhance the performance further.

  • PDF

MRS Pattern Classification Using Fusion Method based on SpPCA and MLP (SpPCA와 MLP에 기반을 둔 응합법칙에 의한 MRS 패턴분류)

  • Song Chang kyu;Lee Dae jong;Jeon Byeong seok;Ryu Jeong woong
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.30 no.9C
    • /
    • pp.922-929
    • /
    • 2005
  • In this paper, we propose the MRS p:Ittern classification techniques by the fusion scheme based on the SpPCA and MLP. A conventional PCA teclulique for the dimension reduction has the problem that it can't find a optimal transformation matrix if the property of input data is nonlinear. To overcome this drawback we extract features by the SpPCA technique which use the local patterns rather than whole patterns. In a next classification step, individual classifier based on MLP calculates the similarity of each class for local features. Finally, MRS patterns is classified by the fusion scheme to effectively combine the individual information. As the simulation results to verify the effectiveness, the proposed method showed more improved classification results than conventional methods.

A Dual Noise-Predictive Partial Response Decision-Feedback Equalizer for Perpendicular Magnetic Recording Channels (수직 자기기록 채널을 위한 쌍 잡음 예측 부분 응답 결정 궤환 등화기)

  • 우중재;조한규;이영일;홍대식
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.28 no.9C
    • /
    • pp.891-897
    • /
    • 2003
  • Partial response maxim likelihood (PRML) is a powerful and indispensable detection scheme for perpendicular magnetic recording channels. The performance of PRML can be improved by incorporating a noise prediction scheme into branch metric computations of Viterbi algorithm (VA). However, the systems constructed by VA have shortcomings in the form of high complexity and cost. In this connection, a new simple detection scheme is proposed by exploiting the minimum run-length parameter d=1 of RLL code. The proposed detection scheme have a slicer instead of Viterbi detector and a noise predictor as a feedback filter. Therefore, to improve BER performance, the proposed detection scheme is extended to dual detection scheme for improving the BER performance. Simulation results show that the proposed scheme has a comparable performance to noise-predictive maximum likelihood (NPML) detector with less complexity when the partial response (PR) target is (1,2,1).

Pseudo Dynamic Test for the Seismic Performance Enhancement of Circular RC Bridge Piers Retrofitted with Fibers (섬유보강 원형 철근콘크리트 교각의 내진성능 향상에 관한 유사동적 실험)

  • 정영수;박종협;박희상;조창백
    • Journal of the Korea Concrete Institute
    • /
    • v.14 no.2
    • /
    • pp.180-189
    • /
    • 2002
  • The objective of this experimental research is to assess the seismic performance of circular RC bridge pier specimens retrofitted with fibers which were designed as a prototype of Hagal bridge in the city of Suwon, Korea. Pseudo dynamic test has been done for four(4) test specimens which were nonseismically or seismically designed by the related provisions of the Korea roadway bridge design specification, and four nonseisemic test specimens retrofitted with fibers in the plastic hinge region. Glass and carbon fiber sheets were used for the seismic capacity enhancement of circular test specimens. Important test parameters were confinement steel ratio, load pattern, and retrofitting. The seismic behavior has been analyzed through the displacement ductility, energy analysis, and capacity spectrum. Approximate 7.7 ∼8.7 displacement ductility was observed for nonseismic test specimens retrofitted with fibers subjected to Korea Highway Cooperation artificial earthquake motions. It is concluded that these retrofitted test specimens could have sufficient seismic capacity in the region of moderate seismic zone.

Iterative Low Rank Approximation for Image Denoising (영상 잡음 제거를 위한 반복적 저 계수 근사)

  • Kim, Seehyun
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.25 no.10
    • /
    • pp.1317-1322
    • /
    • 2021
  • Nonlocal similarity of natural images leads to the fact that a patch matrix whose columns are similar patches of the reference patch has a low rank. Images corrupted by additive white Gaussian noises (AWGN) make their patch matrices to have a higher rank. The noise in the image can be reduced by obtaining low rank approximation of the patch matrices. In this paper, an image denoising algorithm is proposed, which first constructs the patch matrices by combining the similar patches of each reference patch, which is a part of the noisy image. For each patch matrix, the proposed algorithm calculates its low rank approximate, and then recovers the original image by aggregating the low rank estimates. The simulation results using widely accepted test images show that the proposed denoising algorithm outperforms four recent methods.