• Title/Summary/Keyword: Index of similarity

Search Result 667, Processing Time 0.028 seconds

Naturalized Plants and Their Characteristics in Nakdong River Ecological Park in Busan Metropolitan City - Focused on Eulsukdo, Maekdo and Samnak ecological parks - (부산광역시 낙동강 생태공원의 귀화식물상과 특성 - 을숙도생태공원, 맥도생태공원, 삼락생태공원을 중심으로 -)

  • Gwak, Su-Bin;Jeong, Jae-Hyun;You, Ju-Han
    • Journal of the Korean Society of Environmental Restoration Technology
    • /
    • v.24 no.1
    • /
    • pp.81-96
    • /
    • 2021
  • The purpose of this study was conducted in order to provide the necessary basic data, to establish management solutions and to improve biodiversity by calculating similarity index, urbanization index (UI), and disturbed index (DI) to understand current status of naturalized and invasive alien plants in Eulsukdo, Maekdo and Samnak ecological parks in Busan, South Korea. The numbers of naturalized plants identified in these parks were 76 taxa; 20 families, 53 genera, and 76 species. As a result of the similarity index analysis, the most similarity level (83.0%) was obtained at Eulsukdo and Maekdo parks. The numbers of invasive plants identified in the two parks were 11 taxa; Rumex acetosella L., Sicyos angulatus L., Solanum carolinense L., Ambrosia artemisiifolia L., Ambrosia trifida L., Hypochaeris radicata L., Lactuca serriola L., Solidago altissima L., Symphyotrichum pilosum (Willd.) G.L.Nesom, Paspalum distichum L., and Humulus scandens (Lour.) Merr. Overall, UI and DI were 28.6% and 66.7%, respectively, indicating that the ecosystem disruption was serious.

Applying Different Similarity Measures based on Jaccard Index in Collaborative Filtering

  • Lee, Soojung
    • Journal of the Korea Society of Computer and Information
    • /
    • v.26 no.5
    • /
    • pp.47-53
    • /
    • 2021
  • Sparse ratings data hinder reliable similarity computation between users, which degrades the performance of memory-based collaborative filtering techniques for recommender systems. Many works in the literature have been developed for solving this data sparsity problem, where the most simple and representative ones are the methods of utilizing Jaccard index. This index reflects the number of commonly rated items between two users and is mostly integrated into traditional similarity measures to compute similarity more accurately between the users. However, such integration is very straightforward with no consideration of the degree of data sparsity. This study suggests a novel idea of applying different similarity measures depending on the numeric value of Jaccard index between two users. Performance experiments are conducted to obtain optimal values of the parameters used by the proposed method and evaluate it in comparison with other relevant methods. As a result, the proposed demonstrates the best and comparable performance in prediction and recommendation accuracies.

A Space-Efficient Inverted Index Technique using Data Rearrangement for String Similarity Searches (유사도 검색을 위한 데이터 재배열을 이용한 공간 효율적인 역 색인 기법)

  • Im, Manu;Kim, Jongik
    • Journal of KIISE
    • /
    • v.42 no.10
    • /
    • pp.1247-1253
    • /
    • 2015
  • An inverted index structure is widely used for efficient string similarity search. One of the main requirements of similarity search is a fast response time; to this end, most techniques use an in-memory index structure. Since the size of an inverted index structure usually very large, however, it is not practical to assume that an index structure will fit into the main memory. To alleviate this problem, we propose a novel technique that reduces the size of an inverted index. In order to reduce the size of an index, the proposed technique rearranges data strings so that the data strings containing the same q-grams can be placed close to one other. Then, the technique encodes those multiple strings into a range. Through an experimental study using real data sets, we show that our technique significantly reduces the size of an inverted index without sacrificing query processing time.

Vegetation Changes in Forest Restoration Areas in National Parks (국립공원 내 전국 우수 산림생태 복원지역 식생 회복 평가)

  • Jung, Tae-Jun;Kim, Young-Sun;Kim, Young-Jin;Kim, Yeon-Gyeong;Cho, Eun-Suk;Cho, Dong-gil
    • Journal of Environmental Science International
    • /
    • v.31 no.5
    • /
    • pp.389-404
    • /
    • 2022
  • The purpose of this study is to evaluate the vegetation recovery status of Mudeungsan National Park Jungmeorijae, Jeungsimsa district restoration site, and the Shimwon Valley ecological landscape restoration site in Jirisan National Park. Compared to the control plots, the Jungmeorijae restoration site was analyzed to have height growth of about 73.5%, the average species diversity index of about 75.2%. and the average similarity index was recovered to 7.75%. In the case of the restoration site in Jeungsimsa district, the height growth compared to the control plots was about 69.2%, the average species diversity index was about 55.0%. and the average similarity index was recovered to 25.65%. In the case of the Shimwon Valley ecological landscape restoration area, the height growth compared to the control plots was about 32.6%, the average species diversity index about 176.7%. and the average similarity index was recovered to 0.85%. The restoration site of the Jeungsimsa district was planted with relatively large trees during restoration work, and it took a relatively long time(20 years). Also, the site had less limiting factors due to the low elevation, allowing the degree of vegetation recovery to be higher than that of other sites.

Seabed Classification Using the K-L (Karhunen-Lo$\grave{e}$ve) Transform of Chirp Acoustic Profiling Data: An Effective Approach to Geoacoustic Modeling (광역주파수 음향반사자료의 K-L 변환을 이용한 해저면 분류: 지질음향 모델링을 위한 유용한 방법)

  • Chang, Jae-Kyeong;Kim, Han-Joon;Jou, Hyeong-Tae;Suk, Bong-Chool;Park, Gun-Tae;Yoo, Hai-Soo;Yang, Sung-Jin
    • The Sea:JOURNAL OF THE KOREAN SOCIETY OF OCEANOGRAPHY
    • /
    • v.3 no.3
    • /
    • pp.158-164
    • /
    • 1998
  • We introduce a statistical scheme to classify seabed from acoustic profiling data acquired using Chirp sonar system. The classification is based on grouping of signal traces by similarity index, which is computed using the K-L (Karhunen-Lo$\grave{e}$ve) transform of the Chirp profiling data. The similarity index represents the degree of coherence of bottom-reflected signals in consecutive traces, hence indicating the acoustic roughness of the seabed. The results of this study show that similarity index is a function of homogeneity, grain size of sediments and bottom hardness. The similarity index ranges from 0 to 1 for various types of seabed material. It increases in accordance with the homogeneity and softness of bottom sediments, whereas it is inversely proportional to the grain size of sediments. As a real data example, we classified the seabed off Cheju Island, Korea based on the similarity index and compared the result with side-scan sonar data and sediment samples. The comparison shows that the classification of seabed by the similarity index is in good agreement with the real sedimentary facies and can delineate acoustic response of the seabed in more detail. Therefore, this study presents an effective method for geoacoustic modeling to classify the seafloor directly from acoustic data.

  • PDF

Measurement of Document Similarity using Word and Word-Pair Frequencies (단어 및 단어쌍 별 빈도수를 이용한 문서간 유사도 측정)

  • 김혜숙;박상철;김수형
    • Proceedings of the IEEK Conference
    • /
    • 2003.07d
    • /
    • pp.1311-1314
    • /
    • 2003
  • In this paper, we propose a method to measure document similarity. First, we have exploited single-term method that extracts nouns by using a lexical analyzer as a preprocessing step to match one index to one noun. In spite of irrelevance between documents, possibility of increasing document similarity is high with this method. For this reason, a term-phrase method has been reported. This method constructs co-occurrence between two words as an index to measure document similarity. In this paper, we tried another method that combine these two methods to compensate the problems in these two methods. Six types of features are extracted from two input documents, and they are fed into a neural network to calculate the final value of document similarity. Reliability of our method has been proved by an experiment of document retrieval.

  • PDF

Species Diversity Analysis of Mushrooms Collected in Mt. Chiak

  • Lee, Byung-Kook;Kim, Kyoung Su;Eom, Ki-Cheol;Seok, Soon-Ja
    • 한국균학회소식:학술대회논문집
    • /
    • 2014.05a
    • /
    • pp.19-19
    • /
    • 2014
  • This study included the analysis of mushroom data collected from Mt. Chiak in Gangwon-do using various methods. Former studies of Korean mushrooms are limited by regional characters and there is less species diversity among the regions. This study tried to find a way for the forecast of mushroom distribution and appearance by indexes of species diversity. The indexes used in this study include the number of fungi (N), the number of species (S), similarity index (C), richness index (R1, R2), variety index (V1, V2), evenness index (E1, E2, E3, E4, E5), and dominance index (D1) to analyze variety of species diversity. Analyses of data of fungi using a multistage cluster sampling indicate that the average value of C for years was higher than the average value of C for areas. The mushrooms consisted of 208 species in 686 individuals in limited fungal collection from 2002 to 2003. One hundred thirty nine species in 393 individuals were collected in 2002, and 122 species 293 individuals were collected in 2003. The individuals collected in 2003 were smaller than 2002's individuals. Similarity, richness, and variety indexes' values of 2003 were reduced than 2002's values but dominance index of 2003 was increased than 2002's value. Generally the species diversity of the environment to evaluate the index of similarity, richness, and variety was a higher index; dominance index was lower than that of the surrounding environment, suggesting a good diversity. As a result, the occurrence of mushrooms in the surrounding environment and the various factors seem fell in 2002 compared to 2003. The majority genus of the limited fungal collection was Mycena genus in 63 individuals; the majority species was Laccaria laccata in 34 individuals. Ninety three species in 106 individuals were collected by the extended collection and the majority genus of the extended collection was Amanita genus in 17 individuals; the majority species was Amanita citrina (Schaeff.) Pers. which was found in 5 individuals. This demonstrates that periodical similarity's value was 0.159 is higher than special similarity's 0.119. This indicates that the probability of the appearance of same mushrooms in the same area in following year is higher than the probability of the appearance of same mushrooms in the surrounding area in same year. The value of coefficient of variation (CV), in which the amount of change is much or less by N is higher than the CV value by S. CV value of dominance index(D) was the highest r point among other indexes, and evenness index (E) was the lowest point among other indexes. The correlation matrix with 66 combinations between the indexes, the combinations with correlations was 46 combinations. These results revealed that indexes of R1, V2, and E1 were proper to represent species diversity of fungi based on the correlation matrix and the theory of statistical independence which means there is no or less mutual association. This research would contribute to the study about variable living creature by measuring method and in the future this would be used to figure out regulation about fungi with their correlation, values in ecosystem, develop improving new models about agricultural fungi species and numbers by investigating agricultural variable species.

  • PDF

A Novel Network Reduction Method based on Similarity Index between Bus Pairs (모선 간 유사지수에 근거한 새로운 계통축약 기법)

  • Chun, Yeong-Han;Lee, Dong-Su
    • The Transactions of the Korean Institute of Electrical Engineers A
    • /
    • v.55 no.4
    • /
    • pp.156-162
    • /
    • 2006
  • Transmission zones can be defined based on LMPs. Each zone consists of nodes with similar LMPs, and zonal price is determined by average nodal prices in each zone.[1] Network reduction is still important for the analysis of zonal systems under electricity market environments, even though the computing capability of computer system can deal with entire power systems. The Similarity Index is a good performance measure for the network reduction.[2] It can be applied to the network reduction between zones categorized by the nodal prices. This paper deals with a novel network reduction method between zones based on the similarity Index. Line admittances of reduced network were determined by using the least square method. The proposed method was verified by IEEE 39 bus test system.

Analysis of Priority Countries and Products for Indonesian Export Diversification in Latin America

  • Ramana, Febria;Retnosari, Lili
    • The Journal of Industrial Distribution & Business
    • /
    • v.9 no.8
    • /
    • pp.17-26
    • /
    • 2018
  • Purpose - Indonesian economy often receives negative impact from external factors, particularly through trade linkage. To mitigate that impact, the export market and product diversification should be established. Latin America is one of the potential regions to augment the Indonesian export market. Research design, data, and methodology - This study attempts to classify the potential market and product for Indonesian export, particularly in Latin America, by using panel regression, trade complementarity, and export similarity index over the period 2000-2015. Regression was also used to examine whether the presence of the Indonesian Trade Promotion Center (ITPC) can support diversification. Results - Based on regression results, those indexes established Chile, Uruguay, Suriname, and Ecuador as the priority countries with the products: animal and vegetable oils, fats and waxes; chemicals and related products; miscellaneous manufactured articles; commodities and transactions. Conclusions - The results of the regression concludes that the trade complementarity index gave a significant positive effect to boost Indonesian export, whereas, the export similarity index gave a significant negative effect. The regression also conclude that ITPC gave a significant positive impact on Indonesian export. For instance, the government should prioritize those countries and products and also develop ITPC there to optimize Indonesian export.

A Change of Vegetation at the Ecological Restoration Area of Simwon Valley in Jirisan National Park (지리산국립공원 심원계곡 생태경관 복원공사지역 식생 변화)

  • Jung, Tae-Jun;Kim, Yeon-Gyeong;Kim, Young-Jin;Jung, Myung-Hee;Park, Kyoung-Hee;Shin, Chang-Keun;Park, Seung-hong;Kim, Young-Sun
    • Korean Journal of Environment and Ecology
    • /
    • v.35 no.3
    • /
    • pp.294-304
    • /
    • 2021
  • This study aims to obtain basic data for systematic restoration by analyzing the monitoring results of the Shimwon Valley Ecological Landscape Restoration Project area in Jirisan National Park. In 2017, when the restoration project was completed, 12 monitoring plots and 4 control plots were installed for vegetation monitoring, and changes in the relative dominance, species diversity index and similarity between 2017 and 2020 were analyzed. The species diversity index of the surveyed areas where trees were planted during the restoration project was 0-1.4552, and the similarity index with the control group was 0% except for one survey area at 1.32%. The very low species diversity index and similarity index in the survey areas were attributed to the loss of trees planted during the restoration project due to death, damage by wild boars, or erosion by running water. On the other hand, the species diversity index was 0.9538-2.3222 in the monitoring plot where no tree was planted, and the similarity index was analyzed to be as high as 8.33%. It is necessary to continue the long-term monitoring for the development of ecological landscape restoration methods in the national park and analysis of the succession in monitoring plots where no trees were planted.