• Title/Summary/Keyword: Similarity sampling

Search Result 122, Processing Time 0.03 seconds

Visual Model of Pattern Design Based on Deep Convolutional Neural Network

  • Jingjing Ye;Jun Wang
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.18 no.2
    • /
    • pp.311-326
    • /
    • 2024
  • The rapid development of neural network technology promotes the neural network model driven by big data to overcome the texture effect of complex objects. Due to the limitations in complex scenes, it is necessary to establish custom template matching and apply it to the research of many fields of computational vision technology. The dependence on high-quality small label sample database data is not very strong, and the machine learning system of deep feature connection to complete the task of texture effect inference and speculation is relatively poor. The style transfer algorithm based on neural network collects and preserves the data of patterns, extracts and modernizes their features. Through the algorithm model, it is easier to present the texture color of patterns and display them digitally. In this paper, according to the texture effect reasoning of custom template matching, the 3D visualization of the target is transformed into a 3D model. The high similarity between the scene to be inferred and the user-defined template is calculated by the user-defined template of the multi-dimensional external feature label. The convolutional neural network is adopted to optimize the external area of the object to improve the sampling quality and computational performance of the sample pyramid structure. The results indicate that the proposed algorithm can accurately capture the significant target, achieve more ablation noise, and improve the visualization results. The proposed deep convolutional neural network optimization algorithm has good rapidity, data accuracy and robustness. The proposed algorithm can adapt to the calculation of more task scenes, display the redundant vision-related information of image conversion, enhance the powerful computing power, and further improve the computational efficiency and accuracy of convolutional networks, which has a high research significance for the study of image information conversion.

Spatial and Temporal Analysis of the Coleopteran Communities around 5.16 Road of Mt. Halla, Jeju Island, Korea (한라산 5.16 도로변에 분포하는 딱정벌레류(類)의 월별과 고도별 군집 분석)

  • Yang, Kyoung-Sik;Kim, Sang-Bum;Kim, Won-Taek
    • Korean Journal of Environmental Biology
    • /
    • v.24 no.4
    • /
    • pp.337-358
    • /
    • 2006
  • The field survey was conducted weekly from April to October in 2004 and 2005 on the sites along the 5.16 road. Sampling sites were made every 100 m height starting from 250 m altitude of both sides of Mt. Halla along 5.16 Road, which crosses the mountain from North to South. Totally 31,698 individuals of 76 species belonged to 25 families were collected. It was July that showed the largest number of species, as 48 species in the northern sloper 42 in the southern slope, and 22 at the highest site (at an altitude of 750 m), while it was April that showed the smallest as 17 species, 15 and 5, respectively. As for monthly fluctuation, the northern slope and the highest site reached their top in August, whereas it was June in the southern slope. In the analyses of similarity (chord distance) of any pair of temporal communities, the closest pair was between June and July in the northern slope area, between July and August in the southern slope and between July and September at the highest site, respectively.

Insect Fauna of Adjacent Areas of DMZ in Korea

  • Kim, Seung-Tae;Jung, Myung-Pyo;Kim, Hun-Sung;Shin, Joon-Hwan;Lim, Jong-Hwan;Kim, Tae-Woo;Lee, Joon-Ho
    • Journal of Ecology and Environment
    • /
    • v.29 no.2
    • /
    • pp.125-141
    • /
    • 2006
  • Insect fauna in adjacent areas of Demilitarized Zone (DMZ) in Korea was surveyed seasonally in $2001{\sim}2003$. The survey area was divided into 3 regions (eastern mountain, middle inland, and western coastal regions) in accordance with administrative districts and topography. Sampling methods such as sweeping, sieving, beating, brushing and suction were used depending on the environmental and military conditions. Total 361 genera and 437 species of 116 families belonging to 14 orders were identified. Among these, 46 species were new to insect fauna of DMZ areas. Species richness was the highest in the eastern mountain region. Numbers of habitat-common and -specific species were 96 (22%) and 195 (47.2%), respectively. The insect species community similarity was highest (0.64) between eastern mountain region and western coastal region. Insect orders showing high species richness were Coleoptera (38.9%), Lepidoptera (19.2%), Orthoptera (9.4%), and Hemiptera (9.2%). These results will be useful information for study of history on the change of insect fauna and future conservation in DMZ areas.

Windowed Wavelet Stereo Matching Using Shift ability (이동성(shift ability)을 이용한 윈도우 웨이블릿 스테레오 정합)

  • 신재민;이호근;하영호
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.28 no.1C
    • /
    • pp.56-63
    • /
    • 2003
  • In this paper, a wavelet-based stereo matching algorithm to obtain an accurate disparity map in wavelet transformed domain by using a shift ability property, a modified wavelet transform, the similarities for their sub-bands, and a hierarchical structure is proposed. New approaches for stereo matching by lots of feature information are to utilize translation-variant results of the sub-bands in the wavelet transformed domain because they cannot literally expect translation invariance in a system based on convolution and sub-sampling. After the similarity matching for each sub-band, we can easily find optimal matched-points because the sub-bands appearance of the shifted signals is definitely different from that of the original signal with no shift.

Recovery of aquatic insect communities after a catastrophic flood in a Korean stream

  • Lee, Hwang-Goo;Bae, Yeon-Jae
    • Animal cells and systems
    • /
    • v.15 no.2
    • /
    • pp.169-177
    • /
    • 2011
  • In August 2002, a heavy rainfall (445 mm in total for 5 consecutive days) resulted in a catastrophic flood, and it completely washed away the benthic fauna from the mainstream channel of the Gapyeong stream, a typical mid-sized stream in the central Korean peninsula. This study was to investigate the recovery patterns of aquatic insect communities that were damaged by the flood. Aquatic insects were sampled quantitatively using a Surber sampler ($50{\times}50$ cm, 1 riffle and 1 pool/run habitats per site) from three sites (4th-6th order) of the Gapyeong stream prior to 2000 and seasonally after the flood event from 2003 to 2006. Before the flood in the reference year (2000), a total of 77 species of aquatic insects were collected, whereas after the flood 47 species (2003), 51 species (2004), 64 species (2005) and 55 species (2006) were collected from the whole sampling sites. The aquatic insect density decreased to 26.85% (2003), 90.25% (2004), 52.53% (2005) and 54.95% (2006) of that recorded in the reference year. Although approximately 70% of the aquatic insect fauna has recovered since the flood event, the species composition in the most recent year differed substantially (similarity ca. 50%). On the other hand, the compositions of functional groups have not significantly changed. Aquatic insect communities at the riffle sites were affected more profoundly than those at the pool/run sites. The aquatic insect communities at the upstream site recovered more rapidly than those at the downstream sites.

Core Keywords Extraction forEvaluating Online Consumer Reviews Using a Decision Tree: Focusing on Star Ratings and Helpfulness Votes (의사결정나무를 활용한 온라인 소비자 리뷰 평가에 영향을 주는 핵심 키워드 도출 연구: 별점과 좋아요를 중심으로)

  • Min, Kyeong Su;Yoo, Dong Hee
    • The Journal of Information Systems
    • /
    • v.32 no.3
    • /
    • pp.133-150
    • /
    • 2023
  • Purpose This study aims to develop classification models using a decision tree algorithm to identify core keywords and rules influencing online consumer review evaluations for the robot vacuum cleaner on Amazon.com. The difference from previous studies is that we analyze core keywords that affect the evaluation results by dividing the subjects that evaluate online consumer reviews into self-evaluation (star ratings) and peer evaluation (helpfulness votes). We investigate whether the core keywords influencing star ratings and helpfulness votes vary across different products and whether there is a similarity in the core keywords related to star ratings or helpfulness votes across all products. Design/methodology/approach We used random under-sampling to balance the dataset. We progressively removed independent variables based on decreasing importance through backwards elimination to evaluate the classification model's performance. As a result, we identified classification models that best predict star ratings and helpfulness votes for each product's online consumer reviews. Findings We have identified that the core keywords influencing self-evaluation and peer evaluation vary across different products, and even for the same model or features, the core keywords are not consistent. Therefore, companies' producers and marketing managers need to analyze the core keywords of each product to highlight the advantages and prepare customized strategies that compensate for the shortcomings.

Selective Word Embedding for Sentence Classification by Considering Information Gain and Word Similarity (문장 분류를 위한 정보 이득 및 유사도에 따른 단어 제거와 선택적 단어 임베딩 방안)

  • Lee, Min Seok;Yang, Seok Woo;Lee, Hong Joo
    • Journal of Intelligence and Information Systems
    • /
    • v.25 no.4
    • /
    • pp.105-122
    • /
    • 2019
  • Dimensionality reduction is one of the methods to handle big data in text mining. For dimensionality reduction, we should consider the density of data, which has a significant influence on the performance of sentence classification. It requires lots of computations for data of higher dimensions. Eventually, it can cause lots of computational cost and overfitting in the model. Thus, the dimension reduction process is necessary to improve the performance of the model. Diverse methods have been proposed from only lessening the noise of data like misspelling or informal text to including semantic and syntactic information. On top of it, the expression and selection of the text features have impacts on the performance of the classifier for sentence classification, which is one of the fields of Natural Language Processing. The common goal of dimension reduction is to find latent space that is representative of raw data from observation space. Existing methods utilize various algorithms for dimensionality reduction, such as feature extraction and feature selection. In addition to these algorithms, word embeddings, learning low-dimensional vector space representations of words, that can capture semantic and syntactic information from data are also utilized. For improving performance, recent studies have suggested methods that the word dictionary is modified according to the positive and negative score of pre-defined words. The basic idea of this study is that similar words have similar vector representations. Once the feature selection algorithm selects the words that are not important, we thought the words that are similar to the selected words also have no impacts on sentence classification. This study proposes two ways to achieve more accurate classification that conduct selective word elimination under specific regulations and construct word embedding based on Word2Vec embedding. To select words having low importance from the text, we use information gain algorithm to measure the importance and cosine similarity to search for similar words. First, we eliminate words that have comparatively low information gain values from the raw text and form word embedding. Second, we select words additionally that are similar to the words that have a low level of information gain values and make word embedding. In the end, these filtered text and word embedding apply to the deep learning models; Convolutional Neural Network and Attention-Based Bidirectional LSTM. This study uses customer reviews on Kindle in Amazon.com, IMDB, and Yelp as datasets, and classify each data using the deep learning models. The reviews got more than five helpful votes, and the ratio of helpful votes was over 70% classified as helpful reviews. Also, Yelp only shows the number of helpful votes. We extracted 100,000 reviews which got more than five helpful votes using a random sampling method among 750,000 reviews. The minimal preprocessing was executed to each dataset, such as removing numbers and special characters from text data. To evaluate the proposed methods, we compared the performances of Word2Vec and GloVe word embeddings, which used all the words. We showed that one of the proposed methods is better than the embeddings with all the words. By removing unimportant words, we can get better performance. However, if we removed too many words, it showed that the performance was lowered. For future research, it is required to consider diverse ways of preprocessing and the in-depth analysis for the co-occurrence of words to measure similarity values among words. Also, we only applied the proposed method with Word2Vec. Other embedding methods such as GloVe, fastText, ELMo can be applied with the proposed methods, and it is possible to identify the possible combinations between word embedding methods and elimination methods.

섭제골 지역의 산화지 및 비산화지의 군락구조 비교

  • Sim, Hak-Bo;Kim, Woen
    • The Korean Journal of Ecology
    • /
    • v.16 no.4
    • /
    • pp.429-438
    • /
    • 1993
  • This is a report on the early vegetation and the secondary succession in the burned area of SeobJe-Go1 of $IIwasan-MY\v{o}n,\;Y\v{o}ngch\v{o}n-Gun,\;Ky\v{u}ngsangbuk-do$ Province. The forest fire occurred on April 8, 1982 and the pine forest and its floor vegetation were burned down. The investigation was done six times from August 20, 1982 to August 13, 1983. The results are summarized as follows: the floristic composition of burned areas $B_1,\;B_2$, and unburned areas $U_1,\;U_2$ were composed of 25, 23, 32, and 27 kinds of vascular plants. respectively. The biological spectra showed the $H-D_1-R_5-e$ type in both the burned and unburned areas. The species of Arundinella hirta, Miscanthus simnsis var. purpurascens and Cares hurnilis var. nana were dominant species in the burned area, while Pinus densiflorrr, Corex humilis var. nana and Rhododendron mucronulatum var. ciliafum were dominant species in the unburned area. Degree of succession of the unburned area was comparatively higher than that of the burned area. Species diversity index and evenness index of the burned area were similar to those of the unburned area. Indices of similarity in sampling sites showed that $B_1\;and\;B_2$ stands were the most similar. pH, total nitrogen, available phosphorus and exchangeable potassium of soil increased but organic matter and total organic carbon decreased after fire.

  • PDF

The Classification and Species Diversity of Forest Cover Types in the Natural Forest of the Middle Part of Baekdudaegan (백두대간 중부권역 천연림의 산림피복형 분류와 종다양성)

  • Hwang, Kwang-Mo;Chung, Sang-Hoon;Kim, Ji-Hong
    • Journal of Korean Society of Forest Science
    • /
    • v.104 no.1
    • /
    • pp.14-25
    • /
    • 2015
  • This study was carried out to classify forest communities and to aggregate forest cover types for the complex and diversified natural forest areas of Guryongsan, Sobaeksan, Baekhwasan, Sokrisan, and Baekhaksan in the middle part of Baekdudaegan. The vegetation data were collected by point-centered quarter sampling method. One thousand one hundred fourteen sample points were subjected to cluster analysis to classify 27 forest communities, which were aggregated into 7 representative forest cover types on the basis of community similarity from composition of canopy species. They were Quercus mongolica forest cover type, mixed mesophytic forest cover type, Q. variabilis forest cover type, Pinus densiflora forest cover type, the others deciduous forest cover type, Q. serrata forest cover type, and subalpine forest cover type. The Q. mongolica forest cover type was most widely distributed in the study areas. It was assumed that abundance of Q. mongolica might be negatively associated with species diversity. Mixed mesophytic forest cover type and the others deciduous forest cover type were commonly distributed in the areas of valley, on the other hand, Q. mongolica cover type and P. densiflora cover type tended to be distributed in the areas of ridge.

Comparison of Multi-angle TerraSAR-X Staring Mode Image Registration Method through Coarse to Fine Step (Coarse to Fine 단계를 통한 TerraSAR-X Staring Mode 다중 관측각 영상 정합기법 비교 분석)

  • Lee, Dongjun;Kim, Sang-Wan
    • Korean Journal of Remote Sensing
    • /
    • v.37 no.3
    • /
    • pp.475-491
    • /
    • 2021
  • With the recent increase in available high-resolution (< ~1 m) satellite SAR images, the demand for precise registration of SAR images is increasing in various fields including change detection. The registration between high-resolution SAR images acquired in different look angle is difficult due to speckle noise and geometric distortion caused by the characteristics of SAR images. In this study, registration is performed in two stages, coarse and fine, using the x-band SAR data imaged at staring spotlight mode of TerraSAR-X. For the coarse registration, a method combining the adaptive sampling method and SAR-SIFT (Scale Invariant Feature Transform) is applied, and three rigid methods (NCC: Normalized Cross Correlation, Phase Congruency-NCC, MI: Mutual Information) and one non-rigid (Gefolki: Geoscience extended Flow Optical Flow Lucas-Kanade Iterative), for the fine registration stage, was performed for performance comparison. The results were compared by using RMSE (Root Mean Square Error) and FSIM (Feature Similarity) index, and all rigid models showed poor results in all image combinations. It is confirmed that the rigid models have a large registration error in the rugged terrain area. As a result of applying the Gefolki algorithm, it was confirmed that the RMSE of Gefolki showed the best result as a 1~3 pixels, and the FSIM index also obtained a higher value than 0.02~0.03 compared to other rigid methods. It was confirmed that the mis-registration due to terrain effect could be sufficiently reduced by the Gefolki algorithm.