• Title/Summary/Keyword: similarity coefficient

Search Result 450, Processing Time 0.039 seconds

Studies on the Distribution of Ants(Formicidea) in Korea(6) -The Vegetation, the Species Composition and the Colony Density ants in Mt. Namsan, Seoul- (한국산 개미의 분포에 관한 연구(6) -서울 남산의 식생과 개미군집의 종조성 및 Colony 밀도-)

  • 최병문;박경숙
    • Korean journal of applied entomology
    • /
    • v.30 no.1
    • /
    • pp.65-79
    • /
    • 1991
  • In order to investigate the species composition and the colony density of ants in Mt. Namsan, Seoul, 39 quadrats were installed in 13 vegetations, 443 colonies of ants were collected from June, 1989 to October, 1990. As the result, 4 subfamilies, 23 genera, 28 species was confirmed. Among them, Cerapachys humicola $O_{GATA}$ is new to Korean fauna along with the subfamily Cerapachinae. For the species composition of ant communities in each vegetation, Robinia pseudoacacia vegetation(containing 3 subfamilies, 14 genera, 15 species-53.6% of all colonies collected in Mt. Namsan) and Quercus mongolica vegetation (3 subfamiles, 12 genera, 14 species -50%) showed relatively rich composition, while Platunus orientalis vegetation (3 subfamilies, 3 genera, 3 species) showed the simplest composition. Colony density was the highest in Prunus sargentii vegetation (7.875 colony /$m^2$) and the lowest in Platunus orientalis (1.000 colony/$m^2$). The relative density of Paratrechina flavipes proved to be the highest (RD = 0.422) and that of Cerapachys humicola $O_{GATA}$ Massor aciculatus was the lowest (RD = O. 002 respectively). In the analysis of the similarity of ant communities between each vegetation by S¢rensen's coefficient, Prunus sargentii was very similar to Sorbus alnifolia (0.745) and Pinus densiflora (0.736), but had the lowest similarity to Metasequoia glyptostoboides and Chamaecyparis pisifera vegetation (0.164 respectively). Dominance of ants in each vegetation analyzed by Simpson'formula was found to be high in Platunus orientalis ($\lambda$ = 0.393) and Sorbus alnifolia ($\lambda$ = 0.392) and the lowest in Metasequoia glyptostroboides vegetation($\lambda$= 0.067). The analysis of diversity by reverse Simpson's coefficient revealed that it was high in Metasequoia glyptostroboides ($d_s$ = 14.925), Pinus rigida ($d_s$ = 7.874) and was the lowest in Platunus orientalis vegetation ($d_s$ = 2.545). Evenness calculated by using d. and $d_{max}$(maximal diversity) was high in Metasequoia glyptostroboides ($E_s$ = 0.714) and Chamaecyparis pisifera vegetation ($E_s$ = 0.624). On the contrary, Quercus mongo/ica vegetation had the lowest value of evenness ($E_s$ = 0.182).

  • PDF

Species Diversity Analysis of the Mushroom in Mt. Chiak (치악산 발생 버섯의 종 다양성 비교 분석)

  • Lee, Byung Kook;Eom, Ki Cheol;Seok, Soon Ja
    • The Korean Journal of Mycology
    • /
    • v.41 no.2
    • /
    • pp.57-66
    • /
    • 2013
  • The mushrooms collected at seven areas of Mt. Chiak in 2002 and 2003 were classified to analyse the distribution and species diversity. Frequency (number of mushroom : N), number of species (S), relative species density (RSD), similarity index (C), richness index (R1), variety index (V1), evenness index (E2), and dominance index(D1) were investigated. Total N and S was 143 and 84, respectively. The RSD was 0.179 ~ 0.226 of the 7 areas. The yearly C of the total area (0.213) was 8.2%. more higher than the average C of 7 areas (0.131). The order in the coefficient of variation (CV) of the indicator for 7 areas was N (10.5%) > D1 (9.2%) > V1 (8.9%) > S (8.5%) > R1 (7.4%) > E2 (2.2%). The average R1 of the 7 areas was 5.36 with the range from 4.85 to 6.01, and 16.72 for the total area. The average V1 of the 7 areas was 16.24 with the range from 14.44 to 18.66, and 68.82 for the total area. The average E2 of the 7 areas was 0.95 with the range from 0.926 to 0.982, and 0.819 for the total area. The average D1 of the 7 areas was 0.071 with the range from 0.055 to 0.073, and 0.081 for the total area. The correlation between N and 5 kinds of diversity indicator (S, R1, V1, E2, D) was not statistically significant, but the correlation between R1, E2 and D1 was statistically significant each other.

A Study on Market Size Estimation Method by Product Group Using Word2Vec Algorithm (Word2Vec을 활용한 제품군별 시장규모 추정 방법에 관한 연구)

  • Jung, Ye Lim;Kim, Ji Hui;Yoo, Hyoung Sun
    • Journal of Intelligence and Information Systems
    • /
    • v.26 no.1
    • /
    • pp.1-21
    • /
    • 2020
  • With the rapid development of artificial intelligence technology, various techniques have been developed to extract meaningful information from unstructured text data which constitutes a large portion of big data. Over the past decades, text mining technologies have been utilized in various industries for practical applications. In the field of business intelligence, it has been employed to discover new market and/or technology opportunities and support rational decision making of business participants. The market information such as market size, market growth rate, and market share is essential for setting companies' business strategies. There has been a continuous demand in various fields for specific product level-market information. However, the information has been generally provided at industry level or broad categories based on classification standards, making it difficult to obtain specific and proper information. In this regard, we propose a new methodology that can estimate the market sizes of product groups at more detailed levels than that of previously offered. We applied Word2Vec algorithm, a neural network based semantic word embedding model, to enable automatic market size estimation from individual companies' product information in a bottom-up manner. The overall process is as follows: First, the data related to product information is collected, refined, and restructured into suitable form for applying Word2Vec model. Next, the preprocessed data is embedded into vector space by Word2Vec and then the product groups are derived by extracting similar products names based on cosine similarity calculation. Finally, the sales data on the extracted products is summated to estimate the market size of the product groups. As an experimental data, text data of product names from Statistics Korea's microdata (345,103 cases) were mapped in multidimensional vector space by Word2Vec training. We performed parameters optimization for training and then applied vector dimension of 300 and window size of 15 as optimized parameters for further experiments. We employed index words of Korean Standard Industry Classification (KSIC) as a product name dataset to more efficiently cluster product groups. The product names which are similar to KSIC indexes were extracted based on cosine similarity. The market size of extracted products as one product category was calculated from individual companies' sales data. The market sizes of 11,654 specific product lines were automatically estimated by the proposed model. For the performance verification, the results were compared with actual market size of some items. The Pearson's correlation coefficient was 0.513. Our approach has several advantages differing from the previous studies. First, text mining and machine learning techniques were applied for the first time on market size estimation, overcoming the limitations of traditional sampling based- or multiple assumption required-methods. In addition, the level of market category can be easily and efficiently adjusted according to the purpose of information use by changing cosine similarity threshold. Furthermore, it has a high potential of practical applications since it can resolve unmet needs for detailed market size information in public and private sectors. Specifically, it can be utilized in technology evaluation and technology commercialization support program conducted by governmental institutions, as well as business strategies consulting and market analysis report publishing by private firms. The limitation of our study is that the presented model needs to be improved in terms of accuracy and reliability. The semantic-based word embedding module can be advanced by giving a proper order in the preprocessed dataset or by combining another algorithm such as Jaccard similarity with Word2Vec. Also, the methods of product group clustering can be changed to other types of unsupervised machine learning algorithm. Our group is currently working on subsequent studies and we expect that it can further improve the performance of the conceptually proposed basic model in this study.

Ecological Characteristic between the Re-introduction Population and the Original Population (Jojong Stream, Sudong Stream) of Zacco koreanus in the Bongseonsa Stream, Korea (봉선사천의 참갈겨니(Zacco koreanus) 재도입 개체군과 원개체군(조종천, 수동천) 간 생태학적 특징)

  • Wang, Ju-Hyoun;Choi, Jun-Kil;Lee, Hyuk-Je;Lee, Hwang-Goo
    • Korean Journal of Environment and Ecology
    • /
    • v.31 no.6
    • /
    • pp.537-548
    • /
    • 2017
  • The purpose of this study was to investigate the species composition and the aquatic environment of Jojong Stream and Sudong Stream, which were the original habitats of Zacco koreanus population and restored population re-introduced in Bongseonsa Stream. It also compared and analyzed the states of the growth and reproductive ability of Z. koreanus habiting in each of the three streams. The investigation was conducted in June 2016 which was known as the spawning season of Z. koreanus. The results of the physical aquatic environments showed the slight differences in altitude, width and depth of water among three streams, but the bottom structure was found to be quite different in the composition of the boulder, cobble, and pebble among the streams. The result of the physicochemical aquatic environment analysis showed that there were no significant differences in water temperature, pH, DO, BOD, and EC among the three stream. In the fish fauna investigation, 530 individuals of 11 species of 3 families were collected in Bongseonsa Stream, 293 individuals of 12 species of 4 families were collected in Jojong Stream, and 361 individuals of 11 species of 4 families were collected in Sudong Stream. All three streams were dominated by Z. koreanus and Z. platypus. Six Korean endemic species appeared in each of the three streams, showing the high occurrence rate of indigenous species of 50.0% or more. The aggregation index analysis revealed that the mean dominance index ranged from 0.63 (${\pm}0.05$, BS) to 0.72(${\pm}0.01$, JJ), mean diversity index from 1.55 (${\pm}0.06$, JJ) to 1.78 (${\pm}0.11$, BS), mean evenness index from 0.71 (${\pm}0.03$, JJ) to 0.76 (${\pm}0.02$, BS), and mean richness index from 1.61 (${\pm}0.33$, JJ) to 1.73 (${\pm}0.24$, SD). The result indicated that the observed differences between the stream community indices were statistically nonsignificant. The similarity analysis showed that 75.4% similarity was divided into two groups of A and B and that the fish fauna on each analyzed point was similar. The quantitative habitat evaluation index (QHEI) analysis showed that the average value of QHEI was 151.0 (${\pm}46.0$), which means that it was a suboptimal habitat environment. The result of length-weight analysis of Z. koreanus populations showed that the regression coefficient b of the restoration population and the original habitat population were at 3.0 or higher while the condition factor had a positive slope. Moreover, it was found that the slopes of the regression coefficient b and condition factor of the original habitat population were larger than the restored population. The analysis of the length frequency distribution of the Z. koreanus population revealed that all three streams maintained the stable life cycle although it was found that the growth rate of the original habitat population was faster than the restored population in the one-year-old class. The result of the gonadosomatic index (GSI) analysis showed that the GSI median value of the Z. koreanus population in the restored habitat Bongseonsa Stream was higher than the population in the original habitat Jojong Stream and Sudong Stream for both of males and females.

Understanding the Protox Inhibition Activity of Novel 1-(5-methyl-3-phenylisoxazolin-5-yl)methoxy-2-chloro-4-fluorobenzene Derivatives Using Comparative Molecular Similarity Indices Analysis (CoMSIA) Methodology (비교 분자 유사성 지수분석(CoMSIA) 방법에 따른 1-(5-methyl-3-phenylisoxazolin-5-yl)methoxy-2-chlore-4-fluorobenzene 유도체들의 Protox 저해 활성에 관한 이해)

  • Song, Jong-Hwan;Park, Kyung-Yong;Sung, Nack-Do
    • Applied Biological Chemistry
    • /
    • v.47 no.4
    • /
    • pp.414-421
    • /
    • 2004
  • 3D QSAR studies for protox inhibition activities against root and shoot of the rice plant (Orysa sativa L.) and barnyardgrass (Echinochloa crus-galli) by a series of new 1-(5-methyl-3-phenylisoxazolin-5-yl)methoxy-2-chloro-4-fluorobenzene derivatives were conducted based on the results (Sung, N. D. et al.'s, (2004) J. Korean Soc. Appl. Biol. Chem. 47(3), 351-356) using comparative molecular similarity indices analysis (CoMSIA) methodology. Four CoMSIA models, without hydrogen bond donor field for the protox inhibition activities against root and shoot of the two plants, were derived from the combination of several fields using steric field, hydrophobic field, hydrogen bond acceptor field, LUMO molecular orbital field, dipole moment (DM) and molar refractivity (MR) as additional descriptors. The predictabilities and fitness of CoMSIA models for protox inhibition activities against barnyard-grass were higher than that of rice plant. The statistical results of these models showed the best predictability of the protox inhibition activities against barnyard-grass based on the cross-validated value $r^2\;_{cv}\;(q^2=0.635{\sim}0.924)$, non cross-validated, conventional coefficient $r^2\;_{ncv.}$ value $(r^2=0.928{\sim}0.977)$ and PRESS value $(0.255{\sim}0.273)$. The protox inhibition activities exhibited a strong correlation with the steric $(5.4{\sim}15.7%)$ and hydrophobic $(68.0{\sim}84.3%)$ factors of the molecules. Particularly, the CoMSIA models indicated that the groups of increasing steric bulk at ortho-position on the C-phenyl ring will enhance the protox inhibition activities against barnyard-grass and subsequently increase the selectivity.

The Variation of Leaf Characterics in 6 Natural Populations of Stewartia koreana Nakai (노각나무 6개 천연집단(天然集團)의 엽형질(葉形質) 변이(變異))

  • Kim, Young-Jung;Kim, Kee-Chul;Lee, Byung Sil;Lee, Gab-Yeoun;Cho, Kyoung-Jin;Kang, Jin Taek;Kim, Tae-Dong
    • Journal of Korean Society of Forest Science
    • /
    • v.94 no.6
    • /
    • pp.446-452
    • /
    • 2005
  • In order to examine the natural distribution variations between groups of the Stewartia koreana, the leaf form characteristics of the investigation sites were analyzed by each group. As a result, the Mt. Kumsan group showed a smaller value in leaf length, width, area, and the number of veins, but not in the petiole length and serration number. Among each character, the coefficient of variation(CV) of the characters excluding petiole length and leaf area was in a comparatively narrow range, from 11.6~17.4%. On the other hand, the CV of petiole length and leaf area between the groups was 34.9% and 28.4% respectively. The CV of these characters within the group was also extraordinary- petiole length showed 29.5~42% and leaf area showed 27.7~40.7%. Also, the simple correlation analysis between 12 leaf characteristics showed that the correlation between leaf width and leaf area was high (r=0.975). The correlations between leaf length and leaf area, between leaf length and leaf width were 0.971 and 0.969, respectively. A negative correlation between angle of leaf base and ratio of leaf length to leaf width was discovered (r= -0.843), meaning that the ratio of leaf length to leaf width decreases as angle of leaf base increases. A cluster analysis was enforced among leaf characteristics of the selected group as a standard on the similarity of quantitative, qualitative measurements. The results showed that at a 0.4 distance level, the subjects could be classified into 4 groups. Group 1 was the Mt. Jogyesan and Mt. Kayasan group, group 2 was Mt. Paegunsan, group 3 was Mt. Unmunsan and Mt. Mudungsan, and group 4 was Mt. Kumsan. At a distance level of 0.6, the subjects were classified into two groups. Group 1 was the Mt. Ktimsan group and group 2 was Mt. Mudungsan, Unmunsan, Paegunsan, Kayasan, and Cogyesan. Especially the Mt. Kumsan group had the smallest value in the leaf characteristics of leaf length, width, area, and the number of veins, showing an obvious difference from the other five groups. There were five principal components that had a meaningful eigenvalue over 1.0 among the 12 extracted components. The explanatory power of the top two main components (leaf length and width) on the total variation was 52.7%. The explanatory power was 91.3% when all 5 main components were included.

A Study on the Level of Stress Recognition of Urban Housewife and the Method of Coping to Stress (도시 주부의 스트레스 인지수준 및 적응 방법에 관한 연구)

  • 장병옥;이정우
    • Journal of Families and Better Life
    • /
    • v.4 no.1
    • /
    • pp.15-31
    • /
    • 1986
  • The purpose of this study is to investigate the relationship between the level of stress recognition of urban housewife and the method of coping, and to explore bow these factors are influenced by socio-demographic variables such as the age of housewife, level of education, status of employment, number of children, durations of marriage, types of family, religion and socio-economic status. The research was conducted on 431 housewives in Seoul in August, 1985. As for the measurement of the instrument, 48 item questionnaire made by investigator was used. The questionnaire was based upon modified and upplemented Holme & Rahe's SRRS and Bell's 18-item Questionnaire to be appropriate to Korean culture. Data were analyzed by percentage, frequency and mean, and verified significant difference by ANOVA and performed Spearman's correlation coefficient. The results of this study are as follows; 1) There is some similarity in distribution of the level of stress recognition of urban housewife. 2) the level of education and the durations of marriage have influence upon the level of stress recognition of urban housewife. In each area, there are differences among groups : age, level of education, durations of marriage, number of children and types of family in the area of education ; age, status of employment, and durations of marriage in the area of health; level of education, durations of marriage, number of children and socio-economic status in the area of finance; status of employment in the area of household work. 3) There are several methods in the method of coping to stress of housewife and the score of long-term coping method appears higher than that of short-term. 4) The level of education, number of children, religion and socio-economic status were variables to have influence on the method level of education, religion and socio- economic status were variables to have influence and in the long-term coping method level of education, number of children, religion, and socio-economic status were to have influence. 5) There is very low positive correlation between the level of stress recognition of urban housewife and the method of coping to stress( ρ=.10, P<.05). 6)In the relation between several variables in socio-demographic variables and the method to coping to stress, the lower the level of stress recognition there are negative correlation (ρ=-.28, P<.01) between religion and the method of coping and also negative correlation (ρ=-.16, P<.05) between number of children and the method of coping. There are positive correlation between socio-economic status and the method of coping.

  • PDF

Numerical Analysis and Verification of Sound Absorbing Properties of Perforated Plate (타공판의 등가 흡음 물성치 유도와 공명기로서의 흡음성능 해석)

  • Yoon, Gil-Ho;Kim, Ki-Hyun;Choi, Jung-Sik;Yun, Su-Hwan
    • Journal of the Computational Structural Engineering Institute of Korea
    • /
    • v.28 no.2
    • /
    • pp.139-144
    • /
    • 2015
  • Recently, to realize sound-absorbing structures, we have to insert sound-absorbing materials into wall. These shapes are taken limitations because sound-absorbing materials should be fixed. Therefore, the sound absorption is changed by environment that used the sound-absorbing materials. On the other hand, we will take same effect without sound-absorbing material, if we change the shape of wall to sound absorbing structure. If we use this sound absorbing structure, we can get benefits by removing limitation of materials. Therefore we suggest perforated plate for effective sound-absorbing structure. We confirmed the function of sound-absorption of this structure using equivalent property. Then, we found the similarity between perforated plate and resonator. Also, we verify these theories through computer simulation by FEM(Finite Element Method). Finally, we validated that perforated plate has function of sound absorption without sound-absorbing material. This perforated plate is used for sound-absorbing material of buildings and transportations such as vehicle, train etc. Also, these results could be further used basic tool for design of sound-absorption structure.

A Correlation Analysis of the River Naturalness and Water Quality for Biological Habitat Evaluation (하천 생물 서식처 평가를 위한 하천 자연도와 수질의 상관성 분석)

  • Park Bong-Jin;Sung Young-Du;Jung Kwan-Sue
    • Journal of Korea Water Resources Association
    • /
    • v.39 no.8 s.169
    • /
    • pp.637-644
    • /
    • 2006
  • In this study, the analysis of river naturalness and water quality were executed in the major rivers of the Nakdong River Basin. As a result, the assessment index of the General Evaluation was 1.428 to 4.107 as $1^{st}$ to $4^{th}$ grades. The River Shape was 1.929 to 4.429 as $2^{nd}$ to $4^{th}$ grades and the River Environment was 1.774 to 3.643 as $1^{st}$ to $4^{th}$ grades. As well, evaluation of water quality showed that concentration of pH was 7.102 to 8.497 mg/l, BOD was 0.748 to 5.271 mg/l, DO was 5.077 to 12.335 mg/l and 55 was 3.658 to 19.960 mg/l. The correlation between river naturalness and water quality was analyzed to investigate similarity and independence of river naturalness evaluation index. It was shown that coefficient of correlation was low with value of -0.1503 to -0.5886, therefore, was evaluated as independent.

A Synecological Study of the Alnus japonica Forests in Korea (우리나라 오리나무림의 군락생태학적 연구)

  • Cho, Joon-Hee;Bae, Kwan-Ho;Oh, Seung-Hwan;Kim, Jun-Soo;Cho, Hyun-Je
    • Journal of Korean Society of Forest Science
    • /
    • v.109 no.2
    • /
    • pp.124-135
    • /
    • 2020
  • Alder (Alnus japonica) forests are representative of the wetland in East Asia, including Korea. In the past, alder forests were relatively common in various habitats such as mountains, riversides, back marshes, and alluvial plains. However, this plant community has recently become rare due to increasingly arid habitats and the influence of various land uses. In this study, we identify the synecological characteristics of alder (A. japonica) forests distributed naturally in the mountainous wetlands of Korea and provide basic data for their systematic conservation and management in the future. Based on vegetation survey data collected from 66 alder forests, community types were classified using the methods of the Zürich-Montpellier School of Phytosociology and two-way indicator species analysis. There were eight community types: Styrax obassia, Weigela subsessilis-Fraxinus mandschurica, Spiraea fritschiana, Viola verecunda, Impatiens textori-Spiraea salicifolia, Glyceria leptolepis, Molinia japonica, and Lindera obtusiloba-Quercus acutissima. These community types constituted a vegetation unit hierarchy of two communities, four subcommunities, and eight variants. In addition, the ecological characteristics of each community type were compared (including total coverage per 100 square meter, importance value index, constancy class, life-form composition, diversity indices, community similarity coefficient, and indicator species).