• Title/Summary/Keyword: Shannon's Entropy

Search Result 45, Processing Time 0.023 seconds

Diversity and Genotypic Structure of ECOR Collection Determined by Repetitive Extragenic Palindromic PCR Genome Fingerprinting

  • HWANG KEUM-OK;JANG HYO-MI;CHO JAE-CHANG
    • Journal of Microbiology and Biotechnology
    • /
    • v.15 no.3
    • /
    • pp.672-677
    • /
    • 2005
  • The standard reference collection of strains for E. coli, the ECOR collection, was analyzed by a genome-based typing method. Seventy-one ECOR strains were subjected to repetitive extragenic palindromic PCR genome fingerprinting with BOX primers (BOX-PCR). Using a similarity value of 0.8 or more after cluster analysis of BOX-PCR fingerprinting patterns to define the same genotypes, we identified 28 genotypes in the ECOR collection. Shannon's entropy-based diversity index was 3.07, and the incident-based coverage estimator indicated potentially 420 genotypes among E. coli populations. Chi-square test of goodness-of-fit showed statistically significant association between the genotypes defined by BOX-PCR fingerprinting and the groups previously defined by multi-locus enzyme electrophoresis. This study suggests that the diversification of E. coli strains in natural populations is actively ongoing, and rep-PCR fingerprinting is a convenient and reliable method to type E. coli strains for the purposes ranging from ecology to quarantine.ine.

A Study on measuring techniques of retrieval effectiveness (검색효율 측정척도에 관한 연구)

  • Yoon Koo Ho
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.16
    • /
    • pp.177-205
    • /
    • 1989
  • Retrieval effectiveness is the principal criteria for measuring the performance of an information retrieval system. This paper deals with the characteristics of 'relevance' of information and various measuring techniques of retrieval effectivess. The outlines of this study are as follows: 1) Relevance decision for evaluation should be devided into the user-oriented and the system-oriented decisions. 2) The recall-precision measure seems to be user-oriented, and the recall-fallout measure to be system-oriented. 3) Many of composite measures can not be justified III any rational manner unfortunately. 4) The Swets model has demonstrated that it yields, in general, a straight line instead of a curve of varying curvature and emphasized the fundamentally probabilistic nature of information retrieval. 5) The Cooper model seems to be a good substitute for precision and a useful measure for systems which ranked documents. 6) The Rocchio model were proposed for the evaluation of retreval systems which ranked documents, and were designed to be independent of cut-off. 7) The Cawkell model suggested that the Shannon's equation for entropy can be applied to measuring of retrieval effectiveness.

  • PDF

Relationship between Diversity and Productivity at Ratargul Fresh Water Swamp Forest in Bangladesh

  • Sharmin, Mahmuda;Dey, Sunanda;Chowdhury, Sangita
    • Journal of Forest and Environmental Science
    • /
    • v.32 no.3
    • /
    • pp.291-301
    • /
    • 2016
  • One of the most concerned topics in ecology is the relationship between biodiversity and ecosystem functioning. However, there are few field studies, carried out in forests, although many studies have been done in controlled experiments in grasslands. In this paper, we describe the relationship pattern between three facets of diversity and productivity at Ratargul Fresh Water Swamp Forest (RFWSF) in Bangladesh, which is the only remaining fresh water swamp forest of the country. Sixty sample plots were selected from RFWSF and included six functional traits including leaf area (LA), specific leaf area (SLA), leaf dry matter content (LDMC), tree height, bark thickness and wood density. In analyzing TD, we used Shannon diversity and richness indices, functional diversity was measured by Rao's quadratic entropy (Rao 1982) and Faith's (1992) index was used for phylogenetic diversity (PD). It was found that, TD, FD and PD were positively related with productivity (basal area) due to resource use complementarity but surprisingly the best predictor of tree productivity was FD. The results contribute to the understanding the effects of biodiversity loss and it is essential for conservation decision-making and policy-making of Ratargul Fresh Water Swamp Forest.

Multi-objective Optimization of Pedestrian Wind Comfort and Natural Ventilation in a Residential Area

  • H.Y. Peng;S.F. Dai;D. Hu;H.J. Liu
    • International Journal of High-Rise Buildings
    • /
    • v.11 no.4
    • /
    • pp.315-320
    • /
    • 2022
  • With the rapid development of urbanization the problems of pedestrian-level wind comfort and natural ventilation of tall buildings are becoming increasingly prominent. The velocity at the pedestrian level ($\overline{MVR}$) and variation of wind pressure coefficients $\overline{{\Delta}C_p}$ between windward and leeward surfaces of tall buildings were investigated systematically through numerical simulations. The examined parameters included building density ρ, height ratio of building αH, width ratio of building αB, and wind direction θ. The linear and quadratic regression analyses of $\overline{MVR}$ and $\overline{{\Delta}C_p}$ were conducted. The quadratic regression had better performance in predicting $\overline{MVR}$ and $\overline{{\Delta}C_p}$ than the linear regression. $\overline{MVR}$ and $\overline{{\Delta}C_p}$ were optimized by the NSGA-II algorithm. The LINMAP and TOPSIS decision-making methods demonstrated better capability than the Shannon's entropy approach. The final optimal design parameters of buildings were ρ = 20%, αH = 4.5, and αB = 1, and the wind direction was θ = 10°. The proposed method could be used for the optimization of pedestrian-level wind comfort and natural ventilation in a residential area.

A Study on the One-Way Distance in the Longitudinal Section Using Probabilistic Theory (확률론적 이론을 이용한 종단면에서의 단방향 이동거리에 관한 연구)

  • Kim, Seong-Ryul;Moon, Ji-Hyun;Jeon, Hae-Sung;Sue, Jong-Chal;Choo, Yeon-Moon
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.21 no.12
    • /
    • pp.87-96
    • /
    • 2020
  • To use a hydraulic structure effectively, the velocity of a river should be known in detail. In reality, velocity measurements are not conducted sufficiently because of their high cost. The formulae to yield the flux and velocity of the river are commonly called the Manning and Chezy formulae, which are empirical equations applied to uniform flow. This study is based on Chiu (1987)'s paper using entropy theory to solve the limits of the existing velocity formula and distribution and suggests the velocity and distance formula derived from information entropy. The data of a channel having records of a spot's velocity was used to verify the derived formula's utility and showed R2 values of distance and velocity of 0.9993 and 0.8051~0.9483, respectively. The travel distance and velocity of a moving spot following the streamflow were calculated using some flow information, which solves the difficulty in frequent flood measurements when it is needed. This can be used to make a longitudinal section of a river composed of a horizontal distance and elevation. Moreover, GIS makes it possible to obtain accurate information, such as the characteristics of a river. The connection with flow information and GIS model can be used as alarming and expecting flood systems.

Acoustic emission source location and noise cancellation for crack detection in rail head

  • Kuanga, K.S.C.;Li, D.;Koh, C.G.
    • Smart Structures and Systems
    • /
    • v.18 no.5
    • /
    • pp.1063-1085
    • /
    • 2016
  • Taking advantage of the high sensitivity and long-distance detection capability of acoustic emission (AE) technique, this paper focuses on the crack detection in rail head, which is one of the most vulnerable parts of rail track. The AE source location and noise cancellation were studied on the basis of practical rail profile, material and operational noise. In order to simulate the actual AE events of rail head cracks, field tests were carried out to acquire the AE waves induced by pencil lead break (PLB) and operational noise of the railway system. Wavelet transform (WT) was first utilized to investigate the time-frequency characteristics and dispersion phenomena of AE waves. Here, the optimal mother wavelet was selected by minimizing the Shannon entropy of wavelet coefficients. Regarding the obvious dispersion of AE waves propagating along the rail head and the high operational noise, the wavelet transform-based modal analysis location (WTMAL) method was then proposed to locate the AE sources (i.e. simulated cracks) respectively for the PLB-induced AE signals with and without operational noise. For those AE signals inundated with operational noise, the Hilbert transform (HT)-based noise cancellation method was employed to improve the signal-to-noise ratio (SNR). Finally, the experimental results demonstrated that the proposed crack detection strategy could locate PLB-simulated AE sources effectively in the rail head even at high operational noise level, highlighting its potential for field application.

A Risk-Return Analysis of Loan Portfolio Diversification in the Vietnamese Banking System

  • HUYNH, Japan;DANG, Van Dan
    • The Journal of Asian Finance, Economics and Business
    • /
    • v.7 no.9
    • /
    • pp.105-115
    • /
    • 2020
  • The study empirically examines the effects of loan portfolio diversification on bank risk and return in the nascent banking market of Vietnam. Loan portfolio diversification is captured through the Hirschman-Herfindahl index and the Shannon Entropy with sectoral exposures. We access each bank's financial reports to collect the required data, especially the breakdown of sectoral loan portfolios, thus constituting a unique dataset. To compute bank return, we use the traditional accounting indicators, including return-on-assets, return-on-equity, and net-interest margin. For bank risk, we utilize the loan-loss provisions and non-performing loans relative to gross customer loans. Using a sample of 30 commercial banks over the period from 2008 to 2019 and the system generalized method of moments estimator for the dynamic panel, we indicate the downsides of portfolio diversification. Concretely, we observe that all diversification measures exhibit significantly negative signs in all regressions across different bank return proxies. At the same time, the estimates display the significant and positive impact of diversification on the non-performing loan ratio. Hence, sectoral loan portfolio diversification significantly hampers bank performance in both aspects of lower return and higher credit risk. The results are robust across a rich set of bank performance and portfolio diversification measures.

A Study on Detection of Malicious Android Apps based on LSTM and Information Gain (LSTM 및 정보이득 기반의 악성 안드로이드 앱 탐지연구)

  • Ahn, Yulim;Hong, Seungah;Kim, Jiyeon;Choi, Eunjung
    • Journal of Korea Multimedia Society
    • /
    • v.23 no.5
    • /
    • pp.641-649
    • /
    • 2020
  • As the usage of mobile devices extremely increases, malicious mobile apps(applications) that target mobile users are also increasing. It is challenging to detect these malicious apps using traditional malware detection techniques due to intelligence of today's attack mechanisms. Deep learning (DL) is an alternative technique of traditional signature and rule-based anomaly detection techniques and thus have actively been used in numerous recent studies on malware detection. In order to develop DL-based defense mechanisms against intelligent malicious apps, feeding recent datasets into DL models is important. In this paper, we develop a DL-based model for detecting intelligent malicious apps using KU-CISC 2018-Android, the most up-to-date dataset consisting of benign and malicious Android apps. This dataset has hardly been addressed in other studies so far. We extract OPcode sequences from the Android apps and preprocess the OPcode sequences using an N-gram model. We then feed the preprocessed data into LSTM and apply the concept of Information Gain to improve performance of detecting malicious apps. Furthermore, we evaluate our model with numerous scenarios in order to verify the model's design and performance.

The Optimal Partition of Initial Input Space for Fuzzy Neural System : Measure of Fuzziness (퍼지뉴럴 시스템을 위한 초기 입력공간분할의 최적화 : Measure of Fuzziness)

  • Baek, Deok-Soo;Park, In-Kue
    • Journal of the Institute of Electronics Engineers of Korea TE
    • /
    • v.39 no.3
    • /
    • pp.97-104
    • /
    • 2002
  • In this paper we describe the method which optimizes the partition of the input space by means of measure of fuzziness for fuzzy neural network. It covers its generation of fuzzy rules for input sub space. It verifies the performance of the system depended on the various time interval of the input. This method divides the input space into several fuzzy regions and assigns a degree of each of the generated rules for the partitioned subspaces from the given data using the Shannon function and fuzzy entropy function generating the optimal knowledge base without the irrelevant rules. In this scheme the basic idea of the fuzzy neural network is to realize the fuzzy rule base and the process of reasoning by neural network and to make the corresponding parameters of the fuzzy control rules be adapted by the steepest descent algorithm. According to the input interval the proposed inference procedure proves that the fast convergence of root mean square error (RMSE) owes to the optimal partition of the input space

Statistical Consideration on the Resources of the Countries in the World (세계 각국의 자원에 대한 통계적 고찰)

  • Huh, Moon-Yul;Choi, Byong-Su;Lee, Seung-Chun
    • The Korean Journal of Applied Statistics
    • /
    • v.22 no.1
    • /
    • pp.41-57
    • /
    • 2009
  • The paper investigates the resources of the 232 countries based on the 39 resources of these countries. The data used in this work is from various sources like UN, CIA, World bank, OECD reports and the home pages of each country. The purpose of the study is to evaluate what resources are most influential to the wealth of a country, to the well-bring of the country, or the status of the country's development. For this, data visualization method is applied. Data visualization technique, although powerful for exploratory purposes, is dependent upon the users expertize and the interpretation is also dependent on the of the users. For objective methods of investigation, mutual information based on the Shanon's entropy theory is applied here. All the statistical methods employed in this paper are processed with DAVIS (Huh and Song, 2002)