• Title/Summary/Keyword: Second chromosome

Search Result 75, Processing Time 0.022 seconds

Genetic Studies on Lethal and Sterility Genes Concealed in Natural Populations of Drosophila melanogaster (초파리의 자연집단에 보유되어 있는 치사유전자 및 불임유전자에 대한 유전학적 연구)

  • 이택준;이예옥
    • The Korean Journal of Zoology
    • /
    • v.27 no.3
    • /
    • pp.137-150
    • /
    • 1984
  • The present experiments were carried out to understand the genetic structure of the natural population by means of the frequencies of recessive lethal and sterility genes on the second chromosomes of Drosophila melanogaster. The natural populations used for experiment were Anyang, Kimpo and Ulsan populations in 1982 and 1983. The mean frequencies of deleterious gene (lethal plus semilethal) were estimated 29.01% in Anyang, 30.07% in Kimpo and 32.31% in Ulsan population. Allelism rates on the chromosome between lethals extracted from natural populations were examined within or between populations. The mean allelism rates were showed 2.28% in Anyang, 1.90% in Kimpo and 2.17% in Ulsan. The values of elimination $(IQ^2)$ were estimated by frequencies of deleterious genes and allelism rates. The mean values of elimination were 0.0020 in Anyang, 0.0019 in Kimpo and 0.0023 in Ulsan population. The effective population size was estimated by using a formula by Nei. Anyang, Kimpo and Ulsan populations were about 2, 900, 3, 600 and 3, 200, respecively. These data suggest that Korean populations of Drosophila melanogaster attained to stable breeding units of intermediate size, ranging from 2, 900 to 3, 600 pairs of fertile individuals.

  • PDF

Improved Method for Learning Context-Free Grammar using Tabular representation

  • Jung, Soon-Ho
    • Journal of the Korea Society of Computer and Information
    • /
    • v.27 no.2
    • /
    • pp.43-51
    • /
    • 2022
  • In this paper, we suggest the method to improve the existing method leaning context-free grammar(CFG) using tabular representation(TBL) as a chromosome of genetic algorithm in grammatical inference and show the more efficient experimental result. We have two improvements. The first is to improve the formula to reflect the learning evaluation of positive and negative examples at the same time for the fitness function. The second is to classify partitions corresponding to TBLs generated from positive learning examples according to the size of the learning string, proceed with the evolution process by class, and adjust the composition ratio according to the success rate to apply the learning method linked to survival in the next generation. These improvements provide better efficiency than the existing method by solving the complexity and difficulty in the crossover and generalization steps between several individuals according to the size of the learning examples. We experiment with the languages proposed in the existing method, and the results show a rather fast generation rate that takes fewer generations to complete learning with the same success rate than the existing method. In the future, this method can be tried for extended CYK, and furthermore, it suggests the possibility of being applied to more complex parsing tables.

Bankruptcy prediction using an improved bagging ensemble (개선된 배깅 앙상블을 활용한 기업부도예측)

  • Min, Sung-Hwan
    • Journal of Intelligence and Information Systems
    • /
    • v.20 no.4
    • /
    • pp.121-139
    • /
    • 2014
  • Predicting corporate failure has been an important topic in accounting and finance. The costs associated with bankruptcy are high, so the accuracy of bankruptcy prediction is greatly important for financial institutions. Lots of researchers have dealt with the topic associated with bankruptcy prediction in the past three decades. The current research attempts to use ensemble models for improving the performance of bankruptcy prediction. Ensemble classification is to combine individually trained classifiers in order to gain more accurate prediction than individual models. Ensemble techniques are shown to be very useful for improving the generalization ability of the classifier. Bagging is the most commonly used methods for constructing ensemble classifiers. In bagging, the different training data subsets are randomly drawn with replacement from the original training dataset. Base classifiers are trained on the different bootstrap samples. Instance selection is to select critical instances while deleting and removing irrelevant and harmful instances from the original set. Instance selection and bagging are quite well known in data mining. However, few studies have dealt with the integration of instance selection and bagging. This study proposes an improved bagging ensemble based on instance selection using genetic algorithms (GA) for improving the performance of SVM. GA is an efficient optimization procedure based on the theory of natural selection and evolution. GA uses the idea of survival of the fittest by progressively accepting better solutions to the problems. GA searches by maintaining a population of solutions from which better solutions are created rather than making incremental changes to a single solution to the problem. The initial solution population is generated randomly and evolves into the next generation by genetic operators such as selection, crossover and mutation. The solutions coded by strings are evaluated by the fitness function. The proposed model consists of two phases: GA based Instance Selection and Instance based Bagging. In the first phase, GA is used to select optimal instance subset that is used as input data of bagging model. In this study, the chromosome is encoded as a form of binary string for the instance subset. In this phase, the population size was set to 100 while maximum number of generations was set to 150. We set the crossover rate and mutation rate to 0.7 and 0.1 respectively. We used the prediction accuracy of model as the fitness function of GA. SVM model is trained on training data set using the selected instance subset. The prediction accuracy of SVM model over test data set is used as fitness value in order to avoid overfitting. In the second phase, we used the optimal instance subset selected in the first phase as input data of bagging model. We used SVM model as base classifier for bagging ensemble. The majority voting scheme was used as a combining method in this study. This study applies the proposed model to the bankruptcy prediction problem using a real data set from Korean companies. The research data used in this study contains 1832 externally non-audited firms which filed for bankruptcy (916 cases) and non-bankruptcy (916 cases). Financial ratios categorized as stability, profitability, growth, activity and cash flow were investigated through literature review and basic statistical methods and we selected 8 financial ratios as the final input variables. We separated the whole data into three subsets as training, test and validation data set. In this study, we compared the proposed model with several comparative models including the simple individual SVM model, the simple bagging model and the instance selection based SVM model. The McNemar tests were used to examine whether the proposed model significantly outperforms the other models. The experimental results show that the proposed model outperforms the other models.

Studies on Artificial Polyploid Forest Trees XIII -Some Morphological and Physiological Characteristics of Colchitetraploid Hibiscus syriacus L.- (인위배수성(人爲倍數性) 임목(林木)에 관(關)한 연구(硏究) XIII -Colchitetraploid인 자주무궁화와 단심무궁화의 몇 형태학적(形態學的) 및 생리학적(生理學的) 특성(特性)-)

  • Lee, Suk Koo;Kim, Chung Suk
    • Journal of Korean Society of Forest Science
    • /
    • v.32 no.1
    • /
    • pp.73-86
    • /
    • 1976
  • Two individuals ($sp_1$, $sp_2$) of purple and one individual ($sd_1$) of red hearted flower were selected from 18 years old Hibiscus syriacus trees obtained from the seeds treated with colchicine, and their morphological and physiological characteristics were investigated and following results were obtained. 1. The somatic chromosome number of the selected individuals, $sp_1$, $sp_2$, and $sd_1$ were 2n=160, while that of the check tree was 2n=80, indicating that the selected individuals, $sp_1$, $sp_2$ and $sd_1$ were tetraploid. 2. Peroxidase isoenzyme bands of high activity in selected individuals, $sp_1$, $sd_1$ and check tree were mostly in cathode, fixed band was f and v bands, and frequency of each band and their activity were not different between selected individuals, $sp_1$ and $sd_1$ and check tree. 3. The flowers of $sp_1$ individual were large in size and more dark purple than check tree's. The flowers of $sp_2$ individual were not increased in size, but they were dark purple and red heart at the base of the petal was expanded to 2/3 of the petal length. The flower of $sd_1$ individual was also large and some of the red lines from the petal base were extended to 2/3 of the petal length, which was much longer than those of the check tree. 4. Thickess of leaves, length of guard cells, diameter of pollens, wood fiber lengths and woody fiber widths were all increased in $sp_1$, $sp_2$ and $sd_1$ as compared to those of the check tree. 5. Survival percentage of cuttings was 80% with $sp_1$ and 36% with $sd_1$, and their growth performance were inferior to control in their second growing season.

  • PDF

Development of Intelligent ATP System Using Genetic Algorithm (유전 알고리듬을 적용한 지능형 ATP 시스템 개발)

  • Kim, Tai-Young
    • Journal of Intelligence and Information Systems
    • /
    • v.16 no.4
    • /
    • pp.131-145
    • /
    • 2010
  • The framework for making a coordinated decision for large-scale facilities has become an important issue in supply chain(SC) management research. The competitive business environment requires companies to continuously search for the ways to achieve high efficiency and lower operational costs. In the areas of production/distribution planning, many researchers and practitioners have developedand evaluated the deterministic models to coordinate important and interrelated logistic decisions such as capacity management, inventory allocation, and vehicle routing. They initially have investigated the various process of SC separately and later become more interested in such problems encompassing the whole SC system. The accurate quotation of ATP(Available-To-Promise) plays a very important role in enhancing customer satisfaction and fill rate maximization. The complexity for intelligent manufacturing system, which includes all the linkages among procurement, production, and distribution, makes the accurate quotation of ATP be a quite difficult job. In addition to, many researchers assumed ATP model with integer time. However, in industry practices, integer times are very rare and the model developed using integer times is therefore approximating the real system. Various alternative models for an ATP system with time lags have been developed and evaluated. In most cases, these models have assumed that the time lags are integer multiples of a unit time grid. However, integer time lags are very rare in practices, and therefore models developed using integer time lags only approximate real systems. The differences occurring by this approximation frequently result in significant accuracy degradations. To introduce the ATP model with time lags, we first introduce the dynamic production function. Hackman and Leachman's dynamic production function in initiated research directly related to the topic of this paper. They propose a modeling framework for a system with non-integer time lags and show how to apply the framework to a variety of systems including continues time series, manufacturing resource planning and critical path method. Their formulation requires no additional variables or constraints and is capable of representing real world systems more accurately. Previously, to cope with non-integer time lags, they usually model a concerned system either by rounding lags to the nearest integers or by subdividing the time grid to make the lags become integer multiples of the grid. But each approach has a critical weakness: the first approach underestimates, potentially leading to infeasibilities or overestimates lead times, potentially resulting in excessive work-inprocesses. The second approach drastically inflates the problem size. We consider an optimized ATP system with non-integer time lag in supply chain management. We focus on a worldwide headquarter, distribution centers, and manufacturing facilities are globally networked. We develop a mixed integer programming(MIP) model for ATP process, which has the definition of required data flow. The illustrative ATP module shows the proposed system is largely affected inSCM. The system we are concerned is composed of a multiple production facility with multiple products, multiple distribution centers and multiple customers. For the system, we consider an ATP scheduling and capacity allocationproblem. In this study, we proposed the model for the ATP system in SCM using the dynamic production function considering the non-integer time lags. The model is developed under the framework suitable for the non-integer lags and, therefore, is more accurate than the models we usually encounter. We developed intelligent ATP System for this model using genetic algorithm. We focus on a capacitated production planning and capacity allocation problem, develop a mixed integer programming model, and propose an efficient heuristic procedure using an evolutionary system to solve it efficiently. This method makes it possible for the population to reach the approximate solution easily. Moreover, we designed and utilized a representation scheme that allows the proposed models to represent real variables. The proposed regeneration procedures, which evaluate each infeasible chromosome, makes the solutions converge to the optimum quickly.