• Title/Summary/Keyword: DNA database

Search Result 415, Processing Time 0.023 seconds

Interspecific Transferability of Watermelon EST-SSRs Assessed by Genetic Relationship Analysis of Cucurbitaceous Crops (박과작물의 유연관계 분석을 통한 수박 EST-SSR 마커의 종간 적용성 검정)

  • Kim, Hyeogjun;Yeo, Sang-Seok;Han, Dong-Yeop;Park, Young-Hoon
    • Horticultural Science & Technology
    • /
    • v.33 no.1
    • /
    • pp.93-105
    • /
    • 2015
  • This study was performed to analyze genetic relationships of the four major cucurbitaceous crops including watermelon, melon, cucumber, and squash/pumpkin. Among 120 EST-SSR primer sets selected from the International Cucurbit Genomics Initiative (ICuGI) database, PCR was successful for 51 (49.17%) primer sets and 49 (40.8%) primer sets showed polymorphisms among eight Cucurbitaceae accessions. A total of 382 allele-specific PCR bands were produced by 49 EST-SSR primers from 24 Cucurbitaceae accessions and used for analysis of pairwise similarity and dendrogram construction. Assessment of the genetic relationships resulted in similarity indexes ranging from 0.01 to 0.85. In the dendrogram, 24 Cucurbitaceae accessions were classified into two major groups (Clade I and II) and 8 subgroups. Clade I comprised two subgroups, Clade I-1 for watermelon accessions [I-1a and I-1b-2: three wild-type watermelons (Citrullus lanatus var. citroides Mats. & Nakai), I-1b-1: six watermelon cultivars (Citrullus lanatus var. vulgaris S chrad.)] a nd C lade I -2 for melon and cucumber accessions [I-2a-1 : 4 melon cultivars(Cucumis melo var. cantalupensis Naudin.), I-2a-2: oriental melon cultivars (Cucumis melo var. conomon Makino.), and I-2b: five cucumber cultivars (Cucumis sativus L.)]. Squash and pumpkin accessions composed Clade II {II-1: two squash/ pumpkin cultivars [Cucurbita moschata (Duch. ex Lam.)/Duch. & Poir. and Cucurbita maxima Duch.] and II-2: two squash/pumpkin cultivars, Cucurbita pepo L./Cucurbita ficifolia Bouche.}. These results were in accordance with previously reported classification of Cucurbitaceae species, indicating that watermelon EST-SSRs show a high level of marker transferability and should be useful for genetic study in other cucurbit crops.

Analysis and Evaluation of Frequent Pattern Mining Technique based on Landmark Window (랜드마크 윈도우 기반의 빈발 패턴 마이닝 기법의 분석 및 성능평가)

  • Pyun, Gwangbum;Yun, Unil
    • Journal of Internet Computing and Services
    • /
    • v.15 no.3
    • /
    • pp.101-107
    • /
    • 2014
  • With the development of online service, recent forms of databases have been changed from static database structures to dynamic stream database structures. Previous data mining techniques have been used as tools of decision making such as establishment of marketing strategies and DNA analyses. However, the capability to analyze real-time data more quickly is necessary in the recent interesting areas such as sensor network, robotics, and artificial intelligence. Landmark window-based frequent pattern mining, one of the stream mining approaches, performs mining operations with respect to parts of databases or each transaction of them, instead of all the data. In this paper, we analyze and evaluate the techniques of the well-known landmark window-based frequent pattern mining algorithms, called Lossy counting and hMiner. When Lossy counting mines frequent patterns from a set of new transactions, it performs union operations between the previous and current mining results. hMiner, which is a state-of-the-art algorithm based on the landmark window model, conducts mining operations whenever a new transaction occurs. Since hMiner extracts frequent patterns as soon as a new transaction is entered, we can obtain the latest mining results reflecting real-time information. For this reason, such algorithms are also called online mining approaches. We evaluate and compare the performance of the primitive algorithm, Lossy counting and the latest one, hMiner. As the criteria of our performance analysis, we first consider algorithms' total runtime and average processing time per transaction. In addition, to compare the efficiency of storage structures between them, their maximum memory usage is also evaluated. Lastly, we show how stably the two algorithms conduct their mining works with respect to the databases that feature gradually increasing items. With respect to the evaluation results of mining time and transaction processing, hMiner has higher speed than that of Lossy counting. Since hMiner stores candidate frequent patterns in a hash method, it can directly access candidate frequent patterns. Meanwhile, Lossy counting stores them in a lattice manner; thus, it has to search for multiple nodes in order to access the candidate frequent patterns. On the other hand, hMiner shows worse performance than that of Lossy counting in terms of maximum memory usage. hMiner should have all of the information for candidate frequent patterns to store them to hash's buckets, while Lossy counting stores them, reducing their information by using the lattice method. Since the storage of Lossy counting can share items concurrently included in multiple patterns, its memory usage is more efficient than that of hMiner. However, hMiner presents better efficiency than that of Lossy counting with respect to scalability evaluation due to the following reasons. If the number of items is increased, shared items are decreased in contrast; thereby, Lossy counting's memory efficiency is weakened. Furthermore, if the number of transactions becomes higher, its pruning effect becomes worse. From the experimental results, we can determine that the landmark window-based frequent pattern mining algorithms are suitable for real-time systems although they require a significant amount of memory. Hence, we need to improve their data structures more efficiently in order to utilize them additionally in resource-constrained environments such as WSN(Wireless sensor network).

Delineating Transcription Factor Networks Governing Virulence of a Global Human Meningitis Fungal Pathogen, Cryptococcus neoformans

  • Jung, Kwang-Woo;Yang, Dong-Hoon;Maeng, Shinae;Lee, Kyung-Tae;So, Yee-Seul;Hong, Joohyeon;Choi, Jaeyoung;Byun, Hyo-Jeong;Kim, Hyelim;Bang, Soohyun;Song, Min-Hee;Lee, Jang-Won;Kim, Min Su;Kim, Seo-Young;Ji, Je-Hyun;Park, Goun;Kwon, Hyojeong;Cha, Sooyeon;Meyers, Gena Lee;Wang, Li Li;Jang, Jooyoung;Janbon, Guilhem;Adedoyin, Gloria;Kim, Taeyup;Averette, Anna K.;Heitman, Joseph;Cheong, Eunji;Lee, Yong-Hwan;Lee, Yin-Won;Bahn, Yong-Sun
    • 한국균학회소식:학술대회논문집
    • /
    • 2015.05a
    • /
    • pp.59-59
    • /
    • 2015
  • Cryptococcus neoformans causes life-threatening meningoencephalitis in humans, but the treatment of cryptococcosis remains challenging. To develop novel therapeutic targets and approaches, signaling cascades controlling pathogenicity of C. neoformans have been extensively studied but the underlying biological regulatory circuits remain elusive, particularly due to the presence of an evolutionarily divergent set of transcription factors (TFs) in this basidiomycetous fungus. In this study, we constructed a high-quality of 322 signature-tagged gene deletion strains for 155 putative TF genes, which were previously predicted using the DNA-binding domain TF database (http://www.transcriptionfactor.org/). We tested in vivo and in vitro phenotypic traits under 32 distinct growth conditions using 322 TF gene deletion strains. At least one phenotypic trait was exhibited by 145 out of 155 TF mutants (93%) and approximately 85% of the TFs (132/155) have been functionally characterized for the first time in this study. Through high-coverage phenome analysis, we discovered myriad novel TFs that play critical roles in growth, differentiation, virulence-factor (melanin, capsule, and urease) formation, stress responses, antifungal drug resistance, and virulence. Large-scale virulence and infectivity assays in insect (Galleria mellonella) and mouse host models identified 34 novel TFs that are critical for pathogenicity. The genotypic and phenotypic data for each TF are available in the C. neoformans TF phenome database (http://tf.cryptococcus.org). In conclusion, our phenome-based functional analysis of the C. neoformans TF mutant library provides key insights into transcriptional networks of basidiomycetous fungi and ubiquitous human fungal pathogens.

  • PDF

Finding Weighted Sequential Patterns over Data Streams via a Gap-based Weighting Approach (발생 간격 기반 가중치 부여 기법을 활용한 데이터 스트림에서 가중치 순차패턴 탐색)

  • Chang, Joong-Hyuk
    • Journal of Intelligence and Information Systems
    • /
    • v.16 no.3
    • /
    • pp.55-75
    • /
    • 2010
  • Sequential pattern mining aims to discover interesting sequential patterns in a sequence database, and it is one of the essential data mining tasks widely used in various application fields such as Web access pattern analysis, customer purchase pattern analysis, and DNA sequence analysis. In general sequential pattern mining, only the generation order of data element in a sequence is considered, so that it can easily find simple sequential patterns, but has a limit to find more interesting sequential patterns being widely used in real world applications. One of the essential research topics to compensate the limit is a topic of weighted sequential pattern mining. In weighted sequential pattern mining, not only the generation order of data element but also its weight is considered to get more interesting sequential patterns. In recent, data has been increasingly taking the form of continuous data streams rather than finite stored data sets in various application fields, the database research community has begun focusing its attention on processing over data streams. The data stream is a massive unbounded sequence of data elements continuously generated at a rapid rate. In data stream processing, each data element should be examined at most once to analyze the data stream, and the memory usage for data stream analysis should be restricted finitely although new data elements are continuously generated in a data stream. Moreover, newly generated data elements should be processed as fast as possible to produce the up-to-date analysis result of a data stream, so that it can be instantly utilized upon request. To satisfy these requirements, data stream processing sacrifices the correctness of its analysis result by allowing some error. Considering the changes in the form of data generated in real world application fields, many researches have been actively performed to find various kinds of knowledge embedded in data streams. They mainly focus on efficient mining of frequent itemsets and sequential patterns over data streams, which have been proven to be useful in conventional data mining for a finite data set. In addition, mining algorithms have also been proposed to efficiently reflect the changes of data streams over time into their mining results. However, they have been targeting on finding naively interesting patterns such as frequent patterns and simple sequential patterns, which are found intuitively, taking no interest in mining novel interesting patterns that express the characteristics of target data streams better. Therefore, it can be a valuable research topic in the field of mining data streams to define novel interesting patterns and develop a mining method finding the novel patterns, which will be effectively used to analyze recent data streams. This paper proposes a gap-based weighting approach for a sequential pattern and amining method of weighted sequential patterns over sequence data streams via the weighting approach. A gap-based weight of a sequential pattern can be computed from the gaps of data elements in the sequential pattern without any pre-defined weight information. That is, in the approach, the gaps of data elements in each sequential pattern as well as their generation orders are used to get the weight of the sequential pattern, therefore it can help to get more interesting and useful sequential patterns. Recently most of computer application fields generate data as a form of data streams rather than a finite data set. Considering the change of data, the proposed method is mainly focus on sequence data streams.

Association of SNPs in the HNF4α Gene with Growth Performance of Korean Native Chickens (한국 재래계의 HNF4α 유전자 내 SNP와 성장과의 연관성 분석)

  • Yang, Song-Yi;Choi, So-Young;Hong, Min-Wook;Kim, Hun;Kwak, Kyeongrok;Lee, Hyojeong;Jeong, Dong Kee;Sohn, Sea Hwan;Hong, Yeong Ho;Lee, Sung-Jin
    • Korean Journal of Poultry Science
    • /
    • v.45 no.4
    • /
    • pp.253-260
    • /
    • 2018
  • The hepatocyte nuclear factor 4 alpha ($HNF4{\alpha}$) gene is related to lipid transport, including abdominal fat and growth, in chickens. Interestingly, the A543G SNP within the $HNF4{\alpha}$ gene has previously been reported to be associated with body weight in both broilers and Korean native chickens (KNCs). However, its exact position within the HNF4 is not yet reported. This study aimed to identify the position of the A543G SNP and to identify additional SNPs that can be used as genetic markers in KNCs. A total of 128 KNCs were used for the sequencing and analysis of these genetic associations. As a result, A543G SNP was located in intron 4 of the $HNF4{\alpha}$ gene; it is reported as rs731246957 in the NCBI database. Fourteen SNPs were detected in the sequenced portion of the $HNF4{\alpha}$ gene; three of these, rs731246957, rs736159604 and new SNP, intron 6 (249), were significantly related with growth in the chickens. In this study, the TT genotype of rs731246957, previously reported as A543G SNP, the GG genotype of rs736159604 and GT of new SNP have are highly associated with body weight from birth to 40 weeks of age in KNCs (P<0.01). These results suggest that rs736159604, rs731246957 and intron 6 (249) SNPs within the $HNF4{\alpha}$ gene could function as growth-related markers in the selective breeding of KNCs.