• Title/Summary/Keyword: 유전자 데이터베이스

Search Result 182, Processing Time 0.026 seconds

Development of Local Animal BLAST Search System Using Bioinformatics Tools (생물정보시스템을 이용한 Local Animal BLAST Search System 구축)

  • Kim, Byeong-Woo;Lee, Geun-Woo;Kim, Hyo-Seon;No, Seung-Hui;Lee, Yun-Ho;Kim, Si-Dong;Jeon, Jin-Tae;Lee, Ji-Ung;Jo, Yong-Min;Jeong, Il-Jeong;Lee, Jeong-Gyu
    • Bioinformatics and Biosystems
    • /
    • v.1 no.2
    • /
    • pp.99-102
    • /
    • 2006
  • The Basic Local Alignment Search Tool (BLAST) is one of the most established software in bioinformatics research and it compares a query sequence against the libraries of known sequences in order to investigate sequence similarity. Expressed Sequence Tags (ESTs) are single-pass sequence reads from mRNA (or cDNA) and represent the expression for a given cDNA library and the snapshot of genes expressed in a given tissue and/or at a given developmental stage. Therefore, ESTs can be very valuable information for functional genomics and bioinformatics researches. Although major bio database (DB) websites including NCBI are providing BLAST services and EST data, local DB and search system is demanding for better performance and security issue. Here we present animal EST DBs and local BLAST search system. The animal ESTs DB in NCBI Genbank were divided by animal species using the Perl script we developed. and we also built the new extended DB search systems fur the new data (Local Animal BLAST Search System: http://bioinfo.kohost.net), which was constructed on the high-capacity PC Cluster system fur the best performance. The new local DB contains 650,046 sequences for Bos taurus(cattle), 368,120 sequences for Sus scrofa (pig), 693,005 sequences for Gallus gallus (fowl), respectively.

  • PDF

Cancer subtype's classifier based on Hybrid Samples Balanced Genetic Algorithm and Extreme Learning Machine (하이브리드 균형 표본 유전 알고리즘과 극한 기계학습에 기반한 암 아류형 분류기)

  • Sachnev, Vasily;Suresh, Sundaram;Choi, Yong Soo
    • Journal of Digital Contents Society
    • /
    • v.17 no.6
    • /
    • pp.565-579
    • /
    • 2016
  • In this paper a novel cancer subtype's classifier based on Hybrid Samples Balanced Genetic Algorithm with Extreme Learning Machine (hSBGA-ELM) is presented. Proposed cancer subtype's classifier uses genes' expression data of 16063 genes from open Global Cancer Map (GCM) data base for accurate cancer subtype's classification. Proposed method efficiently classifies 14 subtypes of cancer (breast, prostate, lung, colorectal, lymphoma, bladder, melanoma, uterus, leukemia, renal, pancreas, ovary, mesothelioma and CNS). Proposed hSBGA-ELM unifies genes' selection procedure and cancer subtype's classification into one framework. Proposed Hybrid Samples Balanced Genetic Algorithm searches a reduced robust set of genes responsible for cancer subtype's classification from 16063 genes available in GCM data base. Selected reduced set of genes is used to build cancer subtype's classifier using Extreme Learning Machine (ELM). As a result, reduced set of robust genes guarantees stable generalization performance of the proposed cancer subtype's classifier. Proposed hSBGA-ELM discovers 95 genes probably responsible for cancer. Comparison with existing cancer subtype's classifiers clear indicates efficiency of the proposed method.

The Optimal Distributed Database System Design Using the Genetic Algorithm (유전자 알고리즘을 이용한 최적의 분산 데이터베이스 시스템 설계)

  • Ko, Suc-Bum;Youn, Sung-Dae
    • The Transactions of the Korea Information Processing Society
    • /
    • v.7 no.9
    • /
    • pp.2797-2806
    • /
    • 2000
  • Recently, DDSs(Distributed Database Systems) have been implemented on V AN(V alue Added Network) as we know the amazing expansion of information network. DDS can yield significant cost and response time advantages over centralized systems for geographically distributed organizations. However, inappropriate design can result in high cost and poor response time to maintain the database at each site. In a DDS design, the main problem is how to select proper computer and how to allocate data fragment into a proper site. In this paper, we address DDS design problem of selecting the proper class of computers and the allocating data files on VAN. Also, the formulated model includes two objectives, the waited response time and the investment cost to include their relationship. Specially, the formulation of waited response time is based on M/M/1 queueing system to evaluate more precisely. GA(Genetic Algorithm), a kind of heuristic search method, is developed to search an optimal solution in the proposed design model and we show the simulation result to examine the algorithm performance.

  • PDF

Testis-specific transcripts in the chicken

  • Kim, Duk-Kyung
    • Proceedings of the Korea Society of Poultry Science Conference
    • /
    • 2005.11a
    • /
    • pp.53-59
    • /
    • 2005
  • Sequences of candidate chicken testis-specific genes were analyzed in order to develop a resource for functional genomic studies of the testis and male germ cells. Tentative consensus sequences (TCs) containing ESTs expressed in testis libraries only were selected from the TIGR Gallus gallus Gene Index, resulting in a total of 292 TCs. The transcriptional expression of these genes were evaluated in a variety of chicken tissues, including testis and ovary, Of the panel of 292 TCs, 110 were expressed in a testis-specific manner. The correlation between the number of ESTs assembled into each TC and the number of testis-specific TCs was not significant. Annotation of the TCs using the Gene Ontology database terms showed that the proportion of testis-specific TCs that were classified as having catalytic activity (within the Molecular Function branch) was larger than the proportion of total chicken TCs classified in the same way. Our results might facilitate the investigation of testis-specific genes and their functional analysis in the chicken as well as in other avian species.

  • PDF

Implementation of Cognitive Radio System with Genetic Algorithm Using USRP 2 (유전자 알고리즘이 적용된 USRP 2를 이용한 인지무선 시스템 구현)

  • Yong, Seul-Ba-Ro;Jang, Sung-Jeen;Lee, In-Sun;Kim, Jae-Moung
    • Journal of Satellite, Information and Communications
    • /
    • v.7 no.1
    • /
    • pp.39-47
    • /
    • 2012
  • Currently, most of the frequency spectrum resources are allocated and due to the lack of frequency, low frequency band, optimal for wireless communication environment is not used. Therefore, Cognitive Radio (CR) is a critical issue to solve the spectrum scarcity and to improve frequency spectrum utilization in wireless communication. In this paper, we implement data transmission and receive in a real CR system using the USRP(Universal Software Radio Peripheral) board and GNU Radio package of an open source development kit. Concretely, we detect the Primary User by spectrum sensing, and then we send Primary User information to the database. After receiving the information, because the database already sent optimal transmit power, bandwidth and channel information to CR equipment, CR can communicate without any interference to Primary User.

Integrated Model Design of Microarray Data Using miRNA, PPI, Disease Information (miRNA, PPI, 질병 정보를 이용한 마이크로어레이 데이터 통합 모델 설계)

  • Ha, Kyung-Sik;Lim, Jin-Muk;Kim, Hong-Gee
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.22 no.6
    • /
    • pp.786-792
    • /
    • 2012
  • A microarray is a collection of thousands of DNAs or RNAs arranged on a substrate, and it enables one to navigate large amounts of gene expression. However, a researcher uses his designed experimental methods to focus on particular phenotypes from the available mass of data. In this paper, we used MicroRNAs(miRNAs) and Protein-Protein Interation(PPI) databases to enhance and expand meanings in microarray data. Further, the expanded data are linked with the Online Mendelian Inheritance in Man(OMIM), and International Statistical Classification of Diseases and Related Health Problems, $10^{th}$ Revision(ICD-10), in order to extract common genetic relationships between diseases. This approach, we expect, should provide new biological views.

Sequencing analysis of the OFC1 gene on the nonsyndromic cleft lip and palate patient in Korean (한국인 비증후군성 구순구개열 환자의 OFC1 유전자의 서열 분석)

  • Kim, Sung-Sik;Son, Woo-Sung
    • The korean journal of orthodontics
    • /
    • v.33 no.3 s.98
    • /
    • pp.185-197
    • /
    • 2003
  • This study was performed to identify the characteristics of the OFC1 gene (locus: chromosome 6p24.3) in Korean patients, which is assumed to be the major gene behind the nonsyndromic cleft lip and palate. The sample consisted of 80 subjects: 40 nonsyndromic cleft lip and palate patients (proband, 20 males and females, mean age 14.2 years); and 40 normal adults (20 males and 20 females, mean age 25.6 years). Using PCR-based assay, the OFC1 gene was amplified, sequenced, and then searched for similar protein structures. Results were as follows: 1. The OFC1 gene contains the microsatellite marker 'CA' repeats. The number of the reference 'CA' repeats was 21 times, and formed as TA(CA)11TA(CA)10. But, in Koreans, the number of tandem 'CA' repeats was varied from 17 to 26 except 18, and 'CA' repeats consisted of TA(CA)n. 2. Nine allelic variants were found. Distribution of the OFC1 allele was similar between the patients and control group. 3. There was a replacement of the base 'T' to 'C' after 11 tandem 'CA' repeats in Koreans compared with Weissenbach's report. However, the difference did not seem to be the ORF prediction results between Koreans and Weissenbach's report. 4. The BLAST search results showed the Telomerase reverse transcriptase (TERT) and the Nucleotide binding protein 2 (NBP2) as similar proteins. The TERT was a protein product by the hTERT gene in the locus 5p15.33 (NCBI Genome Annotation; NT023089) The NBP2 was a protein product by the ABCC3 (ATP-binding cassette, sub-family C) gene in the locus 17q22 (NCBI Genome Annotation; NT010783). 5. In the Pedant-Pro database analysis, the predictable protein structure of the OFC1 gene had at least one transmembrane region and one non-globular region.

Analysis of Putative Downstream Genes of Arabidopsis AtERF71/HRE2 Transcription Factor using a Microarray (마이크로어레이를 이용한 애기장대 AtERF71/HRE2 전사인자의 하위 유전자 분석)

  • Seok, Hye-Yeon;Lee, Sun-Young;Woo, Dong-Hyuk;Park, Hee-Yeon;Moon, Yong-Hwan
    • Journal of Life Science
    • /
    • v.22 no.10
    • /
    • pp.1359-1370
    • /
    • 2012
  • Arabidopsis AtERF71/HRE2, a transcription activator, is located in the nucleus and is involved in the signal transduction of low oxygen and osmotic stresses. In this study, microarray analysis using AtERF71/HRE2-overexpressing transgenic plants was performed to identify genes downstream of AtERF71/HRE2. A total of 161 different genes as well as AtERF71/HRE2 showed more than a twofold higher expression in AtERF71/HRE2-overexpressing transgenic plants compared with wild-type plants. Among the 161 genes, 24 genes were transcriptional regulators, such as transcription factors and DNA-binding proteins, based on gene ontology annotations, suggesting that AtERF71/HRE2 is an upstream transcription factor that regulates the activities of various downstream genes via these transcription regulators. RT-PCR analysis of 15 genes selected out of the 161 genes showed higher expression in AtERF71/HRE2-overexpressing transgenic plants, validating the microarray data. On the basis of Genevestigator database analysis, 51 genes among the 161 genes were highly expressed under low oxygen and/or osmotic stresses. RT-PCR analysis showed that the expression levels of three genes among the selected 15 genes increased under low oxygen stress and another three genes increased under high salt stress, suggesting that these genes might be downstream genes of AtERF71/HRE2 in low oxygen or high salt stress signal transduction. Microarray analysis results indicated that AtERF71/HRE2 might also be involved in the responses to other abiotic stresses and also in the regulation of plant developmental processes.

Pickprimer: A Graphic User Interface Program for Primer Design on the Gene Target Region (픽프라이머 : 유전자 목표 구간 탐색 모듈을 포함한 프라이머 제작 그래픽 프로그램)

  • Chung, Hee;Mun, Jeong-Hwan;Lee, Seung-Chan;Yu, Hee-Ju
    • Horticultural Science & Technology
    • /
    • v.29 no.5
    • /
    • pp.461-466
    • /
    • 2011
  • In genetic and molecular breeding studies of plants, researchers need to design various kinds of primers based on their research purposes. So far many kinds of web- or script-based non-commercial programs for primer design are available. Because most of them do not include user interface for multipurpose usage including gene structure prediction and direct target selection on sequences, it has been a laborious work to design primers targeting on the exon or intron regions of interesting genes. Here we report a primer designing graphic user interface program, Pickprimer, that includes gene structure prediction and primer design modules by combining source codes of the Spidey and Primer3 programs. This program provides simple graphic user interface to input sequences and design primers. Genomic sequence and mRNA or coding sequence of genes can be copy and pasted or input as fasta or text files. Based on alignment of the input sequences using the Spidey module, a putative gene structure is graphically visualized along with exon-intron sequences of color codes. Primer design can be easily performed by dragging mouse on the displayed sequences or input primer targeting position with desirable values of primers. The output of designed primers with detailed information is provided by the Primer3 module. PCR evaluation of 24 selected primer sets successfully amplified single amplicons from six Brassica rapa cultivars. The Pickprimer will be a convenient tool for genetic and molecular breeding studies of plants.

Functional Analysis of Expressed Sequence Tags from Hanwoo (Korean Cattle) cDNA Libraries (한우 cDNA 라이브러리에서 발현된 ESTs의 기능분석)

  • Lim, Da-Jeong;Byun, Mi-Jeong;Cho, Yong-Min;Yoon, Du-Hak;Lee, Seung-Hwan;Shin, Youn-Hee;Im, Seok-Ki
    • Journal of Animal Science and Technology
    • /
    • v.51 no.1
    • /
    • pp.1-8
    • /
    • 2009
  • We generated 57,598 expressed sequence tags (ESTs) from 3 cDNA libraries of Hanwooo (Korean Cattle), fat, loin, liver. Liver, intermuscular fat and longissimus dorsi tissues were obtained from a 24-month-old Hanwoo steer immediately after slaughter. cDNA library was constructed according to the oligocapped method. The EST data were clustered and assembled into unique sequences, 4,759 contigs and 7,587 singletons. To carry out functional analysis, Gene Ontology annotation and identification of significant leaf nodes were performed that were detected by searching significant p-values from $2^{nd}$ level GO terms to leaf nodes using Bonferroni correction. We found that 13, 26 and 8 significant leaf nodes are unique in the transcripts according to 3 GO categories, molecular function, biological process and cellular component. Also digital gene expression profiling using the Audic's test was performed and tissue specific genes were detected in the above 3 libraries.