• 제목/요약/키워드: genome database

검색결과 355건 처리시간 0.027초

BioSubroutine: an Open Web Server for Bioinformatics Algorithms and Subroutines

  • Lee, Joowon;Kim, Hana;Lee, Wonhye;Chung, Dongil;Bhak, Jong
    • Genomics & Informatics
    • /
    • 제3권1호
    • /
    • pp.35-38
    • /
    • 2005
  • We present BioSubroutine, an open depository server that automatically categorizes various subroutines frequently used in bioinformatics research. We processed a large bioinformatics subroutine library called Bio.pl that was the first Bioperl subroutine library built in 1995. Over 1000 subroutines were processed automatically and an HTML interface has been created. BioSubroutine can accept new subroutines and algorithms from any such subroutine library, as well as provide interactive user forms. The subroutines are stored in an SQL database for quick searching and accessing. BioSubroutine is an open access project under the BioLicense license scheme.

NBLAST: a graphical user interface-based two-way BLAST software with a dot plot viewer

  • Choi, Beom-Soon;Choi, Seon Kang;Kim, Nam-Soo;Choi, Ik-Young
    • Genomics & Informatics
    • /
    • 제20권3호
    • /
    • pp.36.1-36.6
    • /
    • 2022
  • BLAST, a basic bioinformatics tool for searching local sequence similarity, has been one of the most widely used bioinformatics programs since its introduction in 1990. Users generally use the web-based NCBI-BLAST program for BLAST analysis. However, users with large sequence data are often faced with a problem of upload size limitation while using the web-based BLAST program. This proves inconvenient as scientists often want to run BLAST on their own data, such as transcriptome or whole genome sequences. To overcome this issue, we developed NBLAST, a graphical user interface-based BLAST program that employs a two-way system, allowing the use of input sequences either as "query" or "target" in the BLAST analysis. NBLAST is also equipped with a dot plot viewer, thus allowing researchers to create custom database for BLAST and run a dot plot similarity analysis within a single program. It is available to access to the NBLAST with http://nbitglobal.com/nblast.

Thoroughbred Horse Single Nucleotide Polymorphism and Expression Database: HSDB

  • Lee, Joon-Ho;Lee, Taeheon;Lee, Hak-Kyo;Cho, Byung-Wook;Shin, Dong-Hyun;Do, Kyoung-Tag;Sung, Samsun;Kwak, Woori;Kim, Hyeon Jeong;Kim, Heebal;Cho, Seoae;Park, Kyung-Do
    • Asian-Australasian Journal of Animal Sciences
    • /
    • 제27권9호
    • /
    • pp.1236-1243
    • /
    • 2014
  • Genetics is important for breeding and selection of horses but there is a lack of well-established horse-related browsers or databases. In order to better understand horses, more variants and other integrated information are needed. Thus, we construct a horse genomic variants database including expression and other information. Horse Single Nucleotide Polymorphism and Expression Database (HSDB) (http://snugenome2.snu.ac.kr/HSDB) provides the number of unexplored genomic variants still remaining to be identified in the horse genome including rare variants by using population genome sequences of eighteen horses and RNA-seq of four horses. The identified single nucleotide polymorphisms (SNPs) were confirmed by comparing them with SNP chip data and variants of RNA-seq, which showed a concordance level of 99.02% and 96.6%, respectively. Moreover, the database provides the genomic variants with their corresponding transcriptional profiles from the same individuals to help understand the functional aspects of these variants. The database will contribute to genetic improvement and breeding strategies of Thoroughbreds.

"아시아인 건강을 위한 한국인 게놈" : 한국인 유전체 프로젝트의 상업화 전략 ("The Korean Genome for Asian Health": A Commercialization Strategy of the Korean Genome Projects)

  • 현재환
    • 과학기술학연구
    • /
    • 제19권2호
    • /
    • pp.117-167
    • /
    • 2019
  • 인간 유전체 프로젝트의 초안 발표 이후 여러 한국인 유전체 프로젝트들이 추진되었다. 그 결과 등장한 한국인 유전체를 둘러싼 흥미로운 담론 중 하나는 "한국인 유전체" 서열 분석을 통해 "아시아인 맞춤의학"을 구현할 수 있다는 주장이다. 본 논문은 이를 한국 유전체 학자들이 자국민에 대한 유전체 자료를 상업화하려는 노력 가운데 발전시킨 전략으로 인지하고, 이 "아시아인 건강을 위한 한국인 게놈" 전략이 출현하게 된 배경을 역사적으로 검토한다. 이 글은 한국 유전체 프로젝트들의 전략이 탈식민 국가들에서 빈번하게 발견되는 "유전체 주권"(genome sovereignty) 정책이 2000년대 초반 이후 한국에서 주요 정책 의제로 부상한 아시아 지역주의와 결합하여 등장한 산물이라고 주장한다. 이를 통해 이 연구는 그간 범아시아 SNP 컨소시엄(Pan-Asian Single Nucleotide Polymorphism Consortium)을 중심으로 논의된 유전체학과 아시아인의 구성에 관한 과학기술학 연구가 국소적인 아시아인 관념과 아시아 지역주의를 가진 싱가포르의 경험을 지나치게 일반화해왔음을 지적한다. 이와 함께 한국 유전체학 거버넌스에서 과학기술학자들이 맡을 수 있는 역할에 대해서도 고민해 볼 기회를 제공할 것이다.

Bioinformatics Resources of the Korean Bioinformation Center (KOBIC)

  • Lee, Byung-Wook;Chu, In-Sun;Kim, Nam-Shin;Lee, Jin-Hyuk;Kim, Seon-Yong;Kim, Wan-Kyu;Lee, Sang-Hyuk
    • Genomics & Informatics
    • /
    • 제8권4호
    • /
    • pp.165-169
    • /
    • 2010
  • The Korean Bioinformation Center (KOBIC) is a national bioinformatics research center in Korea. We developed many bioinformatics algorithms and applications to facilitate the biological interpretation of OMICS data. Here we present an introduction to major bioinformatics resources of databases and tools developed at KOBIC. These resources are classified into three main fields: genome, proteome, and literature. In the genomic resources, we constructed several pipelines for next generation sequencing (NGS) data processing and developed analysis algorithms and web-based database servers including miRGator, ESTpass, and CleanEST. We also built integrated databases and servers for microarray expression data such as MDCDP. As for the proteome data, VnD database, WDAC, Localizome, and CHARMM_HM web servers are available for various purposes. We constructed IntoPub server and Patome database in the literature field. We continue constructing and maintaining the bioinformatics infrastructure and developing algorithms.

A Genome-wide Approach for Functional Analysis Using Rice Mutant

  • Yim, Won-Cheol;Kim, Dong-Sub;Moon, Jun-Cheol;Jang, Cheol-Seong;Lee, Byung-Moo
    • 한국작물학회지
    • /
    • 제54권3호
    • /
    • pp.332-338
    • /
    • 2009
  • Rapid extension of genomic database leads to the remarkable advance of functional genomics. This study proposes a novel methodology of functional analysis using 5-methyltrytophan (5 MT) mutant together with their 2-DE analysis and public microarray database. A total of 24 proteins was changed in 5 MT mutant and four remarkably different expressed proteins were identified. Among them, three spots were converted to Affymetrix probe. A total of 155 microarray samples from Gene Expression Omnibus (GEO) in NCBI was retrieved and followed by constructing gene co-expression networks over a broad range of biological issues through Self-Organising Tree Algorithm. Three co-expressing gene clusters were retrieved and each functional categorization with differential expression pattern was exhibited from 5 MT resistance mutant rice. It was indicated new co-expression networks in the mutant. This study suggests that on investigating possibility which correspond 2-DE to microarray database with their full potential.

Nucleotide and protein researches on anaerobic fungi during four decades

  • Chang, Jongsoo;Park, Hyunjin
    • Journal of Animal Science and Technology
    • /
    • 제62권2호
    • /
    • pp.121-140
    • /
    • 2020
  • Anaerobic fungi habitat in the gastrointestinal tract of foregut fermenters or hindgut fermenters and degrade fibrous plant biomass through the hydrolysis reactions with a wide variety of cellulolytic enzymes and physical penetration through fiber matrix with their rhizoids. To date, seventeen genera have been described in family Neocallimasticaceae, class Neocallimastigomycetes, phylum Neocallimastigomycota and one genus has been described in phylum Neocallimastigomycota. In National Center for Biotechnology Information (NCBI) database (DB), 23,830 nucleotide sequences and 59,512 protein sequences have been deposited and most of them were originated from Piromyces, Neocallimastix and Anaeromyces. Most of protein sequences (44,025) were acquired with PacBio next generation sequencing system. The whole genome sequences of Anaeromyces robustus, Neocallimastix californiae, Pecoramyces ruminantium, Piromyces finnis and Piromyces sp. E2 are available in Joint Genome Institute (JGI) database. According to the results of protein prediction, average Isoelectric points (pIs) were ranged from 5.88 (Anaeromyces) to 6.57 (Piromyces) and average molecular weights were ranged from 38.7 kDa (Orpinomyces) to 56.6 kDa (Piromyces). In Carbohydrate-Active enZYmes (CAZY) database, glycoside hydrolases (36), carbohydrate binding module (11), carbohydrate esterases (8), glycosyltransferase (5) and polysaccharide lyases (3) from anaerobic fungi were registered. During four decades, 1,031 research articles about anaerobic fungi were published and 444 and 719 articles were available in PubMed (PM) and PubMed Central (PMC) DB.

Prediction of Mammalian MicroRNA Targets - Comparative Genomics Approach with Longer 3' UTR Databases

  • Nam, Seungyoon;Kim, Young-Kook;Kim, Pora;Kim, V. Narry;Shin, Seokmin;Lee, Sanghyuk
    • Genomics & Informatics
    • /
    • 제3권3호
    • /
    • pp.53-62
    • /
    • 2005
  • MicroRNAs play an important role in regulating gene expression, but their target identification is a difficult task due to their short length and imperfect complementarity. Burge and coworkers developed a program called TargetScan that allowed imperfect complementarity and established a procedure favoring targets with multiple binding sites conserved in multiple organisms. We improved their algorithm in two major aspects - (i) using well-defined UTR (untranslated region) database, (ii) examining the extent of conservation inside the 3' UTR specifically. Average length in our UTR database, based on the ECgene annotation, is more than twice longer than the Ensembl. Then, TargetScan was used to identify putative binding sites. The extent of conservation varies significantly inside the 3' UTR. We used the 'tight' tracks in the UCSC genome browser to select the conserved binding sites in multiple species. By combining the longer 3' UTR data, TargetScan, and tightly conserved blocks of genomic DNA, we identified 107 putative target genes with multiple binding sites conserved in multiple species, of which 85 putative targets are novel.

Genome-Based Virus Taxonomy with the ICTV Database Extension

  • Kang, Shinduck;Kim, Young-Chang
    • Genomics & Informatics
    • /
    • 제16권4호
    • /
    • pp.22.1-22.5
    • /
    • 2018
  • In 1966, the International Classification of Viruses (ICNV) was established to standardize the naming of viruses. In 1975, the organization was renamed "International Committee on Taxonomy of Viruses (ICTV)," by which it is still known today. The primary virus classification provided by ICTV in 1971 was for viruses infecting vertebrates, which includes 19 genera, 2 families, and 24 unclassified groups. Presently, the 10th virus taxonomy has been published. However, the early classification of viruses was based on clinical results "in vivo" and "in vitro," as well as on the shape of the Phenotype virus. Due to the development of next-generation sequencing and the accompanying bioinformatics analysis pipelines, a reconstruction of the classification system has been proposed. At a meeting held in Boston, USA between June 9-11, 2016, there was even an in-depth discussion regarding the classification of viruses using metagenomic data. One suggested activity that arose from the meeting was that viral taxonomy should be reconstructed, based on genotype and bioinformatics analysis "in silico." This article describes our efforts to achieve this goal by construction of a web-based system and the extension of an associated database, based on ICTV taxonomy. This virus taxonomy web system was designed specifically to extend the virus taxonomy up to strain and isolation, which was then connected with the NCBI database to facilitate searches for specific viral genes; there are also links to journals provided by the EMBL RESTful API that improves accessibility for academic groups.