• 제목/요약/키워드: gene annotation

검색결과 182건 처리시간 0.025초

Mouse phenogenomics, toolbox for functional annotation of human genome

  • Kim, Il-Yong;Shin, Jae-Hoon;Seong, Je-Kyung
    • BMB Reports
    • /
    • 제43권2호
    • /
    • pp.79-90
    • /
    • 2010
  • Mouse models are crucial for the functional annotation of human genome. Gene modification techniques including gene targeting and gene trap in mouse have provided powerful tools in the form of genetically engineered mice (GEM) for understanding the molecular pathogenesis of human diseases. Several international consortium and programs are under way to deliver mutations in every gene in mouse genome. The information from studying these GEM can be shared through international collaboration. However, there are many limitations in utility because not all human genes are knocked out in mouse and they are not yet phenotypically characterized by standardized ways which is required for sharing and evaluating data from GEM. The recent improvement in mouse genetics has now moved the bottleneck in mouse functional genomics from the production of GEM to the systematic mouse phenotype analysis of GEM. Enhanced, reproducible and comprehensive mouse phenotype analysis has thus emerged as a prerequisite for effectively engaging the phenotyping bottleneck. In this review, current information on systematic mouse phenotype analysis and an issue-oriented perspective will be provided.

Molecular Identification and Fine Mapping of a Major Quantitative Trait Locus, OsGPq3 for Seed Low-Temperature Germinability in Rice

  • Nari Kim;Rahmatullah Jan;Jae-Ryoung Park;Saleem Asif;Kyung-Min Kim
    • 한국작물학회:학술대회논문집
    • /
    • 한국작물학회 2022년도 추계학술대회
    • /
    • pp.283-283
    • /
    • 2022
  • Abiotic stresses such as high/low temperature, drought, salinity, and submergence directly or indirectly influence the physiological status and molecular mechanisms of rice which badly affect yield. Especially, the low temperature causes harmful influences in the overall process of rice growth such as uneven germination and the establishment of seedlings, which has become one of the main limiting factors affecting rice production in the world. It is of great significance to find the candidate genes controlling low-temperature tolerance during seed germination and study their functions for breeding new rice cultivars with immense low-temperature tolerance during seed germination. In this study, 120 lines of Cheongcheong/Nagdong double haploid population were used for quantitative trait locus analysis of low-temperature germinability. The results showed significant difference in germination under low different temperature conditions. In total, 4 QTLs were detected on chromosome 3, 6, and 8. A total of 41 genes were identified from all the 4 QTLs, among them, 25 genes were selected by gene function annotation and further screened through quantitative real time polymerase chain reaction. Based on gene function annotation and level of expression under low-temperature, our study suggested OsGPq3 gene as a candidate gene controlling viviparous germination, ABA and GA signaling under low-temperature. This study will provide a theoretical basis for marker-assisted breeding.

  • PDF

GORank: Gene Ontology를 이용한 유전자 산물의 의미적 유사성 검색 (GORank: Semantic Similarity Search for Gene Products using Gene Ontology)

  • 김기성;유상원;김형주
    • 한국정보과학회논문지:데이타베이스
    • /
    • 제33권7호
    • /
    • pp.682-692
    • /
    • 2006
  • 유사한 생물학적 특성을 가진 유전자 산물을 검색하는 것은 생물정보학 연구에 필수적인 기술이다. 현재 대부분의 생물학 데이타베이스에서 Gene Ontology의 용어를 사용하여 유전자 산물의 생물학적 특성을 기술하고 있다. 본 논문에서는 이런 유전자 산물의 주석 정보를 사용해 의미적으로 유사한 유전자 산물을 검색하는 방법을 제안한다. 이를 위해 우선 정보 이론에 기반한 유전자 산물간의 의미적 유사도를 정의하였다. 그리고 이 유사도를 이용한 의미적 유사성 검색 알고리즘을 제안하였다. 의미적 유사성 검색을 처리하기 위해 Fagin의 문턱값 알고리즘(threshold algorithm)을 다음과 같이 변형한 기법을 사용하였다. 우선 사용하는 유사도 함수가 단조 증가 성질을 갖지 않기 때문에 유사도 함수에 맞는 문턱값을 재정의 하였다. 또 역색인 리스트의 구조를 사용하여 중간 검색을 생략할 수 있는 클러스터 스키핑 기법과 역색인 리스트 액세스 순서를 제안하였다. 실제 GO와 주석 정보를 이용하여 성능 평가를 했으며 제안한 알고리즘은 효율적인 알고리즘임을 보였다.

통합형 미생물 유전자 예측 시스템의 구축에 관한 연구 (A Study on Construction of Integrated Prokaryotes Gene Prediction System)

  • 장종원;류윤규;구자효;윤영우
    • 융합신호처리학회논문지
    • /
    • 제6권1호
    • /
    • pp.27-32
    • /
    • 2005
  • 유전자 서열 분석기의 발달로 유전체 서열 데이터는 급속도로 증가하여 자동적으로 유전체에 주석을 첨부하는 과정이 필요하다. 유전체에 주석을 다는 작업 중 가장 어려운 과정이 유전체내에 존재하는 단백질을 코드화하고 있는 유전자의 탐색이다. 진핵생물과 원핵생물은 유전자 구조에서 현격한 차이를 보이고 있으므로 유전자를 예측하는 방법도 각각 달라야 한다. 지금까지 전체 유전체 서열이 밝혀진 231종의 생물에서 200종이 원핵생물이다. 그러므로 비교 유전체학을 통한 생물공학 연구에서 진핵생물보다 원핵생물이 더 적합하다 할 것이다. 게다가 원핵생물의 경우 intron이라는 구조를 가지고 있지 않아 유전자 예측이 더 간단하다. 이전에 연구된 원핵생물의 유전자 예측 정확성은 80%~90%에 이르고 있고 최근의 연구에서는 유전자 예측 정확도 100%를 목표로 하고 있고, 본 논문에서는 E. coli K-12와 S. typhi 유전체의 경우, 유전체 예측 정확도가 각각 98.5%와 98.7%를 보여 기존의 GLIMMER보다 더 우수한 결과를 나타내었다.

  • PDF

Transcriptome analysis of internal and external stress mechanisms in Aster spathulifolius Maxim.

  • Sivagami, Jean Claude;Park, SeonJoo
    • 한국자원식물학회:학술대회논문집
    • /
    • 한국자원식물학회 2019년도 춘계학술대회
    • /
    • pp.35-35
    • /
    • 2019
  • Aster spathulifolius Maxim. is belongs to the Asteraceae family which is distributed only in Korea and Japan. It is recognize as a traditionally medicinal plants and economically valuable in ornamental field. However, among the Asteraceae family, the Aster genus, which is lacks in genomic resources and information of molecular function. Therefore, we used high throughput RNA-sequencing transcriptome data of the A. spathulifolius to know molecular level function. DeNovo assembly produced 98,660 unigene with N50 value 1126 bp. Unigenes was performed to analyses the functional annotation against NCBI database like plant database of nucleotide (Nt) and non-redundant protein (Nr), Pfam, Uniprot, KEGG and Transcriptional factor (TF). In addition, Distribution of SSR markers also analyzed for future perfectives. Further, Comparing with other two Asteraceae family species like, Karelinia caspica and Chrysanthemum morifolium to the A. spathulifolius shows the number of gene that regulated in internal and external stress respectively salt-tolerant and heat and drought stress to understand the molecular basis related to the different environments stress.

  • PDF

Annotation of Genes Having Candidate Somatic Mutations in Acute Myeloid Leukemia with Whole-Exome Sequencing Using Concept Lattice Analysis

  • Lee, Kye Hwa;Lim, Jae Hyeun;Kim, Ju Han
    • Genomics & Informatics
    • /
    • 제11권1호
    • /
    • pp.38-45
    • /
    • 2013
  • In cancer genome studies, the annotation of newly detected oncogene/tumor suppressor gene candidates is a challenging process. We propose using concept lattice analysis for the annotation and interpretation of genes having candidate somatic mutations in whole-exome sequencing in acute myeloid leukemia (AML). We selected 45 highly mutated genes with whole-exome sequencing in 10 normal matched samples of the AML-M2 subtype. To evaluate these genes, we performed concept lattice analysis and annotated these genes with existing knowledge databases.

Vibrio 속 16S rRNA 유전자 염기서열의 이질성 분석 (Heterogeneity Analysis of the 16S rRNA Gene Sequences of the Genus Vibrio)

  • 기장서
    • 미생물학회지
    • /
    • 제45권4호
    • /
    • pp.430-434
    • /
    • 2009
  • 세균 16S rRNA 유전자 염기서열은 분자계통분류, 진화역사 규명, 미생물 검출 등 다양한 목적으로 이용되어 왔다. 세균 제놈(genome)은 multiple rRNA 오페론을 갖고 있으며, 이들 유전자 염기서열은 일부 변이가 있는 것으로 알려져 있다. 본 연구에서는 Vibrio 속의 16S rRNA 유전자 염기서열을 이용하여 세포 내 16S rRNA의 이질성을 규명하였다. 분석은 GenBank 자료 중에서 제놈 염기서열 annotation이 완료된 V. cholerae, V. harveyi, V. parahaemolyticus, V. splendidus, V. vulnificus를 이용하여 실시하였다. Vibrio 속은 1번 염색체에 7~10개의 16S rRNA 유전자 copy를 갖고 있으며, 이들의 세포 내 유전자 변이는 0.9% 이하 상이성(99.1%이상 DNA 상동성)을 보였다. 2번 염색체에서는 16S rRNA 유전자가 1개 이하로 존재하였다. 유전체내 16S rRNA 유전형은 최소 5개(V. vulnificus #CMCP6)에서 최대 8개(V. parahaemolyticus #RIMD 2210633, V. harveyi #ATCC BAA-1116)로 조사되었다. 본 결과는 Vibrio 속의 16S rRNA 유전자 염기서열이 높은 이질성을 갖는 것을 제시해 준다.

Genetic structure analysis of domestic companion dogs using high-density SNP chip

  • Gwang Hyeon Lee;Jae Don Oh;Hong Sik Kong
    • 한국동물생명공학회지
    • /
    • 제39권2호
    • /
    • pp.138-144
    • /
    • 2024
  • Background: As the number of households raising companion dogs increases, the pet genetic analysis market also continues to grow. However, most studies have focused on specific purposes or native breeds. This study aimed to collect genomic data through single nucleotide polymorphism (SNP) chip analysis of companion dogs in South Korea and perform genetic diversity analysis and SNP annotation. Methods: We collected samples from 95 dogs belonging to 26 breeds, including mixed breeds, in South Korea. The SNP genotypes were obtained for each sample using an AxiomTM Canine HD Array. Quality control (QC) was performed to enhance the accuracy of the analysis. A genetic diversity analysis was performed for each SNP. Results: QC initially selected SNPs, and after excluding non-diverse ones, 621,672 SNPs were identified. Genetic diversity analysis revealed minor allele frequencies, polymorphism information content, expected heterozygosity, and observed heterozygosity values of 0.220, 0.244, 0.301, and 0.261, respectively. The SNP annotation indicated that most variations had an uncertain or minimal impact on gene function. However, approximately 16,000 non-synonymous SNPs (nsSNPs) have been found to significantly alter gene function or affect exons by changing translated amino acids. Conclusions: This study obtained data on SNP genetic diversity and functional SNPs in companion dogs raised in South Korea. The results suggest that establishing an SNP set for individual identification could enable a gene-based registration system. Furthermore, identifying and researching nsSNPs related to behavior and diseases could improve dog care and prevent abandonment.

LitCovid-AGAC: cellular and molecular level annotation data set based on COVID-19

  • Ouyang, Sizhuo;Wang, Yuxing;Zhou, Kaiyin;Xia, Jingbo
    • Genomics & Informatics
    • /
    • 제19권3호
    • /
    • pp.23.1-23.7
    • /
    • 2021
  • Currently, coronavirus disease 2019 (COVID-19) literature has been increasing dramatically, and the increased text amount make it possible to perform large scale text mining and knowledge discovery. Therefore, curation of these texts becomes a crucial issue for Bio-medical Natural Language Processing (BioNLP) community, so as to retrieve the important information about the mechanism of COVID-19. PubAnnotation is an aligned annotation system which provides an efficient platform for biological curators to upload their annotations or merge other external annotations. Inspired by the integration among multiple useful COVID-19 annotations, we merged three annotations resources to LitCovid data set, and constructed a cross-annotated corpus, LitCovid-AGAC. This corpus consists of 12 labels including Mutation, Species, Gene, Disease from PubTator, GO, CHEBI from OGER, Var, MPA, CPA, NegReg, PosReg, Reg from AGAC, upon 50,018 COVID-19 abstracts in LitCovid. Contain sufficient abundant information being possible to unveil the hidden knowledge in the pathological mechanism of COVID-19.

Functional Annotation and Analysis of Korean Patented Biological Sequences Using Bioinformatics

  • Lee, Byung Wook;Kim, Tae Hyung;Kim, Seon Kyu;Kim, Sang Soo;Ryu, Gee Chan;Bhak, Jong
    • Molecules and Cells
    • /
    • 제21권2호
    • /
    • pp.269-275
    • /
    • 2006
  • A recent report of the Korean Intellectual Property Office(KIPO) showed that the number of biological sequence-based patents is rapidly increasing in Korea. We present biological features of Korean patented sequences though bioinformatic analysis. The analysis is divided into two steps. The first is an annotation step in which the patented sequences were annotated with the Reference Sequence (RefSeq) database. The second is an association step in which the patented sequences were linked to genes, diseases, pathway, and biological functions. We used Entrez Gene, Online Mendelian Inheritance in Man (OMIM), Kyoto Encyclopedia of Genes and Genomes (KEGG), and Gene Ontology (GO) databases. Through the association analysis, we found that nearly 2.6% of human genes were associated with Korean patenting, compared to 20% of human genes in the U.S. patent. The association between the biological functions and the patented sequences indicated that genes whose products act as hormones on defense responses in the extra-cellular environments were the most highly targeted for patenting. The analysis data are available at http://www.patome.net