• 제목/요약/키워드: Gene Prediction

검색결과 295건 처리시간 0.024초

The Application of Machine Learning Algorithm In The Analysis of Tissue Microarray; for the Prediction of Clinical Status

  • Cho, Sung-Bum;Kim, Woo-Ho;Kim, Ju-Han
    • 한국생물정보학회:학술대회논문집
    • /
    • 한국생물정보시스템생물학회 2005년도 BIOINFO 2005
    • /
    • pp.366-370
    • /
    • 2005
  • Tissue microarry is one of the high throughput technologies in the post-genomic era. Using tissue microarray, the researchers are able to investigate large amount of gene expressions at the level of DNA, RNA, and protein The important aspect of tissue microarry is its ability to assess a lot of biomarkers which have been used in clinical practice. To manipulate the categorical data of tissue microarray, we applied Bayesian network classifier algorithm. We identified that Bayesian network classifier algorithm could analyze tissue microarray data and integrating prior knowledge about gastric cancer could achieve better performance result. The results showed that relevant integration of prior knowledge promote the prediction accuracy of survival status of the immunohistochemical tissue microarray data of 18 tumor suppressor genes. In conclusion, the application of Bayesian network classifier seemed appropriate for the analysis of the tissue microarray data with clinical information.

  • PDF

증가 계수의 직접 계산법을 이용한 항공기 유동장 효과의 예측 (PREDICTION OF AIRCRAFT FLOW FIELD EFFECT BY DIRECT CALCULATION OF INCREMENTAL COEFFICIENTS)

  • 김유진;권장혁
    • 한국전산유체공학회:학술대회논문집
    • /
    • 한국전산유체공학회 2006년도 추계 학술대회논문집
    • /
    • pp.41-46
    • /
    • 2006
  • When new weapons are introduced, the target points estimation is one of the important objectives in the flight test as well as the safe separation. The prediction methods help to design the flight test schedule. However, the incremental aerodynamic coefficients in the aircraft flow field so-called BSE are difficult to predict. Generally, the semiempirical methods such as the grid methods, IFM and Flow TGP using database are used for estimation of BSE. However, these methods are quasi-steady methods using static aerodynamic loads. Nowadays the time-accurate CFD method is often used to predict the store separation event. In the current process, the incremental aerodynamic coefficients in BSE regime are calculated directly, and the elimination of delta coefficients is checked simultaneously. This stage can be used for the initial condition of Flow TGP with freestream database. Two dimensional supersonic and subsonic store separation problems have been simulated and incremental coefficients are calculated. The results show the time when the store gets out of BSE region.

  • PDF

Discovering cis-regulatory motifs by combining multiple predictors

  • Chang, Hye-Shik;Hwang, Kyu-Woong;Kim, Dong-Sup
    • Bioinformatics and Biosystems
    • /
    • 제2권2호
    • /
    • pp.52-57
    • /
    • 2007
  • The computational discovery of transcription factor binding site is one of the important tools in the genetic and genomic analysis. Rough prediction of gene regulation network and finding possible co-regulated genes are typical applications of the technique. Countless motif-discovery algorithms have been proposed for the past years. However, there is no dominant algorithm yet. Each algorithm does not give enough accuracy without extensive information. In this paper, we explore the possibility of combining multiple algorithms for the one integrated result in order to improve the performance and the convenience of researchers. Moreover, we apply new high order information that is reorganized from the set of basis predictions to the final prediction.

  • PDF

IoT-based systemic lupus erythematosus prediction model using hybrid genetic algorithm integrated with ANN

  • Edison Prabhu K;Surendran D
    • ETRI Journal
    • /
    • 제45권4호
    • /
    • pp.594-602
    • /
    • 2023
  • Internet of things (IoT) is commonly employed to detect different kinds of diseases in the health sector. Systemic lupus erythematosus (SLE) is an autoimmune illness that occurs when the body's immune system attacks its own connective tissues and organs. Because of the complicated interconnections between illness trigger exposure levels across time, humans have trouble predicting SLE symptom severity levels. An effective automated machine learning model that intakes IoT data was created to forecast SLE symptoms to solve this issue. IoT has several advantages in the healthcare industry, including interoperability, information exchange, machine-to-machine networking, and data transmission. An SLE symptom-predicting machine learning model was designed by integrating the hybrid marine predator algorithm and atom search optimization with an artificial neural network. The network is trained by the Gene Expression Omnibus dataset as input, and the patients' data are used as input to predict symptoms. The experimental results demonstrate that the proposed model's accuracy is higher than state-of-the-art prediction models at approximately 99.70%.

BRCA1 Gene Mutation Screening for the Hereditary Breast and/or Ovarian Cancer Syndrome in Breast Cancer Cases: a First High Resolution DNA Melting Analysis in Indonesia

  • Mundhofir, Farmaditya EP;Wulandari, Catharina Endah;Prajoko, Yan Wisnu;Winarni, Tri Indah
    • Asian Pacific Journal of Cancer Prevention
    • /
    • 제17권3호
    • /
    • pp.1539-1546
    • /
    • 2016
  • Specific patterns of the hereditary breast and ovarian cancer (HBOC) syndrome are related to mutations in the BRCA1 gene. One hundred unrelated breast cancer patients were interviewed to obtain clinical symptoms and signs, pedigree and familial history of HBOC syndrome related cancer. Subsequently, data were calculated using the Breast and Ovarian Analysis of Disease Incidence and Carrier Estimation Algorithm (BOADICEA) risk prediction model. Patients with high score of BOADICEA were offered genetic testing. Eleven patients with high score of BOADICEA, 2 patients with low score of BOADICEA, 2 patient's family members and 15 controls underwent BRCA1 genetic testing. Mutation screening using PCR-HRM was carried out in 22 exons (41 amplicons) of BRCA1 gene. Sanger sequencing was subjected in all samples with aberrant graph. This study identified 10 variants in the BRCA1 gene, consisting of 6 missense mutations (c.1480C>A, c.2612C>T, c.2566T>C, c.3113A>G, c.3548 A>G, c.4837 A>G), 3 synonymous mutations (c.2082 C>T, c.2311 T>C and c.4308T>C) and one intronic mutation (c.134+35 G>T). All variants tend to be polymorphisms and unclassified variants. However, no known pathogenic mutations were found.

데이터마이닝을 이용한 한우의 우수 지방산합성효소 유전자 조합 선별 (Major gene identification for FASN gene in Korean cattles by data mining)

  • 김병두;김현지;이성원;이제영
    • Journal of the Korean Data and Information Science Society
    • /
    • 제25권6호
    • /
    • pp.1385-1395
    • /
    • 2014
  • 가축의 경제적인 특성은 환경적인 요인과 유전적인 요인의 영향을 받으며, 또한 하나의 유전자가 아닌 여러 유전자의 상호작용의 영향을 받는다고 알려져 있다. 본 논문에서는 선형회귀모형을 활용하여 환경적인 요인을 보정한 자료로 한우의 맛과 육질에 영향을 준다고 밝혀진 지방산합성효소의 단일염기다형성 5개를 이용해 한우의 경제 형질에 영향을 미치는 우수 유전자 조합을 선별하고 우수 유전자형을 밝힌다. 이를 위해 데이터마이닝 기법인 인공신경망, 로지스틱 회귀모형, C5.0, CART 기법을 이용하였다. 공정한 모형 평가를 위해 전체 데이터를 훈련용 데이터 (60%)와 검증용 데이터 (40%)로 나누었고, 훈련용 데이터에서 설정된 모형을 검증용 데이터에 적용시켜 정확도를 비교하였다. 그 결과 C5.0이 최적 모형으로 선정되었으며, C5.0의 의사결정나무를 통해 우수 유전자 조합을 선별하였다.

Molecular and Morphological Evidence of Hepatotoxicity after Silver Nanoparticle Exposure: A Systematic Review, In Silico, and Ultrastructure Investigation

  • Sooklert, Kanidta;Wongjarupong, Asarn;Cherdchom, Sarocha;Wongjarupong, Nicha;Jindatip, Depicha;Phungnoi, Yupa;Rojanathanes, Rojrit;Sereemaspun, Amornpun
    • Toxicological Research
    • /
    • 제35권3호
    • /
    • pp.257-270
    • /
    • 2019
  • Silver nanoparticles (AgNPs) have been widely used in a variety of applications in innovative development; consequently, people are more exposed to this particle. Growing concern about toxicity from AgNP exposure has attracted greater attention, while questions about nanosilver-responsive genes and consequences for human health remain unanswered. By considering early detection and prevention of nanotoxicology at the genetic level, this study aimed to identify 1) changes in gene expression levels that could be potential indicators for AgNP toxicity and 2) morphological phenotypes correlating to toxicity of HepG2 cells. To detect possible nanosilver-responsive genes in xenogenic targeted organs, a comprehensive systematic literature review of changes in gene expression in HepG2 cells after AgNP exposure and in silico method, connection up- and down-regulation expression analysis of microarrays (CU-DREAM), were performed. In addition, cells were extracted and processed for transmission electron microscopy to examine ultrastructural alterations. From the Gene Expression Omnibus (GEO) Series database, we selected genes that were up- and down-regulated in AgNPs, but not up- and down-regulated in silver ion exposed cells, as nanosilver-responsive genes. HepG2 cells in the AgNP-treated group showed distinct ultrastructural alterations. Our results suggested potential representative gene data after AgNPs exposure provide insight into assessment and prediction of toxicity from nanosilver exposure.

In silico genome wide identification and expression analysis of the WUSCHEL-related homeobox gene family in Medicago sativa

  • Yang, Tianhui;Gao, Ting;Wang, Chuang;Wang, Xiaochun;Chen, Caijin;Tian, Mei;Yang, Weidi
    • Genomics & Informatics
    • /
    • 제20권2호
    • /
    • pp.19.1-19.15
    • /
    • 2022
  • Alfalfa (Medicago sativa) is an important food and feed crop which rich in mineral sources. The WUSCHEL-related homeobox (WOX) gene family plays important roles in plant development and identification of putative gene families, their structure, and potential functions is a primary step for not only understanding the genetic mechanisms behind various biological process but also for genetic improvement. A variety of computational tools, including MAFFT, HMMER, hidden Markov models, Pfam, SMART, MEGA, ProtTest, BLASTn, and BRAD, among others, were used. We identified 34 MsWOX genes based on a systematic analysis of the alfalfa plant genome spread in eight chromosomes. This is an expansion of the gene family which we attribute to observed chromosomal duplications. Sequence alignment analysis revealed 61 conserved proteins containing a homeodomain. Phylogenetic study sung reveal five evolutionary clades with 15 motif distributions. Gene structure analysis reveals various exon, intron, and untranslated structures which are consistent in genes from similar clades. Functional analysis prediction of promoter regions reveals various transcription binding sites containing key growth, development, and stress-responsive transcription factor families such as MYB, ERF, AP2, and NAC which are spread across the genes. Most of the genes are predicted to be in the nucleus. Also, there are duplication events in some genes which explain the expansion of the family. The present research provides a clue on the potential roles of MsWOX family genes that will be useful for further understanding their functional roles in alfalfa plants.

Draft Genome Sequence of the Reference Strain of the Korean Medicinal Mushroom Wolfiporia cocos KMCC03342

  • Bogun Kim;Byoungnam Min;Jae-Gu Han;Hongjae Park;Seungwoo Baek;Subin Jeong;In-Geol Choi
    • Mycobiology
    • /
    • 제50권4호
    • /
    • pp.254-257
    • /
    • 2022
  • Wolfiporia cocos is a wood-decay brown rot fungus belonging to the family Polyporaceae. While the fungus grows, the sclerotium body of the strain, dubbed Bokryeong in Korean, is formed around the roots of conifer trees. The dried sclerotium has been widely used as a key component of many medicinal recipes in East Asia. Wolfiporia cocos strain KMCC03342 is the reference strain registered and maintained by the Korea Seed and Variety Service for commercial uses. Here, we present the first draft genome sequence of W. cocos KMCC03342 using a hybrid assembly technique combining both short- and long-read sequences. The genome has a total length of 55.5 Mb comprised of 343 contigs with N50 of 332 kb and 95.8% BUSCO completeness. The GC ratio was 52.2%. We predicted 14,296 protein-coding gene models based on ab initio gene prediction and evidence-based annotation procedure using RNAseq data. The annotated genome was predicted to have 19 terpene biosynthesis gene clusters, which was the same number as the previously sequenced W. cocos strain MD-104 genome but higher than Chinese W. cocos strains. The genome sequence and the predicted gene clusters allow us to study biosynthetic pathways for the active ingredients of W. cocos.

Functional annotation of uncharacterized proteins from Fusobacterium nucleatum: identification of virulence factors

  • Kanchan Rauthan;Saranya Joshi;Lokesh Kumar;Divya Goel;Sudhir Kumar
    • Genomics & Informatics
    • /
    • 제21권2호
    • /
    • pp.21.1-21.14
    • /
    • 2023
  • Fusobacterium nucleatum is a gram-negative bacteria associated with diverse infections like appendicitis and colorectal cancer. It mainly attacks the epithelial cells in the oral cavity and throat of the infected individual. It has a single circular genome of 2.7 Mb. Many proteins in F. nucleatum genome are listed as "Uncharacterized." Annotation of these proteins is crucial for obtaining new facts about the pathogen and deciphering the gene regulation, functions, and pathways along with discovery of novel target proteins. In the light of new genomic information, an armoury of bioinformatic tools were used for predicting the physicochemical parameters, domain and motif search, pattern search, and localization of the uncharacterized proteins. The programs such as receiver operating characteristics determine the efficacy of the databases that have been employed for prediction of different parameters at 83.6%. Functions were successfully assigned to 46 uncharacterized proteins which included enzymes, transporter proteins, membrane proteins, binding proteins, etc. Apart from the function prediction, the proteins were also subjected to string analysis to reveal the interacting partners. The annotated proteins were also put through homology-based structure prediction and modeling using Swiss PDB and Phyre2 servers. Two probable virulent factors were also identified which could be investigated further for potential drug-related studies. The assigning of functions to uncharacterized proteins has shown that some of these proteins are important for cell survival inside the host and can act as effective drug targets.