• Title/Summary/Keyword: seq2seq

Search Result 224, Processing Time 0.024 seconds

News Recommendation Exploiting Document Summarization based on Deep Learning (딥러닝 기반의 문서요약기법을 활용한 뉴스 추천)

  • Heu, Jee-Uk
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.22 no.4
    • /
    • pp.23-28
    • /
    • 2022
  • Recently smart device(such as smart phone and tablet PC) become a role as an information gateway, using of the web news by multiple users from the web portal has been more important things. However, the quantity of creating web news on the web makes hard to catch the information which the user wants and confuse the users cause of the similar and repeated contents. In this paper, we propose the news recommend system using the document summarization based on KoBART which gives the selected news to users from the candidate news on the news portal. As a result, our proposed system shows higher performance and recommending the news efficiently by pre-training and fine-tuning the KoBART using collected news data.

Epigenetic Silencing of CHOP Expression by the Histone Methyltransferase EHMT1 Regulates Apoptosis in Colorectal Cancer Cells

  • Kim, Kwangho;Ryu, Tae Young;Lee, Jinkwon;Son, Mi-Young;Kim, Dae-Soo;Kim, Sang Kyum;Cho, Hyun-Soo
    • Molecules and Cells
    • /
    • v.45 no.9
    • /
    • pp.622-630
    • /
    • 2022
  • Colorectal cancer (CRC) has a high mortality rate among cancers worldwide. To reduce this mortality rate, chemotherapy (5-fluorouracil, oxaliplatin, and irinotecan) or targeted therapy (bevacizumab, cetuximab, and panitumumab) has been used to treat CRC. However, due to various side effects and poor responses to CRC treatment, novel therapeutic targets for drug development are needed. In this study, we identified the overexpression of EHMT1 in CRC using RNA sequencing (RNA-seq) data derived from TCGA, and we observed that knocking down EHMT1 expression suppressed cell growth by inducing cell apoptosis in CRC cell lines. In Gene Ontology (GO) term analysis using RNA-seq data, apoptosis-related terms were enriched after EHMT1 knockdown. Moreover, we identified the CHOP gene as a direct target of EHMT1 using a ChIP (chromatin immunoprecipitation) assay with an anti-histone 3 lysine 9 dimethylation (H3K9me2) antibody. Finally, after cotransfection with siEHMT1 and siCHOP, we again confirmed that CHOP-mediated cell apoptosis was induced by EHMT1 knockdown. Our findings reveal that EHMT1 plays a key role in regulating CRC cell apoptosis, suggesting that EHMT1 may be a therapeutic target for the development of cancer inhibitors.

Transcriptomic Profile Analysis of Jeju Buckwheat using RNA-Seq Data (NA-Seq를 이용한 제주산 메밀의 발아초기 전사체 프로파일 분석)

  • Han, Song-I;Chung, Sung Jin;Oh, Dae-Ju;Jung, Yong-Hwan;Kim, Chan-Shick;Kim, Jae-hoon
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.19 no.1
    • /
    • pp.537-545
    • /
    • 2018
  • In this study, transcriptome analysis was conducted to collect various information from Fagopyrum esculentum and Fagopyrum tataricum during the early germination stage. Total RNA was extracted from the seeds and at 12, 24, and 36 hrs after germination of Jeju native Fagopyrum esculentum and Fagopyrum tataricum and sequenced using the Illumina Hiseq 2000 platform. Raw data analysis was conducted using the Dynamic Trim and Lengths ORT programs in the SolexaQA package, and assembly and annotation were performed. Based on RNA-seq raw data, we obtained 16.5 Gb and 16.2 Gb of transcriptome data corresponding to about 84.2% and 81.5% of raw data, respectively. De novo assembly and annotation revealed 43,494 representative transcripts corresponding to 47.5Mb. Among them, 23,165 sequences were shown to have similar sequences with annotation DB. Moreover, Gene Ontology (GO) analysis of buckwheat representative transcripts confirmed that the gene is involved in metabolic processes (49.49%) of biological processes, as well as cell function (46.12%) in metabolic process, and catalytic activity (80.43%) in molecular function In the case of gibberellin receptor GID1C, which is related to germination of seeds, the expression levels increased with time after germination in both F. esculentum and F. tataricum. The expression levels of gibberellin 20-oxidase 1 were increased within 12 hrs of gemination in F. esculentum but continuously until 36 hrs in F. tataricum. This buckwheat transcriptome profile analysis of the early germination stage will help to identify the mechanism causing functional and morphological differences between species.

Skin Transcriptome Profiling of the Blass Bloched Rockfish (Sebastes pachycephalus) with Different Body Color Patterns (체색 패턴이 다른 개볼락(Sebastes pachycephalus) 피부 전사체 프로파일링)

  • Jang, Yo-Soon
    • Korean Journal of Ichthyology
    • /
    • v.32 no.3
    • /
    • pp.117-129
    • /
    • 2020
  • The body color pattern in fish is a distinctive feature for species identification. The blass bloched rockfish Sebastes pachycephalus is a commercially important marine fish species, distributed in the central and southern parts of Korea and south Hokkaido of Japan. It has a morphological feature divided into four subspecies according to with or lacking distinct spots on the body surface, and to the location of markings on the body surface. However, the genetic basis of body color pattern of S. pachycephalus is still unknown. Thus we analyzed the transcriptome of S. pachycephalus skin samples using RNA-seq analysis to investigate functional genes related to body color patterns. The experimental skin samples were prepared by classified into 'Wild type' (lacking distinct spots and markings) and 'Color type' (with distinct spots and marking). Two skin sample transcriptomes were compared pairwise and the results revealed that were 164 differentially expressed unigenes in the skin samples of 'Wild type' and 'Color type'. Gene Ontology analysis of 164 differentially expressed unigenes showed that these genes were included in the functional group of molecular function (2 genes), biological process (46 genes), and cellular component (6 genes). There were several genes that body color type skin specific expression and the genes were CTL (Galactose-specific lectin nattectin), CUL1 (Cullin-1), CMAS (N-acylneuraminate cytidylyltransferase), NMRK2 (Nicotinamide riboside kinase 2), ALOXE3 (Hydroperoxide isomerase ALOXE3), SLC4A7 (sodium bicarbonate cotransporter 3). Our study is the first attempt to search for functional genes involved in the formation of body color patterns in S. pachycephalus. The differentially expressed unigenes obtained in this study can be used as candidate genes for functional gene study related to body coloration of fish.

Integration and Reanalysis of Four RNA-Seq Datasets Including BALF, Nasopharyngeal Swabs, Lung Biopsy, and Mouse Models Reveals Common Immune Features of COVID-19

  • Rudi Alberts;Sze Chun Chan;Qian-Fang Meng;Shan He;Lang Rao;Xindong Liu;Yongliang Zhang
    • IMMUNE NETWORK
    • /
    • v.22 no.3
    • /
    • pp.22.1-22.25
    • /
    • 2022
  • Coronavirus disease 2019 (COVID-19), caused by severe acute respiratory syndromecoronavirus-2 (SARS-CoV-2), has spread over the world causing a pandemic which is still ongoing since its emergence in late 2019. A great amount of effort has been devoted to understanding the pathogenesis of COVID-19 with the hope of developing better therapeutic strategies. Transcriptome analysis using technologies such as RNA sequencing became a commonly used approach in study of host immune responses to SARS-CoV-2. Although substantial amount of information can be gathered from transcriptome analysis, different analysis tools used in these studies may lead to conclusions that differ dramatically from each other. Here, we re-analyzed four RNA-sequencing datasets of COVID-19 samples including human bronchoalveolar lavage fluid, nasopharyngeal swabs, lung biopsy and hACE2 transgenic mice using the same standardized method. The results showed that common features of COVID-19 include upregulation of chemokines including CCL2, CXCL1, and CXCL10, inflammatory cytokine IL-1β and alarmin S100A8/S100A9, which are associated with dysregulated innate immunity marked by abundant neutrophil and mast cell accumulation. Downregulation of chemokine receptor genes that are associated with impaired adaptive immunity such as lymphopenia is another common feather of COVID-19 observed. In addition, a few interferon-stimulated genes but no type I IFN genes were identified to be enriched in COVID-19 samples compared to their respective control in these datasets. These features are in line with results from single-cell RNA sequencing studies in the field. Therefore, our re-analysis of the RNA-seq datasets revealed common features of dysregulated immune responses to SARS-CoV-2 and shed light to the pathogenesis of COVID-19.

Genome-wide survey and expression analysis of F-box genes in wheat

  • Kim, Dae Yeon;Hong, Min Jeong;Seo, Yong Weon
    • Proceedings of the Korean Society of Crop Science Conference
    • /
    • 2017.06a
    • /
    • pp.141-141
    • /
    • 2017
  • The ubiquitin-proteasome pathway is the major regulatory mechanism in a number of cellular processes for selective degradation of proteins and involves three steps: (1) ATP dependent activation of ubiquitin by E1 enzyme, (2) transfer of activated ubiquitin to E2 and (3) transfer of ubiquitin to the protein to be degraded by E3 complex. F-box proteins are subunit of SCF complex and involved in specificity for a target substrate to be degraded. F-box proteins regulate many important biological processes such as embryogenesis, floral development, plant growth and development, biotic and abiotic stress, hormonal responses and senescence. However, little is known about the F-box genes in wheat. The draft genome sequence of wheat (IWGSC Reference Sequence v1.0 assembly) used to analysis a genome-wide survey of the F-box gene family in wheat. The Hidden Markov Model (HMM) profiles of F-box (PF00646), F-box-like (PF12937), F-box-like 2 (PF13013), FBA (PF04300), FBA_1 (PF07734), FBA_2 (PF07735), FBA_3 (PF08268) and FBD (PF08387) domains were downloaded from Pfam database were searched against IWGSC Reference Sequence v1.0 assembly. RNA-seq paired-end libraries from different stages of wheat, such as stages of seedling, tillering, booting, day after flowering (DAF) 1, DAF 10, DAF 20, and DAF 30 were conducted and sequenced by Illumina HiSeq2000 for expression analysis of F-box protein genes. Basic analysis including Hisat, HTseq, DEseq, gene ontology analysis and KEGG mapping were conducted for differentially expressed gene analysis and their annotation mappings of DEGs from various stages. About 950 F-box domain proteins identified by Pfam were mapped to wheat reference genome sequence by blastX (e-value < 0.05). Among them, more than 140 putative F-box protein genes were selected by fold changes cut-offs of > 2, significance p-value < 0.01, and FDR<0.01. Expression profiling of selected F-box protein genes were shown by heatmap analysis, and average linkage and squared Euclidean distance of putative 144 F-box protein genes by expression patterns were calculated for clustering analysis. This work may provide valuable and basic information for further investigation of protein degradation mechanism by ubiquitin proteasome system using F-box proteins during wheat development stages.

  • PDF

BSA-Seq Technologies Identify a Major QTL for Clubroot Resistance in Chinese Cabbage (Brassica rapa ssp. pekinesis)

  • Yuan, Yu-Xiang;Wei, Xiao-Chun;Zhang, Qiang;Zhao, Yan-Yan;Jiang, Wu-Sheng;Yao, Qiu-Ju;Wang, Zhi-Yong;Zhang, Ying;Tan, Yafei;Li, Yang;Xu, Qian;Zhang, Xiao-Wei
    • 한국균학회소식:학술대회논문집
    • /
    • 2015.05a
    • /
    • pp.41-41
    • /
    • 2015
  • BSA-seq technologies, combined Bulked Segregant Analysis (BSA) and Next-Generation Sequencing (NGS), are making it faster and more efficient to establish the association of agronomic traits with molecular markers or candidate genes, which is the requirement for marker-assisted selection in molecular breeding. Clubroot disease, caused by Plasmodiophora brassicae, is a serious threat to Brassica crops. Even we have breed new clubroot resistant varieties of Chinese cabbage (B. rapa ssp. pekinesis), the underlying genetic mechanism is unclear. In this study, an $F_2$ population of 340 plants were inoculated with P. brassicae from Xinye (Pathotype 2 on the differentials of Williams). Resistance phenotype segregation ratio for the populations fit a 3:1 (R:S) segregation model, consistent with a single dominant gene model. Super-BSA, using re-sequencing the parents, extremely R and S DNA pools with each 50 plants, revealed 3 potential candidate regions on the chromosome A03, with the most significant region falling between 24.30 Mb and 24.75 Mb. A linkage map with 31 markers in this region was constructed with several closely linked markers identified. A Major QTL for clubroot resistance, CRq, which was identified with the peak LOD score at 169.3, explaining 89.9% of the phenotypic variation. And we developed a new co-segregated InDel marker BrQ-2. Joint BSA-seq and traditional QTL analysis delimited CRq to an 250 kb genomic region, where four TIR-NBS-LRR genes (Bra019409, Bra019410, Bra019412 and Bra019413) clustered. The CR gene CRq and closely linked markers will be highly useful for breeding new resistant Chinese cabbage cultivars.

  • PDF

Whole genome MBD-seq and RRBS analyses reveal that hypermethylation of gastrointestinal hormone receptors is associated with gastric carcinogenesis

  • Kim, Hee-Jin;Kang, Tae-Wook;Haam, Keeok;Kim, Mirang;Kim, Seon-Kyu;Kim, Seon-Young;Lee, Sang-Il;Song, Kyu-Sang;Jeong, Hyun-Yong;Kim, Yong Sung
    • Experimental and Molecular Medicine
    • /
    • v.50 no.12
    • /
    • pp.1.1-1.14
    • /
    • 2018
  • DNA methylation is a regulatory mechanism in epigenetics that is frequently altered during human carcinogenesis. To detect critical methylation events associated with gastric cancer (GC), we compared three DNA methylomes from gastric mucosa (GM), intestinal metaplasia (IM), and gastric tumor (GT) cells that were microscopically dissected from an intestinal-type early gastric cancer (EGC) using methylated DNA binding domain sequencing (MBD-seq) and reduced representation bisulfite sequencing (RRBS) analysis. In this study, we focused on differentially methylated promoters (DMPs) that could be directly associated with gene expression. We detected 2,761 and 677 DMPs between the GT and GM by MBD-seq and RRBS, respectively, and for a total of 3,035 DMPs. Then, 514 (17%) of all DMPs were detected in the IM genome, which is a precancer of GC, supporting that some DMPs might represent an early event in gastric carcinogenesis. A pathway analysis of all DMPs demonstrated that 59 G protein-coupled receptor (GPCR) genes linked to the hypermethylated DMPs were significantly enriched in a neuroactive ligand-receptor interaction pathway. Furthermore, among the 59 GPCRs, six GI hormone receptor genes (NPY1R, PPYR1, PTGDR, PTGER2, PTGER3, and SSTR2) that play an inhibitory role in the secretion of gastrin or gastric acid were selected and validated as potential biomarkers for the diagnosis or prognosis of GC patients in two cohorts. These data suggest that the loss of function of gastrointestinal (GI) hormone receptors by promoter methylation may lead to gastric carcinogenesis because gastrin and gastric acid have been known to play a role in cell differentiation and carcinogenesis in the GI tract.

Genome Survey and Microsatellite Marker Selection of Tegillarca granosa (꼬막(Tegillarca granosa)의 유전적 다양성 분석을 위한 드래프트 게놈분석과 마이크로새틀라이트 마커 발굴)

  • Kim, Jinmu;Lee, Seung Jae;Jo, Euna;Choi, Eunkyung;Kim, Hyeon Jin;Lee, Jung Sick;Park, Hyun
    • Journal of Marine Life Science
    • /
    • v.6 no.1
    • /
    • pp.38-46
    • /
    • 2021
  • The blood clam, Tegillarca granosa, is economically important in marine bivalve and is used in fisheries industry among western Pacific Ocean Coasts especially in Korea, China, and Japan. The number of chromosomes in the blood clam is known as 2n=38, but the genome size and genetic information of the genome are not still clear. In order to predict the genomic size of the T. granosa, the in-silico analysis analysed the genomic size using short DNA sequence information obtained using the NGS Illumina HiSeq platform. As a result, the genomic size of T. granosa was estimated to be 770.61 Mb. Subsequently, a draft genome assembly was performed through the MaSuRCA assembler, and a simple sequence repeat (SSR) analysis was done by using the QDD pipeline. 43,944 SSRs were detected from the genome of T. granosa and 69.51% di-nucleotide, 16.68% trinucleotide, 12.96% tetra-nucleotide, 0.82% penta-nucleotide, and 0.03% hexa-nucleotide were consisted. 100 primer sets that could be used for genetic diversity studies were selected. In the future, this study will help identify the genetic diversity of T. granosa and population genetic studies, and further identify the classification of origin between homogenous groups.

Single-cell RNA sequencing reveals the heterogeneity of adipose tissue-derived mesenchymal stem cells under chondrogenic induction

  • Jeewan Chun;Ji-Hoi Moon;Kyu Hwan Kwack;Eun-Young Jang;Saebyeol Lee;Hak Kyun Kim;Jae-Hyung Lee
    • BMB Reports
    • /
    • v.57 no.5
    • /
    • pp.232-237
    • /
    • 2024
  • This study investigated how adipose tissue-derived mesenchymal stem cells (AT-MSCs) respond to chondrogenic induction using droplet-based single-cell RNA sequencing (scRNA-seq). We analyzed 37,219 high-quality transcripts from control cells and cells induced for 1 week (1W) and 2 weeks (2W). Four distinct cell clusters (0-3), undetectable by bulk analysis, exhibited varying proportions. Cluster 1 dominated in control and 1W cells, whereas clusters (3, 2, and 0) exclusively dominated in control, 1W, and 2W cells, respectively. Furthermore, heterogeneous chondrogenic markers expression within clusters emerged. Gene ontology (GO) enrichment analysis of differentially expressed genes unveiled cluster-specific variations in key biological processes (BP): (1) Cluster 1 exhibited up-regulation of GO-BP terms related to ribosome biogenesis and translational control, crucial for maintaining stem cell properties and homeostasis; (2) Additionally, cluster 1 showed up-regulation of GO-BP terms associated with mitochondrial oxidative metabolism; (3) Cluster 3 displayed up-regulation of GO-BP terms related to cell proliferation; (4) Clusters 0 and 2 demonstrated similar up-regulation of GO-BP terms linked to collagen fibril organization and supramolecular fiber organization. However, only cluster 0 showed a significant decrease in GO-BP terms related to ribosome production, implying a potential correlation between ribosome regulation and the differentiation stages of AT-MSCs. Overall, our findings highlight heterogeneous cell clusters with varying balances between proliferation and differentiation before, and after, chondrogenic stimulation. This provides enhanced insights into the single-cell dynamics of AT-MSCs during chondrogenic differentiation.