• Title/Summary/Keyword: Data Amplification

Search Result 298, Processing Time 0.046 seconds

Machine Learning Language Model Implementation Using Literary Texts (문학 텍스트를 활용한 머신러닝 언어모델 구현)

  • Jeon, Hyeongu;Jung, Kichul;Kwon, Kyoungah;Lee, Insung
    • The Journal of the Convergence on Culture Technology
    • /
    • v.7 no.2
    • /
    • pp.427-436
    • /
    • 2021
  • The purpose of this study is to implement a machine learning language model that learns literary texts. Literary texts have an important characteristic that pairs of question-and-answer are not frequently clearly distinguished. Also, literary texts consist of pronouns, figurative expressions, soliloquies, etc. They hinder the necessity of machine learning using literary texts by making it difficult to learn algorithms. Algorithms that learn literary texts can show more human-friendly interactions than algorithms that learn general sentences. For this goal, this paper proposes three text correction tasks that must be preceded in researches using literary texts for machine learning language model: pronoun processing, dialogue pair expansion, and data amplification. Learning data for artificial intelligence should have clear meanings to facilitate machine learning and to ensure high effectiveness. The introduction of special genres of texts such as literature into natural language processing research is expected not only to expand the learning area of machine learning, but to show a new language learning method.

Prediction of Genomic Relationship Matrices using Single Nucleotide Polymorphisms in Hanwoo (한우의 유전체 표지인자 활용 개체 혈연관계 추정)

  • Lee, Deuk-Hwan;Cho, Chung-Il;Kim, Nae-Soo
    • Journal of Animal Science and Technology
    • /
    • v.52 no.5
    • /
    • pp.357-366
    • /
    • 2010
  • The emergence of next-generation sequencing technologies has lead to application of new computational and statistical methodologies that allow incorporating genetic information from entire genomes of many individuals composing the population. For example, using single-nucleotide polymorphisms (SNP) obtained from whole genome amplification platforms such as the Ilummina BovineSNP50 chip, many researchers are actively engaged in the genetic evaluation of cattle livestock using whole genome relationship analyses. In this study, we estimated the genomic relationship matrix (GRM) and compared it with one computed using a pedigree relationship matrix (PRM) using a population of Hanwoo. This project is a preliminary study that will eventually include future work on genomic selection and prediction. Data used in this study were obtained from 187 blood samples consisting of the progeny of 20 young bulls collected after parentage testing from the Hanwoo improvement center, National Agriculture Cooperative Federation as well as 103 blood samples from the progeny of 12 proven bulls collected from farms around the Kyong-buk area in South Korea. The data set was divided into two cases for analysis. In the first case missing genotypes were included. In the second case missing genotypes were excluded. The effect of missing genotypes on the accuracy of genomic relationship estimation was investigated. Estimation of relationships using genomic information was also carried out chromosome by chromosome for whole genomic SNP markers based on the regression method using allele frequencies across loci. The average correlation coefficient and standard deviation between relationships using pedigree information and chromosomal genomic information using data which was verified using a parentage test andeliminated missing genotypes was $0.81{\pm}0.04$ and their correlation coefficient when using whole genomic information was 0.98, which was higher. Variation in relationships between non-inbred half sibs was $0.22{\pm}0.17$ on chromosomal and $0.22{\pm}0.04$ on whole genomic SNP markers. The variations were larger and unusual values were observed when non-parentage test data were included. So, relationship matrix by genomic information can be useful for genetic evaluation of animal breeding.

Development of a Species-specific PCR Assay for Three Xanthomonas Species, Causing Bulb and Flower Diseases, Based on Their Genome Sequences

  • Back, Chang-Gi;Lee, Seung-Yeol;Lee, Boo-Ja;Yea, Mi-Chi;Kim, Sang-Mok;Kang, In-Kyu;Cha, Jae-Soon;Jung, Hee-Young
    • The Plant Pathology Journal
    • /
    • v.31 no.3
    • /
    • pp.212-218
    • /
    • 2015
  • In this study, we developed a species-specific PCR assay for rapid and accurate detection of three Xanthomonas species, X. axonopodis pv. poinsettiicola (XAP), X. hyacinthi (XH) and X. campestris pv. zantedeschiae (XCZ), based on their draft genome sequences. XAP, XH and XCZ genomes consist of single chromosomes that contain 5,221, 4,395 and 7,986 protein coding genes, respectively. Species-specific primers were designed from variable regions of the draft genome sequence data and assessed by a PCR-based detection method. These primers were also tested for specificity against 17 allied Xanthomonas species as well as against the host DNA and the microbial community of the host surface. Three primer sets were found to be very specific and no amplification product was obtained with the host DNA and the microbial community of the host surface. In addition, a detection limit of $1pg/{\mu}l$ per PCR reaction was detected when these primer sets were used to amplify corresponding bacterial DNAs. Therefore, these primer sets and the developed species-specific PCR assay represent a valuable, sensitive, and rapid diagnostic tool that can be used to detect three specific pathogens at early stages of infection and may help control diseases.

Expression of Fatty Acid Synthase Negatively Correlates with PTEN and Predicts Peritoneal Dissemination of Human Gastric Cancer

  • Xiang, Hong-Gang;Hao, Jun;Zhang, Wen-Jie;Lu, Wen-Jie;Dong, Ping;Liu, Ying-Bin;Chen, Lei
    • Asian Pacific Journal of Cancer Prevention
    • /
    • v.16 no.16
    • /
    • pp.6851-6855
    • /
    • 2015
  • Background: This study aimed to examine the clinical significance of fatty acid synthase (FASN) expression in gastric cancer (GC), and investigate any prognostic role. Materials and Methods: FASN expression was assessed in gastric cancers by immunohistochemistry using 60 paraffin-embedded tissue specimens, and clinical data were collected by retrospective chart review. Moreover, FASN mRNA expression in 15 fresh resected specimens was evaluated by the reverse transcription-polymerase chain reaction (RT-PCR). Immunohistochemical staining of PTEN was performed to assess the correlation of PTEN with FASN in gastric cancer. Results: Increased expression of FASN was noted in gastric cancers. The frequency of FASN gene amplification was also significantly higher in gastric cancer than in adjacent normal tissue. FASN expression in human gastric cancer tissues was significantly correlated with patient TNM stage and peritoneal dissemination (p<0.05). Moreover, higher FASN expression significantly correlated with shorter overall survival (p<0.05). Here, upregulation of FASN negatively correlated with PTEN expression in gastric cancer. Conclusions: These findings indicate that FASN expression is upregulated in gastric cancer, and increased FASN may be critical to th peritoneal metastasis and survival. Our results suggest that FASN upregulation and PTEN downregualtion may be involved in peritoneal dissemination for gastric cancer progression.

Inhibitors of DNA methylation support TGF-β1-induced IL11 expression in gingival fibroblasts

  • Sufaru, Irina-Georgeta;Beikircher, Gabriel;Weinhaeusel, Andreas;Gruber, Reinhard
    • Journal of Periodontal and Implant Science
    • /
    • v.47 no.2
    • /
    • pp.66-76
    • /
    • 2017
  • Purpose: Oral wound healing requires gingival fibroblasts to respond to local growth factors. Epigenetic silencing through DNA methylation can potentially decrease the responsiveness of gingival fibroblasts to local growth factors. In this study, our aim was to determine whether the inhibition of DNA methylation sensitized gingival fibroblasts to transforming growth factor-${\beta}1$ (TGF-${\beta}1$). Methods: Gingival fibroblasts were exposed to 5-aza-2'-deoxycytidine (5-aza), a clinically approved demethylating agent, before stimulation with TGF-${\beta}1$. Gene expression changes were evaluated using quantitative polymerase chain reaction (PCR) analysis. DNA methylation was detected by methylation-sensitive restriction enzymes and PCR amplification. Results: We found that 5-aza enhanced TGF-${\beta}1$-induced interleukin-11 (IL11) expression in gingival fibroblasts 2.37-fold (P=0.008). 5-aza had no significant effects on the expression of proteoglycan 4 (PRG4) and NADPH oxidase 4 (NOX4). Consistent with this, 5-aza caused demethylation of the IL11 gene commonly next to a guanosine (CpG) island in gingival fibroblasts. The TGF-${\beta}$ type I receptor kinase inhibitor SB431542 impeded the changes in IL11 expression, indicating that the effects of 5-aza require TGF-${\beta}$ signaling. 5-aza moderately increased the expression of TGF-${\beta}$ type II receptor (1.40-fold; P=0.009), possibly enhancing the responsiveness of fibroblasts to TGF-${\beta}1$. As part of the feedback response, 5-aza increased the expression of the DNA methyltransferases 1 (DNMT1) (P=0.005) and DNMT3B (P=0.002), which are enzymes responsible for gene methylation. Conclusions: These in vitro data suggest that the inhibition of DNA methylation by 5-aza supports TGF-${\beta}$-induced IL11 expression in gingival fibroblasts.

MAGNETIC FIELD IN THE LOCAL UNIVERSE AND THE PROPAGATION OF UHECRS

  • DOLAG KLAUS;GRASSO DARIO;SPRINGEL VOLKER;TKACHEV IGOR
    • Journal of The Korean Astronomical Society
    • /
    • v.37 no.5
    • /
    • pp.427-431
    • /
    • 2004
  • We use simulations of large-scale structure formation to study the build-up of magnetic fields (MFs) in the intergalactic medium. Our basic assumption is that cosmological MFs grow in a magnetohy-drodynamical (MHD) amplification process driven by structure formation out of a magnetic seed field present at high redshift. This approach is motivated by previous simulations of the MFs in galaxy clusters which, under the same hypothesis that we adopt here, succeeded in reproducing Faraday rotation measurements (RMs) in clusters of galaxies. Our ACDM initial conditions for the dark matter density fluctuations have been statistically constrained by the observed large-scale density field within a sphere of 110 Mpc around the Milky Way, based on the IRAS 1.2-Jy all-sky redshift survey. As a result, the positions and masses of prominent galaxy clusters in our simulation coincide closely with their real counterparts in the Local Universe. We find excellent agreement between RMs of our simulated galaxy clusters and observational data. The improved numerical resolution of our simulations compared to previous work also allows us to study the MF in large-scale filaments, sheets and voids. By tracing the propagation of ultra high energy (UHE) protons in the simulated MF we construct full-sky maps of expected deflection angles of protons with arrival energies $E = 10^{20}\;eV$ and $4 {\times} 10^{19}\;eV$, respectively. Accounting only for the structures within 110 Mpc, we find that strong deflections are only produced if UHE protons cross galaxy clusters. The total area on the sky covered by these structures is however very small. Over still larger distances, multiple crossings of sheets and filaments may give rise to noticeable deflections over a significant fraction of the sky; the exact amount and angular distribution depends on the model adopted for the magnetic seed field. Based on our results we argue that over a large fraction of the sky the deflections are likely to remain smaller than the present experimental angular sensitivity. Therefore, we conclude that forthcoming air shower experiments should be able to locate sources of UHE protons and shed more light on the nature of cosmological MFs.

Development of PCR-Based Sequence Characterized DNA Markers for the Identification and Detection, Genetic Diversity of Didymella bryoniae with Random Amplified polymorphic DNA(RAPD)

  • Kyo, Seo-Il;Shim, Chang-Ki;Kim, Dong-Kil;Baep, Dong-Won;Lee, Seon-Chul;Kim, Hee-Kyu
    • Proceedings of the Korean Society of Plant Pathology Conference
    • /
    • 2003.10a
    • /
    • pp.130-130
    • /
    • 2003
  • Gummy stem blight pathogen is very difficult not only to monitor the inoculum levels prior to host infection, and also it is destructive and hard to control in field condition. We have applied RAPD technique to elucidate the genetic diversity of the genomic DNA of Didymella bryoniae and also to generate specific diagnostic DNA probe useful for identification and detection. The 40 primers produced clear bands consistently from the genomic DNA of twenty isolates of Didymella bryoniae, and two hundred seventy-three amplified fragments were produced with 40 primers. The combined data from 273 bands was analyzed by a cluster analysis using UPGMA method with an arithmetic average program of NTSYS-PC (Version 1.80) to generate a dendrogram. At the distance level of 0.7, two major RAPD groups were differentiated among 20 strains. RAPD group (RG) I included 8 isolates from watermelon except one isolate from melon. RAPD group (RG) IV included 12 isolates from squash, cucumber, watermelon and melon.. In amplification experiment with SCAR specific primer RG1F-RG1R resulted in a single band of 650bp fragment only for 8 isolates out of 20 isolates that should be designated as RAPD Group 1. However, same set of experiment done with RGIIF-RGIIR did not result in any amplified product.. Our attempts to detect intraspecific diversity of ITS region of rDNA by amplifying ITS region and 17s rDNA region for 20 isolates and restriction digestion of amplified fragment with 12 enzymes did not reveal polymorphic band. In order to develop RAPD markers for RGIV specific primer, a candidate PCR fragment( ≒1.4kb) was purified and Southern hybridized to the amplified fragment RGIV isolates. This promising candidate probe recognized only RGIV isolates

  • PDF

Molecular Cloning and Phylogeny of the Human Endogenous Retrovirus HERV-W LTR Family in cDNA Library of Human Fetal Brain (인간 태아의 뇌로부터 만들어진 cDNA library에서 내생 레트로바이러스 HERV-W LTR의 클로닝 및 분자계통분류)

  • 이주민;허재원;신경미;이지원;이영춘;백인호;장경립;김희수
    • Journal of Life Science
    • /
    • v.11 no.4
    • /
    • pp.379-384
    • /
    • 2001
  • Long terminal repeats(LTRs) of the human endogenous retrovirus(HERV) heve been found to be coexpresed with genes located nearby. It has been suggested that the LTR elements have contributed to the genetic variation of human genome connected to various diseases. Recently, HERV-W family was identified in the cerebrospinal fluids and brains of individuals with schizophrenia. Using cHNA library derived from human fetal brain, we performed PCR amplification and identified seven new HERV-W LTR elements. Those LTR elements showed a high degree of sequence similarity(98∼99%) with HERV-W (AF072500). A phylogentic tree obtained by the neighbor-joining method revealed that seven new HERV-W LTR elements(FB-1, 2, 4, 8, 9, 10, 12) were closely related to the AX000960, AF072504, and AF072506 from Gen Bank database. Our data suggest that several copy numbers of the HERV-W LTR elements are expressed in human feta brain and may contribute to an understanding of biological function connected to neuropsychiatric diseases.

  • PDF

Molecular Characterization, Chromosomal Localizations, Expression Profile, and Association Analysis of the Porcine PECI Gene with Carcass Traits

  • Gao, H.;Fan, B.;Zhu, M.J.;Liu, Bang
    • Asian-Australasian Journal of Animal Sciences
    • /
    • v.23 no.1
    • /
    • pp.7-12
    • /
    • 2010
  • The full-length cDNA of the porcine peroxisomal ${\Delta}^3$,${\Delta}^2$-enoyl-CoA isomerase (PECI) gene encodes a monofunctional peroxisomal ${\Delta}^3$,${\Delta}^2$-enoyl-CoA isomerase. Cloning and sequencing of the porcine PECI cDNA revealed the presence of an 1185-base pair open reading frame predicted to encode a 394-amino acid protein by the 5'rapid amplification of cDNA ends (5'RACE) and EST sequences. The porcine PECI gene was expressed in seven tissues (heart, liver, spleen, lung, kidney, skeletal muscle, fat) which was revealed by reverse transcriptase-polymerase chain reaction (RT-PCR). The porcine PECI was mapped to SSC71/2 p11-13 using the somatic cell hybrid panel (SCHP) and the radiation hybrid panel (RH) (LOD score 12.84). The data showed that PECI was closely linked to marker S0383. A C/T single nucleotide polymorphism in PECI exon 10 (3'UTR) was detected as a PvuII PCR-RFLP. Association analysis in our experimental pig population showed that different genotypes of PECI gene were significantly associated with the Average Backfat thickness (ABF) (p<0.05) and Buttock backfat thickness (p<0.01).

Cloning and Expression Analysis of a Novel Mouse Zinc Finger Protein Gene Znf313 Abundantly Expressed in Testis

  • Li, Na;Sun, Huaqin;Wu, Qiaqing;Tao, Dachang;Zhang, Sizhong;Ma, Yongxin
    • BMB Reports
    • /
    • v.40 no.2
    • /
    • pp.270-276
    • /
    • 2007
  • We have cloned a novel mouse zinc finger protein gene Znf313 by rapid amplification of cDNA ends (RACE) according to the homologue of human ZNF313 gene. The cDNA is 2,163 base pairs (bp) in length and encodes a 229 amino acids (aa) protein with a $C_3HC_4$ ring finger domain and three $C_2H_2$ domains. 89% and 93% nucleotide (nt) and aa sequence identity is observed with its human homologue. Revealed by Northern blot and RT-PCR, full mRNA consists of 2.16 kb and widely expresses in tissues as a single transcript, most abundantly in heart, liver, kidney and testis. The expression of Znf313 in testis is detected in all development stages. Western blot analysis also reveals that Znf313 is expressed in the tissues. Immunohistochemical staining and subcellular localization demonstrate that Znf313 is expressed both in the cytoplasm and nucleus whereas predominantly localized in the nucleus. Present data suggests that Znf313 gene might play a fundamental role in gene transcription and regulation in organism and relates to spermatogenesis.