• 제목/요약/키워드: Next Generation Sequence

검색결과 172건 처리시간 0.031초

Whole genome sequencing of foot-and-mouth disease virus using benchtop next generation sequencing (NGS) system

  • Moon, Sung-Hyun;Oh, Yeonsu;Tark, Dongseob;Cho, Ho-Seong
    • 한국동물위생학회지
    • /
    • 제42권4호
    • /
    • pp.297-300
    • /
    • 2019
  • In countries with FMD vaccination, as in Korea, typical clinical signs do not appear, and even in FMD positive cases, it is difficult to isolate the FMDV or obtain whole genome sequence. To overcome this problem, more rapid and simple NGS system is required to control FMD in Korea. FMDV (O/Boeun/ SKR/2017) RNA was extracted and sequenced using Ion Torrent's bench-top sequencer with amplicon panel with optimized bioinformatics pipelines. The whole genome sequencing of raw data generated data of 1,839,864 (mean read length 283 bp) reads comprising a total of 521,641,058 (≥Q20 475,327,721). Compared with FMDV (GenBank accession No. MG983730), the FMDV sequences in this study showed 99.83% nucleotide identity. Further study is needed to identify these differences. In this study, fast and robust methods for benchtop next generation sequencing (NGS) system was developed for analysis of Foot-and-mouth disease virus (FMDV) whole genome sequences.

Genome-Wide SNP Calling Using Next Generation Sequencing Data in Tomato

  • Kim, Ji-Eun;Oh, Sang-Keun;Lee, Jeong-Hee;Lee, Bo-Mi;Jo, Sung-Hwan
    • Molecules and Cells
    • /
    • 제37권1호
    • /
    • pp.36-42
    • /
    • 2014
  • The tomato (Solanum lycopersicum L.) is a model plant for genome research in Solanaceae, as well as for studying crop breeding. Genome-wide single nucleotide polymorphisms (SNPs) are a valuable resource in genetic research and breeding. However, to do discovery of genome-wide SNPs, most methods require expensive high-depth sequencing. Here, we describe a method for SNP calling using a modified version of SAMtools that improved its sensitivity. We analyzed 90 Gb of raw sequence data from next-generation sequencing of two resequencing and seven transcriptome data sets from several tomato accessions. Our study identified 4,812,432 non-redundant SNPs. Moreover, the workflow of SNP calling was improved by aligning the reference genome with its own raw data. Using this approach, 131,785 SNPs were discovered from transcriptome data of seven accessions. In addition, 4,680,647 SNPs were identified from the genome of S. pimpinellifolium, which are 60 times more than 71,637 of the PI212816 transcriptome. SNP distribution was compared between the whole genome and transcriptome of S. pimpinellifolium. Moreover, we surveyed the location of SNPs within genic and intergenic regions. Our results indicated that the sufficient genome-wide SNP markers and very sensitive SNP calling method allow for application of marker assisted breeding and genome-wide association studies.

Screening of Genetic Variations in Korean Native Duck using Next-Generation Resequencing Data

  • Eunjin Cho;Minjun Kim;Hyo Jun Choo;Jun Heon Lee
    • 한국가금학회지
    • /
    • 제50권3호
    • /
    • pp.187-191
    • /
    • 2023
  • Korean native ducks (KNDs) continue to have a high preference from consumers due to their excellent meat quality and taste characteristics. However, due to low productivity and fixed plumage color phenotype, it could not secure a large share in the domestic market compared to imported species. In order to improve the market share of KNDs, the genetic characteristics of the breed should be identified and used for improvement and selection. Therefore, this study was conducted to identify the genetic information of colored and white KNDs using next-generation resequencing data and screening for differences between the two groups. As a result of the analysis, the genetic variants that showed significant differences between the colored and white KND groups were mainly identified as mutations related to tyrosine activity. The variants were located in the genes that affect melanin synthesis and regulation, such as EGFR, PDGFRA, and DDR2, and these were reported as the candidate genes related to plumage pigmentation in poultry. Therefore, the results of this study are expected to be useful as a basis for understanding and utilizing the genetic characteristics of KNDs for genetic improvement and selection of white broiler KNDs.

misMM: An Integrated Pipeline for Misassembly Detection Using Genotyping-by-Sequencing and Its Validation with BAC End Library Sequences and Gene Synteny

  • Ko, Young-Joon;Kim, Jung Sun;Kim, Sangsoo
    • Genomics & Informatics
    • /
    • 제15권4호
    • /
    • pp.128-135
    • /
    • 2017
  • As next-generation sequencing technologies have advanced, enormous amounts of whole-genome sequence information in various species have been released. However, it is still difficult to assemble the whole genome precisely, due to inherent limitations of short-read sequencing technologies. In particular, the complexities of plants are incomparable to those of microorganisms or animals because of whole-genome duplications, repeat insertions, and Numt insertions, etc. In this study, we describe a new method for detecting misassembly sequence regions of Brassica rapa with genotyping-by-sequencing, followed by MadMapper clustering. The misassembly candidate regions were cross-checked with BAC clone paired-ends library sequences that have been mapped to the reference genome. The results were further verified with gene synteny relations between Brassica rapa and Arabidopsis thaliana. We conclude that this method will help detect misassembly regions and be applicable to incompletely assembled reference genomes from a variety of species.

Flanking Sequence and Copy-Number Analysis of Transformation Events by Integrating Next-Generation Sequencing Technology with Southern Blot Hybridization

  • Qin, Yang;Woo, Hee-Jong;Shin, Kong-Sik;Lim, Myung-Ho;Cho, Hyun-Suk;Lee, Seong-Kon
    • Plant Breeding and Biotechnology
    • /
    • 제5권4호
    • /
    • pp.269-281
    • /
    • 2017
  • With the continual development of genetically modified (GM) crops, it has become necessary to develop detailed and effective molecular characterization methods to select candidate events from a large pool of transformation events. Relative to traditional molecular analysis methods such as the polymerase chain reaction (PCR) and Southern blot hybridization, next generation sequencing (NGS) technology for whole-genome sequencing of complex crop genomes had proven comparatively useful for in-depth molecular characterization. In this study, four transformation events, including one in Bacillus thuringiensis (Bt)-resistant rice, one in resveratrol-producing rice, and two in beta-carotene-enhanced soybeans, were selected for molecular characterization. To merge NGS analysis and Southern blot-hybridization results, we confirmed the transgene insertion sites, insertion construction, and insertion numbers of these four transformation events. In addition, the read-coverage depth assessed by NGS analysis for inserted genes might provide consistent results in terms of inserted T-DNA numbers in case of complex insertion structures and highly duplicated donor genomes; however, PCR-based methods can produce incorrect conclusions. Our combined method provides an effective and complete analytical approach for whole-genome visual inspection of transformation events that require biosafety assessment.

조립 공정계획을 위한 지식기반 시스템 (A Knowledge-based System for Assembly Process Planning)

  • 박홍석;손석배
    • 한국정밀공학회지
    • /
    • 제16권5호통권98호
    • /
    • pp.29-39
    • /
    • 1999
  • Many industrial products can be assembled in various sequences of assembly operations. To save time and cost in assembly process and to increase the quality of products, it is very important to choose an optimal assembly sequence. In this paper, we propose a methodology that generates an optimal assembly sequence by using the knowledge of experts. First, a product is divided into several sub-assemblies. Next, the disassembly sequences of sub-assembly are generated using disassembly rules and special information can be extracted through the disassembly process. By combining every assembly sequence of sub-assemblies, we can generate all the possible assembly sequences of a product. Finally, the expert system evaluates all the possible assembly sequences and finds an optimal assembly sequence. It can be achieved under consideration of the parameters such as assembly operation, tool change, safety of part. basepart location, setup change, distance, and orientation. The developed system is applied to UBR(Unit Bath Room) example.

  • PDF

Identification of copy number variations using high density whole-genome single nucleotide polymorphism markers in Chinese Dongxiang spotted pigs

  • Wang, Chengbin;Chen, Hao;Wang, Xiaopeng;Wu, Zhongping;Liu, Weiwei;Guo, Yuanmei;Ren, Jun;Ding, Nengshui
    • Asian-Australasian Journal of Animal Sciences
    • /
    • 제32권12호
    • /
    • pp.1809-1815
    • /
    • 2019
  • Objective: Copy number variations (CNVs) are a major source of genetic diversity complementary to single nucleotide polymorphism (SNP) in animals. The aim of the study was to perform a comprehensive genomic analysis of CNVs based on high density whole-genome SNP markers in Chinese Dongxiang spotted pigs. Methods: We used customized Affymetrix Axiom Pig1.4M array plates containing 1.4 million SNPs and the PennCNV algorithm to identify porcine CNVs on autosomes in Chinese Dongxiang spotted pigs. Then, the next generation sequence data was used to confirm the detected CNVs. Next, functional analysis was performed for gene contents in copy number variation regions (CNVRs). In addition, we compared the identified CNVRs with those reported ones and quantitative trait loci (QTL) in the pig QTL database. Results: We identified 871 putative CNVs belonging to 2,221 CNVRs on 17 autosomes. We further discarded CNVRs that were detected only in one individual, leaving us 166 CNVRs in total. The 166 CNVRs ranged from 2.89 kb to 617.53 kb with a mean value of 93.65 kb and a genome coverage of 15.55 Mb, corresponding to 0.58% of the pig genome. A total of 119 (71.69%) of the identified CNVRs were confirmed by next generation sequence data. Moreover, functional annotation showed that these CNVRs are involved in a variety of molecular functions. More than half (56.63%) of the CNVRs (n = 94) have been reported in previous studies, while 72 CNVRs are reported for the first time. In addition, 162 (97.59%) CNVRs were found to overlap with 2,765 previously reported QTLs affecting 378 phenotypic traits. Conclusion: The findings improve the catalog of pig CNVs and provide insights and novel molecular markers for further genetic analyses of Chinese indigenous pigs.

Flexible Voltage Support Control with Imbalance Mitigation Capability for Inverter-Based Distributed Generation Power Plants under Grid Faults

  • Wang, Yuewu;Yang, Ping;Xu, Zhirong
    • Journal of Power Electronics
    • /
    • 제16권4호
    • /
    • pp.1551-1564
    • /
    • 2016
  • The high penetration level of inverter-based distributed generation (DG) power plants is challenging the low-voltage ride-through requirements, especially under unbalanced voltage sags. Recently, a flexible injection of both positive- (PS) and negative-sequence (NS) reactive currents has been suggested for the next generation of grid codes. This can enhance the ancillary services for voltage support at the point of common coupling (PCC). In light of this, considering distant grid faults that occur in a mainly inductive grid, this paper proposes a complete voltage support control scheme for the interface inverters of medium or high-rated DG power plants. The first contribution is the development of a reactive current reference generator combining PS and NS, with a feature to increase the PS voltage and simultaneously decrease the NS voltage, to mitigate voltage imbalance. The second contribution is the design of a voltage support control loop with two flexible PCC voltage set points, which can ensure continuous operation within the limits required in grid codes. In addition, a current saturation strategy is also considered for deep voltage sags to avoid overcurrent protection. Finally, simulation and experimental results are presented to validate the effectiveness of the proposed control scheme.

Subword Neural Language Generation with Unlikelihood Training

  • Iqbal, Salahuddin Muhammad;Kang, Dae-Ki
    • International Journal of Internet, Broadcasting and Communication
    • /
    • 제12권2호
    • /
    • pp.45-50
    • /
    • 2020
  • A Language model with neural networks commonly trained with likelihood loss. Such that the model can learn the sequence of human text. State-of-the-art results achieved in various language generation tasks, e.g., text summarization, dialogue response generation, and text generation, by utilizing the language model's next token output probabilities. Monotonous and boring outputs are a well-known problem of this model, yet only a few solutions proposed to address this problem. Several decoding techniques proposed to suppress repetitive tokens. Unlikelihood training approached this problem by penalizing candidate tokens probabilities if the tokens already seen in previous steps. While the method successfully showed a less repetitive generated token, the method has a large memory consumption because of the training need a big vocabulary size. We effectively reduced memory footprint by encoding words as sequences of subword units. Finally, we report competitive results with token level unlikelihood training in several automatic evaluations compared to the previous work.

Construction of PANM Database (Protostome DB) for rapid annotation of NGS data in Mollusks

  • Kang, Se Won;Park, So Young;Patnaik, Bharat Bhusan;Hwang, Hee Ju;Kim, Changmu;Kim, Soonok;Lee, Jun Sang;Han, Yeon Soo;Lee, Yong Seok
    • 한국패류학회지
    • /
    • 제31권3호
    • /
    • pp.243-247
    • /
    • 2015
  • A stand-alone BLAST server is available that provides a convenient and amenable platform for the analysis of molluscan sequence information especially the EST sequences generated by traditional sequencing methods. However, it is found that the server has limitations in the annotation of molluscan sequences generated using next-generation sequencing (NGS) platforms due to inconsistencies in molluscan sequence available at NCBI. We constructed a web-based interface for a new stand-alone BLAST, called PANM-DB (Protostome DB) for the analysis of molluscan NGS data. The PANM-DB includes the amino acid sequences from the protostome groups-Arthropoda, Nematoda, and Mollusca downloaded from GenBank with the NCBI taxonomy Browser. The sequences were translated into multi-FASTA format and stored in the database by using the formatdb program at NCBI. PANM-DB contains 6% of NCBInr database sequences (as of 24-06-2015), and for an input of 10,000 RNA-seq sequences the processing speed was 15 times faster by using PANM-DB when compared with NCBInr DB. It was also noted that PANM-DB show two times more significant hits with diverse annotation profiles as compared with Mollusks DB. Hence, the construction of PANM-DB is a significant step in the annotation of molluscan sequence information obtained from NGS platforms. The PANM-DB is freely downloadable from the web-based interface (Malacological Society of Korea, http://malacol.or/kr/blast) as compressed file system and can run on any compatible operating system.