• Title/Summary/Keyword: Intensity-dependent Normalization

Search Result 4, Processing Time 0.021 seconds

Combining Support Vector Machine Recursive Feature Elimination and Intensity-dependent Normalization for Gene Selection in RNAseq (RNAseq 빅데이터에서 유전자 선택을 위한 밀집도-의존 정규화 기반의 서포트-벡터 머신 병합법)

  • Kim, Chayoung
    • Journal of Internet Computing and Services
    • /
    • v.18 no.5
    • /
    • pp.47-53
    • /
    • 2017
  • In past few years, high-throughput sequencing, big-data generation, cloud computing, and computational biology are revolutionary. RNA sequencing is emerging as an attractive alternative to DNA microarrays. And the methods for constructing Gene Regulatory Network (GRN) from RNA-Seq are extremely lacking and urgently required. Because GRN has obtained substantial observation from genomics and bioinformatics, an elementary requirement of the GRN has been to maximize distinguishable genes. Despite of RNA sequencing techniques to generate a big amount of data, there are few computational methods to exploit the huge amount of the big data. Therefore, we have suggested a novel gene selection algorithm combining Support Vector Machines and Intensity-dependent normalization, which uses log differential expression ratio in RNAseq. It is an extended variation of support vector machine recursive feature elimination (SVM-RFE) algorithm. This algorithm accomplishes minimum relevancy with subsets of Big-Data, such as NCBI-GEO. The proposed algorithm was compared to the existing one which uses gene expression profiling DNA microarrays. It finds that the proposed algorithm have provided as convenient and quick method than previous because it uses all functions in R package and have more improvement with regard to the classification accuracy based on gene ontology and time consuming in terms of Big-Data. The comparison was performed based on the number of genes selected in RNAseq Big-Data.

A MA-plot-based Feature Selection by MRMR in SVM-RFE in RNA-Sequencing Data

  • Kim, Chayoung
    • The Journal of Korean Institute of Information Technology
    • /
    • v.16 no.12
    • /
    • pp.25-30
    • /
    • 2018
  • It is extremely lacking and urgently required that the method of constructing the Gene Regulatory Network (GRN) from RNA-Sequencing data (RNA-Seq) because of Big-Data and GRN in Big-Data has obtained substantial observation as the interactions among relevant featured genes and their regulations. We propose newly the computational comparative feature patterns selection method by implementing a minimum-redundancy maximum-relevancy (MRMR) filter the support vector machine-recursive feature elimination (SVM-RFE) with Intensity-dependent normalization (DEGSEQ) as a preprocessor for emphasizing equal preciseness in RNA-seq in Big-Data. We found out the proposed algorithm might be more scalable and convenient because of all libraries in R package and be more improved in terms of the time consuming in Big-Data and minimum-redundancy maximum-relevancy of a set of feature patterns at the same time.

Availability of Normalized Spectra of Landsat/TM Data by Their Band Sum

  • Ono, Akiko;Kajiwara, Koji;Honda, Yoshiaki;Ono, Atsuo
    • Proceedings of the KSRS Conference
    • /
    • 2003.11a
    • /
    • pp.573-575
    • /
    • 2003
  • In satellite spectra, Though the magnitude varies with intensity of sunstroke, dip angle of land so on, the shape is less deformed with these effects. from this point of view, we have developed a spectral shape-dependent analysis utilizing a normalization procedure by the spectral integral and applied it to Landsat/TM spectra. Inevitable topographic and atmospheric effects can be suppressed. The correction algorithm is very simple and timesaving and the suppression of topographic effects is especially effective. Normalized band 4 is almost linear to NDVI values, and is available to the vegetation index.

  • PDF

Establishing meteorological drought severity considering the level of emergency water supply (비상급수의 규모를 고려한 기상학적 가뭄 강도 수립)

  • Lee, Seungmin;Wang, Wonjoon;Kim, Donghyun;Han, Heechan;Kim, Soojun;Kim, Hung Soo
    • Journal of Korea Water Resources Association
    • /
    • v.56 no.10
    • /
    • pp.619-629
    • /
    • 2023
  • Recent intensification of climate change has led to an increase in damages caused by droughts. Currently, in Korea, the Standardized Precipitation Index (SPI) is used as a criterion to classify the intensity of droughts. Based on the accumulated precipitation over the past six months (SPI-6), meteorological drought intensities are classified into four categories: concern, caution, alert, and severe. However, there is a limitation in classifying drought intensity solely based on precipitation. To overcome the limitations of the meteorological drought warning criteria based on SPI, this study collected emergency water supply damage data from the National Drought Information Portal (NDIP) to classify drought intensity. Factors of SPI, such as precipitation, and factors used to calculate evapotranspiration, such as temperature and humidity, were indexed using min-max normalization. Coefficients for each factor were determined based on the Genetic Algorithm (GA). The drought intensity based on emergency water supply was used as the dependent variable, and the coefficients of each meteorological factor determined by GA were used as coefficients to derive a new Drought Severity Classification Index (DSCI). After deriving the DSCI, cumulative distribution functions were used to present intensity stage classification boundaries. It is anticipated that using the proposed DSCI in this study will allow for more accurate drought intensity classification than the traditional SPI, supporting decision-making for disaster management personnel.