• Title/Summary/Keyword: Feature selection

Search Result 1,080, Processing Time 0.032 seconds

A Study on MRI Semi-Automatically Selected Biomarkers for Predicting Risk of Rectal Cancer Surgery Based on Radiomics (라디오믹스 기반 직장암 수술 위험도 예측을 위한 MRI 반자동 선택 바이오마커 검증 연구)

  • Young Seo, Baik;Young Jae, Kim;Youngbae, Jeon;Tae-sik, Hwang;Jeong-Heum, Baek;Kwang Gi, Kim
    • Journal of Biomedical Engineering Research
    • /
    • v.44 no.1
    • /
    • pp.11-18
    • /
    • 2023
  • Currently, studies to predict the risk of rectal cancer surgery select MRI image slices based on the clinical experience of surgeons. The purpose of this study is to semi-automatically select and classify 2D MRI image slides to predict the risk of rectal cancer surgery using biomarkers. The data used were retrospectively collected MRI imaging data of 50 patients who underwent laparoscopic surgery for rectal cancer at Gachon University Gil Medical Center. Expert-selected MRI image slices and non-selected slices were screened and radiomics was used to extract a total of 102 features. A total of 16 approaches were used, combining 4 classifiers and 4 feature selection methods. The combination of Random Forest and Ridge performed with a sensitivity of 0.83, a specificity of 0.88, an accuracy of 0.85, and an AUC of 0.89±0.09. Differences between expert-selected MRI image slices and non-selected slices were analyzed by extracting the top five significant features. Selected quantitative features help expedite decision making and improve efficiency in studies to predict risk of rectal cancer surgery.

A Detecting Technique for the Climatic Factors that Aided the Spread of COVID-19 using Deep and Machine Learning Algorithms

  • Al-Sharari, Waad;Mahmood, Mahmood A.;Abd El-Aziz, A.A.;Azim, Nesrine A.
    • International Journal of Computer Science & Network Security
    • /
    • v.22 no.6
    • /
    • pp.131-138
    • /
    • 2022
  • Novel Coronavirus (COVID-19) is viewed as one of the main general wellbeing theaters on the worldwide level all over the planet. Because of the abrupt idea of the flare-up and the irresistible force of the infection, it causes individuals tension, melancholy, and other pressure responses. The avoidance and control of the novel Covid pneumonia have moved into an imperative stage. It is fundamental to early foresee and figure of infection episode during this troublesome opportunity to control of its grimness and mortality. The entire world is investing unimaginable amounts of energy to fight against the spread of this lethal infection. In this paper, we utilized machine learning and deep learning techniques for analyzing what is going on utilizing countries shared information and for detecting the climate factors that effect on spreading Covid-19, such as humidity, sunny hours, temperature and wind speed for understanding its regular dramatic way of behaving alongside the forecast of future reachability of the COVID-2019 around the world. We utilized data collected and produced by Kaggle and the Johns Hopkins Center for Systems Science. The dataset has 25 attributes and 9566 objects. Our Experiment consists of two phases. In phase one, we preprocessed dataset for DL model and features were decreased to four features humidity, sunny hours, temperature and wind speed by utilized the Pearson Correlation Coefficient technique (correlation attributes feature selection). In phase two, we utilized the traditional famous six machine learning techniques for numerical datasets, and Dense Net deep learning model to predict and detect the climatic factor that aide to disease outbreak. We validated the model by using confusion matrix (CM) and measured the performance by four different metrics: accuracy, f-measure, recall, and precision.

Machine Learning-Based Malicious URL Detection Technique (머신러닝 기반 악성 URL 탐지 기법)

  • Han, Chae-rim;Yun, Su-hyun;Han, Myeong-jin;Lee, Il-Gu
    • Journal of the Korea Institute of Information Security & Cryptology
    • /
    • v.32 no.3
    • /
    • pp.555-564
    • /
    • 2022
  • Recently, cyberattacks are using hacking techniques utilizing intelligent and advanced malicious codes for non-face-to-face environments such as telecommuting, telemedicine, and automatic industrial facilities, and the damage is increasing. Traditional information protection systems, such as anti-virus, are a method of detecting known malicious URLs based on signature patterns, so unknown malicious URLs cannot be detected. In addition, the conventional static analysis-based malicious URL detection method is vulnerable to dynamic loading and cryptographic attacks. This study proposes a technique for efficiently detecting malicious URLs by dynamically learning malicious URL data. In the proposed detection technique, malicious codes are classified using machine learning-based feature selection algorithms, and the accuracy is improved by removing obfuscation elements after preprocessing using Weighted Euclidean Distance(WED). According to the experimental results, the proposed machine learning-based malicious URL detection technique shows an accuracy of 89.17%, which is improved by 2.82% compared to the conventional method.

A Box Office Type Classification and Prediction Model Based on Automated Machine Learning for Maximizing the Commercial Success of the Korean Film Industry (한국 영화의 산업의 흥행 극대화를 위한 AutoML 기반의 박스오피스 유형 분류 및 예측 모델)

  • Subeen Leem;Jihoon Moon;Seungmin Rho
    • Journal of Platform Technology
    • /
    • v.11 no.3
    • /
    • pp.45-55
    • /
    • 2023
  • This paper presents a model that supports decision-makers in the Korean film industry to maximize the success of online movies. To achieve this, we collected historical box office movies and clustered them into types to propose a model predicting each type's online box office performance. We considered various features to identify factors contributing to movie success and reduced feature dimensionality for computational efficiency. We systematically classified the movies into types and predicted each type's online box office performance while analyzing the contributing factors. We used automated machine learning (AutoML) techniques to automatically propose and select machine learning algorithms optimized for the problem, allowing for easy experimentation and selection of multiple algorithms. This approach is expected to provide a foundation for informed decision-making and contribute to better performance in the film industry.

  • PDF

Investigation of AI-based dual-model strategy for monitoring cyanobacterial blooms from Sentinel-3 in Korean inland waters

  • Hoang Hai Nguyen;Dalgeun Lee;Sunghwa Choi;Daeyun Shin
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2023.05a
    • /
    • pp.168-168
    • /
    • 2023
  • The frequent occurrence of cyanobacterial harmful algal blooms (CHABs) in inland waters under climate change seriously damages the ecosystem and human health and is becoming a big problem in South Korea. Satellite remote sensing is suggested for effective monitoring CHABs at a larger scale of water bodies since the traditional method based on sparse in-situ networks is limited in space. However, utilizing a standalone variable of satellite reflectances in common CHABs dual-models, which relies on both chlorophyll-a (Chl-a) and phycocyanin or cyanobacteria cells (Cyano-cell), is not fully beneficial because their seasonal variation is highly impacted by surrounding meteorological and bio-environmental factors. Along with the development of Artificial Intelligence (AI), monitoring CHABs from space with analyzing the effects of environmental factors is accessible. This study aimed to investigate the potential application of AI in the dual-model strategy (Chl-a and Cyano-cell are output parameters) for monitoring seasonal dynamics of CHABs from satellites over Korean inland waters. The Sentinel-3 satellite was selected in this study due to the variety of spectral bands and its unique band (620 nm), which is sensitive to cyanobacteria. Via the AI-based feature selection, we analyzed the relationships between two output parameters and major parameters (satellite water-leaving reflectances at different spectral bands), together with auxiliary (meteorological and bio-environmental) parameters, to select the most important ones. Several AI models were then employed for modelling Chl-a and Cyano-cell concentration from those selected important parameters. Performance evaluation of the AI models and their comparison to traditional semi-analytical models were conducted to demonstrate whether AI models (using water-leaving reflectances and environmental variables) outperform traditional models (using water-leaving reflectances only) and which AI models are superior for monitoring CHABs from Sentinel-3 satellite over a Korean inland water body.

  • PDF

Enhancing the Quality of Service by GBSO Splay Tree Routing Framework in Wireless Sensor Network

  • Majidha Fathima K. M.;M. Suganthi;N. Santhiyakumari
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.17 no.8
    • /
    • pp.2188-2208
    • /
    • 2023
  • Quality of Service (QoS) is a critical feature of Wireless Sensor Networks (WSNs) with routing algorithms. Data packets are moved between cluster heads with QoS using a number of energy-efficient routing techniques. However, sustaining high scalability while increasing the life of a WSN's networks scenario remains a challenging task. Thus, this research aims to develop an energy-balancing component that ensures equal energy consumption for all network sensors while offering flexible routing without congestion, even at peak hours. This research work proposes a Gravitational Blackhole Search Optimised splay tree routing framework. Based on the splay tree topology, the routing procedure is carried out by the suggested method using three distinct steps. Initially, the proposed GBSO decides the optimal route at initiation phases by choosing the root node with optimum energy in the splay tree. In the selection stage, the steps for energy update and trust update are completed by evaluating a novel reliance function utilising the Parent Reliance (PR) and Grand Parent Reliance (GPR). Finally, in the routing phase, using the fitness measure and the minimal distance, the GBSO algorithm determines the best route for data broadcast. The model results demonstrated the efficacy of the suggested technique with 99.52% packet delivery ratio, a minimum delay of 0.19 s, and a network lifetime of 1750 rounds with 200 nodes. Also, the comparative analysis ensured that the suggested algorithm surpasses the effectiveness of the existing algorithm in all aspects and guaranteed end-to-end delivery of packets.

Comparison of Stock Price Prediction Using Time Series and Non-Time Series Data

  • Min-Seob Song;Junghye Min
    • Journal of the Korea Society of Computer and Information
    • /
    • v.28 no.8
    • /
    • pp.67-75
    • /
    • 2023
  • Stock price prediction is an important topic extensively discussed in the financial market, but it is considered a challenging subject due to numerous factors that can influence it. In this research, performance was compared and analyzed by applying time series prediction models (LSTM, GRU) and non-time series prediction models (RF, SVR, KNN, LGBM) that do not take into account the temporal dependence of data into stock price prediction. In addition, various data such as stock price data, technical indicators, financial statements indicators, buy sell indicators, short selling, and foreign indicators were combined to find optimal predictors and analyze major factors affecting stock price prediction by industry. Through the hyperparameter optimization process, the process of improving the prediction performance for each algorithm was also conducted to analyze the factors affecting the performance. As a result of feature selection and hyperparameter optimization, it was found that the forecast accuracy of the time series prediction algorithm GRU and LSTM+GRU was the highest.

An Efficient Data Collection Method for Deep Learning-based Wireless Signal Identification in Unlicensed Spectrum (딥 러닝 기반의 이기종 무선 신호 구분을 위한 데이터 수집 효율화 기법)

  • Choi, Jaehyuk
    • Journal of IKEEE
    • /
    • v.26 no.1
    • /
    • pp.62-66
    • /
    • 2022
  • Recently, there have been many research efforts based on data-based deep learning technologies to deal with the interference problem between heterogeneous wireless communication devices in unlicensed frequency bands. However, existing approaches are commonly based on the use of complex neural network models, which require high computational power, limiting their efficiency in resource-constrained network interfaces and Internet of Things (IoT) devices. In this study, we address the problem of classifying heterogeneous wireless technologies including Wi-Fi and ZigBee in unlicensed spectrum bands. We focus on a data-driven approach that employs a supervised-learning method that uses received signal strength indicator (RSSI) data to train Deep Convolutional Neural Networks (CNNs). We propose a simple measurement methodology for collecting RSSI training data which preserves temporal and spectral properties of the target signal. Real experimental results using an open-source 2.4 GHz wireless development platform Ubertooth show that the proposed sampling method maintains the same accuracy with only a 10% level of sampling data for the same neural network architecture.

The Formation of Compact Elliptical Galaxies: Nature or Nurture?

  • Kim, Suk;Jeong, Hyunjin;Rey, Soo-Chang;Lee, Youngdae;Joo, Seok-Joo;Kim, Hak-Sub
    • The Bulletin of The Korean Astronomical Society
    • /
    • v.44 no.2
    • /
    • pp.77.3-77.3
    • /
    • 2019
  • We present an analysis of the stellar population of compact elliptical galaxies (cEs) in various environments. Following conventional selection criteria of cEs, we created a list of cE candidates in the redshift range of z < 0.05 using SDSS DR12 catalog. We finally selected cEs with low-luminosity (Mg > 18.7 mag), small effective radius (Re < 600 pc), and high velocity dispersion (> 60 kms-1). We divide our cE sample into those inside and outside of the one virial radius of the bright (Mr < -21 mag) nearby host galaxy which is then defined as cEs with (cEw) and without (cEw/o) host galaxy, respectively. We investigated the stellar population properties of cEs based on the Hb, Mgb, Fe 5270, and Fe 5335 line strengths from the OSSY catalog. We found that cEw has a systematically higher metallicity than cEw/o. In the velocity dispersion-Mgb distribution, while cEw/o follows the relation of early-type galaxies, cEw are found to have a systematically higher metallicity than cEw/o at a given velocity dispersion. The different feature in the metallicity between cEw and cEw/o can suggest that two different scenarios can be provided in the formation of cEs. cEw would be the remnant cores of the massive progenitor galaxies that their outer parts have been tidally stripped by massive neighbor galaxies (i.e., nurture origin). On the other hand, cEw/o are likely to be faint-end of early-type galaxies maintaining in-situ evolution (i.e., nurture origin).

  • PDF

On the Analysis of the Aesthetic Style of Huo Jianqi's Local-themed Films : Take Nuan and Postman in the Mountains as Examples (호젠기 향토를 소재로 한 영화의 미학적 스타일 분석에 관한 연구 : '훈'과 '그 산, 저 사람, 저 개'를 예로 들자면)

  • Zhang, Yi
    • Journal of Korea Entertainment Industry Association
    • /
    • v.13 no.6
    • /
    • pp.95-102
    • /
    • 2019
  • This paper mainly studies two films created by Huo Jianqi, Nuan and Postman in the Mountains, and analyzes the aesthetic style of Huo Jianqi's local movies in three parts. The first part deals with the characteristics of Huo Jianqi's local movies and discusses the theme selection, theme performance, character creation and emotional expression of the film. The second part elaborates Huo Jianqi's film and television works from three aspects: picture language, voice language and color language, which further reflects Huo Jianqi's aesthetic style. The third part analyses the puzzlement in the development of local films and how to develop them. With its unique oriental aesthetics, unique perspectives, unique rural scenery and characteristics, Chinese local movies have become the most representative of the overall level and style of Chinese movies. Chinese local film is the reflection and care of Chinese local culture, showing strong regional local culture, regionality is its distinct feature. Since the reform and opening up, with the changes of the times and the renewal of film directors, local films have shown distinct characteristics of the times. Through the analysis of Huo Jianqi's aesthetic style of local movies, this paper hopes to provide some valuable reference and inspiration for the development of Chinese local movies.