Search | Korea Science

Feature Selection with Ensemble Learning for Prostate Cancer Prediction from Gene Expression

Abass, Yusuf Aleshinloye;Adeshina, Steve A.
- International Journal of Computer Science & Network Security
- /
- v.21 no.12spc
- /
- pp.526-538
- /
- 2021
Machine and deep learning-based models are emerging techniques that are being used to address prediction problems in biomedical data analysis. DNA sequence prediction is a critical problem that has attracted a great deal of attention in the biomedical domain. Machine and deep learning-based models have been shown to provide more accurate results when compared to conventional regression-based models. The prediction of the gene sequence that leads to cancerous diseases, such as prostate cancer, is crucial. Identifying the most important features in a gene sequence is a challenging task. Extracting the components of the gene sequence that can provide an insight into the types of mutation in the gene is of great importance as it will lead to effective drug design and the promotion of the new concept of personalised medicine. In this work, we extracted the exons in the prostate gene sequences that were used in the experiment. We built a Deep Neural Network (DNN) and Bi-directional Long-Short Term Memory (Bi-LSTM) model using a k-mer encoding for the DNA sequence and one-hot encoding for the class label. The models were evaluated using different classification metrics. Our experimental results show that DNN model prediction offers a training accuracy of 99 percent and validation accuracy of 96 percent. The bi-LSTM model also has a training accuracy of 95 percent and validation accuracy of 91 percent.
https://doi.org/10.22937/IJCSNS.2021.21.12.73 인용 PDF KSCI

Crop Leaf Disease Identification Using Deep Transfer Learning

Changjian Zhou;Yutong Zhang;Wenzhong Zhao
- Journal of Information Processing Systems
- /
- v.20 no.2
- /
- pp.149-158
- /
- 2024
Traditional manual identification of crop leaf diseases is challenging. Owing to the limitations in manpower and resources, it is challenging to explore crop diseases on a large scale. The emergence of artificial intelligence technologies, particularly the extensive application of deep learning technologies, is expected to overcome these challenges and greatly improve the accuracy and efficiency of crop disease identification. Crop leaf disease identification models have been designed and trained using large-scale training data, enabling them to predict different categories of diseases from unlabeled crop leaves. However, these models, which possess strong feature representation capabilities, require substantial training data, and there is often a shortage of such datasets in practical farming scenarios. To address this issue and improve the feature learning abilities of models, this study proposes a deep transfer learning adaptation strategy. The novel proposed method aims to transfer the weights and parameters from pre-trained models in similar large-scale training datasets, such as ImageNet. ImageNet pre-trained weights are adopted and fine-tuned with the features of crop leaf diseases to improve prediction ability. In this study, we collected 16,060 crop leaf disease images, spanning 12 categories, for training. The experimental results demonstrate that an impressive accuracy of 98% is achieved using the proposed method on the transferred ResNet-50 model, thereby confirming the effectiveness of our transfer learning approach.
https://doi.org/10.3745/JIPS.04.0305 인용 PDF

Study on predictive model and mechanism analysis for martensite transformation temperatures through explainable artificial intelligence (설명가능한 인공지능을 통한 마르텐사이트 변태 온도 예측 모델 및 거동 분석 연구)

Junhyub Jeon;Seung Bae Son;Jae-Gil Jung;Seok-Jae Lee
- Journal of the Korean Society for Heat Treatment
- /
- v.37 no.3
- /
- pp.103-113
- /
- 2024
Martensite volume fraction significantly affects the mechanical properties of alloy steels. Martensite start temperature (M_s), transformation temperature for martensite 50 vol.% (M₅₀), and transformation temperature for martensite 90 vol.% (M₉₀) are important transformation temperatures to control the martensite phase fraction. Several researchers proposed empirical equations and machine learning models to predict the Ms temperature. These numerical approaches can easily predict the Ms temperature without additional experiment and cost. However, to control martensite phase fraction more precisely, we need to reduce prediction error of the Ms model and propose prediction models for other martensite transformation temperatures (M₅₀, M₉₀). In the present study, machine learning model was applied to suggest the predictive model for the Ms, M50, M90 temperatures. To explain prediction mechanisms and suggest feature importance on martensite transformation temperature of machine learning models, the explainable artificial intelligence (XAI) is employed. Random forest regression (RFR) showed the best performance for predicting the Ms, M50, M90 temperatures using different machine learning models. The feature importance was proposed and the prediction mechanisms were discussed by XAI.
https://doi.org/10.12656/jksht.2024.37.3.103 인용 PDF

A Flexible Feature Matching for Automatic Facial Feature Points Detection (얼굴 특징점 자동 검출을 위한 탄력적 특징 정합)

Hwang, Suen-Ki;Bae, Cheol-Soo
- The Journal of Korea Institute of Information, Electronics, and Communication Technology
- /
- v.3 no.2
- /
- pp.12-17
- /
- 2010
An automatic facial feature points(FFPs) detection system is proposed. A face is represented as a graph where the nodes are placed at facial feature points(FFPs) labeled by their Gabor features and the edges are describes their spatial relations. An innovative flexible feature matching is proposed to perform features correspondence between models and the input image. This matching model works likes random diffusion process in the image space by employing the locally competitive and globally corporative mechanism. The system works nicely on the face images under complicated background, pose variations and distorted by facial accessories. We demonstrate the benefits of our approach by its implementation on the system.
PDF

A Flexible Feature Matching for Automatic face and Facial feature Points Detection (얼굴과 얼굴 특징점 자동 검출을 위한 탄력적 특징 정합)

박호식;손형경;정연길;배철수
- Proceedings of the Korean Institute of Information and Commucation Sciences Conference
- /
- 2002.05a
- /
- pp.608-612
- /
- 2002
An automatic face and facial feature points(FFPs) detection system is proposed. A face is represented as a graph where the nodes are placed at facial feature points(FFPs) labeled by their Gabor features md the edges are describes their spatial relations. An innovative flexible feature matching is proposed to perform features correspondence between models and the input image. This matching model works likes random diffusion process in the image spare by employing the locally competitive and globally corporative mechanism. The system works nicely on the face images under complicated background, pose variations and distorted by facial accessories. We demonstrate the benefits of our approach by its implementation on the fare identification system.
PDF

Navigable Space-Relation Model for Indoor Space Analysis (실내 공간 분석을 위한 보행 공간관계 모델)

Lee, Seul-Ji;Lee, Ji-Yeong
- Spatial Information Research
- /
- v.19 no.5
- /
- pp.75-86
- /
- 2011
Three-dimensional modeling of cities in the real-world is an essential task for city planning and decision-making. And many three-dimensional city models are being developed with the development of wireless Internet and location-based services that identify the location of users and provide the information increases for consumers. Especially, in case of urban areas of Korea, indoor space modeling as well as outdoor is needed due to the high-rise buildings densities. Also location-based services should be provided through spatial analysis such as the shortest path based on a space model. Many studies of three-dimensional city models are feature models. In a feature model, space is represented by combining primitives, and relationships among spaces are represented only if shared primitives are detected. So relationships between complex three-dimensional objects in space is difficult to be defined through the feature models. In this study, Navigable space-relation model(NSRM) is developed, which is topological data model for efficient representation of spatial relationships between objects based on the network structure.
PDF KSCI

Facial Expression Recognition with Instance-based Learning Based on Regional-Variation Characteristics Using Models-based Feature Extraction (모델기반 특징추출을 이용한 지역변화 특성에 따른 개체기반 표정인식)

Park, Mi-Ae;Ko, Jae-Pil
- Journal of Korea Multimedia Society
- /
- v.9 no.11
- /
- pp.1465-1473
- /
- 2006
In this paper, we present an approach for facial expression recognition using Active Shape Models(ASM) and a state-based model in image sequences. Given an image frame, we use ASM to obtain the shape parameter vector of the model while we locate facial feature points. Then, we can obtain the shape parameter vector set for all the frames of an image sequence. This vector set is converted into a state vector which is one of the three states by the state-based model. In the classification step, we use the k-NN with the proposed similarity measure that is motivated on the observation that the variation-regions of an expression sequence are different from those of other expression sequences. In the experiment with the public database KCFD, we demonstrate that the proposed measure slightly outperforms the binary measure in which the recognition performance of the k-NN with the proposed measure and the existing binary measure show 89.1% and 86.2% respectively when k is 1.
PDF

Investigating Dynamic Mutation Process of Issues Using Unstructured Text Analysis (부도예측을 위한 KNN 앙상블 모형의 동시 최적화)

Min, Sung-Hwan
- Journal of Intelligence and Information Systems
- /
- v.22 no.1
- /
- pp.139-157
- /
- 2016
Bankruptcy involves considerable costs, so it can have significant effects on a country's economy. Thus, bankruptcy prediction is an important issue. Over the past several decades, many researchers have addressed topics associated with bankruptcy prediction. Early research on bankruptcy prediction employed conventional statistical methods such as univariate analysis, discriminant analysis, multiple regression, and logistic regression. Later on, many studies began utilizing artificial intelligence techniques such as inductive learning, neural networks, and case-based reasoning. Currently, ensemble models are being utilized to enhance the accuracy of bankruptcy prediction. Ensemble classification involves combining multiple classifiers to obtain more accurate predictions than those obtained using individual models. Ensemble learning techniques are known to be very useful for improving the generalization ability of the classifier. Base classifiers in the ensemble must be as accurate and diverse as possible in order to enhance the generalization ability of an ensemble model. Commonly used methods for constructing ensemble classifiers include bagging, boosting, and random subspace. The random subspace method selects a random feature subset for each classifier from the original feature space to diversify the base classifiers of an ensemble. Each ensemble member is trained by a randomly chosen feature subspace from the original feature set, and predictions from each ensemble member are combined by an aggregation method. The k-nearest neighbors (KNN) classifier is robust with respect to variations in the dataset but is very sensitive to changes in the feature space. For this reason, KNN is a good classifier for the random subspace method. The KNN random subspace ensemble model has been shown to be very effective for improving an individual KNN model. The k parameter of KNN base classifiers and selected feature subsets for base classifiers play an important role in determining the performance of the KNN ensemble model. However, few studies have focused on optimizing the k parameter and feature subsets of base classifiers in the ensemble. This study proposed a new ensemble method that improves upon the performance KNN ensemble model by optimizing both k parameters and feature subsets of base classifiers. A genetic algorithm was used to optimize the KNN ensemble model and improve the prediction accuracy of the ensemble model. The proposed model was applied to a bankruptcy prediction problem by using a real dataset from Korean companies. The research data included 1800 externally non-audited firms that filed for bankruptcy (900 cases) or non-bankruptcy (900 cases). Initially, the dataset consisted of 134 financial ratios. Prior to the experiments, 75 financial ratios were selected based on an independent sample t-test of each financial ratio as an input variable and bankruptcy or non-bankruptcy as an output variable. Of these, 24 financial ratios were selected by using a logistic regression backward feature selection method. The complete dataset was separated into two parts: training and validation. The training dataset was further divided into two portions: one for the training model and the other to avoid overfitting. The prediction accuracy against this dataset was used to determine the fitness value in order to avoid overfitting. The validation dataset was used to evaluate the effectiveness of the final model. A 10-fold cross-validation was implemented to compare the performances of the proposed model and other models. To evaluate the effectiveness of the proposed model, the classification accuracy of the proposed model was compared with that of other models. The Q-statistic values and average classification accuracies of base classifiers were investigated. The experimental results showed that the proposed model outperformed other models, such as the single model and random subspace ensemble model.
https://doi.org/10.13088/jiis.2016.22.1.139 인용 PDF KSCI

Improved Statistical Grey-Level Models for PCB Inspection (PCB 검사를 위한 개선된 통계적 그레이레벨 모델)

Bok, Jin Seop;Cho, Tai-Hoon
- Journal of the Semiconductor & Display Technology
- /
- v.12 no.1
- /
- pp.1-7
- /
- 2013
Grey-level statistical models have been widely used in many applications for object location and identification. However, conventional models yield some problems in model refinement when training images are not properly aligned, and have difficulties for real-time recognition of arbitrarily rotated models. This paper presents improved grey-level statistical models that align training images using image or feature matching to overcome problems in model refinement of conventional models, and that enable real-time recognition of arbitrarily rotated objects using efficient hierarchical search methods. Edges or features extracted from a mean training image are used for accurate alignment of models in the search image. On the aligned position and orientation, fitness measure based on grey-level statistical models is computed for object recognition. It is demonstrated in various experiments in PCB inspection that proposed methods are superior to conventional methods in recognition accuracy and speed.
PDF KSCI

The Geometric Averaging Technique for Long Bone (긴뼈의 형상 평균화 기법)

Kwak Dai-Soon;Lee U-Young;Han Seung-Ho;Choi Kwang-Nam;Kim Tae-Joong
- Proceedings of the Korean Society of Precision Engineering Conference
- /
- 2006.05a
- /
- pp.177-178
- /
- 2006
Many authors issued the feature-preserving averaging technique according to positioning and scaling process using landmarks, which represent the geometric characteristics of three dimensional surface models. Such a technique should be done by manual procedure, choosing and marking the landmarks on each bone surface before averaging process. In this study, we produced another averaging technique without having to use such manual procedure, and made averaging models from three dimensional surface data that were reconstructed from computerized tomography images of Digital Korean Project. The bone models were subjected to orthogonal coordinator system. These models were transformed to coincide mass center and to align principal axis. Then, bone models were scaled according to average length data of sample bone models on all axis(x, y, z). After establishing voxellar hexahedron space which contain all sample bone models, we counted the number of overlapping for each voxel. We generated the three dimensional average surface by displaying the yokels that have more overlapping number than boundary number. The boundary number was decided when the average volume of each bone equal to the volume of bone that would be averaged. Using this technique, we can make a feature-preserving averaging volume of bones.
PDF

Search Result 1,096, Processing Time 0.029 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)