• 제목/요약/키워드: Optimal Variable Selection

검색결과 95건 처리시간 0.023초

Development of an Artificial Neural Network Model for a Predictive Control of Cooling Systems (건물 냉방시스템의 예측제어를 위한 인공신경망 모델 개발)

  • Kang, In-Sung;Yang, Young-Kwon;Lee, Hyo-Eun;Park, Jin-Chul;Moon, Jin-Woo
    • KIEAE Journal
    • /
    • 제17권5호
    • /
    • pp.69-76
    • /
    • 2017
  • Purpose: This study aimed at developing an Artificial Neural Network (ANN) model for predicting the amount of cooling energy consumption of the variable refrigerant flow (VRF) cooling system by the different set-points of the control variables, such as supply air temperature of air handling unit (AHU), condenser fluid temperature, condenser fluid pressure, and refrigerant evaporation temperature. Applying the predicted results for the different set-points, the control algorithm, which embedded the ANN model, will determine the most energy efficient control strategy. Method: The ANN model was developed and tested its prediction accuracy by using matrix laboratory (MATLAB) and its neural network toolbox. The field data sets were collected for the model training and performance evaluation. For completing the prediction model, three major steps were conducted - i) initial model development including input variable selection, ii) model optimization, and iii) performance evaluation. Result: Eight meaningful input variables were selected in the initial model development such as outdoor temperature, outdoor humidity, indoor temperature, cooling load of the previous cycle, supply air temperature of AHU, condenser fluid temperature, condenser fluid pressure, and refrigerant evaporation temperature. The initial model was optimized to have 2 hidden layers with 15 hidden neurons each, 0.3 learning rate, and 0.3 momentum. The optimized model proved its prediction accuracy with stable prediction results.

Evaluation of benzene residue in edible oils using Fourier transform infrared (FTIR) spectroscopy

  • Joshi, Ritu;Cho, Byoung-Kwan;Lohumi, Santosh;Joshi, Rahul;Lee, Jayoung;Lee, Hoonsoo;Mo, Changyeun
    • Korean Journal of Agricultural Science
    • /
    • 제46권2호
    • /
    • pp.257-271
    • /
    • 2019
  • The use of food grade hexane (FGH) for edible oil extraction is responsible for the presence of benzene in the crude oil. Benzene is a Group 1 carcinogen and could pose a serious threat to the health of consumer. However, its detection still depends on classical methods using chromatography which requires a rapid non-destructive detection method. Hence, the aim of this study was to investigate the feasibility of using Fourier transform infrared (FTIR) spectroscopy combined with multivariate analysis to detect and quantify the benzene residue in edible oil (sesame and cottonseed oil). Oil samples were adulterated with varying quantities of benzene, and their FTIR spectra were acquired with an attenuated total reflectance (ATR) method. Optimal variables for a partial least-squares regression (PLSR) model were selected using the variable importance in projection (VIP) and the selectivity ratio (SR) methods. The developed PLS models with whole variables and the VIP- and SR-selected variables were validated against an independent data set which resulted in $R^2$ values of 0.95, 0.96, and 0.95 and standard error of prediction (SEP) values of 38.5, 33.7, and 41.7 mg/L, respectively. The proposed technique of FTIR combined with multivariate analysis and variable selection methods can detect benzene residuals in edible oils with the advantages of being fast and simple and thus, can replace the conventional methods used for the same purpose.

Minimum Path Planning for Mobile Robot using Distribution Density (분포 밀도를 이용한 이동 로봇의 최단 경로 설정)

  • Kwak Jae-Hyuk;Lim Joon-Hong
    • Journal of the Institute of Electronics Engineers of Korea SC
    • /
    • 제43권3호
    • /
    • pp.31-40
    • /
    • 2006
  • Many researches on path planning and obstacle avoidance for the fundamentals of mobile robot have been done. Informations from various sensors can find obstacles and make path. In spite of many solutions of finding optimal path, each can be applied to only a constrained condition. This means that it is difficult to find a universal algorithm. A optimal path with a complicated computation generates a time delay which cannot avoid moving obstacles. In this paper, we propose the algorithm of path planning and obstacle avoidance for mobile robot. We call the proposed method Random Access Sequence(RAS) method. In the proposed method, a small region is set first and numbers are assigned to its neighbors, then the path is selected using these numbers. It has an advantage of fast planning and simple operation. This means that new path selection may be possible within short time and that helps a robot to avoid obstacle in any direction. When a robot meets moving obstacles, it avoids obstacles in a random direction. RAS method using obstacle information from variable sensors is useful to get minimum path length to goal.

A Study on Optimal Location Selection and Analytic Method of Landmark Element in terms of Visual Perception (시각적 측면에서 랜드마크 요소의 최적입지선정 분석방법에 관한 연구)

  • Kim, Suk-Tae
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • 제16권9호
    • /
    • pp.6360-6367
    • /
    • 2015
  • The location selection of the element that should guarantee easy visual perception, like the landmark, is the a topic that appears much in the design process. Recently, a graph analysis technique using computers has been applied in order to evaluate the visibility of the visual element, but the analytic frame is flat and the setting of the visual pont and the matrix are fixed so there were great limitations in obtaining the results of the practical analysis. Thus, this study presented Nondirectional Multi-Dimensional Calculation (MDVC-N), an analytic methodology available for the analysis of the dynamic visual point in the 3D environment. It thus attempted to establish the analytic application using the 3D computer graphics technology and designed a script structure to set the visual point and the matrix. In addition to that, this study tried to verify the analytic methodology by applying the complex land as an example model, where buildings in various heights of terrains with a high-differences are located, verifying the same analytic methodology. It thus tried to identify the visual characteristics of each alternative location. The following results were gained from the study. 1) The visibility can be measured quantitatively trough the application of the 6-alternatives. 2) Using the 3dimensional graph, intuitive analysis was possible. 3) It attempted to improve the analytic applicability by calculating the results corrected as a variable behavior from the local integration variable of the space syntax.

Design of Multi-FPNN Model Using Clustering and Genetic Algorithms and Its Application to Nonlinear Process Systems (HCM 클러스처링과 유전자 알고리즘을 이용한 다중 FPNN 모델 설계와 비선형 공정으로의 응용)

  • 박호성;오성권;안태천
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • 제10권4호
    • /
    • pp.343-350
    • /
    • 2000
  • In this paper, we propose the Multi-FPNN(Fuzzy Polynomial Neural Networks) model based on FNN and PNN(Polyomial Neural Networks) for optimal system identifacation. Here FNN structure is designed using fuzzy input space divided by each separated input variable, and urilized both in order to get better output performace. Each node of PNN structure based on GMDH(Group Method of Data handing) method uses two types of high-order polynomials such as linearane and quadratic, and the input of that node uses three kinds of multi-variable inputs such as linear and quadratic, and the input of that node and Genetic Algorithms(GAs) to identify both the structure and the prepocessing of parameters of a Multi-FPNN model. Here, HCM clustering method, which is carried out for data preproessing of process system, is utilized to determine the structure method, which is carried out for data preprocessing of process system, is utilized to determance index with a weighting factor is used to according to the divisions of input-output space. A aggregate performance inddex with a wegihting factor is used to achieve a sound balance between approximation and generalization abilities of the model. According to the selection and adjustment of a weighting factor of this aggregate abjective function which it is acailable and effective to design to design and optimal Multi-FPNN model. The study is illustrated with the aid of two representative numerical examples and the aggregate performance index related to the approximation and generalization abilities of the model is evaluated and discussed.

  • PDF

Development of Nondestructive Detection Method for Adulterated Powder Products Using Raman Spectroscopy and Partial Least Squares Regression (라만 분광법과 부분최소자승법을 이용한 불량 분말식품 비파괴검사 기술 개발)

  • Lee, Sangdae;Lohumi, Santosh;Cho, Byoung-Kwan;Kim, Moon S.;Lee, Soo-Hee
    • Journal of the Korean Society for Nondestructive Testing
    • /
    • 제34권4호
    • /
    • pp.283-289
    • /
    • 2014
  • This study was conducted to develop a non-destructive detection method for adulterated powder products using Raman spectroscopy and partial least squares regression(PLSR). Garlic and ginger powder, which are used as natural seasoning and in health supplement foods, were selected for this experiment. Samples were adulterated with corn starch in concentrations of 5-35%. PLSR models for adulterated garlic and ginger powders were developed and their performances evaluated using cross validation. The $R^2_c$ and SEC of an optimal PLSR model were 0.99 and 2.16 for the garlic powder samples, and 0.99 and 0.84 for the ginger samples, respectively. The variable importance in projection (VIP) score is a useful and simple tool for the evaluation of the importance of each variable in a PLSR model. After the VIP scores were taken pre-selection, the Raman spectrum data was reduced by one third. New PLSR models, based on a reduced number of wavelengths selected by the VIP scores technique, gave good predictions for the adulterated garlic and ginger powder samples.

Optimal Location Allocation of CCTV Using 3D Simulation (3차원 시뮬레이션을 활용한 CCTV 최적입지선정)

  • PARK, Jeong-Woo;LEE, Seong-Ho
    • Journal of the Korean Association of Geographic Information Studies
    • /
    • 제19권4호
    • /
    • pp.92-105
    • /
    • 2016
  • This study aims to establish a simulation method for CCTV (Closed Circuit Television) sight area. The simulation incorporates variables for computing CCTV sight area including CCTV specifications and installation. Currently CCTV is used for traffic, crime prevention and fire prevention by local governments. However, new locations are selected by administrator decision rather than analysis of the optimal location. In order to determine optimum location, a method to CCTV compute range is needed, which incorporates specifications according to CCTV purpose. For this purpose, limitations of previous research methods must be recognized and the simulation method must supplement these limitations. Here in this study, we derived CCTV sight area variables for realistic analysis to complement the limitations of previous studies. A total of eight elements were derived from image device sensors and installation: wide angle, height, angle, setting height, setting angle, and others. This research implemented a 3D simulation technique that can be applied to the derived factors and automate them using ArcObject and Visual C#. This simulation method can calculate sight range in accordance with CCTV specifications. Furthermore, when installing additional CCTVs, it can derive optimal allocation position. The results of this study will provide rational choices for specification selection and CCTV location by interagency collaborative projects.

Improving the Accuracy of Early Diagnosis of Thyroid Nodule Type Based on the SCAD Method

  • Shahraki, Hadi Raeisi;Pourahmad, Saeedeh;Paydar, Shahram;Azad, Mohsen
    • Asian Pacific Journal of Cancer Prevention
    • /
    • 제17권4호
    • /
    • pp.1861-1864
    • /
    • 2016
  • Although early diagnosis of thyroid nodule type is very important, the diagnostic accuracy of standard tests is a challenging issue. We here aimed to find an optimal combination of factors to improve diagnostic accuracy for distinguishing malignant from benign thyroid nodules before surgery. In a prospective study from 2008 to 2012, 345 patients referred for thyroidectomy were enrolled. The sample size was split into a training set and testing set as a ratio of 7:3. The former was used for estimation and variable selection and obtaining a linear combination of factors. We utilized smoothly clipped absolute deviation (SCAD) logistic regression to achieve the sparse optimal combination of factors. To evaluate the performance of the estimated model in the testing set, a receiver operating characteristic (ROC) curve was utilized. The mean age of the examined patients (66 male and 279 female) was $40.9{\pm}13.4years$ (range 15- 90 years). Some 54.8% of the patients (24.3% male and 75.7% female) had benign and 45.2% (14% male and 86% female) malignant thyroid nodules. In addition to maximum diameters of nodules and lobes, their volumes were considered as related factors for malignancy prediction (a total of 16 factors). However, the SCAD method estimated the coefficients of 8 factors to be zero and eliminated them from the model. Hence a sparse model which combined the effects of 8 factors to distinguish malignant from benign thyroid nodules was generated. An optimal cut off point of the ROC curve for our estimated model was obtained (p=0.44) and the area under the curve (AUC) was equal to 77% (95% CI: 68%-85%). Sensitivity, specificity, positive predictive value and negative predictive values for this model were 70%, 72%, 71% and 76%, respectively. An increase of 10 percent and a greater accuracy rate in early diagnosis of thyroid nodule type by statistical methods (SCAD and ANN methods) compared with the results of FNA testing revealed that the statistical modeling methods are helpful in disease diagnosis. In addition, the factor ranking offered by these methods is valuable in the clinical context.

Optimization of Multiclass Support Vector Machine using Genetic Algorithm: Application to the Prediction of Corporate Credit Rating (유전자 알고리즘을 이용한 다분류 SVM의 최적화: 기업신용등급 예측에의 응용)

  • Ahn, Hyunchul
    • Information Systems Review
    • /
    • 제16권3호
    • /
    • pp.161-177
    • /
    • 2014
  • Corporate credit rating assessment consists of complicated processes in which various factors describing a company are taken into consideration. Such assessment is known to be very expensive since domain experts should be employed to assess the ratings. As a result, the data-driven corporate credit rating prediction using statistical and artificial intelligence (AI) techniques has received considerable attention from researchers and practitioners. In particular, statistical methods such as multiple discriminant analysis (MDA) and multinomial logistic regression analysis (MLOGIT), and AI methods including case-based reasoning (CBR), artificial neural network (ANN), and multiclass support vector machine (MSVM) have been applied to corporate credit rating.2) Among them, MSVM has recently become popular because of its robustness and high prediction accuracy. In this study, we propose a novel optimized MSVM model, and appy it to corporate credit rating prediction in order to enhance the accuracy. Our model, named 'GAMSVM (Genetic Algorithm-optimized Multiclass Support Vector Machine),' is designed to simultaneously optimize the kernel parameters and the feature subset selection. Prior studies like Lorena and de Carvalho (2008), and Chatterjee (2013) show that proper kernel parameters may improve the performance of MSVMs. Also, the results from the studies such as Shieh and Yang (2008) and Chatterjee (2013) imply that appropriate feature selection may lead to higher prediction accuracy. Based on these prior studies, we propose to apply GAMSVM to corporate credit rating prediction. As a tool for optimizing the kernel parameters and the feature subset selection, we suggest genetic algorithm (GA). GA is known as an efficient and effective search method that attempts to simulate the biological evolution phenomenon. By applying genetic operations such as selection, crossover, and mutation, it is designed to gradually improve the search results. Especially, mutation operator prevents GA from falling into the local optima, thus we can find the globally optimal or near-optimal solution using it. GA has popularly been applied to search optimal parameters or feature subset selections of AI techniques including MSVM. With these reasons, we also adopt GA as an optimization tool. To empirically validate the usefulness of GAMSVM, we applied it to a real-world case of credit rating in Korea. Our application is in bond rating, which is the most frequently studied area of credit rating for specific debt issues or other financial obligations. The experimental dataset was collected from a large credit rating company in South Korea. It contained 39 financial ratios of 1,295 companies in the manufacturing industry, and their credit ratings. Using various statistical methods including the one-way ANOVA and the stepwise MDA, we selected 14 financial ratios as the candidate independent variables. The dependent variable, i.e. credit rating, was labeled as four classes: 1(A1); 2(A2); 3(A3); 4(B and C). 80 percent of total data for each class was used for training, and remaining 20 percent was used for validation. And, to overcome small sample size, we applied five-fold cross validation to our dataset. In order to examine the competitiveness of the proposed model, we also experimented several comparative models including MDA, MLOGIT, CBR, ANN and MSVM. In case of MSVM, we adopted One-Against-One (OAO) and DAGSVM (Directed Acyclic Graph SVM) approaches because they are known to be the most accurate approaches among various MSVM approaches. GAMSVM was implemented using LIBSVM-an open-source software, and Evolver 5.5-a commercial software enables GA. Other comparative models were experimented using various statistical and AI packages such as SPSS for Windows, Neuroshell, and Microsoft Excel VBA (Visual Basic for Applications). Experimental results showed that the proposed model-GAMSVM-outperformed all the competitive models. In addition, the model was found to use less independent variables, but to show higher accuracy. In our experiments, five variables such as X7 (total debt), X9 (sales per employee), X13 (years after founded), X15 (accumulated earning to total asset), and X39 (the index related to the cash flows from operating activity) were found to be the most important factors in predicting the corporate credit ratings. However, the values of the finally selected kernel parameters were found to be almost same among the data subsets. To examine whether the predictive performance of GAMSVM was significantly greater than those of other models, we used the McNemar test. As a result, we found that GAMSVM was better than MDA, MLOGIT, CBR, and ANN at the 1% significance level, and better than OAO and DAGSVM at the 5% significance level.

Efficient Resource Management Framework on Grid Service (그리드 서비스 환경에서 효율적인 자원 관리 프레임워크)

  • Song, Eun-Ha;Jeong, Young-Sik
    • Journal of KIISE:Computer Systems and Theory
    • /
    • 제35권5호
    • /
    • pp.187-198
    • /
    • 2008
  • This paper develops a framework for efficient resource management within the grid service environment. Resource management is the core element of the grid service; therefore, GridRMF(Grid Resource Management Framework) is modeled and developed in order to respond to such variable characteristics of resources as accordingly as possible. GridRMF uses the participation level of grid resource as a basis of its hierarchical management. This hierarchical management divides managing domains into two parts: VMS(Virtual Organization Management System) for virtual organization management and RMS(Resource Management System) for metadata management. VMS mediates resources according to optimal virtual organization selection mechanism, and responds to malfunctions of the virtual organization by LRM(Local Resource Manager) automatic recovery mechanism. RMS, on the other hand, responds to load balance and fault by applying resource status monitoring information into adaptive performance-based task allocation algorithm.