• Title/Summary/Keyword: Automatic model selection

Search Result 103, Processing Time 0.028 seconds

Compositional Feature Selection and Its Effects on Bandgap Prediction by Machine Learning (기계학습을 이용한 밴드갭 예측과 소재의 조성기반 특성인자의 효과)

  • Chunghee Nam
    • Korean Journal of Materials Research
    • /
    • v.33 no.4
    • /
    • pp.164-174
    • /
    • 2023
  • The bandgap characteristics of semiconductor materials are an important factor when utilizing semiconductor materials for various applications. In this study, based on data provided by AFLOW (Automatic-FLOW for Materials Discovery), the bandgap of a semiconductor material was predicted using only the material's compositional features. The compositional features were generated using the python module of 'Pymatgen' and 'Matminer'. Pearson's correlation coefficients (PCC) between the compositional features were calculated and those with a correlation coefficient value larger than 0.95 were removed in order to avoid overfitting. The bandgap prediction performance was compared using the metrics of R2 score and root-mean-squared error. By predicting the bandgap with randomforest and xgboost as representatives of the ensemble algorithm, it was found that xgboost gave better results after cross-validation and hyper-parameter tuning. To investigate the effect of compositional feature selection on the bandgap prediction of the machine learning model, the prediction performance was studied according to the number of features based on feature importance methods. It was found that there were no significant changes in prediction performance beyond the appropriate feature. Furthermore, artificial neural networks were employed to compare the prediction performance by adjusting the number of features guided by the PCC values, resulting in the best R2 score of 0.811. By comparing and analyzing the bandgap distribution and prediction performance according to the material group containing specific elements (F, N, Yb, Eu, Zn, B, Si, Ge, Fe Al), various information for material design was obtained.

Grid Strut-Tie Model Approach for Structural Concrete Design (콘크리트 구조부재의 설계를 위한 격자 스트럿-타이 모델 방법)

  • Yun, Young Mook;Kim, Byung Hun
    • KSCE Journal of Civil and Environmental Engineering Research
    • /
    • v.26 no.4A
    • /
    • pp.621-637
    • /
    • 2006
  • Although the approaches implementing strut-tie models are the valuable tools for designing discontinuity regions of structural concrete, the approaches of the current design codes have to be improved for the design of structural concrete subjected to complex loading and geometrical conditions because of the uncertainties in the selection of strut-tie model, in the use of an indeterminate strut-tie model, and in the effective strengths of struts and nodal zones. To improve the uncertainties, a grid struttie model approach is proposed in this study. The proposed approach, allowing to perform a consistent and effective design of structural concrete, employs an initial grid strut-tie model in which various load combinations can be considered. In addition, the approach performs an automatic selection of an optimal strut-tie model by evaluating the capacities of struts and ties using a simple optimization algorithm. The validity and effectiveness of the proposed approach is verified by conducting the analysis of the four reinforced concrete deep beams tested to failure and the design of shearwalls with two openings.

시뮬레이션 코드 자동 생성을 위한 생산공정 모델링

  • 김대송;조현보;정무영
    • Proceedings of the Korean Operations and Management Science Society Conference
    • /
    • 1996.04a
    • /
    • pp.627-630
    • /
    • 1996
  • One of the most common communication mechanisms to describe a situation or a process is a story written as an ordered sequence of events or activities. For example, a shop floor supervisor may present the operations of his manufacturing system by describing the processes of manunfacturing a product in his shop. Although IDEF3 is one of the most commonly used methods for describing a business process, but it is not common for a manufacturing process. In this study, we tried to apply IDEF3 for describing a manufacturing process. Problems and suggestions such as selection probability, programmable process modeling, manufacturing resource model were presented.

  • PDF

Credit Scoring Using Splines (스플라인을 이용한 신용 평점화)

  • Koo Ja-Yong;Choi Daewoo;Choi Min-Sung
    • The Korean Journal of Applied Statistics
    • /
    • v.18 no.3
    • /
    • pp.543-553
    • /
    • 2005
  • Linear logistic regression is one of the most widely used method for credit scoring in credit risk management. This paper deals with credit scoring using splines based on Logistic regression. Linear splines and an automatic basis selection algorithm are adopted. The final model is an example of the generalized additive model. A simulation using a real data set is used to illustrate the performance of the spline method.

An Image Contrast Enhancement Technique Using the Improved Integrated Adaptive Fuzzy Clustering Model (개선된 IAFC 모델을 이용한 영상 대비 향상 기법)

  • 이금분;김용수
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.11 no.9
    • /
    • pp.777-781
    • /
    • 2001
  • This paper presents an image contrast enhancement technique for improving the low contrast images using the improved IAFC(Integrated Adaptive Fuzzy Clustering) model. The low pictorial information of a low contrast image is due to the vagueness or fuzziness of the multivalued levels of brightness rather than randomness. Fuzzy image processing has three main stages, namely, image fuzzification, modification of membership values, and image defuzzification. Using a new model of automatic crossover point selection, optimal crossover point is selected automatically. The problem of crossover point selection can be considered as the two-category classification problem. The improved IAFC model is used to classify the image into two classes. The proposed method is applied to several experimental images with 256 gray levels and the results are compared with those of the histogram equalization technique. We utilized the index of fuzziness as a measure of image quality. The results show that the proposed method is better than the histogram equalization technique.

  • PDF

A Study on Optimal Site Selection for Automatic Mountain Meteorology Observation System (AMOS): the Case of Honam and Jeju Areas (최적의 산악기상관측망 적정위치 선정 연구 - 호남·제주 권역을 대상으로)

  • Yoon, Sukhee;Won, Myoungsoo;Jang, Keunchang
    • Korean Journal of Agricultural and Forest Meteorology
    • /
    • v.18 no.4
    • /
    • pp.208-220
    • /
    • 2016
  • Automatic Mountain Meteorology Observation System (AMOS) is an important ingredient for several climatological and forest disaster prediction studies. In this study, we select the optimal sites for AMOS in the mountain areas of Honam and Jeju in order to prevent forest disasters such as forest fires and landslides. So, this study used spatial dataset such as national forest map, forest roads, hiking trails and 30m DEM(Digital Elevation Model) as well as forest risk map(forest fire and landslide), national AWS information to extract optimal site selection of AMOS. Technical methods for optimal site selection of the AMOS was the firstly used multifractal model, IDW interpolation, spatial redundancy for 2.5km AWS buffering analysis, and 200m buffering analysis by using ArcGIS. Secondly, optimal sites selected by spatial analysis were estimated site accessibility, observatory environment of solar power and wireless communication through field survey. The threshold score for the final selection of the sites have to be higher than 70 points in the field assessment. In the result, a total of 159 polygons in national forest map were extracted by the spatial analysis and a total of 64 secondary candidate sites were selected for the ridge and the top of the area using Google Earth. Finally, a total of 26 optimal sites were selected by quantitative assessment based on field survey. Our selection criteria will serve for the establishment of the AMOS network for the best observations of weather conditions in the national forests. The effective observation network may enhance the mountain weather observations, which leads to accurate prediction of forest disasters.

Research on damage detection and assessment of civil engineering structures based on DeepLabV3+ deep learning model

  • Chengyan Song
    • Structural Engineering and Mechanics
    • /
    • v.91 no.5
    • /
    • pp.443-457
    • /
    • 2024
  • At present, the traditional concrete surface inspection methods based on artificial vision have the problems of high cost and insecurity, while the computer vision methods rely on artificial selection features in the case of sensitive environmental changes and difficult promotion. In order to solve these problems, this paper introduces deep learning technology in the field of computer vision to achieve automatic feature extraction of structural damage, with excellent detection speed and strong generalization ability. The main contents of this study are as follows: (1) A method based on DeepLabV3+ convolutional neural network model is proposed for surface detection of post-earthquake structural damage, including surface damage such as concrete cracks, spaling and exposed steel bars. The key semantic information is extracted by different backbone networks, and the data sets containing various surface damage are trained, tested and evaluated. The intersection ratios of 54.4%, 44.2%, and 89.9% in the test set demonstrate the network's capability to accurately identify different types of structural surface damages in pixel-level segmentation, highlighting its effectiveness in varied testing scenarios. (2) A semantic segmentation model based on DeepLabV3+ convolutional neural network is proposed for the detection and evaluation of post-earthquake structural components. Using a dataset that includes building structural components and their damage degrees for training, testing, and evaluation, semantic segmentation detection accuracies were recorded at 98.5% and 56.9%. To provide a comprehensive assessment that considers both false positives and false negatives, the Mean Intersection over Union (Mean IoU) was employed as the primary evaluation metric. This choice ensures that the network's performance in detecting and evaluating pixel-level damage in post-earthquake structural components is evaluated uniformly across all experiments. By incorporating deep learning technology, this study not only offers an innovative solution for accurately identifying post-earthquake damage in civil engineering structures but also contributes significantly to empirical research in automated detection and evaluation within the field of structural health monitoring.

Development of Selection Model of Subway Station Influence Area (SIA) in Seoul City using Chi-square Automatic Interaction Detection (CHAID) (CHAID분석을 이용한 서울시 지하철 역세권 지가 영향모형 개발)

  • Choi, Yu-Ran;Kim, Tae-Ho;Park, Jung-Soo
    • Journal of the Korean Society for Railway
    • /
    • v.11 no.5
    • /
    • pp.504-512
    • /
    • 2008
  • In general, based on criteria of subway law, radius 500m from subway station is defined as SIA (Subway Station Influence Area). Therefore, in this paper, selection models of SIA are developed to identify appropriate SIA for specific legions in Seoul metropolitan city based on CHAID analysis. As a result, following outputs are obtained; (1) walking distance from subway station is the most influential factor to define SIA (2) SIAs vary with regions (i. e. Gangnam area: 767m, Gangbuk area: 452m), and (3) walking distance from subway station is influential to land price of SIA. In addition, in Gangnam, the structure of land price of the closest section has a polynomial trend curve rather than linear compared in comparison with other sections. Therefore, it is desirable for current definition of SIA (radius 500m from subway station) to be redefined to reflect characteristics of land use and walking distance according to each region respectively.

Automatic Searching Algorithm for Galactic HI at Forbidden Velocities in the Inner-Galaxy ALFA Low-Latitude HI (I-GALFA) Survey

  • Kang, Ji-Hyun;Koo, Bon-Chul;Gibson, S.J.;Douglas, K.A.;Park, Geum-Sook;Peek, J.E.G.;Korpela, E.J.;Heiles, C.E.
    • The Bulletin of The Korean Astronomical Society
    • /
    • v.36 no.1
    • /
    • pp.86.2-86.2
    • /
    • 2011
  • The faint wing-like features at velocities beyond the velocity boundaries of the Galactic rotation (Forbidden-Velocity Wings, FVWs) in the large-scale position-velocity diagrams of the HI surveys are thought to be associated with dynamical Galactic events. The primary candidates of these FVWs are rapidly expanding HI shells of the old Galactic supernova remnants (SNRs), which are too faint to be visible in other frequencies. The unprecedented sensitivity and resolution of the I-GALFA survey enable detection of "all" HI shells of Galactic SNRs at forbidden velocities predicted by Koo and Kang (2004). Therefore, comparing the distribution of the FVWs visible in the I-GALFA survey and that of the model will improve our understanding on the interstellar medium and the evolution of SNRs. We have been developing an automatic searching algorithm for FVWs in the I-GALFA survey to minimize the selection effects of visual inspection used in the previous FVW study. We present the searching mechanism for FVWs and the statistical properties of the automatically searched FVWs. Also, we discuss the similarities and the differences between the distribution of the identified FVWs and that of the SNRs predicted by Koo and Kang (2004).

  • PDF

Development of Selection Model of Interchange Influence Area in Seoul Belt Expressway Using Chi-square Automatic Interaction Detection (CHAID) (CHAID분석을 이용한 나들목 주변 지가의 공간분포 영향모형 개발 - 서울외곽순환고속도로를 중심으로 -)

  • Kim, Tae Ho;Park, Je Jin;Kim, Young Il;Rho, Jeong Hyun
    • KSCE Journal of Civil and Environmental Engineering Research
    • /
    • v.29 no.6D
    • /
    • pp.711-717
    • /
    • 2009
  • This study develops model for analysis of relationship between major node (Interchange in expressway) and land price formation of apartments along with Seoul Belt Expressway by using CHAID analysis. The results show that first, regions(outer side: Gyeongido, inner side: Seoul) on the line of Seoul Belt Expressway are different and a graph generally show llinear relationships between land price and traffic node but it does not; second, CHAID analysis shows two different spatial distribution at the point of 2.6km in the outer side, but three different spatial distribution at the point of 1.4km and 3.8km in the inner side. In other words, traffic access does not necessarily guarantee high housing price since the graphs shows land price related to composite spatial distribution. This implies that residential environments (highway noise and regional discontinuity) and traffic accessibility cause mutual interaction to generate this phenomenon. Therefore, the highway IC landprice model will be beneficial for calculation of land price in New Town which constantly is being built along the highway.