• Title/Summary/Keyword: Model Interpretability

Search Result 47, Processing Time 0.019 seconds

Explainable Photovoltaic Power Forecasting Scheme Using BiLSTM (BiLSTM 기반의 설명 가능한 태양광 발전량 예측 기법)

  • Park, Sungwoo;Jung, Seungmin;Moon, Jaeuk;Hwang, Eenjun
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.11 no.8
    • /
    • pp.339-346
    • /
    • 2022
  • Recently, the resource depletion and climate change problem caused by the massive usage of fossil fuels for electric power generation has become a critical issue worldwide. According to this issue, interest in renewable energy resources that can replace fossil fuels is increasing. Especially, photovoltaic power has gaining much attention because there is no risk of resource exhaustion compared to other energy resources and there are low restrictions on installation of photovoltaic system. In order to use the power generated by the photovoltaic system efficiently, a more accurate photovoltaic power forecasting model is required. So far, even though many machine learning and deep learning-based photovoltaic power forecasting models have been proposed, they showed limited success in terms of interpretability. Deep learning-based forecasting models have the disadvantage of being difficult to explain how the forecasting results are derived. To solve this problem, many studies are being conducted on explainable artificial intelligence technique. The reliability of the model can be secured if it is possible to interpret how the model derives the results. Also, the model can be improved to increase the forecasting accuracy based on the analysis results. Therefore, in this paper, we propose an explainable photovoltaic power forecasting scheme based on BiLSTM (Bidirectional Long Short-Term Memory) and SHAP (SHapley Additive exPlanations).

The Prediction of the Helpfulness of Online Review Based on Review Content Using an Explainable Graph Neural Network (설명가능한 그래프 신경망을 활용한 리뷰 콘텐츠 기반의 유용성 예측모형)

  • Eunmi Kim;Yao Ziyan;Taeho Hong
    • Journal of Intelligence and Information Systems
    • /
    • v.29 no.4
    • /
    • pp.309-323
    • /
    • 2023
  • As the role of online reviews has become increasingly crucial, numerous studies have been conducted to utilize helpful reviews. Helpful reviews, perceived by customers, have been verified in various research studies to be influenced by factors such as ratings, review length, review content, and so on. The determination of a review's helpfulness is generally based on the number of 'helpful' votes from consumers, with more 'helpful' votes considered to have a more significant impact on consumers' purchasing decisions. However, recently written reviews that have not been exposed to many customers may have relatively few 'helpful' votes and may lack 'helpful' votes altogether due to a lack of participation. Therefore, rather than relying on the number of 'helpful' votes to assess the helpfulness of reviews, we aim to classify them based on review content. In addition, the text of the review emerges as the most influential factor in review helpfulness. This study employs text mining techniques, including topic modeling and sentiment analysis, to analyze the diverse impacts of content and emotions embedded in the review text. In this study, we propose a review helpfulness prediction model based on review content, utilizing movie reviews from IMDb, a global movie information site. We construct a review helpfulness prediction model by using an explainable Graph Neural Network (GNN), while addressing the interpretability limitations of the machine learning model. The explainable graph neural network is expected to provide more reliable information about helpful or non-helpful reviews as it can identify connections between reviews.

Automatic scoring of mathematics descriptive assessment using random forest algorithm (랜덤 포레스트 알고리즘을 활용한 수학 서술형 자동 채점)

  • Inyong Choi;Hwa Kyung Kim;In Woo Chung;Min Ho Song
    • The Mathematical Education
    • /
    • v.63 no.2
    • /
    • pp.165-186
    • /
    • 2024
  • Despite the growing attention on artificial intelligence-based automated scoring technology as a support method for the introduction of descriptive items in school environments and large-scale assessments, there is a noticeable lack of foundational research in mathematics compared to other subjects. This study developed an automated scoring model for two descriptive items in first-year middle school mathematics using the Random Forest algorithm, evaluated its performance, and explored ways to enhance this performance. The accuracy of the final models for the two items was found to be between 0.95 to 1.00 and 0.73 to 0.89, respectively, which is relatively high compared to automated scoring models in other subjects. We discovered that the strategic selection of the number of evaluation categories, taking into account the amount of data, is crucial for the effective development and performance of automated scoring models. Additionally, text preprocessing by mathematics education experts proved effective in improving both the performance and interpretability of the automated scoring model. Selecting a vectorization method that matches the characteristics of the items and data was identified as one way to enhance model performance. Furthermore, we confirmed that oversampling is a useful method to supplement performance in situations where practical limitations hinder balanced data collection. To enhance educational utility, further research is needed on how to utilize feature importance derived from the Random Forest-based automated scoring model to generate useful information for teaching and learning, such as feedback. This study is significant as foundational research in the field of mathematics descriptive automatic scoring, and there is a need for various subsequent studies through close collaboration between AI experts and math education experts.

A Method of Machine Learning-based Defective Health Functional Food Detection System for Efficient Inspection of Imported Food (효율적 수입식품 검사를 위한 머신러닝 기반 부적합 건강기능식품 탐지 방법)

  • Lee, Kyoungsu;Bak, Yerin;Shin, Yoonjong;Sohn, Kwonsang;Kwon, Ohbyung
    • Journal of Intelligence and Information Systems
    • /
    • v.28 no.3
    • /
    • pp.139-159
    • /
    • 2022
  • As interest in health functional foods has increased since COVID-19, the importance of imported food safety inspections is growing. However, in contrast to the annual increase in imports of health functional foods, the budget and manpower required for inspections for import and export are reaching their limit. Hence, the purpose of this study is to propose a machine learning model that efficiently detects unsuitable food suitable for the characteristics of data possessed by government offices on imported food. First, the components of food import/export inspections data that affect the judgment of nonconformity were examined and derived variables were newly created. Second, in order to select features for the machine learning, class imbalance and nonlinearity were considered when performing exploratory analysis on imported food-related data. Third, we try to compare the performance and interpretability of each model by applying various machine learning techniques. In particular, the ensemble model was the best, and it was confirmed that the derived variables and models proposed in this study can be helpful to the system used in import/export inspections.

The Validity and Reliability of Communication Skills Attitude Scale (CSAS) for Nursing Students (간호대학생의 의사소통 태도 측정도구 타당도 및 신뢰도 검증)

  • Song, Mi-Ok;Yun, So-Young
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.18 no.3
    • /
    • pp.345-355
    • /
    • 2017
  • The purpose of this study was to examine the validity and reliability of the Communication Skills Attitude Scale, which is used to examine communication learning attitudes for domestic nursing students. Study subjects were 401 nursing students at two nursing college who completed the CSAS scale consisting of 26 items from June 1 to 15. Data were analyzed using exploratory factor analysis, confirmatory factor analysis, internal consistency with IBM Statistics SPSS 21.0, and the IBM Statistics AMOS 21.0 program. To verify the construction factor of the scale, exploratory factor analysis with varimax rotation was performed, resulting in four factors but confirmed positive and negative attitudes two factors with 19 items considering the construct of theory and interpretability. The internal structure of the scale was schematized using confirmatory factor analysis, and goodness of fit of the final research model was very appropriate as shown by ${\chi}^2=446.475$ (df=148, p<0.001), TLI=.90, CFI=.91, RMSEA=.07, SRMR=.05. The final scale consisted of 19 items and two factors based on the confirmatory factor analysis. Cronbach's ${\alpha}$ for final scale was .90, showing internal consistency. The CSAS is expected to be useful to monitor the effectiveness of multiple teaching strategies about communication for domestic nursing students.

A Study on the Educational Meaning of eXplainable Artificial Intelligence for Elementary Artificial Intelligence Education (초등 인공지능 교육을 위한 설명 가능한 인공지능의 교육적 의미 연구)

  • Park, Dabin;Shin, Seungki
    • Journal of The Korean Association of Information Education
    • /
    • v.25 no.5
    • /
    • pp.803-812
    • /
    • 2021
  • This study explored the concept of artificial intelligence and the problem-solving process that can be explained through literature research. Through this study, the educational meaning and application plan of artificial intelligence that can be explained were presented. XAI education is a human-centered artificial intelligence education that deals with human-related artificial intelligence problems, and students can cultivate problem-solving skills. In addition, through algorithmic education, it is possible to understand the principles of artificial intelligence, explain artificial intelligence models related to real-life problem situations, and expand to the field of application of artificial intelligence. In order for such XAI education to be applied in elementary schools, examples related to real world must be used, and it is recommended to utilize those that the algorithm itself has interpretability. In addition, various teaching and learning methods and tools should be used for understanding to move toward explanation. Ahead of the introduction of artificial intelligence in the revised curriculum in 2022, we hope that this study will be meaningfully used as the basis for actual classes.

The Relationships between Dry Matter Yield and Days of Summer Depression in different Regions with Mixed Pasture (혼파초지에서 지역별 건물수량과 하고일수 간 관계)

  • Oh, Seung Min;Kim, Moonju;Peng, Jinglun;Lee, Bae Hun;Kim, Ji Yung;Chemere, Befekadu;Kim, Si Chul;Kim, Kyeong Dae;Kim, Byong Wan;Jo, Mu Hwan;Sung, Kyung Il
    • Journal of The Korean Society of Grassland and Forage Science
    • /
    • v.38 no.1
    • /
    • pp.53-60
    • /
    • 2018
  • Yield prediction model for mixed pasture was developed with a shortage that the relationship between dry matter yield (DMY) and days of summer depression (DSD) was not properly reflected in the model in the previous research. Therefore, this study was designed to eliminate the data of the regions with distinctly different climatic conditions and then investigate their relationships DMY and DSD using the data in each region separately of regions with distinct climatic characteristics and classify the data based on regions for further analysis based on the previous mixed pasture prediction model. The data set used in the research kept 582 data points from 11 regions and 41 mixed pasture types. The relationship between DMY and DSD in each region were analyzed through scatter plot, correlation analysis and multiple regression analysis in each region separately. In the statistical analysis, DMY was taken as the response variable and 5 climatic variables including DSD were taken as explanatory variables. The results of scatter plot showed that negative correlations between DMY and DSD were observed in 7 out of 9 regions. Therefore, it was confirmed that analyzing the relationship between DMY and DSD based on each region is necessary and 5 regions were selected (Hwaseong, Suwon, Daejeon, Siheung and Gwangju) since the data size in these regions is large enough to perform the further statistical analysis based on large sample approximation theory. Correlation analysis showed that negative correlations were found between DMY and DSD in 3 (Hwaseong, Suwon and Siheung) out of the 5 regions, meanwhile the negative relationship in Hwaseong was confirmed through multiple regression analysis. Therefore, it was concluded that the interpretability of the yield prediction model for mixed pasture could be improved based on constructing the models using the data from each region separately instead of using the pooled data from different regions.