• 제목/요약/키워드: Statistical learning model

검색결과 541건 처리시간 0.026초

Utilization of deep learning-based metamodel for probabilistic seismic damage analysis of railway bridges considering the geometric variation

  • Xi Song;Chunhee Cho;Joonam Park
    • Earthquakes and Structures
    • /
    • 제25권6호
    • /
    • pp.469-479
    • /
    • 2023
  • A probabilistic seismic damage analysis is an essential procedure to identify seismically vulnerable structures, prioritize the seismic retrofit, and ultimately minimize the overall seismic risk. To assess the seismic risk of multiple structures within a region, a large number of nonlinear time-history structural analyses must be conducted and studied. As a result, each assessment requires high computing resources. To overcome this limitation, we explore a deep learning-based metamodel to enable the prediction of the mean and the standard deviation of the seismic damage distribution of track-on steel-plate girder railway bridges in Korea considering the geometric variation. For machine learning training, nonlinear dynamic time-history analyses are performed to generate 800 high-fidelity datasets on the seismic response. Through intensive trial and error, the study is concentrated on developing an optimal machine learning architecture with the pre-identified variables of the physical configuration of the bridge. Additionally, the prediction performance of the proposed method is compared with a previous, well-defined, response surface model. Finally, the statistical testing results indicate that the overall performance of the deep-learning model is improved compared to the response surface model, as its errors are reduced by as much as 61%. In conclusion, the model proposed in this study can be effectively deployed for the seismic fragility and risk assessment of a region with a large number of structures.

Machine Learning Approaches to Corn Yield Estimation Using Satellite Images and Climate Data: A Case of Iowa State

  • Kim, Nari;Lee, Yang-Won
    • 한국측량학회지
    • /
    • 제34권4호
    • /
    • pp.383-390
    • /
    • 2016
  • Remote sensing data has been widely used in the estimation of crop yields by employing statistical methods such as regression model. Machine learning, which is an efficient empirical method for classification and prediction, is another approach to crop yield estimation. This paper described the corn yield estimation in Iowa State using four machine learning approaches such as SVM (Support Vector Machine), RF (Random Forest), ERT (Extremely Randomized Trees) and DL (Deep Learning). Also, comparisons of the validation statistics among them were presented. To examine the seasonal sensitivities of the corn yields, three period groups were set up: (1) MJJAS (May to September), (2) JA (July and August) and (3) OC (optimal combination of month). In overall, the DL method showed the highest accuracies in terms of the correlation coefficient for the three period groups. The accuracies were relatively favorable in the OC group, which indicates the optimal combination of month can be significant in statistical modeling of crop yields. The differences between our predictions and USDA (United States Department of Agriculture) statistics were about 6-8 %, which shows the machine learning approaches can be a viable option for crop yield modeling. In particular, the DL showed more stable results by overcoming the overfitting problem of generic machine learning methods.

Optimized Chinese Pronunciation Prediction by Component-Based Statistical Machine Translation

  • Zhu, Shunle
    • Journal of Information Processing Systems
    • /
    • 제17권1호
    • /
    • pp.203-212
    • /
    • 2021
  • To eliminate ambiguities in the existing methods to simplify Chinese pronunciation learning, we propose a model that can predict the pronunciation of Chinese characters automatically. The proposed model relies on a statistical machine translation (SMT) framework. In particular, we consider the components of Chinese characters as the basic unit and consider the pronunciation prediction as a machine translation procedure (the component sequence as a source sentence, the pronunciation, pinyin, as a target sentence). In addition to traditional features such as the bidirectional word translation and the n-gram language model, we also implement a component similarity feature to overcome some typos during practical use. We incorporate these features into a log-linear model. The experimental results show that our approach significantly outperforms other baseline models.

기온 데이터를 반영한 전력수요 예측 딥러닝 모델 (Electric Power Demand Prediction Using Deep Learning Model with Temperature Data)

  • 윤협상;정석봉
    • 정보처리학회논문지:소프트웨어 및 데이터공학
    • /
    • 제11권7호
    • /
    • pp.307-314
    • /
    • 2022
  • 최근 전력수요를 예측하기 위해 통계기반 시계열 분석 기법을 대체하기 위해 딥러닝 기법을 활용한 연구가 활발히 진행되고 있다. 딥러닝 기반 전력수요 예측 연구 결과를 분석한 결과, LSTM 기반 예측 모델의 성능이 우수한 것으로 규명되었으나 장기간의 지역 범위 전력수요 예측에 대해 LSTM 기반 모델의 성능이 충분하지 않음을 확인할 수 있다. 본 연구에서는 기온 데이터를 반영하여 24시간 이전에 전력수요를 예측하는 WaveNet 기반 딥러닝 모델을 개발하여, 실제 사용하고 있는 통계적 시계열 예측 기법의 정확도(MAPE 값 2%)보다 우수한 예측 성능을 달성하는 모델을 개발하고자 한다. 먼저 WaveNet의 핵심 구조인 팽창인과 1차원 합성곱 신경망 구조를 소개하고, 전력수요와 기온 데이터를 입력값으로 모델에 주입하기 위한 데이터 전처리 과정을 제시한다. 다음으로, 개선된 WaveNet 모델을 학습하고 검증하는 방법을 제시한다. 성능 비교 결과, WaveNet 기반 모델에 기온 데이터를 반영한 방법은 전체 검증데이터에 대해 MAPE 값 1.33%를 달성하였고, 동일한 구조의 모델에서 기온 데이터를 반영하지 않는 것(MAPE 값 2.31%)보다 우수한 전력수요 예측 결과를 나타내고 있음을 확인할 수 있다.

유비쿼터스 컴퓨팅 환경에서의 킬러서비스 사례연구: 현장체험 학습을 중심으로 (A Study on Killer Services in Ubiquitous Computing: The Case of the Scene of Labor Learning)

  • 김경규;박성국;류성렬;김문오;장항배
    • 한국IT서비스학회지
    • /
    • 제6권2호
    • /
    • pp.99-112
    • /
    • 2007
  • In this study we designed the killer services for the scene of labor learning in ubiquitous computing. To achieve this study, we have explored the unmet needs of teachers in the scene of labor learning and examined whether the unmet needs could be served by the resources and capabilities of ubiquitous computing. Then, we have crafted a detail killer services that includes value propositions and resource maps by using statistical methodology. Finally, the killer services for the scene of labor learning proposed to serve educational users with the service architecture. The result of this study will be applied to develop new business model in ubiquitous computing as the basic research.

The Analysis of Association between Learning Styles and a Model of IoT-based Education : Chi-Square Test for Association

  • Sayassatov, Dulan;Cho, Namjae
    • Journal of Information Technology Applications and Management
    • /
    • 제27권3호
    • /
    • pp.19-36
    • /
    • 2020
  • The Internet of things (IoT) is a system of interrelated computed devices, digital machines and any physical objects which are provided with unique identifiers and the potential to transmit data to people or machine (M2M) without requiring human interaction. IoT devices can be used to monitor and control the electrical and electronic systems used in different fields like smart home, smart city, smart healthcare and etc. In this study we introduce four imaginary IoT devices as a learning support assistants according to students' dominant learning styles measured by Honey and Mumford Learning Styles: Activists, Reflectors, Theorists and Pragmatists. This research emphasizes the association between students' strong learning styles and a preference to appropriate IoT devices with specific characteristics. Moreover, different levels of IoT devices' architecture are clearly explained in this study where all the artificial devices are designed based on this structure. Data analysis of experiment were measured by the use of chi square test for association and research results showed the statistical significance of the estimated model and the impacts of each category over the model where we finally got accurate estimates for our research variables. This study revealed the importance of considering the students' dominant learning styles before inventing a new IoT device.

Application of Reinforcement Learning in Detecting Fraudulent Insurance Claims

  • Choi, Jung-Moon;Kim, Ji-Hyeok;Kim, Sung-Jun
    • International Journal of Computer Science & Network Security
    • /
    • 제21권9호
    • /
    • pp.125-131
    • /
    • 2021
  • Detecting fraudulent insurance claims is difficult due to small and unbalanced data. Some research has been carried out to better cope with various types of fraudulent claims. Nowadays, technology for detecting fraudulent insurance claims has been increasingly utilized in insurance and technology fields, thanks to the use of artificial intelligence (AI) methods in addition to traditional statistical detection and rule-based methods. This study obtained meaningful results for a fraudulent insurance claim detection model based on machine learning (ML) and deep learning (DL) technologies, using fraudulent insurance claim data from previous research. In our search for a method to enhance the detection of fraudulent insurance claims, we investigated the reinforcement learning (RL) method. We examined how we could apply the RL method to the detection of fraudulent insurance claims. There are limited previous cases of applying the RL method. Thus, we first had to define the RL essential elements based on previous research on detecting anomalies. We applied the deep Q-network (DQN) and double deep Q-network (DDQN) in the learning fraudulent insurance claim detection model. By doing so, we confirmed that our model demonstrated better performance than previous machine learning models.

중학생의 자기효능감, 자기주도학습, 학교적응과 학습몰입 간의 관계 분석 (Structural Relationship among the Self-Efficacy, Self-Directed Learning Ability, School Adjustment, and Leaning Flow in Middle School Students)

  • 강승희
    • 수산해양교육연구
    • /
    • 제24권6호
    • /
    • pp.935-949
    • /
    • 2012
  • The purpose of this study was to investigate the structural relationship among the self-efficacy, self-directed learning ability, school adjustment and learning flow in middle school students by the structural equation modeling analysis. The subjects of this study consisted of 553 middle school students. The data were analyzed with descriptive statistics, Pearson correlations and structural equation modeling analysis by using the SPSS 12.0 and AMOS 5.0 statistical program. The results of this study were as followed: First, there were significant correlations among the self-efficacy, self-directed learning ability, school adjustment and learning flow. Second, the self-directed learning ability and school adjustment directly affected the learning flow. Third, self-efficacy and school adjustment variables indirectly affected learning flow. The indices of the best fit model on these variable were adequate. This study shows that the self-efficacy, self-directed learning ability, school adjustment are the significant predictor for the learning flow during adolescent.

통계분석 기법과 머신러닝 기법의 비교분석을 통한 건물의 지진취약도 공간분석 (A Spatial Analysis of Seismic Vulnerability of Buildings Using Statistical and Machine Learning Techniques Comparative Analysis)

  • 김성훈;김상빈;김대현
    • 산업융합연구
    • /
    • 제21권1호
    • /
    • pp.159-165
    • /
    • 2023
  • 최근 지진 발생 빈도가 증가하고 있는 반면 국내 지진 대응 체계는 취약한 현실에서, 본 연구의 목적은 통계분석 기법과 머신러닝 기법을 활용한 공간분석을 통해 건물의 지진취약도를 비교분석 하는 것이다. 통계분석 기법을 활용한 결과, 최적화척도법을 활용해 개발된 모델의 예측정확도는 약 87%로 도출되었다. 머신러닝 기법을 활용한 결과, 분석된 4가지 방법 중, Random Forest의 정확도가 Train Set의 경우 94%, Test Set의 경우 76.7%로 가장 높아, 최종적으로 Random Forest가 선정되었다. 따라서, 예측정확도는 통계분석 기법이 약 87%, 머신러닝 기법이 76.7%로, 통계분석 기법의 예측정확도가 더 높은 것으로 분석되었다. 최종 결과로, 건물의 지진취약도는 분석된 건물데이터 총 22,296개 중, 1,627(0.1%)개의 건물데이터는 통계분석 기법 사용 시 더 위험하다고 도출되었고, 10,146(49%)개의 건물데이터는 동일하게 도출되었으며, 나머지 10,523(50%)개의 건물데이터는 머신러닝 기법 사용 시 더 위험하게 도출되었다. 기존 통계분석 기법에 첨단 머신러닝 기법활용결과가 추가로 비교검토 됨으로써 공간분석 의사결정에 있어서, 좀더 신뢰도가 높은 지진대응책 마련에 도움이 되길 기대한다.

스파크에서 스칼라와 R을 이용한 머신러닝의 비교 (Comparison of Scala and R for Machine Learning in Spark)

  • 류우석
    • 한국전자통신학회논문지
    • /
    • 제18권1호
    • /
    • pp.85-90
    • /
    • 2023
  • 보건의료분야 데이터 분석 방법론이 기존의 통계 중심의 연구방법에서 머신러닝을 이용한 예측 연구로 전환되고 있다. 본 연구에서는 다양한 머신러닝 도구들을 살펴보고, 보건의료분야에서 많이 사용하고 있는 통계 도구인 R을 빅데이터 머신러닝에 적용하기 위해 R과 스파크를 연계한 프로그래밍 모델들을 비교한다. 그리고, R을 스파크 환경에서 수행하는 SparkR을 이용한 선형회귀모델 학습의 성능을 스파크의 기본 언어인 스칼라를 이용한 모델과 비교한다. 실험 결과 SparkR을 이용할 때의 학습 수행 시간이 스칼라와 비교하여 10~20% 정도 증가하였다. 결과로 제시된 성능 저하를 감안한다면 기존의 통계분석 도구인 R을 그대로 활용 가능하다는 측면에서 SparkR의 분산 처리의 유용성을 확인하였다.