• Title/Summary/Keyword: Big 5 Model

Search Result 445, Processing Time 0.11 seconds

Influence on overfitting and reliability due to change in training data

  • Kim, Sung-Hyeock;Oh, Sang-Jin;Yoon, Geun-Young;Jung, Yong-Gyu;Kang, Min-Soo
    • International Journal of Advanced Culture Technology
    • /
    • v.5 no.2
    • /
    • pp.82-89
    • /
    • 2017
  • The range of problems that can be handled by the activation of big data and the development of hardware has been rapidly expanded and machine learning such as deep learning has become a very versatile technology. In this paper, mnist data set is used as experimental data, and the Cross Entropy function is used as a loss model for evaluating the efficiency of machine learning, and the value of the loss function in the steepest descent method is We applied the GradientDescentOptimize algorithm to minimize and updated weight and bias via backpropagation. In this way we analyze optimal reliability value corresponding to the number of exercises and optimal reliability value without overfitting. And comparing the overfitting time according to the number of data changes based on the number of training times, when the training frequency was 1110 times, we obtained the result of 92%, which is the optimal reliability value without overfitting.

Prediction of Remaining Useful Life of Lithium-ion Battery based on Multi-kernel Support Vector Machine with Particle Swarm Optimization

  • Gao, Dong;Huang, Miaohua
    • Journal of Power Electronics
    • /
    • v.17 no.5
    • /
    • pp.1288-1297
    • /
    • 2017
  • The estimation of the remaining useful life (RUL) of lithium-ion (Li-ion) batteries is important for intelligent battery management system (BMS). Data mining technology is becoming increasingly mature, and the RUL estimation of Li-ion batteries based on data-driven prognostics is more accurate with the arrival of the era of big data. However, the support vector machine (SVM), which is applied to predict the RUL of Li-ion batteries, uses the traditional single-radial basis kernel function. This type of classifier has weak generalization ability, and it easily shows the problem of data migration, which results in inaccurate prediction of the RUL of Li-ion batteries. In this study, a novel multi-kernel SVM (MSVM) based on polynomial kernel and radial basis kernel function is proposed. Moreover, the particle swarm optimization algorithm is used to search the kernel parameters, penalty factor, and weight coefficient of the MSVM model. Finally, this paper utilizes the NASA battery dataset to form the observed data sequence for regression prediction. Results show that the improved algorithm not only has better prediction accuracy and stronger generalization ability but also decreases training time and computational complexity.

Exploring inter-media agenda-setting effects: Network agenda-setting model by using big-data analysis (자살 보도에 대한 미디어 간 의제 설정 분석: 빅데이터를 이용한 네트워크 의제 설정 모델 분석을 중심으로)

  • Kim, Daewook
    • Journal of the Korea Convergence Society
    • /
    • v.12 no.5
    • /
    • pp.121-126
    • /
    • 2021
  • Based on network agenda-setting theory, this study attempted to analyze media reports about suicide from 2000 to 2020 in order to find solutions for suicide problem in the Korean society. Results showed that top 10 key words in media were suicide, death leap, death, attempt, supposition, discovery, men, pessimism. Those key words were appeared similarly and contunually in the media. In addition, both newspapers and broadcastings had similar reports trend, so it is plausible to consider inter-media agenda setting relations between newspapers and broadcasings.

Field measurement and numerical simulation of snow deposition on an embankment in snowdrift

  • Ma, Wenyong;Li, Feiqiang;Sun, Yuanchun;Li, Jianglong;Zhou, Xuanyi
    • Wind and Structures
    • /
    • v.32 no.5
    • /
    • pp.453-469
    • /
    • 2021
  • Snow accumulation on the road frequently induces a big traffic problem in the cold snowy region. Accurate prediction on snow distribution is fundamental for solving drifting snow disasters on roads. The present study adopts the transient method to simulate the wind-induced snow distribution on embankment based on the mixture multiphase model and dynamic mesh technique. The simulation and field measurement are compared to confirm the applicability of the simulation. Furthermore, the process of snow accumulation is revealed. The effects of friction velocity and snow concentration on snow accumulation are analyzed to clarify its mechanism. The results show that the simulation agrees well with the field measurement in trends. Moreover, the snow accumulation on the embankment can be approximately divided into three stages with time, the snow firstly deposited on the windward side, then, accumulation occurs on the leeward side which induced by the wake vortex, finally, the snow distribution reaches an equilibrium state with the slope of approximately 7°. The friction velocity and duration have a significant influence on the snow accumulation, and the vortex scale directly affected the snow deposition range on the embankment leeward side.

HBase based Business Process Event Log Schema Design of Hadoop Framework

  • Ham, Seonghun;Ahn, Hyun;Kim, Kwanghoon Pio
    • Journal of Internet Computing and Services
    • /
    • v.20 no.5
    • /
    • pp.49-55
    • /
    • 2019
  • Organizations design and operate business process models to achieve their goals efficiently and systematically. With the advancement of IT technology, the number of items that computer systems can participate in and the process becomes huge and complicated. This phenomenon created a more complex and subdivide flow of business process.The process instances that contain workcase and events are larger and have more data. This is an essential resource for process mining and is used directly in model discovery, analysis, and improvement of processes. This event log is getting bigger and broader, which leads to problems such as capacity management and I / O load in management of existing row level program or management through a relational database. In this paper, as the event log becomes big data, we have found the problem of management limit based on the existing original file or relational database. Design and apply schemes to archive and analyze large event logs through Hadoop, an open source distributed file system, and HBase, a NoSQL database system.

A Study on the Prediction of Strawberry Production in Machine Learning Infrastructure (머신러닝 기반 시설재배 딸기 생산량 예측 연구)

  • Oh, HanByeol;Lim, JongHyun;Yang, SeungWeon;Cho, YongYun;Shin, ChangSun
    • Smart Media Journal
    • /
    • v.11 no.5
    • /
    • pp.9-16
    • /
    • 2022
  • Recently, agricultural sites are automating into digital agricultural smart farms by applying technologies such as big data and Internet of Things (IoT). These smart farms aim to increase production and improve crop quality by measuring the environment of crops, investigating and processing data. Production prediction is an important study in smart farm digital agriculture, which is a high-tech agriculture, and it is necessary to analyze environmental data using big data and further standardized research to manage the quality of growth information data. In this paper, environmental and production data collected from smart farm strawberry farms were analyzed and studied. Based on regression analysis, crop production prediction models were analyzed using Ridge Regression, LightGBM, and XGBoost. Among the three models, the optimal model was XGBoost, and R2 showed 82.5 percent explanatory power. As a result of the study, the correlation between the amount of positive fluid absorption and environmental data was confirmed, and significant results were obtained for the production prediction study. In the future, it is expected to contribute to the prevention of environmental pollution and reduction of sheep through the management of sheep by studying the amount of sheep absorption, such as information on the growing environment of crops and the ingredients of sheep.

Efficient Outlier Detection of the Water Temperature Monitoring Data (수온 관측 자료의 효율적인 이상 자료 탐지)

  • Cho, Hongyeon;Jeong, Shin Taek;Ko, Dong Hui;Son, Kyeong-Pyo
    • Journal of Korean Society of Coastal and Ocean Engineers
    • /
    • v.26 no.5
    • /
    • pp.285-291
    • /
    • 2014
  • The statistical information of the coastal water temperature monitoring data can be biased because of outliers and missing intervals. Though a number of outlier detection methods have been developed, their applications are very limited to the in-situ monitoring data because of the assumptions of the a prior information of the outliers and no-missing condition, and the excessive computational time for some methods. In this study, the practical robust method is developed that can be efficiently and effectively detect the outliers in case of the big-data. This model is composed of these two parts, one part is the construction part of the approximate components of the monitoring data using the robust smoothing and data re-sampling method, and the other part is the main iterative outlier detection part using the detailed components of the data estimated by the approximate components. This model is tested using the two-years 5-minute interval water temperature data in Lake Saemangeum. It can be estimated that the outlier proportion of the data is about 1.6-3.7%. It shows that most of the outliers in the data are detected and removed with satisfaction by the model. In order to effectively detect and remove the outliers, the outlier detection using the long-span smoothing should be applied earlier than that using the short-span smoothing.

An Empirical Study on the Failure Prediction for KOSDAQ Firms (코스닥기업의 부실예측에 대한 실증 분석)

  • Park, Hee-Jung;Kang, Ho-Jung
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.10 no.3
    • /
    • pp.670-676
    • /
    • 2009
  • Bankruptcy of firms in Korea can cause distress of financial institutions because these institutions have disterssed bond. Accordingly, social and economical spill-over effects by these results are very big. Even after the difficult times of IMF crisis had ended, bankruptcy of information-based small-medium companies and venture firms listed on the KOSDAQ has been continued. In this context, this study developed and adopted failure prediction models for which discriminant analysis was used. Samples of this study was 81 firms respectively for both failed and non-failed firms listed on the KOSDAQ between the year of 2000 and 2007. The results of this study are as follows. First, the accuracy of classification of the model by years was $74.5%{\sim}76.5%$, and the accuracy of classification of the mean model was $69.6%{\sim}80.4%$. Among the models, the mean model of -one year, -two years, and -three years was highest in accuracy of classification (80.4%). Second, accuracy of prediction of final model adopted on validation samples showed 85% before one year of bankruptcy. The results of this study may be significant in that the results may be used as early warning system for bankruptcy prediction of KOSDAQ firms.

The long-term agricultural weather forcast methods using machine learning and GloSea5 : on the cultivation zone of Chinese cabbage. (기계학습과 GloSea5를 이용한 장기 농업기상 예측 : 고랭지배추 재배 지역을 중심으로)

  • Kim, Junseok;Yang, Miyeon;Yoon, Sanghoo
    • Journal of Digital Convergence
    • /
    • v.18 no.4
    • /
    • pp.243-250
    • /
    • 2020
  • Systematic farming can be planned and managed if long-term agricultural weather information of the plantation is available. Because the greatest risk factor for crop cultivation is the weather. In this study, a method for long-term predicting of agricultural weather using the GloSea5 and machine learning is presented for the cultivation of Chinese cabbage. The GloSea5 is a long-term weather forecast that is available up to 240 days. The deep neural networks and the spatial randomforest were considered as the method of machine learning. The longterm prediction performance of the deep neural networks was slightly better than the spatial randomforest in the sense of root mean squared error and mean absolute error. However, the spatial randomforest has the advantage of predicting temperatures with a global model, which reduces the computation time.

A Study on Design Parameters for Ready-made Ear Shell of Hearing Aids (보청기용 범용 이어쉘을 위한 설계 파라미터에 관한 연구)

  • Urtnasan, Erdenebayar;Jeon, Yu-Yong;Park, Gyu-Seok;Song, Young-Rok;Lee, Sang-Min
    • The Transactions of The Korean Institute of Electrical Engineers
    • /
    • v.60 no.5
    • /
    • pp.1055-1061
    • /
    • 2011
  • In this study, main parameters: aperture, first bend and second bend which express a structure of ear canal are extracted in order to modeling and manufacture the ready-made ear shells of hearing aids. The proposed parameter extraction method consists of 2 important algorithms, aperture detection and feature detection. In the aperture detection algorithm, aperture of 3-D scanned virtual ear impression and parameters relating to ear shell of hearing aid are determined. The feature detection algorithm detects first bend, second bend, and related parameters. Through these two algorithms, parameters for aperture, first bend, and second bend are extracted to model the ready-made ear shell of hearing aid. The values of these extracted parameters from 36 people's right ear impression are analyzed and measured statistically. As a result of the analysis, it has been found that it is possible to classify ready-made ear shell parameters by age and size. The ready-made ear shell parameters are classified 3-size for 20 years old and 2-size for 60 years olde. Using 3D rhino program, virtual ready-made ear shell is reconstructed by parameters of every type, and simulated to model it. A final product was produced by transferring simulation result with rapid prototyping system. The modeled ready-made ear shell is evaluated with the objective and subjective method. Objective method is the comparison volume ratio and overlapped volume ratio of ear impression from randomly chosen 18 people and ready-made ear shell. And subjective method is that the final product of ready-made ear shell is used by users and the satisfaction number drawn from well fitting and comfortable testing was evaluated. In the result of the evaluation, it has been found that volume ration is 70%, big and middle size ready-made ear shell products are possible, and the satisfaction number is high.