• Title/Summary/Keyword: Big 5 Model

Search Result 445, Processing Time 0.039 seconds

Development of Type 2 Prediction Prediction Based on Big Data (빅데이터 기반 2형 당뇨 예측 알고리즘 개발)

  • Hyun Sim;HyunWook Kim
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.18 no.5
    • /
    • pp.999-1008
    • /
    • 2023
  • Early prediction of chronic diseases such as diabetes is an important issue, and improving the accuracy of diabetes prediction is especially important. Various machine learning and deep learning-based methodologies are being introduced for diabetes prediction, but these technologies require large amounts of data for better performance than other methodologies, and the learning cost is high due to complex data models. In this study, we aim to verify the claim that DNN using the pima dataset and k-fold cross-validation reduces the efficiency of diabetes diagnosis models. Machine learning classification methods such as decision trees, SVM, random forests, logistic regression, KNN, and various ensemble techniques were used to determine which algorithm produces the best prediction results. After training and testing all classification models, the proposed system provided the best results on XGBoost classifier with ADASYN method, with accuracy of 81%, F1 coefficient of 0.81, and AUC of 0.84. Additionally, a domain adaptation method was implemented to demonstrate the versatility of the proposed system. An explainable AI approach using the LIME and SHAP frameworks was implemented to understand how the model predicts the final outcome.

Predicting Regional Soybean Yield using Crop Growth Simulation Model (작물 생육 모델을 이용한 지역단위 콩 수량 예측)

  • Ban, Ho-Young;Choi, Doug-Hwan;Ahn, Joong-Bae;Lee, Byun-Woo
    • Korean Journal of Remote Sensing
    • /
    • v.33 no.5_2
    • /
    • pp.699-708
    • /
    • 2017
  • The present study was to develop an approach for predicting soybean yield using a crop growth simulation model at the regional level where the detailed and site-specific information on cultivation management practices is not easily accessible for model input. CROPGRO-Soybean model included in Decision Support System for Agrotechnology Transfer (DSSAT) was employed for this study, and Illinois which is a major soybean production region of USA was selected as a study region. As a first step to predict soybean yield of Illinois using CROPGRO-Soybean model, genetic coefficients representative for each soybean maturity group (MG I~VI) were estimated through sowing date experiments using domestic and foreign cultivars with diverse maturity in Seoul National University Farm ($37.27^{\circ}N$, $126.99^{\circ}E$) for two years. The model using the representative genetic coefficients simulated the developmental stages of cultivars within each maturity group fairly well. Soybean yields for the grids of $10km{\times}10km$ in Illinois state were simulated from 2,000 to 2,011 with weather data under 18 simulation conditions including the combinations of three maturity groups, three seeding dates and two irrigation regimes. Planting dates and maturity groups were assigned differently to the three sub-regions divided longitudinally. The yearly state yields that were estimated by averaging all the grid yields simulated under non-irrigated and fully-Irrigated conditions showed a big difference from the statistical yields and did not explain the annual trend of yield increase due to the improved cultivation technologies. Using the grain yield data of 9 agricultural districts in Illinois observed and estimated from the simulated grid yield under 18 simulation conditions, a multiple regression model was constructed to estimate soybean yield at agricultural district level. In this model a year variable was also added to reflect the yearly yield trend. This model explained the yearly and district yield variation fairly well with a determination coefficients of $R^2=0.61$ (n = 108). Yearly state yields which were calculated by weighting the model-estimated yearly average agricultural district yield by the cultivation area of each agricultural district showed very close correspondence ($R^2=0.80$) to the yearly statistical state yields. Furthermore, the model predicted state yield fairly well in 2012 in which data were not used for the model construction and severe yield reduction was recorded due to drought.

Evaluation of Mobility and Safety of Operating an Overlap Phase on a Shared-Left-Turn Lane Using a Microscopic Traffic Simulation Model (미시교통시뮬레이션모형을 이용한 공용 좌회전 차로의 중첩현시운영의 이동성과 안전성 평가 연구)

  • Yun, Il-Soo;Han, Eum;Woo, Seok-Cheol;Yoon, Jung-Eun;Park, Sung-Ho
    • The Journal of The Korea Institute of Intelligent Transport Systems
    • /
    • v.11 no.5
    • /
    • pp.15-26
    • /
    • 2012
  • Government agencies including the national police agency have executed diverse efforts including continuous improvements of traffic facilities and operation methods, education, enforcements in order to improve traffic operation systems; nevertheless there have been continuous criticisms on irrationality in traffic signal and road facility operation. One of the reasons may be the lack of systematic preliminary evaluations on various alternatives. However, there was no appropriate tool to evaluate the mobility and safety of thus alternatives in a systematic way. Therefore, this study proposes the systematic use of microscopic traffic simulation models as a comprehensive evaluation tool. In addition, this study verified the potential of using a microscopic traffic simulation model using the case of operating an overlap phase on a shared-left-turn lane through a systematic way where the evaluation was conducted through data collection, building networks, calibrating microscopic simulation models, producing performance measures, evaluating mobility and safety, and so on. As a result, the operation of overlap phase on a shared-left-turn lane showed no big difference from other operation scenarios such as leading left-turn on exclusive left turn lane in terms of mobility. However the operation of overlap phase on a shared-left-turn lane decreased safety by increasing potential conflicts.

A study on service model for unified data transmission in a subway and railway (차지상간 통합전송시스템의 서비스 모델에 관한 연구)

  • An, Tae-Kil;Kim, Back-Hyun;Jeong, Sang-Guk;Nam, Myung-Woo;Lee, Young-Seock;Oh, Myung-Kwan
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.11 no.5
    • /
    • pp.1573-1579
    • /
    • 2010
  • In this paper, we studied efficient design of wireless transmission system for unified data transmission in a subway and railway. It is increased that need of broadband multimedia service to make useful environment for users and to support the operation of railway system. High bandwidth is better if we need more services. But, high bandwidth requires more cost at tunnel of subway. And more bandwidth makes received antenna sensitivity bad. So it needs more wireless stations. We deduced best bandwidth for subway wireless transmission system using the cost of installation and efficiency of system. Consequently, we proposed efficient service model for broadband wireless system at a subway. Subway broadband wireless transmission system is testing and extended to province subway. The cost of subway broadband wireless transmission system is saved, because the system can be efficiently designed using proposed service model. Therefore, the effectiveness of it will be expected to be very big.

A Research on PV-connected ESS dissemination strategy considering the effects of GHG reduction (온실가스감축효과를 고려한 태양광 연계형 에너지저장장치(ESS) 보급전략에 대한 연구)

  • Lee, Wongoo;KIM, Kang-Won;KIM, Balho H.
    • Journal of Energy Engineering
    • /
    • v.25 no.2
    • /
    • pp.94-100
    • /
    • 2016
  • ESS(Energy Storage System) is an important source that keeps power supply stable and utilizes electricity efficiently. For example, ESS contributes to resolve power supply imbalance, stabilize new renewable energy output and regulate frequency. ESS is predicted to be expanded to 55.9GWh of installed capacity by 2023, which is 30 times more than that of 2014. To raise competitiveness of domestic ESS industry in this increasing world market, we have disseminated load-shift ESS for continuous power supply imbalance with FR ESS, and also necessity to secure domestic track record is required. However in case of FR ESS, utility of installing thermal power plant is generally generated within 5% range of rated capacity, so that scalability of domestic market is low without dramatic increase of thermal power plant. Necessity of load-shift ESS dissemination is also decreasing effected by surplus backup power securement policy, raising demand for new dissemination model. New dissemination model is promising for $CO_2$ reduction effect in spite of intermittent output. By stabilizing new renewable energy output in connection with new renewable energy, and regulating system input timing of new renewable energy generation rate, it is prospected model for 'post-2020' regime and energy industry. This research presents a policy alternatives of REC multiplier calculation method to induce investment after outlining PV-connected ESS charge/discharge mode to reduce GHG emission, This alternative is projected to utilize GHG emission reduction methodology for 'Post-2020' regime, big issue of new energy policy.

Research for facial model of Korean female using physiognomy (성격에 따른 한국인 여성(20대) 캐릭터의 얼굴 모델 연구(관상학을 이용하여))

  • 송은화;최유미
    • Archives of design research
    • /
    • v.17 no.3
    • /
    • pp.39-50
    • /
    • 2004
  • According to the survey done among the 20s of university students in Oct. 2002, the female animation characters from Korea and Japan have almost no distinction. Speaking of the features of the characters, they have big eyes, a long nose, an oval face. However, their eyes and mouths are expressed exaggeratedly, which makes them hard to get a good impression from the people. It seems that the identity of Korean female animation characters is disappearing in terms of the copycat from western animation characters. Through the research, the features of 20s of Korean and Japanese girls are compared and they were reviewed from the ideal and biological point of view. Then, the animation characters, which physiognomy has been applied, are created based on result, which I am trying to find the standard model of Korea female animation character. As an ideal point of view, it turns out that Korean has a positive reaction on the female animation characters with the cute & introspective characteristics. On the contrary, Korean has a negative reactions on the ones that were created in a biological viewpoint and has a highly sensitive and discontented characteristics. This shows that Korean do not prefer the female animation characters with highly sensitive and discontented characters. According to the survey, 73.5% of people could find the same 7 kinds of female characters which are shown frequently in the animation. This means that the oriental physiognomy is an effective way to approach to the features of the character's face depending on the characteristics of each female characters. The characters in this study are based on real face of female, but we can create the various animation characters if we apply the partial feature of distinctive each personal face.

  • PDF

Fandom-Persona Design based on Social Network Analysis (소셜 네트워크 분석을 이용한 팬덤 페르소나 디자인)

  • Sul, Sanghun;Seong, Kihun
    • Journal of Internet Computing and Services
    • /
    • v.20 no.5
    • /
    • pp.87-94
    • /
    • 2019
  • In this paper, the method of analyzing the unformatted data of consumers accumulated on social networks in the era of the Fourth Industrial Revolution by utilizing data from the service design and social psychology aspects was proposed. First, the fandom phenomenon, which shows subjective and collective behavior in a space on a social network rather than physical space, was defined from a data service perspective. The fandom model has been transformed into a collective level of customer Persona that has been analyzed at a personal level in traditional service design, and social network analysis that analyzes consumers' big data has been presented as an efficient way to pattern and visually analyze it. Consumer data collected through social leasing were pre-processed by column based on correlation, stability, missing, and ID-ness. Based on the above data, the company's brand strategy was divided into active and passive interventions and the effect of this strategic attitude on the growth direction of the consumer's fandom community was analyzed. To this end, the fandom model of consumers was proposed by dividing it into four strategies that the brand strategy had: stand-alone, decentralized, integrated and centralized, and the fandom shape of consumers was proposed as a growth model analysis technique that analyzes changes over time.

A Study on Project-based Smart Learning Tool Model (프로젝트 기반 스마트 학습 도구 모델에 관한 연구)

  • Lee, Keun-Ho
    • Journal of Internet of Things and Convergence
    • /
    • v.8 no.5
    • /
    • pp.93-98
    • /
    • 2022
  • With the development of new digital technologies, research on various learning tools is being actively conducted. These learning tools are also being developed so that they can be applied to various environments by applying the technology of artificial intelligence or using smart functions to which big data technology is applied. These smart learning tools are contributing a lot to increasing educational effectiveness and learning efficiency. Recently, various learning tools have been applied in universities, and solutions for smart learning from smart attendance are introduced to improve student learning efficiency. This study intends to propose a design for a smart learning tool that can increase the efficiency of project progress and increase the scalability of the results when conducting a company's customized project through such a university's smart learning tool. The proposed smart learning tool is expected to have the advantage of being able to easily adapt to the practical business project as the company-customized projects that can improve practical skills are smoothly used as a learning tool. The proposed project-based smart learning tool model is later built as a related LMS and applied to actual project progress to check its utility, and to revise and supplement the proposed smart learning tool model to provide a project-based smart learning function want to strengthen.

A Study on the Application of Virtual Space Design Using the Blended Education Method - A La Carte Model Based on the Creation of Infographic - (블렌디드 교육방식을 활용한 가상공간 디자인 적용에 관한 연구 -알 라 카르테 모델 (A La Carte) 인포그래픽 가상공간 제작을 중심으로-)

  • Cho, Hyun Kyung
    • The Journal of the Convergence on Culture Technology
    • /
    • v.8 no.5
    • /
    • pp.279-284
    • /
    • 2022
  • As a study of the blended learning method on design education through the blended learning method, I would like to propose that more advanced learner-led customized design education is possible. Understanding in face-to-face classes and advantages in non-face-to-face classes can be supplemented in an appropriate way in remote classes. Advanced artificial intelligence and big data technology can provide personalized and subdivided learning materials and effective learning methods tailored to learners' levels and interests based on quantified data in design classes. In this paper, it was proposed to maximize the efficiency of the class by applying a method that exceeds the limitations of time and space through the proposal of the A La Carte model (A La Carte). It is a remote class that can be heard anytime, anywhere, and it is also possible to bridge the educational quality and educational gap provided to students living in underprivileged areas. As the goal of fostering creative convergence-type future talents, it is changing with a rapid technological development speed. It is necessary to adapt to the change in learning methods in line with this. An analysis of the infographic virtual space design and construction process through the A La Carte model (A La Carte) proposal was presented. Rather than simply acquiring knowledge, it is expected that knowledge can be sorted, distinguished, learned, and easily reborn with its own knowledge.

Clickstream Big Data Mining for Demographics based Digital Marketing (인구통계특성 기반 디지털 마케팅을 위한 클릭스트림 빅데이터 마이닝)

  • Park, Jiae;Cho, Yoonho
    • Journal of Intelligence and Information Systems
    • /
    • v.22 no.3
    • /
    • pp.143-163
    • /
    • 2016
  • The demographics of Internet users are the most basic and important sources for target marketing or personalized advertisements on the digital marketing channels which include email, mobile, and social media. However, it gradually has become difficult to collect the demographics of Internet users because their activities are anonymous in many cases. Although the marketing department is able to get the demographics using online or offline surveys, these approaches are very expensive, long processes, and likely to include false statements. Clickstream data is the recording an Internet user leaves behind while visiting websites. As the user clicks anywhere in the webpage, the activity is logged in semi-structured website log files. Such data allows us to see what pages users visited, how long they stayed there, how often they visited, when they usually visited, which site they prefer, what keywords they used to find the site, whether they purchased any, and so forth. For such a reason, some researchers tried to guess the demographics of Internet users by using their clickstream data. They derived various independent variables likely to be correlated to the demographics. The variables include search keyword, frequency and intensity for time, day and month, variety of websites visited, text information for web pages visited, etc. The demographic attributes to predict are also diverse according to the paper, and cover gender, age, job, location, income, education, marital status, presence of children. A variety of data mining methods, such as LSA, SVM, decision tree, neural network, logistic regression, and k-nearest neighbors, were used for prediction model building. However, this research has not yet identified which data mining method is appropriate to predict each demographic variable. Moreover, it is required to review independent variables studied so far and combine them as needed, and evaluate them for building the best prediction model. The objective of this study is to choose clickstream attributes mostly likely to be correlated to the demographics from the results of previous research, and then to identify which data mining method is fitting to predict each demographic attribute. Among the demographic attributes, this paper focus on predicting gender, age, marital status, residence, and job. And from the results of previous research, 64 clickstream attributes are applied to predict the demographic attributes. The overall process of predictive model building is compose of 4 steps. In the first step, we create user profiles which include 64 clickstream attributes and 5 demographic attributes. The second step performs the dimension reduction of clickstream variables to solve the curse of dimensionality and overfitting problem. We utilize three approaches which are based on decision tree, PCA, and cluster analysis. We build alternative predictive models for each demographic variable in the third step. SVM, neural network, and logistic regression are used for modeling. The last step evaluates the alternative models in view of model accuracy and selects the best model. For the experiments, we used clickstream data which represents 5 demographics and 16,962,705 online activities for 5,000 Internet users. IBM SPSS Modeler 17.0 was used for our prediction process, and the 5-fold cross validation was conducted to enhance the reliability of our experiments. As the experimental results, we can verify that there are a specific data mining method well-suited for each demographic variable. For example, age prediction is best performed when using the decision tree based dimension reduction and neural network whereas the prediction of gender and marital status is the most accurate by applying SVM without dimension reduction. We conclude that the online behaviors of the Internet users, captured from the clickstream data analysis, could be well used to predict their demographics, thereby being utilized to the digital marketing.