Search | Korea Science

A Multimodal Profile Ensemble Approach to Development of Recommender Systems Using Big Data (빅데이터 기반 추천시스템 구현을 위한 다중 프로파일 앙상블 기법)

Kim, Minjeong;Cho, Yoonho
- Journal of Intelligence and Information Systems
- /
- v.21 no.4
- /
- pp.93-110
- /
- 2015
The recommender system is a system which recommends products to the customers who are likely to be interested in. Based on automated information filtering technology, various recommender systems have been developed. Collaborative filtering (CF), one of the most successful recommendation algorithms, has been applied in a number of different domains such as recommending Web pages, books, movies, music and products. But, it has been known that CF has a critical shortcoming. CF finds neighbors whose preferences are like those of the target customer and recommends products those customers have most liked. Thus, CF works properly only when there's a sufficient number of ratings on common product from customers. When there's a shortage of customer ratings, CF makes the formation of a neighborhood inaccurate, thereby resulting in poor recommendations. To improve the performance of CF based recommender systems, most of the related studies have been focused on the development of novel algorithms under the assumption of using a single profile, which is created from user's rating information for items, purchase transactions, or Web access logs. With the advent of big data, companies got to collect more data and to use a variety of information with big size. So, many companies recognize it very importantly to utilize big data because it makes companies to improve their competitiveness and to create new value. In particular, on the rise is the issue of utilizing personal big data in the recommender system. It is why personal big data facilitate more accurate identification of the preferences or behaviors of users. The proposed recommendation methodology is as follows: First, multimodal user profiles are created from personal big data in order to grasp the preferences and behavior of users from various viewpoints. We derive five user profiles based on the personal information such as rating, site preference, demographic, Internet usage, and topic in text. Next, the similarity between users is calculated based on the profiles and then neighbors of users are found from the results. One of three ensemble approaches is applied to calculate the similarity. Each ensemble approach uses the similarity of combined profile, the average similarity of each profile, and the weighted average similarity of each profile, respectively. Finally, the products that people among the neighborhood prefer most to are recommended to the target users. For the experiments, we used the demographic data and a very large volume of Web log transaction for 5,000 panel users of a company that is specialized to analyzing ranks of Web sites. R and SAS E-miner was used to implement the proposed recommender system and to conduct the topic analysis using the keyword search, respectively. To evaluate the recommendation performance, we used 60% of data for training and 40% of data for test. The 5-fold cross validation was also conducted to enhance the reliability of our experiments. A widely used combination metric called F1 metric that gives equal weight to both recall and precision was employed for our evaluation. As the results of evaluation, the proposed methodology achieved the significant improvement over the single profile based CF algorithm. In particular, the ensemble approach using weighted average similarity shows the highest performance. That is, the rate of improvement in F1 is 16.9 percent for the ensemble approach using weighted average similarity and 8.1 percent for the ensemble approach using average similarity of each profile. From these results, we conclude that the multimodal profile ensemble approach is a viable solution to the problems encountered when there's a shortage of customer ratings. This study has significance in suggesting what kind of information could we use to create profile in the environment of big data and how could we combine and utilize them effectively. However, our methodology should be further studied to consider for its real-world application. We need to compare the differences in recommendation accuracy by applying the proposed method to different recommendation algorithms and then to identify which combination of them would show the best performance.
https://doi.org/10.13088/jiis.2015.21.4.093 인용 PDF KSCI

Sex Role Identity and Health Behavior among University Student (대학생의 성역할정체감과 건강행위)

Park, Eun-Ok;Park, Young-Sook
- Women's Health Nursing
- /
- v.5 no.3
- /
- pp.362-378
- /
- 1999
This study is to investigate sex role identity and health behavior among university students in Seoul, during the late of 1999. The instruments for data obtainment were KSRI of Kim(1994), and Health Style : A Self-Test provided by ODPHP National Health Information Center. These instrument were reliable, showing Cronbach $\alpha$ .98 and .77. Frequency, t-test, $x^2$-test, stepwise regression were conducted for data analysis, using SAS 6.12 program. The major findings were as follows : 1. For female student. Androgeny(34.0%)type was most common and subjects of feminity type were 28.7%. In contrast, Masculinity type(41.8%) was most prevalent. and undifferentiated type was 30.1% among male students. There were significant difference between male and female student in the type of sex role identity. 2. 89.6% of all subjects were included in risk group for exercise and physical activity, 86.4% for diet habit, 43.2% for alcohol drinking and drug use, 35.6% for stress control, 32.8% for safety behavior, 24.8% for smoking. The big health risk behavior problem of male students were smoking, drinking, diet habit, and exercise. The important health risk behavior problem were diet habit and exercise. There were significant difference in smoking, drinking, exercise between sex. 3. Analysis of the distribution by sex role identity type and health behavior revealed that subjects who were undifferentiated typed group had high risk behavior in stress control, safety, exercise, drinking. Smoking and drinking were more problematic for masculinity typed group had high risk behavior in diet and exercise. The data showed that androginy typed group had more healthy behavior, compared with other sex role identity typed group for all of health behavior. Further research is need to understand the role of sex role identity in health behavior, the variables associated with them. And sex role identity has to be considered in research and practice about health promotion.
PDF

The efficient data-driven solution to nonlinear continuum thermo-mechanics behavior of structural concrete panel reinforced by nanocomposites: Development of building construction in engineering

Hengbin Zheng;Wenjun Dai;Zeyu Wang;Adham E. Ragab
- Advances in nano research
- /
- v.16 no.3
- /
- pp.231-249
- /
- 2024
When the amplitude of the vibrations is equivalent to that clearance, the vibrations for small amplitudes will really be significantly nonlinear. Nonlinearities will not be significant for amplitudes that are rather modest. Finally, nonlinearities will become crucial once again for big amplitudes. Therefore, the concrete panel system may experience a big amplitude in this work as a result of the high temperature. Based on the 3D modeling of the shell theory, the current work shows the influences of the von Kármán strain-displacement kinematic nonlinearity on the constitutive laws of the structure. The system's governing Equations in the nonlinear form are solved using Kronecker and Hadamard products, the discretization of Equations on the space domain, and Duffing-type Equations. Thermo-elasticity Equations. are used to represent the system's temperature. The harmonic solution technique for the displacement domain and the multiple-scale approach for the time domain are both covered in the section on solution procedures for solving nonlinear Equations. An effective data-driven solution is often utilized to predict how different systems would behave. The number of hidden layers and the learning rate are two hyperparameters for the network that are often chosen manually when required. Additionally, the data-driven method is offered for addressing the nonlinear vibration issue in order to reduce the computing cost of the current study. The conclusions of the present study may be validated by contrasting them with those of data-driven solutions and other published articles. The findings show that certain physical and geometrical characteristics have a significant effect on the existing concrete panel structure's susceptibility to temperature change and GPL weight fraction. For building construction industries, several useful recommendations for improving the thermo-mechanics' behavior of structural concrete panels are presented.
https://doi.org/10.12989/anr.2024.16.3.231 인용

A Study of Stress, Coping Behaviors and Health Problems in School Age Children (학령기 아동의 스트레스와 대처행동 및 건강문제)

Kim Mi-Ye
- Child Health Nursing Research
- /
- v.11 no.1
- /
- pp.83-89
- /
- 2005
Purpose: The purpose of this study was to investigate the stress level, coping behaviors and health problems of elementary school children and to compare the level of these three variables according to size of city of residence and to identify the relationship among the three variables. Method: Data were collected by questionnaire from 465 5th and 6th grade elementary school children living in Daegu and North Kyungsang Province. Data were collected between December 1 and 20, 2003 and analyzed using the SPSS program with means, standard deviation, t-test, and Pearson's correlation coefficients. Results: The stress level was significantly higher in children who lived in the big city. The coping behavior score was not significantly different according to size of city, nor was there a difference in the health problems according to size of city. There was a positive correlation among stress level, coping behaviors and health problems. Conclusion: In general, the stress level was significantly different but coping behavior scores and health problem scores were not significantly different according to size of city. Also the elementary school children used more passive coping behavior than active coping behavior. Therefore, strategies to develop active coping behaviors for these children are needed.
PDF

Design and Implementation of Efficient Storage and Retrieval Technology of Traffic Big Data (교통 빅데이터의 효율적 저장 및 검색 기술의 설계와 구현)

Kim, Ki-su;Yi, Jae-Jin;Kim, Hong-Hoi;Jang, Yo-lim;Hahm, Yu-Kun
- The Journal of Bigdata
- /
- v.4 no.2
- /
- pp.207-220
- /
- 2019
Recent developments in information and communication technology has enabled the deployment of sensor based data to provide real-time services. In Korea, The Korea Transportation Safety Authority is collecting driving information of all commercial vehicles through a fitted digital tachograph (DTG). This information gathered using DTG can be utilized in various ways in the field of transportation. Notably in autonomous driving, the real-time analysis of this information can be used to prevent or respond to dangerous driving behavior. However, there is a limit to processing a large amount of data at a level suitable for real-time services using a traditional database system. In particular, due to a such technical problem, the processing of large quantity of traffic big data for real-time commercial vehicle operation information analysis has never been attempted in Korea. In order to solve this problem, this study optimized the new database server system and confirmed that a real-time service is possible. It is expected that the constructed database system will be used to secure base data needed to establish digital twin and autonomous driving environments.
PDF

Traffic Volume Dependent Displacement Estimation Model for Gwangan Bridge Using Monitoring Big Data (교량 모니터링 빅데이터를 이용한 광안대교의 교통량 의존 변위 추정 모델)

Park, Ji Hyun;Shin, Sung Woo;Kim, Soo Yong
- KSCE Journal of Civil and Environmental Engineering Research
- /
- v.38 no.2
- /
- pp.183-191
- /
- 2018
In this study a traffic volume dependent displacement estimation model for Gwangan Bridge was developed using bridge monitoring big data. Traffic volume data for four different vehicle types and the vertical displacement data in the central position of the Gwangan Bridge were used to develop and validate the estimation model. Two statistical estimation models were developed using multiple regression analysis (MRA) and principal component analysis (PCA). Estimation performance of those two models were compared with actual values. The results show that both the MRA and the PCA based models are successfully estimating the vertical displacement of Gwangan Bridge. Based on the results, it is concluded that the developed model can effectively be used to predict the traffic volume dependent displacement behavior of Gwangan Bridge.
https://doi.org/10.12652/Ksce.2018.38.2.0183 인용 PDF KSCI

Analysis Method for Speeding Risk Exposure using Mobility Trajectory Big Data (대용량 모빌리티 궤적 자료를 이용한 과속 위험노출도 분석 방법론)

Lee, Soongbong;Chang, Hyunho;Kang, Taeseok
- Journal of the Society of Disaster Information
- /
- v.17 no.3
- /
- pp.655-666
- /
- 2021
Purpose: This study is to develop a method for measuring dynamic speeding risks using vehicle trajectory big data and to demonstrate the feasibility of the devised speeding index. Method: The speed behaviors of vehicles were analysed in microscopic space and time using individual vehicle trajectories, and then the boundary condition of speeding (i.e., boundary speed) was determined from the standpoint of crash risk. A novel index for measuring the risk exposure of speeding was developed in microscopic space and time with the boundary speed. Result: A validation study was conducted with vehicle-GPS trajectory big data and ground-truth vehicle crash data. As a result of the analysis, it turned out that the index of speeding-risk exposure has a strong explanatory power (R²=0.7) for motorway traffic accidents. This directly indicates that speeding behaviors should be analysed at a microscopic spatiotemporal dimension. Conclusion: The spatial and temporal evolution of vehicle velocity is very variable. It is, hence, expected that the method presented in this study could be efficaciously employed to analyse the causal factors of traffic accidents and the crash risk exposure in microscopic space using mobility trajectory data.
https://doi.org/10.15683/kosdi.2021.9.30.655 인용 PDF KSCI

Analysis of public opinion in the 20th presidential election using YouTube data (유튜브 데이터를 활용한 20대 대선 여론분석)

Kang, Eunkyung;Yang, Seonuk;Kwon, Jiyoon;Yang, Sung-Byung
- Journal of Intelligence and Information Systems
- /
- v.28 no.3
- /
- pp.161-183
- /
- 2022
Opinion polls have become a powerful means for election campaigns and one of the most important subjects in the media in that they predict the actual election results and influence people's voting behavior. However, the more active the polls, the more often they fail to properly reflect the voters' minds in measuring the effectiveness of election campaigns, such as repeatedly conducting polls on the likelihood of winning or support rather than verifying the pledges and policies of candidates. Even if the poor predictions of the election results of the polls have undermined the authority of the press, people cannot easily let go of their interest in polls because there is no clear alternative to answer the instinctive question of which candidate will ultimately win. In this regard, we attempt to retrospectively grasp public opinion on the 20th presidential election by applying the 'YouTube Analysis' function of Sometrend, which provides an environment for discovering insights through online big data. Through this study, it is confirmed that a result close to the actual public opinion (or opinion poll results) can be easily derived with simple YouTube data results, and a high-performance public opinion prediction model can be built.
https://doi.org/10.13088/jiis.2022.28.3.161 인용 PDF KSCI

A Study on the Effects of Big Five Personality Factors on Career Behavior (성격 5요인이 진로행동에 미치는 영향)

Lee, Gil-Hwan;Lee, Deog-Ro;Park, Sang-Seok
- Management & Information Systems Review
- /
- v.31 no.4
- /
- pp.397-432
- /
- 2012
The purpose of this study is Big Five personality factors of personality type on career behavior, that is, entry to the unsafe first job in Korean university students. To accomplish this research objective, this study collected data from 500 students in three universities and two colleges located in the Chungcheong region. Among 350 questionnaires, 342 copies were used for a final analysis. In order to analyze the survey data, statistical package program SPSS/WIN 18.0 was utilized and statistical techniques such as basic statistical analysis, factor analysis, Cronbach's alpha, correlation analysis, and multiple regression analysis were used. The results obtained in this study can be summarized as follows. The extraversion personality and agreeableness personality of college students has a significant effect on the entry to the unsafe first job in the positive direction. On the other hand, the neurotic personality and openness to experience has a significant effect on the entry to the unsafe first job in the negative direction. However, the integrity personality of college students was not statistically significant. Finally, based on the empirical results we suggested several theoretical and practical implications.
PDF

Design and Implementation of the Farm-level Data Acquisition System for the Behavior Analysis of Livestocks (가축의 행동 분석을 위한 농장 수준의 데이터 수집 시스템 설계와 구현)

Park, Gi-Cheol;Han, Su-Young
- Journal of Software Assessment and Valuation
- /
- v.17 no.2
- /
- pp.117-124
- /
- 2021
Livestock behavioral analysis is a factor that has a great influence on livestock health management and agricultural productivity increase. However, most digital devices introduced for behavioral analysis of livestock do not provide raw data and also provide limited analysis results. Such a closed system makes it more difficult to integrate data and build big data, which are essential for the introduction of advanced IT technologies. Therefore, it is necessary to supply farm-scale data collection devices that can be easily used at low cost. This study presents a data collection system for analyzing the behavior of livestock. The system consists of a number of miniature computing units that operate wirelessly, and collects livestock body temperature and acceleration data, location information, and livestock environment data. In addition, this study presents an algorithm for estimating the behavior of livestock based on the collected acceleration data. For the experiment, a system was built in a Korean cattle farm in Icheon, Gyeonggi-do, and data were collected for 20 Korean cattle, and based on this, the empirical and analysis results were presented.
https://doi.org/10.29056/jsav.2021.12.13 인용

Search Result 274, Processing Time 0.024 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)