• Title/Summary/Keyword: social media data

Search Result 1,244, Processing Time 0.023 seconds

Personalized Clothing and Food Recommendation System Based on Emotions and Weather (감정과 날씨에 따른 개인 맞춤형 옷 및 음식 추천 시스템)

  • Ugli, Sadriddinov Ilkhomjon Rovshan;Park, Doo-Soon
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.11 no.11
    • /
    • pp.447-454
    • /
    • 2022
  • In the era of the 4th industrial revolution, we are living in a flood of information. It is very difficult and complicated to find the information people need in such an environment. Therefore, in the flood of information, a recommendation system is essential. Among these recommendation systems, many studies have been conducted on each recommendation system for movies, music, food, and clothes. To date, most personalized recommendation systems have recommended clothes, books, or movies by checking individual tendencies such as age, genre, region, and gender. Future generations will want to be recommended clothes, books, and movies at once by checking age, genre, region, and gender. In this paper, we propose a recommendation system that recommends personalized clothes and food at once according to the user's emotions and weather. We obtained user data from Twitter of social media and analyzed this data as user's basic emotion according to Paul Eckman's theory. The basic emotions obtained in this way were converted into colors by applying Hayashi's Quantification Method III, and these colors were expressed as recommended clothes colors. Also, the type of clothing is recommended using the weather information of the visualcrossing.com API. In addition, various foods are recommended according to the contents of comfort food according to emotions.

Generating Sponsored Blog Texts through Fine-Tuning of Korean LLMs (한국어 언어모델 파인튜닝을 통한 협찬 블로그 텍스트 생성)

  • Bo Kyeong Kim;Jae Yeon Byun;Kyung-Ae Cha
    • Journal of Korea Society of Industrial Information Systems
    • /
    • v.29 no.3
    • /
    • pp.1-12
    • /
    • 2024
  • In this paper, we fine-tuned KoAlpaca, a large-scale Korean language model, and implemented a blog text generation system utilizing it. Blogs on social media platforms are widely used as a marketing tool for businesses. We constructed training data of positive reviews through emotion analysis and refinement of collected sponsored blog texts and applied QLoRA for the lightweight training of KoAlpaca. QLoRA is a fine-tuning approach that significantly reduces the memory usage required for training, with experiments in an environment with a parameter size of 12.8B showing up to a 58.8% decrease in memory usage compared to LoRA. To evaluate the generative performance of the fine-tuned model, texts generated from 100 inputs not included in the training data produced on average more than twice the number of words compared to the pre-trained model, with texts of positive sentiment also appearing more than twice as often. In a survey conducted for qualitative evaluation of generative performance, responses indicated that the fine-tuned model's generated outputs were more relevant to the given topics on average 77.5% of the time. This demonstrates that the positive review generation language model for sponsored content in this paper can enhance the efficiency of time management for content creation and ensure consistent marketing effects. However, to reduce the generation of content that deviates from the category of positive reviews due to elements of the pre-trained model, we plan to proceed with fine-tuning using the augmentation of training data.

A Study on the Analysis of Park User Experiences in Phase 1 and 2 Korea's New Towns with Blog Text Data (블로그 텍스트 데이터를 활용한 1, 2기 신도시 공원의 이용자 경험 분석 연구)

  • Sim, Jooyoung;Lee, Minsoo;Choi, Hyeyoung
    • Journal of the Korean Institute of Landscape Architecture
    • /
    • v.52 no.3
    • /
    • pp.89-102
    • /
    • 2024
  • This study aims to examine the characteristics of the user experience of New Town neighborhood parks and explore issues that diversify the experience of the parks. In order to quantitatively analyze a large amount of park visitors' experiences, text-based Naver blog reviews were collected and analyzed. Among the Phase 1 and 2 New Towns, the parks with the highest user experience postings were selected for each city as the target of analysis. Blog text data was collected from May 20, 2003, to May 31, 2022, and analysis was conducted targeting Ilsan Lake Park, Bundang Yuldong Park, Gwanggyo Lake Park, and Dongtan Lake Park. The findings revealed that all four parks were used for everyday relaxation and recreation. Second, the analysis underscores park's diverse user groups. Third, the programs for parks nearby were also related to park usage. Fourth, the words within the top 20 rankings represented distinctive park elements or content/programs specific to each park. Lastly, the results of the network analysis delineated four overarching types of park users and the networks of four park user types appeared differently depending on the park. This study provides two implications. First, in addition to the naturalistic characteristics, the differentiation of each park's unique facilities and programs greatly improves public awareness and enriches the individual park experience. Second, if analysis of the context surrounding the park based on spatial information is performed in addition to text analysis, the accuracy of interpretation of text data analysis results could be improved. The results of this study can be used in the planning and designing of parks and greenspaces in the Phase 3 New Towns currently in progress.

A Checklist to Improve the Fairness in AI Financial Service: Focused on the AI-based Credit Scoring Service (인공지능 기반 금융서비스의 공정성 확보를 위한 체크리스트 제안: 인공지능 기반 개인신용평가를 중심으로)

  • Kim, HaYeong;Heo, JeongYun;Kwon, Hochang
    • Journal of Intelligence and Information Systems
    • /
    • v.28 no.3
    • /
    • pp.259-278
    • /
    • 2022
  • With the spread of Artificial Intelligence (AI), various AI-based services are expanding in the financial sector such as service recommendation, automated customer response, fraud detection system(FDS), credit scoring services, etc. At the same time, problems related to reliability and unexpected social controversy are also occurring due to the nature of data-based machine learning. The need Based on this background, this study aimed to contribute to improving trust in AI-based financial services by proposing a checklist to secure fairness in AI-based credit scoring services which directly affects consumers' financial life. Among the key elements of trustworthy AI like transparency, safety, accountability, and fairness, fairness was selected as the subject of the study so that everyone could enjoy the benefits of automated algorithms from the perspective of inclusive finance without social discrimination. We divided the entire fairness related operation process into three areas like data, algorithms, and user areas through literature research. For each area, we constructed four detailed considerations for evaluation resulting in 12 checklists. The relative importance and priority of the categories were evaluated through the analytic hierarchy process (AHP). We use three different groups: financial field workers, artificial intelligence field workers, and general users which represent entire financial stakeholders. According to the importance of each stakeholder, three groups were classified and analyzed, and from a practical perspective, specific checks such as feasibility verification for using learning data and non-financial information and monitoring new inflow data were identified. Moreover, financial consumers in general were found to be highly considerate of the accuracy of result analysis and bias checks. We expect this result could contribute to the design and operation of fair AI-based financial services.

Professional Baseball Viewing Culture Survey According to Corona 19 using Social Network Big Data (소셜네트워크 빅데이터를 활용한 코로나 19에 따른 프로야구 관람문화조사)

  • Kim, Gi-Tak
    • Journal of Korea Entertainment Industry Association
    • /
    • v.14 no.6
    • /
    • pp.139-150
    • /
    • 2020
  • The data processing of this study focuses on the textom and social media words about three areas: 'Corona 19 and professional baseball', 'Corona 19 and professional baseball', and 'Corona 19 and professional sports' The data was collected and refined in a web environment and then processed in batch, and the Ucinet6 program was used to visualize it. Specifically, the web environment was collected using Naver, Daum, and Google's channels, and was summarized into 30 words through expert meetings among the extracted words and used in the final study. 30 extracted words were visualized through a matrix, and a CONCOR analysis was performed to identify clusters of similarity and commonality of words. As a result of analysis, the clusters related to Corona 19 and Pro Baseball were composed of one central cluster and five peripheral clusters, and it was found that the contents related to the opening of professional baseball according to the corona 19 wave were mainly searched. The cluster related to Corona 19 and unrelated to professional baseball consisted of one central cluster and five peripheral clusters, and it was found that the keyword of the position of professional baseball related to the professional baseball game according to Corona 19 was mainly searched. Corona 19 and the cluster related to professional sports consisted of one central cluster and five peripheral clusters, and it was found that the keywords related to the start of professional sports according to the aftermath of Corona 19 were mainly searched.

A study of Artificial Intelligence (AI) Speaker's Development Process in Terms of Social Constructivism: Focused on the Products and Periodic Co-revolution Process (인공지능(AI) 스피커에 대한 사회구성 차원의 발달과정 연구: 제품과 시기별 공진화 과정을 중심으로)

  • Cha, Hyeon-ju;Kweon, Sang-hee
    • Journal of Internet Computing and Services
    • /
    • v.22 no.1
    • /
    • pp.109-135
    • /
    • 2021
  • his study classified the development process of artificial intelligence (AI) speakers through analysis of the news text of artificial intelligence (AI) speakers shown in traditional news reports, and identified the characteristics of each product by period. The theoretical background used in the analysis are news frames and topic frames. As analysis methods, topic modeling and semantic network analysis using the LDA method were used. The research method was a content analysis method. From 2014 to 2019, 2710 news related to AI speakers were first collected, and secondly, topic frames were analyzed using Nodexl algorithm. The result of this study is that, first, the trend of topic frames by AI speaker provider type was different according to the characteristics of the four operators (communication service provider, online platform, OS provider, and IT device manufacturer). Specifically, online platform operators (Google, Naver, Amazon, Kakao) appeared as a frame that uses AI speakers as'search or input devices'. On the other hand, telecommunications operators (SKT, KT) showed prominent frames for IPTV, which is the parent company's flagship business, and 'auxiliary device' of the telecommunication business. Furthermore, the frame of "personalization of products and voice service" was remarkable for OS operators (MS, Apple), and the frame for IT device manufacturers (Samsung) was "Internet of Things (IoT) Integrated Intelligence System". The econd, result id that the trend of the topic frame by AI speaker development period (by year) showed a tendency to develop around AI technology in the first phase (2014-2016), and in the second phase (2017-2018), the social relationship between AI technology and users It was related to interaction, and in the third phase (2019), there was a trend of shifting from AI technology-centered to user-centered. As a result of QAP analysis, it was found that news frames by business operator and development period in AI speaker development are socially constituted by determinants of media discourse. The implication of this study was that the evolution of AI speakers was found by the characteristics of the parent company and the process of co-evolution due to interactions between users by business operator and development period. The implications of this study are that the results of this study are important indicators for predicting the future prospects of AI speakers and presenting directions accordingly.

Evaluation of Preference by Bukhansan Dulegil Course Using Sentiment Analysis of Blog Data (블로그 데이터 감성분석을 통한 북한산둘레길 구간별 선호도 평가)

  • Lee, Sung-Hee;Son, Yong-Hoon
    • Journal of the Korean Institute of Landscape Architecture
    • /
    • v.49 no.3
    • /
    • pp.1-10
    • /
    • 2021
  • This study aimed to evaluate preferences of Bukhansan dulegil using sentiment analysis, a natural language processing technique, to derive preferred and non-preferred factors. Therefore, we collected blog articles written in 2019 and produced sentimental scores by the derivation of positive and negative words in the texts for 21 dulegil courses. Then, content analysis was conducted to determine which factors led visitors to prefer or dislike each course. In blogs written about Bukhansan dulegil, positive words appeared in approximately 73% of the content, and the percentage of positive documents was significantly higher than that of negative documents for each course. Through this, it can be seen that visitors generally had positive sentiments toward Bukhansan dulegil. Nevertheless, according to the sentiment score analysis, all 21 dulegil courses belonged to both the preferred and non-preferred courses. Among courses, visitors preferred less difficult courses, in which they could walk without a burden, and in which various landscape elements (visual, auditory, olfactory, etc.) were harmonious yet distinct. Furthermore, they preferred courses with various landscapes and landscape sequences. Additionally, visitors appreciated the presence of viewpoints, such as observation decks, as a significant factor and preferred courses with excellent accessibility and information provisions, such as information boards. Conversely, the dissatisfaction with the dulegil courses was due to noise caused by adjacent roads, excessive urban areas, and the inequality or difficulty of the course which was primarily attributed to insufficient information on the landscape or section of the course. The results of this study can serve not only serve as a guide in national parks but also in the management of nearby forest green areas to formulate a plan to repair and improve dulegil. Further, the sentiment analysis used in this study is meaningful in that it can continuously monitor actual users' responses towards natural areas. However, since it was evaluated based on a predefined sentiment dictionary, continuous updates are needed. Additionally, since there is a tendency to share positive content rather than negative views due to the nature of social media, it is necessary to compare and review the results of analysis, such as with on-site surveys.

Assessing and Mapping the Aesthetic Value of Bukhansan National Park Using Geotagged Images (지오태그 이미지를 활용한 북한산국립공원의 경관미 평가 및 맵핑)

  • Kim, Jee-Young;Son, Yong-Hoon
    • Journal of the Korean Institute of Landscape Architecture
    • /
    • v.49 no.4
    • /
    • pp.64-73
    • /
    • 2021
  • The purpose of this study is to present a method to assess the landscape aesthetic value of Bukhansan National Park using geotagged images that have been shared on social media sites. The method presented in this study consisted mainly of collecting geotagged image data, identifying landscape images, and analyzing the cumulative visibility by applying a target probability index. Ramblr is an application that supports outdoor activities with many users in Korea, from which a total of 110,954 geotagged images for Bukhansan National Park were collected and used to assess the landscape aesthetics. The collected geotagged images were interpreted using the Google Vision API, and were subsequently were divided into 11 landscape image types and 9 non-landscape image types through cluster analysis. As a result of analyzing the landscape types of Bukhansan National Park based on the extracted landscape images, landscape types related to topographical characteristics, such as peaks and mountain ranges, accounted for the largest portion, and forest landscapes, foliage landscapes, and waterscapes were also commonly found as major landscape types. In the derived landscape aesthetic value map, the higher the elevation and slope, the higher the overall landscape aesthetic value, according to the proportion and characteristics of these major landscape types. However, high landscape aesthetic values were also confirmed in some areas of lowlands with gentle slopes. In addition, the Bukhansan area was evaluated to have higher landscape aesthetics than the Dobongsan area. Despite the high elevation and slope, the Dobongsan area had a relatively low landscape aesthetic value. This shows that the aesthetic value of the landscape is strongly related not only to the physical environment but also to the recreational activities of visitors who are viewing the scenery. In this way, the landscape aesthetics assessment using the cumulative visibility of geotagged images is expected to be useful for planning and managing the landscape of Bukhansan National Park in the future, through allowing the geographical understanding of the landscape values based on people's perceptions and the identification of the regional deviations.

Current Barriers of Obesity Management of Children Using Community Child Care Centers and Potential Possibility of Utilizing Mobile Phones: A Qualitative Study for Children and Caregivers (지역아동센터 이용 어린이의 비만관리의 한계점과 모바일폰의 잠재적인 활용 가능성: 어린이와 보호자 대상의 질적 연구)

  • Lee, Bo Young;Park, Mi-Young;Kim, Kirang;Shim, Jea Eun;Hwang, Ji-Yun
    • Korean Journal of Community Nutrition
    • /
    • v.25 no.3
    • /
    • pp.189-203
    • /
    • 2020
  • Objectives: This study was performed to identify the current barriers of obesity management for children using Community Child Care Centers and their caregivers (parents and teachers working in the Centers). Further, this study explored the possibility of utilizing a mobile phone application for tailored obesity prevention and management programs to overcome the current difficulties associated with children's obesity management. Methods: The qualitative data were collected through in-depth interviews with 20 obese and overweight children or children who wanted to participate in this study using Community Child Care Centers, 12 teachers working at the Centers, and a focus group interview with five parents of children using the Centers. Data were analyzed with a thematic approach categorizing themes and sub-themes based on the transcripts. Results: The current barriers of obesity management of obese and overweight children using Community Child Care Centers were lack of self-directed motivation regarding obesity management (chronic obesity-induced lifestyles and reduced self-confidence due to stigma) and lack of support from households and Community Child Care Centers (latchkey child, inconsistency in dietary guidance between the Center and household, repetitive pressure to eat, and absence of regular nutrition education). Mobile phone applications may have potential to overcome the current barriers by providing handy and interesting obesity management based on visual media (real-time tracking of lifestyles using behavior records and social support using gamification), environmental support (supplementation of parental care and network-based education between the Community Child Care Center and household), and individualized intervention (encouragement of tailored and gradual changes in eating habits and tailored goal setting). It is predicted that the real-time mobile phone program will provide information for improving nutritional knowledge and behavioral skills as well as lead to sustainable children's coping strategies regarding obesity management. In addition, it is expected that environmental factors may be improved by network-based education between the Community Child Care Centers and households using the characteristics of mobile phones, which are free from space and time constraints. Conclusions: The tailored education program for children using Community Child Care Centers based on mobile phones may prevent and reduce childhood obesity by overcoming the current barriers of obesity management for children, providing environmental and individualized support to promote healthy lifestyles and quality of life in the future.

Development and Effectiveness of a Smoking Preventive Program for Elementary Students (초등학생을 위한 흡연예방 프로그램의 개발 및 효과에 관한 연구)

  • Lee, Eun-Hye;Kim, Il-Ok
    • The Journal of Korean Academic Society of Nursing Education
    • /
    • v.9 no.2
    • /
    • pp.264-275
    • /
    • 2003
  • The purpose of this study were to develop a smoking preventive education program for elementary students and evaluate it's effectiveness. This study was a quasi experimental study under the nonequivalent control group with pretest-posttest design. The subjects of this study were 62 who are attending elementary school(31 for each group), 2 different district elementary school. The subjects were matched by grade, similar in anti-smoking educational background of smoking, as well as their residence and income level of their families. The instruments used in this study was 18 criterion referenced test items modeled by Dick & Carey that were developed by researchers for evaluating the subjects' knowledge and attitude about smoking. A pretest was administered a week before treatment The program given to the experimental group is composed of the texts explaining the poisonous substances in tobacco, social and cultural harmfulness of smoking to the body and psychology, indirect smoking, smoking of pregnant women, motives of smoking, refusal skills of smoking; and for the subjects' understanding and the better results of study - pictures, role play, discussion, text through computer based multi-media, puzzle searching for hidden pictures, cross-word puzzle, and finally compensation. The data were collected for 50 days form mid- September to the end of October in the year of 2000, composed of formative evaluation, pre-test and summative evaluation via 2 sessions. Accordingly, the collected data were analysed by t-test, paired t-test, repeated measure ANOVA by the SAS program. This research summarize the findings as follows; 1. There was a significant difference in knowledge between the experimental group(after 1 wks t=10.4680, p=.0001; after 4 wks t= 9.310, p=.0001) and control group(after 1 wks t=0.0420, p= .9669; after 4 wks t= -0.378 p=.7079) in between the results of 1 and 4 week after education in summative evaluation (F=27.45, P=.0001). 2. There was non statistical significant difference in attitude between the experimental group (after 1 wks t=1.2292, p=0.2286 ; after 4 wks t=1.330, p=0.1935) and control group (after 1 wks t=0.1819, p=0.8569 ; after 4 wks t=0.2970, p=0.7685) in between the results of 1 and 4 week after education in summative evaluation(F=0.71, P=0.494). To sum up, the statistics of conclusive analysis evaluative for the children under school age of the 'knowledge acquisition' about smoking harmfulness. On the other hand, as there was already sound attitude about smoking, the evaluation of attitude was non significant difference between control group and experimental group, just there was partially significant difference.

  • PDF