• Title/Summary/Keyword: 빅데이터 융합

Search Result 796, Processing Time 0.027 seconds

Implementation of the Unborrowed Book Recommendation System for Public Libraries: Based on Daegu D Library (공공도서관 미대출 도서 추천시스템 구현 : 대구 D도서관을 중심으로)

  • Jin, Min-Ha;Jeong, Seung-Yeon;Cho, Eun-Ji;Lee, Myoung-Hun;Kim, Keun-Wook
    • Journal of Digital Convergence
    • /
    • v.19 no.5
    • /
    • pp.175-186
    • /
    • 2021
  • The roles and functions of domestic public libraries are diversifying, but various problems have emerged due to internally biased book lending. In addition, due to the 4th Industrial Revolution, public libraries have introduced a book recommendation system focusing on popular books, but the variety of books that users can access is limited. Therefore, in this study, the public library unborrowed book recommendation system was implemented limiting its spatial scope to Duryu Library in Daegu City to enhance the satisfaction of public library users, by using the loan records data (213,093 cases), user information (35,561 people), etc. and utilizing methods like cluster analysis, topic modeling, content-based filtering recommendation algorithm, and conducted a survey on actual users' satisfaction to present the possibility and implications of the unborrowed book recommendation system. As a result of the analysis, the majority of users responded with high satisfaction, and was able to find the satisfaction was relatively high in the class classified by specific gender, age, occupation, and usual reading. Through the results of this study, it is expected that some problems such as biased book lending and reduced operational efficiency of public libraries can be improved, and limitations of the study was also presented.

MF sampler: Sampling method for improving the performance of a video based fashion retrieval model (MF sampler: 동영상 기반 패션 검색 모델의 성능 향상을 위한 샘플링 방법)

  • Baek, Sanghun;Park, Jonghyuk
    • Journal of Intelligence and Information Systems
    • /
    • v.28 no.4
    • /
    • pp.329-346
    • /
    • 2022
  • Recently, as the market for short form videos (Instagram, TikTok, YouTube) on social media has gradually increased, research using them is actively being conducted in the artificial intelligence field. A representative research field is Video to Shop, which detects fashion products in videos and searches for product images. In such a video-based artificial intelligence model, product features are extracted using convolution operations. However, due to the limitation of computational resources, extracting features using all the frames in the video is practically impossible. For this reason, existing studies have improved the model's performance by sampling only a part of the entire frame or developing a sampling method using the subject's characteristics. In the existing Video to Shop study, when sampling frames, some frames are randomly sampled or sampled at even intervals. However, this sampling method degrades the performance of the fashion product search model while sampling noise frames where the product does not exist. Therefore, this paper proposes a sampling method MF (Missing Fashion items on frame) sampler that removes noise frames and improves the performance of the search model. MF sampler has improved the problem of resource limitations by developing a keyframe mechanism. In addition, the performance of the search model is improved through noise frame removal using the noise detection model. As a result of the experiment, it was confirmed that the proposed method improves the model's performance and helps the model training to be effective.

Analyzing TripAdvisor application reviews to enable smart tourism : focusing on topic modeling (스마트 관광 활성화를 위한 트립어드바이저 애플리케이션 리뷰 분석 : 토픽 모델링을 중심으로)

  • YuNa Lee;MuMoungCho Han;SeonYeong Yu;MeeQi Siow;Mijin Noh;YangSok Kim
    • Smart Media Journal
    • /
    • v.12 no.8
    • /
    • pp.9-17
    • /
    • 2023
  • The development of information and communication technology and the improvement of the development and dissemination of smart devices have caused changes in the form of tourism, and the concept of smart tourism has since emerged. In this regard, researches related to smart tourism has been conducted in various fields such as policy implementation and surveys, but there is a lack of research on application reviews. This study collects Trip Advisor application review data in the Google Play Store to identify usage of the application and user satisfaction through Latent Dirichlet Allocation (LDA) topic modeling. The analysis results in four topics, two of which are positive and the other two are negative. We found that users were satisfied with the application's recommendation system, but were dissatisfied when the filters they set during search were not applied or that reviews were not published after updates of the application. We suggest more categories can be added to the application to provide users with different experiences. In addition, it is expected that user satisfaction can be improved by identifying problems within the application, including the filter function, and checking the application environment and resolving the error occurring during the application usage.

Estimation of Bridge Vehicle Loading using CCTV images and Deep Learning (CCTV 영상과 딥러닝을 이용한 교량통행 차량하중 추정)

  • Suk-Kyoung Bae;Wooyoung Jeong;Soohyun Choi;Byunghyun Kim;Soojin Cho
    • Journal of the Korea institute for structural maintenance and inspection
    • /
    • v.28 no.3
    • /
    • pp.10-18
    • /
    • 2024
  • Vehicle loading is one of the main causes of bridge deterioration. Although WiM (Weigh in Motion) can be used to measure vehicle loading on a bridge, it has disadvantage of high installation and maintenance cost due to its contactness. In this study, a non-contact method is proposed to estimate the vehicle loading history of bridges using deep learning and CCTV images. The proposed method recognizes the vehicle type using an object detection deep learning model and estimates the vehicle loading based on the load-based vehicle type classification table developed using the weights of empty vehicles of major domestic vehicle models. Faster R-CNN, an object detection deep learning model, was trained using vehicle images classified by the classification table. The performance of the model is verified using images of CCTVs on actual bridges. Finally, the vehicle loading history of an actual bridge was obtained for a specific time by continuously estimating the vehicle loadings on the bridge using the proposed method.

Analysis of media trends related to spent nuclear fuel treatment technology using text mining techniques (텍스트마이닝 기법을 활용한 사용후핵연료 건식처리기술 관련 언론 동향 분석)

  • Jeong, Ji-Song;Kim, Ho-Dong
    • Journal of Intelligence and Information Systems
    • /
    • v.27 no.2
    • /
    • pp.33-54
    • /
    • 2021
  • With the fourth industrial revolution and the arrival of the New Normal era due to Corona, the importance of Non-contact technologies such as artificial intelligence and big data research has been increasing. Convergent research is being conducted in earnest to keep up with these research trends, but not many studies have been conducted in the area of nuclear research using artificial intelligence and big data-related technologies such as natural language processing and text mining analysis. This study was conducted to confirm the applicability of data science analysis techniques to the field of nuclear research. Furthermore, the study of identifying trends in nuclear spent fuel recognition is critical in terms of being able to determine directions to nuclear industry policies and respond in advance to changes in industrial policies. For those reasons, this study conducted a media trend analysis of pyroprocessing, a spent nuclear fuel treatment technology. We objectively analyze changes in media perception of spent nuclear fuel dry treatment techniques by applying text mining analysis techniques. Text data specializing in Naver's web news articles, including the keywords "Pyroprocessing" and "Sodium Cooled Reactor," were collected through Python code to identify changes in perception over time. The analysis period was set from 2007 to 2020, when the first article was published, and detailed and multi-layered analysis of text data was carried out through analysis methods such as word cloud writing based on frequency analysis, TF-IDF and degree centrality calculation. Analysis of the frequency of the keyword showed that there was a change in media perception of spent nuclear fuel dry treatment technology in the mid-2010s, which was influenced by the Gyeongju earthquake in 2016 and the implementation of the new government's energy conversion policy in 2017. Therefore, trend analysis was conducted based on the corresponding time period, and word frequency analysis, TF-IDF, degree centrality values, and semantic network graphs were derived. Studies show that before the 2010s, media perception of spent nuclear fuel dry treatment technology was diplomatic and positive. However, over time, the frequency of keywords such as "safety", "reexamination", "disposal", and "disassembly" has increased, indicating that the sustainability of spent nuclear fuel dry treatment technology is being seriously considered. It was confirmed that social awareness also changed as spent nuclear fuel dry treatment technology, which was recognized as a political and diplomatic technology, became ambiguous due to changes in domestic policy. This means that domestic policy changes such as nuclear power policy have a greater impact on media perceptions than issues of "spent nuclear fuel processing technology" itself. This seems to be because nuclear policy is a socially more discussed and public-friendly topic than spent nuclear fuel. Therefore, in order to improve social awareness of spent nuclear fuel processing technology, it would be necessary to provide sufficient information about this, and linking it to nuclear policy issues would also be a good idea. In addition, the study highlighted the importance of social science research in nuclear power. It is necessary to apply the social sciences sector widely to the nuclear engineering sector, and considering national policy changes, we could confirm that the nuclear industry would be sustainable. However, this study has limitations that it has applied big data analysis methods only to detailed research areas such as "Pyroprocessing," a spent nuclear fuel dry processing technology. Furthermore, there was no clear basis for the cause of the change in social perception, and only news articles were analyzed to determine social perception. Considering future comments, it is expected that more reliable results will be produced and efficiently used in the field of nuclear policy research if a media trend analysis study on nuclear power is conducted. Recently, the development of uncontact-related technologies such as artificial intelligence and big data research is accelerating in the wake of the recent arrival of the New Normal era caused by corona. Convergence research is being conducted in earnest in various research fields to follow these research trends, but not many studies have been conducted in the nuclear field with artificial intelligence and big data-related technologies such as natural language processing and text mining analysis. The academic significance of this study is that it was possible to confirm the applicability of data science analysis technology in the field of nuclear research. Furthermore, due to the impact of current government energy policies such as nuclear power plant reductions, re-evaluation of spent fuel treatment technology research is undertaken, and key keyword analysis in the field can contribute to future research orientation. It is important to consider the views of others outside, not just the safety technology and engineering integrity of nuclear power, and further reconsider whether it is appropriate to discuss nuclear engineering technology internally. In addition, if multidisciplinary research on nuclear power is carried out, reasonable alternatives can be prepared to maintain the nuclear industry.

Building a Korean Sentiment Lexicon Using Collective Intelligence (집단지성을 이용한 한글 감성어 사전 구축)

  • An, Jungkook;Kim, Hee-Woong
    • Journal of Intelligence and Information Systems
    • /
    • v.21 no.2
    • /
    • pp.49-67
    • /
    • 2015
  • Recently, emerging the notion of big data and social media has led us to enter data's big bang. Social networking services are widely used by people around the world, and they have become a part of major communication tools for all ages. Over the last decade, as online social networking sites become increasingly popular, companies tend to focus on advanced social media analysis for their marketing strategies. In addition to social media analysis, companies are mainly concerned about propagating of negative opinions on social networking sites such as Facebook and Twitter, as well as e-commerce sites. The effect of online word of mouth (WOM) such as product rating, product review, and product recommendations is very influential, and negative opinions have significant impact on product sales. This trend has increased researchers' attention to a natural language processing, such as a sentiment analysis. A sentiment analysis, also refers to as an opinion mining, is a process of identifying the polarity of subjective information and has been applied to various research and practical fields. However, there are obstacles lies when Korean language (Hangul) is used in a natural language processing because it is an agglutinative language with rich morphology pose problems. Therefore, there is a lack of Korean natural language processing resources such as a sentiment lexicon, and this has resulted in significant limitations for researchers and practitioners who are considering sentiment analysis. Our study builds a Korean sentiment lexicon with collective intelligence, and provides API (Application Programming Interface) service to open and share a sentiment lexicon data with the public (www.openhangul.com). For the pre-processing, we have created a Korean lexicon database with over 517,178 words and classified them into sentiment and non-sentiment words. In order to classify them, we first identified stop words which often quite likely to play a negative role in sentiment analysis and excluded them from our sentiment scoring. In general, sentiment words are nouns, adjectives, verbs, adverbs as they have sentimental expressions such as positive, neutral, and negative. On the other hands, non-sentiment words are interjection, determiner, numeral, postposition, etc. as they generally have no sentimental expressions. To build a reliable sentiment lexicon, we have adopted a concept of collective intelligence as a model for crowdsourcing. In addition, a concept of folksonomy has been implemented in the process of taxonomy to help collective intelligence. In order to make up for an inherent weakness of folksonomy, we have adopted a majority rule by building a voting system. Participants, as voters were offered three voting options to choose from positivity, negativity, and neutrality, and the voting have been conducted on one of the largest social networking sites for college students in Korea. More than 35,000 votes have been made by college students in Korea, and we keep this voting system open by maintaining the project as a perpetual study. Besides, any change in the sentiment score of words can be an important observation because it enables us to keep track of temporal changes in Korean language as a natural language. Lastly, our study offers a RESTful, JSON based API service through a web platform to make easier support for users such as researchers, companies, and developers. Finally, our study makes important contributions to both research and practice. In terms of research, our Korean sentiment lexicon plays an important role as a resource for Korean natural language processing. In terms of practice, practitioners such as managers and marketers can implement sentiment analysis effectively by using Korean sentiment lexicon we built. Moreover, our study sheds new light on the value of folksonomy by combining collective intelligence, and we also expect to give a new direction and a new start to the development of Korean natural language processing.

Analyzing the Trend of False·Exaggerated Advertisement Keywords Using Text-mining Methodology (1990-2019) (텍스트마이닝 기법을 활용한 허위·과장광고 관련 기사의 트렌드 분석(1990-2019))

  • Kim, Do-Hee;Kim, Min-Jeong
    • The Journal of the Korea Contents Association
    • /
    • v.21 no.4
    • /
    • pp.38-49
    • /
    • 2021
  • This study analyzed the trend of the term 'false and exaggerated advertisement' in 5,141 newspaper articles from 1990 to 2019 using text mining methodology. First of all, we identified the most frequent keywords of false and exaggerated advertisements through frequency analysis for all newspaper articles, and understood the context between the extracted keywords. Next, to examine how false and exaggerated advertisements have changed, the frequency analysis was performed by separating articles by 10 years, and the tendency of the keyword that became an issue was identified by comparing the number of academic papers on the subject of the highest keywords of each year. Finally, we identified trends in false and exaggerated advertisements based on the detailed keywords in the topic using the topic modeling. In our results, it was confirmed that the topic that became an issue at a specific time was extracted as the frequent keywords, and the keyword trends by period changed in connection with social and environmental factors. This study is meaningful in helping consumers spend wisely by cultivating background knowledge about unfair advertising. Furthermore, it is expected that the core keyword extraction will provide the true purpose of advertising and deliver its implications to companies and related employees who commit misconduct.

A Simulation Study on Image Quality of Virtual Monochromatic Image using Dual-energy Method (이중에너지 방법을 이용한 가상 단색 영상의 화질 시뮬레이션 연구)

  • Son, Ki-Hong;Lee, Soo-Yeul;Kim, Dae-Hong;Chung, Myung-Ae
    • Journal of the Korean Society of Radiology
    • /
    • v.16 no.5
    • /
    • pp.553-558
    • /
    • 2022
  • The purpose of this work was a simulation study to evaluate the virtual monochromatic (VM) image quality of blood vessels compared to the monochromatic image. Dual-energy images were obtained based on the linear attenuation coefficients of five materials at 50 keV and 80 keV at low- and high-energies, respectively. A weighting factor is required to synthesize the VM image, and the liver and bone were used as basis materials to obtain the weighting factor. VM images were synthesized at energies ranging from 30 keV to 100 keV. Image quality was evaluated by Contrast to noise ratio (CNR) and noise by setting calcium and contrast medium as signals and blood as background. According to the results, the energies with the maximum CNR were 50 keV and 60 keV for calcium and contrast medium, respectively. The energies showing the minimum noise were 70 keV, 70 keV, and 60 keV in calcium, iodine contrast medium, and blood, respectively. The VM image can contribute to the improvement of diagnostic performance in CT examination because it can implement an image at the optimal energy that minimize noise and maximize CNR.

A Study on the Possibility of Pancreas Detection through Extraction of Effective Atomic Number using a Simulation such as Dual-energy CT (이중에너지 CT와 같은 시뮬레이션을 이용한 유효원자번호 추출을 통한 췌장 검출 가능성 연구)

  • Son, Ki-Hong;Lee, Soo-Yeul;Chung, Myung-Ae;Kim, Dae-Hong
    • Journal of the Korean Society of Radiology
    • /
    • v.16 no.5
    • /
    • pp.537-543
    • /
    • 2022
  • The purpose of this simulation study was to evaluate the possibility of pancreas detection through effective atomic number information using dual-energy computed tomography(CT). The effective atomic number of 10 tissue-equivalent materials were estimated through stoichiometric calibration. For stoichiometric calibration, HU values at low-energy (80 kV) and high-energy (140 kV) for 10 tissue-equivalent materials were used. Based on this method, the effective atomic number image of the tissue-equivalent material was extracted through an iterative algorithm. According to the results, the attenuation ratio in accordance with the effective atomic number was estimated to have an R2 value of 0.9999, and the effective atomic number of Pancreas, Water, Liver, Blood, Spongiosa, and Cortical bone was overall within 1% accuracy compared to the theoretical value. Conventional pancreatic cancer examination uses a contrast medium, so there is a possibility of potential side effects of the contrast medium. In order to solve this problem, it is thought that it will be possible to contribute to an accurate and safe examination by extracting the effective atomic number using dual-energy CT without contrast enhancement. Based on this study, future research will be conducted on the detection of pancreatic cancer using the HU value of pancreatic cancer based on clinical images.

AI Art Creation Case Study for AI Film & Video Content (AI 영화영상콘텐츠를 위한 AI 예술창작 사례연구)

  • Jeon, Byoungwon
    • The Journal of the Convergence on Culture Technology
    • /
    • v.7 no.2
    • /
    • pp.85-95
    • /
    • 2021
  • Currently, we stand between computers as creative tools and computers as creators. A new genre of movies, which can be called a post-cinema situation, is emerging. This paper aims to diagnose the possibility of the emergence of AI cinema. To confirm the possibility of AI cinema, it was examined through a case study whether the creation of a story, narrative, image, and sound, which are necessary conditions for film creation, is possible by artificial intelligence. First, we checked the visual creation of AI painting algorithms Obvious, GAN, and CAN. Second, AI music has already entered the distribution stage in the market in cooperation with humans. Third, AI can already complete drama scripts, and automatic scenario creation programs using big data are also gaining popularity. That said, we confirmed that the filmmaking requirements could be met with AI algorithms. From the perspective of Manovich's 'AI Genre Convention', web documentaries and desktop documentaries, typical trends post-cinema, can be said to be representative genres that can be expected as AI cinemas. The conditions for AI, web documentaries and desktop documentaries to exist are the same. This article suggests a new path for the media of the 4th Industrial Revolution era through research on AI as a creator of post-cinema.