• Title/Summary/Keyword: search word

Search Result 379, Processing Time 0.032 seconds

A Design of Similar Video Recommendation System using Extracted Words in Big Data Cluster (빅데이터 클러스터에서의 추출된 형태소를 이용한 유사 동영상 추천 시스템 설계)

  • Lee, Hyun-Sup;Kim, Jindeog
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.24 no.2
    • /
    • pp.172-178
    • /
    • 2020
  • In order to recommend contents, the company generally uses collaborative filtering that takes into account both user preferences and video (item) similarities. Such services are primarily intended to facilitate user convenience by leveraging personal preferences such as user search keywords and viewing time. It will also be ranked around the keywords specified in the video. However, there is a limit to analyzing video similarities using limited keywords. In such cases, the problem becomes serious if the specified keyword does not properly reflect the item. In this paper, I would like to propose a system that identifies the characteristics of a video as it is by the system without human intervention, and analyzes and recommends similarities between videos. The proposed system analyzes similarities by taking into account all words (keywords) that have different meanings from training videos, and in such cases, the methods handled by big data clusters are applied because of the large scale of data and operations.

PDF Version 1.4-1.6 Password Cracking in CUDA GPU Environment (PDF 버전 1.4-1.6의 CUDA GPU 환경에서 암호 해독 최적 구현)

  • Hyun Jun, Kim;Si Woo, Eum;Hwa Jeong, Seo
    • KIPS Transactions on Computer and Communication Systems
    • /
    • v.12 no.2
    • /
    • pp.69-76
    • /
    • 2023
  • Hundreds of thousands of passwords are lost or forgotten every year, making the necessary information unavailable to legitimate owners or authorized law enforcement personnel. In order to recover such a password, a tool for password cracking is required. Using GPUs instead of CPUs for password cracking can quickly process the large amount of computation required during the recovery process. This paper optimizes on GPUs using CUDA, with a focus on decryption of the currently most popular PDF 1.4-1.6 version. Techniques such as eliminating unnecessary operations of the MD5 algorithm, implementing 32-bit word integration of the RC4 algorithm, and using shared memory were used. In addition, autotune techniques were used to search for the number of blocks and threads that affect performance improvement. As a result, we showed throughput of 31,460 kp/s (kilo passwords per second) and 66,351 kp/s at block size 65,536, thread size 96 in RTX 3060, RTX 3090 environments, and improved throughput by 22.5% and 15.2%, respectively, compared to the cracking tool hashcat that achieves the highest throughput.

Keyword Analysis of Arboretums and Botanical Gardens Using Social Big Data

  • Shin, Hyun-Tak;Kim, Sang-Jun;Sung, Jung-Won
    • Journal of People, Plants, and Environment
    • /
    • v.23 no.2
    • /
    • pp.233-243
    • /
    • 2020
  • This study collects social big data used in various fields in the past 9 years and explains the patterns of major keywords of the arboretums and botanical gardens to use as the basic data to establish operational strategies for future arboretums and botanical gardens. A total of 6,245,278 cases of data were collected: 4,250,583 from blogs (68.1%), 1,843,677 from online cafes (29.5%), and 151,018 from knowledge search engine (2.4%). As a result of refining valid data, 1,223,162 cases were selected for analysis. We came up with keywords through big data, and used big data program Textom to derive keywords of arboretums and botanical gardens using text mining analysis. As a result, we identified keywords such as 'travel', 'picnic', 'children', 'festival', 'experience', 'Garden of Morning Calm', 'program', 'recreation forest', 'healing', and 'museum'. As a result of keyword analysis, we found that keywords such as 'healing', 'tree', 'experience', 'garden', and 'Garden of Morning Calm' received high public interest. We conducted word cloud analysis by extracting keywords with high frequency in total 6,245,278 titles on social media. The results showed that arboretums and botanical gardens were perceived as spaces for relaxation and leisure such as 'travel', 'picnic' and 'recreation', and that people had high interest in educational aspects with keywords such as 'experience' and 'field trip'. The demand for rest and leisure space, education, and things to see and enjoy in arboretums and botanical gardens increased than in the past. Therefore, there must be differentiation and specialization strategies such as plant collection strategies, exhibition planning and programs in establishing future operation strategies.

A Study on the Consumer Perception of Metaverse Before and After COVID-19 through Big Data Analysis (빅데이터 분석을 통한 코로나 이전과 이후 메타버스에 대한 소비자의 인식에 관한 연구)

  • Park, Sung-Woo;Park, Jun-Ho;Ryu, Ki-Hwan
    • The Journal of the Convergence on Culture Technology
    • /
    • v.8 no.6
    • /
    • pp.287-294
    • /
    • 2022
  • The purpose of this study is to find out consumers' perceptions of "metaverse," a newly spotlighted technology, through big data analysis as a non-face-to-face society continues after the outbreak of COVID-19. This study conducted a big data analysis using text mining to analyze consumers' perceptions of metaverse before and after COVID-19. The top 30 keywords were extracted through word purification, and visualization was performed through network analysis and concor analysis between each keyword based on this. As a result of the analysis, it was confirmed that the non-face-to-face society continued and metaverse emerged as a trend. Previously, metaverse was focused on textual data such as SNS as a part of life logging, but after that, it began to pay attention to virtual reality space, creating many platforms and expanding industries. The limitation of this study is that since data was collected through the search frequency of portal sites, anonymity was guaranteed, so demographic characteristics were not reflected when data was collected.

A Study on Research Trends in Literacy Education through a Key word Network Analysis (키워드 네트워크 분석을 통한 리터러시 교육 연구 동향)

  • Lee, Woo-Jin;Baek, Hye-Jin
    • Journal of Digital Convergence
    • /
    • v.20 no.5
    • /
    • pp.53-59
    • /
    • 2022
  • The purpose of this study is to examine the factors related to learning through analysis of domestic research trends in literacy and to present the direction of literacy education. Research papers from 1993 to February 2022 were collected using RISS. 'Literacy' and 'Education' were used as search keywords, and 200 papers were selected for analysis. As a result of analysis using keyword network analysis, 118 keywords appeared at least three times out of a total of 810 keywords. The order of the keywords with the highest frequency is 'digital literacy', 'media literacy', and 'elementary school'. The following direction was suggested through the analysis results. First, it is required to establish an online teaching and learning resource platform and link it with education policy. Second, it is necessary to set literacy competencies and seek ways to improve competencies. Third, a digital-based convergence education model should be developed. This study is meaningful in that it analyzed the most recent literacy studies and suggested the direction of literacy education.

Conceptualization of IT Humanities through Keyword Topic Modeling (주제어 토픽모델링을 통한 IT 인문학 개념의 정립)

  • Youngmi Choi;Namje Park
    • Journal of The Korean Association of Information Education
    • /
    • v.26 no.5
    • /
    • pp.467-480
    • /
    • 2022
  • This paper aimed to explore research trends for the conceptualization of IT humanities. Reflecting domestic and international references which focused on the possibility of the integration of digital technology and humanities, the authors examined the beginning, background, and relevant concepts of IT humanities to figure out the meaning and the research trends. In addition, using the search word "IT humanities," the authors analyzed network topics of the keywords retrieved from 1,566 KCI and 64 SCI journal articles published since 2001. The concept of IT humanities in the previous studies has tended to associate with competencies that allow considering various fields of IT based on the lens of humanities perspectives. The result of the topic modeling revealed four groups as fields to be integrated with IT humanities, methods of implementation, connections of literature or culture, and creations of IT humanities. Instead of instrumentalization or merger by one stance of IT or humanities, it is imperative to collaboratively work for the generation of a new viewpoint through mutual respect of disciplines.

A Heuristic Method of In-situ Drought Using Mass Media Information

  • Lee, Jiwan;Kim, Seong-Joon
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2020.06a
    • /
    • pp.168-168
    • /
    • 2020
  • This study is to evaluate the drought-related bigdata characteristics published from South Korean by developing crawler. The 5 years (2013 ~ 2017) drought-related posted articles were collected from Korean internet search engine 'NAVER' which contains 13 main and 81 local daily newspapers. During the 5 years period, total 40,219 news articles including 'drought' word were found using crawler. To filter the homonyms liken drought to soccer goal drought in sports, money drought economics, and policy drought in politics often used in South Korea, the quality control was processed and 47.8 % articles were filtered. After, the 20,999 (52.2 %) drought news articles of this study were classified into four categories of water deficit (WD), water security and support (WSS), economic damage and impact (EDI), and environmental and sanitation impact (ESI) with 27, 15, 13, and 18 drought-related keywords in each category. The WD, WSS, EDI, and ESI occupied 41.4 %, 34.5 %, 14.8 %, and 9.3 % respectively. The drought articles were mostly posted in June 2015 and June 2017 with 22.7 % (15,097) and 15.9 % (10,619) respectively. The drought news articles were spatiotemporally compared with SPI (Standardized Precipitation Index) and RDI (Reservoir Drought Index) were calculated. They were classified into administration boundaries of 8 main cities and 9 provinces in South Korea because the drought response works based on local government unit. The space-time clustering between news articles (WD, WSS, EDI, and ESI) and indices (SPI and RDI) were tried how much they have correlation each other. The spatiotemporal clusters detection was applied using SaTScan software (Kulldorff, 2015). The retrospective and prospective cluster analyses were conducted for past and present time to understand how much they are intensive in clusters. The news articles of WD, WSS and EDI had strong clusters in provinces, and ESI in cities.

  • PDF

A study on the current status of DIY clothing products related to fabric using text mining (텍스트마이닝을 활용한 패브릭 관련 DIY 의류 상품 현황 연구)

  • Eun-Hye Lee;Ha-Eun Lee;Jeong-Wook Choi
    • Journal of the Korea Fashion and Costume Design Association
    • /
    • v.25 no.2
    • /
    • pp.111-122
    • /
    • 2023
  • This study aims to collect Big Data related to DIY clothing, analyze the results on a year-by-year basis, understand consumers' perceptions, the status, and reality of DIY clothing. The reference period for the evaluation of DIY clothing trends was set from 2012 to 2022. The data in this study was collected and analyzed using Textom, a Big Data solution program certified as a Good Software by the Telecommunications Technology Association (TTA). For the analysis of fabric-related DIY products, the keyword was set to "DIY clothing", and for data cleansing following collection, the "Espresso K" module was employed. Also, via data collection on a year-by-year basis, a total of 11 lists were generated and the collected data was analyzed by period. The following are the findings of this study's data collection on DIY clothing. The total number of keywords collected over a period of ten years on search engines "Naver" and "Google" between January 1, 2012 and December 31, 2022 was 16,315, and data trends by period indicate a continuous upward trend. In addition, a keyword analysis was conducted to analyze TF-IDF (Term Frequency-Inverse Document Frequency), a statistical measure that reflects the importance of a word within data, and the relationship with N-gram, an analysis of the correlation concerning the relationship between words. Using these results, it was possible to evaluate the popularity and growing tendency of DIY clothing products in conjunction with the evolving social environment, as well as the desire to explore DIY trends among consumers. Therefore, this study is valuable in that it provides preliminary data for DIY clothing research by analyzing the status and reality of DIY products, and furthermore, contributes to the development and production of DIY clothing.

A Keyphrase Extraction Model for Each Conference or Journal (학술대회 및 저널별 기술 핵심구 추출 모델)

  • Jeong, Hyun Ji;Jang, Gwangseon;Kim, Tae Hyun;Sin, Donggu
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2022.10a
    • /
    • pp.81-83
    • /
    • 2022
  • Understanding research trends is necessary to select research topics and explore related works. Most researchers search representative keywords of interesting domains or technologies to understand research trends. However some conferences in artificial intelligence or data mining fields recently publish hundreds to thousands of papers for each year. It makes difficult for researchers to understand research trend of interesting domains. In our paper, we propose an automatic technology keyphrase extraction method to support researcher to understand research trend for each conference or journal. Keyphrase extraction that extracts important terms or phrases from a text, is a fundamental technology for a natural language processing such as summarization or searching, etc. Previous keyphrase extraction technologies based on pretrained language model extract keyphrases from long texts so performances are degraded in short texts like titles of papers. In this paper, we propose a techonolgy keyphrase extraction model that is robust in short text and considers the importance of the word.

  • PDF

A Systematic Review on the Use of SBAR in Communication for Domestic Nursing Students (국내 간호대학생의 SBAR를 이용한 의사소통에 대한 체계적 고찰)

  • Mi-Jin Lee;Hwa-Young Kim
    • The Journal of the Convergence on Culture Technology
    • /
    • v.10 no.1
    • /
    • pp.171-182
    • /
    • 2024
  • The purpose of the study was to review the evidence of effects of SBAR for communication in domestic nursing students. Four databases were searched for articles publised until June 2023 that databases include RISS, KISS, DBpia & KCI. Key word used for search include 'nursing students,' 'nursing,' 'SBAR,' 'communication.' Of 57 papers searched, seventeen studies were selected for data analysis. Studies evaluated outcomes including communication clarity, communication competence, communication confidence, critical thinking ability, self-efficacy, reporting confidence, clinical judgment, and communication satisfaction that most studies reported positive effects while some figures were not statistically significant. Accordingly, we intend to analyze the characteristics and contents of communication interventions using SBAR, evaluate their effectiveness, and use them as evidence for future follow-up research on communication interventions using SBAR.