• Title/Summary/Keyword: 단어 군집화

Search Result 81, Processing Time 0.022 seconds

A Statistical Analysis of the Causes of Marine Incidents occurring during Berthing (정박 중 발생한 준해양사고 원인에 대한 통계 분석 연구)

  • Roh, Boem-Seok;Kang, Suk-Young
    • Journal of Navigation and Port Research
    • /
    • v.45 no.3
    • /
    • pp.95-101
    • /
    • 2021
  • Marine Incidents based on Heinrich's law are very important in preventing accidents. However, marine Incident data are mainly qualitative and are used to prevent similar accidents through case sharing rather than statistical analysis, which can be confirmed in the marine Incident-related data posted in the Korea Maritime Safety Tribunal. Therefore, this study derived quantitative results by analyzing the causes of marine incidents during berthing using various methods of statistical analysis. To this end, data involving marine incidents from various shipping companies were collected and reclassified for easy analysis. The main keywords were derived via primary analysis using text mining. Only meaningful words were selected via verification by an expert group, and time series and cluster analysis were performed to predict marine incidents that may occur during berthing. Although the role of an expert group was still required during the analysis, it was confirmed that quantitative analysis of marine incidents was feasible, and iused to provide cause and accident prevention information.

A Study on the Development of Visual Arts Convergence Education Model with the Formless Concept (비정형 개념에 따른 시각예술 융합교육 모형 개발)

  • Cho, Hyun Geun
    • Korea Science and Art Forum
    • /
    • v.37 no.2
    • /
    • pp.275-292
    • /
    • 2019
  • This study was initiated with the attention of demanding new and diverse approaches, we're talking familiar with imitations in the design process like a way to draw a image. So I studied a convergence of humanities and visual arts with the understanding and conceptual approach of the formless. The purpose of this study is to develop formless languages and to organize practical courses which are to enable deeper research and design expression on theoretical approaches and explanations of outcomes required before and after the process when we practice in connection with the formless. The method of this study is to draw detailed items from selected words through advanced researches, work and author researches and practice teaching. The results of the study I proposed the formless language that is related to the horizontality in spatial positioning system, and pulse in the separation of space and time, and entropy in structural orders of the system, and base materialism in the limitation of matter as the operating mechanism and parent item of formless. And those elements are related with shape, size, shading, color, texture, space, structure as visual elements of formative elements and those have various adjectival meanings as the subordinate concept. So I presented an education materials of basic design which is to enable understanding and expressing the formless language in the overall process of formless visual art(theoretical approach, practice course, presentation, etc.). Based on these study results, I hope that this educational materials will be used as educational contents that makes them express and understand different new beauties, and a role that reveals social identity, and a reference for research on a formless visual arts.

Analysis of Changes in Restaurant Attributes According to the Spread of Infectious Diseases: Application of Text Mining Techniques (감염병 확산에 따른 레스토랑 선택속성 변화 분석: 텍스트마이닝 기법 적용)

  • Joonil Yoo;Eunji Lee;Chulmo Koo
    • Information Systems Review
    • /
    • v.25 no.4
    • /
    • pp.89-112
    • /
    • 2023
  • In March 2020, as it was declared a COVID-19 pandemic, various quarantine measures were taken. Accordingly, many changes have occurred in the tourism and hospitality industries. In particular, quarantine guidelines, such as the introduction of non-face-to-face services and social distancing, were implemented in the restaurant industry. For decades, research on restaurant attributes has emphasized the importance of three attributes: atmosphere, service quality, and food quality. Nevertheless, to the best of our knowledge, research on restaurant attributes considering the COVID-19 situation is insufficient. To respond to this call, this study attempted an exploratory approach to classify new restaurant attributes based on understanding environmental changes. This study considered 31,115 online reviews registered in Naverplace as an analysis unit, with 475 general restaurants located in Euljiro, Seoul. Further, we attempted to classify restaurant attributes by clustering words within online reviews through TF-IDF and LDA topic modeling techniques. As a result of the analysis, the factors of "prevention of infectious diseases" were derived as new attributes of restaurants in the context of COVID-19 situations, along with the atmosphere, service quality, and food quality. This study is of academic significance by expanding the literature of existing restaurant attributes in that it categorized the three attributes presented by existing restaurant attributes and further presented new attributes. Moreover, the analysis results have led to the formulation of practical recommendations, considering both the operational aspects of restaurants and policy implications.

The Effect of the Number of Phoneme Clusters on Speech Recognition (음성 인식에서 음소 클러스터 수의 효과)

  • Lee, Chang-Young
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.9 no.11
    • /
    • pp.1221-1226
    • /
    • 2014
  • In an effort to improve the efficiency of the speech recognition, we investigate the effect of the number of phoneme clusters. For this purpose, codebooks of varied number of phoneme clusters are prepared by modified k-means clustering algorithm. The subsequent processing is fuzzy vector quantization (FVQ) and hidden Markov model (HMM) for speech recognition test. The result shows that there are two distinct regimes. For large number of phoneme clusters, the recognition performance is roughly independent of it. For small number of phoneme clusters, however, the recognition error rate increases nonlinearly as it is decreased. From numerical calculation, it is found that this nonlinear regime might be modeled by a power law function. The result also shows that about 166 phoneme clusters would be the optimal number for recognition of 300 isolated words. This amounts to roughly 3 variations per phoneme.

Question Answering Optimization via Temporal Representation and Data Augmentation of Dynamic Memory Networks (동적 메모리 네트워크의 시간 표현과 데이터 확장을 통한 질의응답 최적화)

  • Han, Dong-Sig;Lee, Chung-Yeon;Zhang, Byoung-Tak
    • Journal of KIISE
    • /
    • v.44 no.1
    • /
    • pp.51-56
    • /
    • 2017
  • The research area for solving question answering (QA) problems using artificial intelligence models is in a methodological transition period, and one such architecture, the dynamic memory network (DMN), is drawing attention for two key attributes: its attention mechanism defined by neural network operations and its modular architecture imitating cognition processes during QA of human. In this paper, we increased accuracy of the inferred answers, by adapting an automatic data augmentation method for lacking amount of training data, and by improving the ability of time perception. The experimental results showed that in the 1K-bAbI tasks, the modified DMN achieves 89.21% accuracy and passes twelve tasks which is 13.58% higher with passing four more tasks, as compared with one implementation of DMN. Additionally, DMN's word embedding vectors form strong clusters after training. Moreover, the number of episodic passes and that of supporting facts shows direct correlation, which affects the performance significantly.

Analysis of trends in domestic research on addiction using text mining and CONCOR (텍스트마이닝과 CONCOR을 활용한 중독 관련 국내 연구 동향 분석)

  • Sol-Ji Lee;Ki-Hyok Youn
    • Journal of Internet of Things and Convergence
    • /
    • v.9 no.6
    • /
    • pp.99-110
    • /
    • 2023
  • This study analyzed 817 articles published in Korean professional journals over the past three years, from 2020 to 2022, using text mining techniques to identify trends in addiction research in Korea and explore development directions. The analysis results are as follows. First, as a result of the analysis of the top keywords, online addiction studies such as smartphones, games, Internet, gambling, and relationship addiction were prominent as the top keywords. Second, as a result of TF-IDF analysis, many addiction studies related to behavioral addiction such as smartphones, games, the Internet, and work addiction have been conducted over the past three years, and in particular, there are many studies on addiction problems such as smartphones, games, and the Internet that have not yet been clinically diagnosed as addiction problems. This is the same as the result of word frequency analysis, and it can be interpreted that recent studies have been remarkably conducted on more diverse addiction problems. Third, the 2-gram analysis shows that words that mainly correspond to behavioral addiction, such as smartphones, games, and the Internet, appear side by side with the keyword addiction, and among them, words paired with smartphones are mentioned a lot in research papers and are being studied. Fourth, as a result of the CONCOR analysis, there were five clusters: a study on universal addiction issues such as alcohol use disorders and the Internet, a study of recovery on drug and gambling addiction, a study on mobile devices and media addiction, a study on the latest trends related to behavioral addiction, and other addiction issues. Finally, based on the results of this study, a direction for future addiction-related research was suggested.

Analysis method of patent document to Forecast Patent Registration (특허 등록 예측을 위한 특허 문서 분석 방법)

  • Koo, Jung-Min;Park, Sang-Sung;Shin, Young-Geun;Jung, Won-Kyo;Jang, Dong-Sik
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.11 no.4
    • /
    • pp.1458-1467
    • /
    • 2010
  • Recently, imitation and infringement rights of an intellectual property are being recognized as impediments to nation's industrial growth. To prevent the huge loss which comes from theses impediments, many researchers are studying protection and efficient management of an intellectual property in various ways. Especially, the prediction of patent registration is very important part to protect and assert intellectual property rights. In this study, we propose the patent document analysis method by using text mining to predict whether the patent is registered or rejected. In the first instance, the proposed method builds the database by using the word frequencies of the rejected patent documents. And comparing the builded database with another patent documents draws the similarity value between each patent document and the database. In this study, we used k-means which is partitioning clustering algorithm to select criteria value of patent rejection. In result, we found conclusion that some patent which similar to rejected patent have strong possibility of rejection. We used U.S.A patent documents about bluetooth technology, solar battery technology and display technology for experiment data.

A Method for Detecting Event-Location based on Similar Keyword Extraction in Tweet Text (트윗 텍스트의 유사 키워드 추출을 통한 이벤트 지역 탐지 기법)

  • Yim, Junyeob;Ha, Hyunsoo;Hwang, Byung-Yeon
    • Spatial Information Research
    • /
    • v.23 no.5
    • /
    • pp.1-7
    • /
    • 2015
  • Twitter has the fast propagation and diffusion of information compare to other SNS. Therefore, many researches about detecting real-time event using twitter are progressing. Twitter real-time event detecting system assumes every twitter user as a sensor and analyzes their written tweet in order to detect the event. Researches that are related to this twitter have already obtained good results but confronted the limits because of some problems. Especially, many existing researches are using the method that can trace an event location by using GPS coordinate. However, it can be suggested a definite limitation through the present user's skeptical responses about making personal location information public. Therefore, this paper suggests the method that traces the location information in tweet contents text without using the provided location information from twitter. Associated words were grouped by using the keyword that extracted in tweet contents text. The place that the events have occurred and whether the events have surely occurred are detected by this experiment using this algorithm. Furthermore, this experiment demonstrated the necessity of the suggested methods by showing faster detection compare to the other existing media.

A Study on the Social Perception of Jiu-Jitsu Using Big data Analysis (빅데이터 분석을 활용한 주짓수의 사회적 인식 연구)

  • Kun-hee Kim
    • The Journal of the Convergence on Culture Technology
    • /
    • v.10 no.3
    • /
    • pp.209-217
    • /
    • 2024
  • The purpose of this study is to explore development plans by analyzing social interests and perceptions of jiu-jitsu using big data analysis. Network analysis, centrality analysis, and CONCOR analysis were conducted by collecting data for the last 10 years of major domestic portal sites. First, 'judo' was found to be the most important related word in network analysis, and 'judo' was also an important word in the analysis of dgree centrality. In the closeness centrality analysis, "defender" was the most important word, and "sports" was the most important word in betweenness centrality. Finally, as a result of CONCOR analysis, four clusters (related sports and marketing, jiu-jitsu competitions, belt test, supplies and expenses) were formed. As a conclusion of the study, first, words such as 'judo', 'exercise', 'competition', 'dobok', 'gym', and 'graduation' should be actively used to promote jiu-jitsu.As a conclusion of the study, first, words such as 'judo', 'exercise', 'contest', 'dobok', 'gym', and 'graduation' should be actively used to promote jiu-jitsu. Second, it is necessary to share information on training costs through various routes, to make awareness of the graduation process or method common, and to develop safety products and create a safe training culture. Third, it is necessary to find ways to continuously increase the influx of new trainees by attracting steady competitions.

A Study on Analysis of consumer perception of YouTube advertising using text mining (텍스트 마이닝을 활용한 Youtube 광고에 대한 소비자 인식 분석)

  • Eum, Seong-Won
    • Management & Information Systems Review
    • /
    • v.39 no.2
    • /
    • pp.181-193
    • /
    • 2020
  • This study is a study that analyzes consumer perception by utilizing text mining, which is a recent issue. we analyzed the consumer's perception of Samsung Galaxy by analyzing consumer reviews of Samsung Galaxy YouTube ads. for analysis, 1,819 consumer reviews of YouTube ads were extracted. through this data pre-processing, keywords for advertisements were classified and extracted into nouns, adjectives, and adverbs. after that, frequency analysis and emotional analysis were performed. Finally, clustering was performed through CONCOR. the summary of this study is as follows. the first most frequently mentioned words were Galaxy Note (n = 217), Good (n = 135), Pen (n = 40), and Function (n = 29). it can be judged through the advertisement that consumers "Galaxy Note", "Good", "Pen", and "Features" have good functional aspects for Samsung mobile phone products and positively recognize the Note Pen. in addition, the recognition of "Samsung Pay", "Innovation", "Design", and "iPhone" shows that Samsung's mobile phone is highly regarded for its innovative design and functional aspects of Samsung Pay. second, it is the result of sentiment analysis on YouTube advertising. As a result of emotional analysis, the ratio of emotional intensity was positive (75.95%) and higher than negative (24.05%). this means that consumers are positively aware of Samsung Galaxy mobile phones. As a result of the emotional keyword analysis, positive keywords were "good", "good", "innovative", "highest", "fast", "pretty", etc., negative keywords were "frightening", "I want to cry", "discomfort", "sorry", "no", etc. were extracted. the implication of this study is that most of the studies by quantitative analysis methods were considered when looking at the consumer perception study of existing advertisements. In this study, we deviated from quantitative research methods for advertising and attempted to analyze consumer perception through qualitative research. this is expected to have a great influence on future research, and I am sure that it will be a starting point for consumer awareness research through qualitative research.