Search | Korea Science

Semantic Visualization of Dynamic Topic Modeling (다이내믹 토픽 모델링의 의미적 시각화 방법론)

Yeon, Jinwook;Boo, Hyunkyung;Kim, Namgyu
- Journal of Intelligence and Information Systems
- /
- v.28 no.1
- /
- pp.131-154
- /
- 2022
Recently, researches on unstructured data analysis have been actively conducted with the development of information and communication technology. In particular, topic modeling is a representative technique for discovering core topics from massive text data. In the early stages of topic modeling, most studies focused only on topic discovery. As the topic modeling field matured, studies on the change of the topic according to the change of time began to be carried out. Accordingly, interest in dynamic topic modeling that handle changes in keywords constituting the topic is also increasing. Dynamic topic modeling identifies major topics from the data of the initial period and manages the change and flow of topics in a way that utilizes topic information of the previous period to derive further topics in subsequent periods. However, it is very difficult to understand and interpret the results of dynamic topic modeling. The results of traditional dynamic topic modeling simply reveal changes in keywords and their rankings. However, this information is insufficient to represent how the meaning of the topic has changed. Therefore, in this study, we propose a method to visualize topics by period by reflecting the meaning of keywords in each topic. In addition, we propose a method that can intuitively interpret changes in topics and relationships between or among topics. The detailed method of visualizing topics by period is as follows. In the first step, dynamic topic modeling is implemented to derive the top keywords of each period and their weight from text data. In the second step, we derive vectors of top keywords of each topic from the pre-trained word embedding model. Then, we perform dimension reduction for the extracted vectors. Then, we formulate a semantic vector of each topic by calculating weight sum of keywords in each vector using topic weight of each keyword. In the third step, we visualize the semantic vector of each topic using matplotlib, and analyze the relationship between or among the topics based on the visualized result. The change of topic can be interpreted in the following manners. From the result of dynamic topic modeling, we identify rising top 5 keywords and descending top 5 keywords for each period to show the change of the topic. Existing many topic visualization studies usually visualize keywords of each topic, but our approach proposed in this study differs from previous studies in that it attempts to visualize each topic itself. To evaluate the practical applicability of the proposed methodology, we performed an experiment on 1,847 abstracts of artificial intelligence-related papers. The experiment was performed by dividing abstracts of artificial intelligence-related papers into three periods (2016-2017, 2018-2019, 2020-2021). We selected seven topics based on the consistency score, and utilized the pre-trained word embedding model of Word2vec trained with 'Wikipedia', an Internet encyclopedia. Based on the proposed methodology, we generated a semantic vector for each topic. Through this, by reflecting the meaning of keywords, we visualized and interpreted the themes by period. Through these experiments, we confirmed that the rising and descending of the topic weight of a keyword can be usefully used to interpret the semantic change of the corresponding topic and to grasp the relationship among topics. In this study, to overcome the limitations of dynamic topic modeling results, we used word embedding and dimension reduction techniques to visualize topics by era. The results of this study are meaningful in that they broadened the scope of topic understanding through the visualization of dynamic topic modeling results. In addition, the academic contribution can be acknowledged in that it laid the foundation for follow-up studies using various word embeddings and dimensionality reduction techniques to improve the performance of the proposed methodology.
https://doi.org/10.13088/jiis.2022.28.1.131 인용 PDF KSCI

A Study on Evaluating the Possibility of Monitoring Ships of CAS500-1 Images Based on YOLO Algorithm: A Case Study of a Busan New Port and an Oakland Port in California (YOLO 알고리즘 기반 국토위성영상의 선박 모니터링 가능성 평가 연구: 부산 신항과 캘리포니아 오클랜드항을 대상으로)

Park, Sangchul;Park, Yeongbin;Jang, Soyeong;Kim, Tae-Ho
- Korean Journal of Remote Sensing
- /
- v.38 no.6_1
- /
- pp.1463-1478
- /
- 2022
Maritime transport accounts for 99.7% of the exports and imports of the Republic of Korea; therefore, developing a vessel monitoring system for efficient operation is of significant interest. Several studies have focused on tracking and monitoring vessel movements based on automatic identification system (AIS) data; however, ships without AIS have limited monitoring and tracking ability. High-resolution optical satellite images can provide the missing layer of information in AIS-based monitoring systems because they can identify non-AIS vessels and small ships over a wide range. Therefore, it is necessary to investigate vessel monitoring and small vessel classification systems using high-resolution optical satellite images. This study examined the possibility of developing ship monitoring systems using Compact Advanced Satellite 500-1 (CAS500-1) satellite images by first training a deep learning model using satellite image data and then performing detection in other images. To determine the effectiveness of the proposed method, the learning data was acquired from ships in the Yellow Sea and its major ports, and the detection model was established using the You Only Look Once (YOLO) algorithm. The ship detection performance was evaluated for a domestic and an international port. The results obtained using the detection model in ships in the anchorage and berth areas were compared with the ship classification information obtained using AIS, and an accuracy of 85.5% and 70% was achieved using domestic and international classification models, respectively. The results indicate that high-resolution satellite images can be used in mooring ships for vessel monitoring. The developed approach can potentially be used in vessel tracking and monitoring systems at major ports around the world if the accuracy of the detection model is improved through continuous learning data construction.
https://doi.org/10.7780/kjrs.2022.38.6.1.35 인용 PDF KSCI HTML

An Ontology-based Generation of Operating Procedures for Boiler Shutdown : Knowledge Representation and Application to Operator Training (온톨로지 기반의 보일러 셧다운 절차 생성 : 지식표현 및 훈련시나리오 활용)

Park, Myeongnam;Kim, Tae-Ok;Lee, Bongwoo;Shin, Dongil
- Journal of the Korean Institute of Gas
- /
- v.21 no.4
- /
- pp.47-61
- /
- 2017
The preconditions of the usefulness of an operator safety training model in large plants are the versatility and accuracy of operational procedures, obtained by detailed analysis of the various types of risks associated with the operation, and the systematic representation of knowledge. In this study, we consider the artificial intelligence planning method for the generation of operation procedures; classify them into general actions, actions and technical terms of the operator; and take into account the sharing and reuse of knowledge, defining a knowledge expression ontology. In order to expand and extend the general operations of the operation, we apply a Hierarchical Task Network (HTN). Actual boiler plant case studies are classified according to operating conditions, states and operating objectives between the units, and general emergency shutdown procedures are created to confirm the applicability of the proposed method. These results based on systematic knowledge representation can be easily applied to general plant operation procedures and operator safety training scenarios and will be used for automatic generation of safety training scenarios.
https://doi.org/10.7842/kigas.2017.21.4.47 인용 PDF KSCI

Retrieval of Land Surface Temperature Using Landsat 8 Images with Deep Neural Networks (Landsat 8 영상을 이용한 심층신경망 기반의 지표면온도 산출)

Kim, Seoyeon;Lee, Soo-Jin;Lee, Yang-Won
- Korean Journal of Remote Sensing
- /
- v.36 no.3
- /
- pp.487-501
- /
- 2020
As a viable option for retrieval of LST (Land Surface Temperature), this paper presents a DNN (Deep Neural Network) based approach using 148 Landsat 8 images for South Korea. Because the brightness temperature and emissivity for the band 10 (approx. 11-㎛ wavelength) of Landsat 8 are derived by combining physics-based equations and empirical coefficients, they include uncertainties according to regional conditions such as meteorology, climate, topography, and vegetation. To overcome this, we used several land surface variables such as NDVI (Normalized Difference Vegetation Index), land cover types, topographic factors (elevation, slope, aspect, and ruggedness) as well as the T₀ calculated from the brightness temperature and emissivity. We optimized four seasonal DNN models using the input variables and in-situ observations from ASOS (Automated Synoptic Observing System) to retrieve the LST, which is an advanced approach when compared with the existing method of the bias correction using a linear equation. The validation statistics from the 1,728 matchups during 2013-2019 showed a good performance of the CC=0.910~0.917 and RMSE=3.245~3.365℃, especially for spring and fall. Also, our DNN models produced a stable LST for all types of land cover. A future work using big data from Landsat 5/7/8 with additional land surface variables will be necessary for a more reliable retrieval of LST for high-resolution satellite images.
https://doi.org/10.7780/kjrs.2020.36.3.8 인용 PDF KSCI HTML

Multi-modal Emotion Recognition using Semi-supervised Learning and Multiple Neural Networks in the Wild (준 지도학습과 여러 개의 딥 뉴럴 네트워크를 사용한 멀티 모달 기반 감정 인식 알고리즘)

Kim, Dae Ha;Song, Byung Cheol
- Journal of Broadcast Engineering
- /
- v.23 no.3
- /
- pp.351-360
- /
- 2018
Human emotion recognition is a research topic that is receiving continuous attention in computer vision and artificial intelligence domains. This paper proposes a method for classifying human emotions through multiple neural networks based on multi-modal signals which consist of image, landmark, and audio in a wild environment. The proposed method has the following features. First, the learning performance of the image-based network is greatly improved by employing both multi-task learning and semi-supervised learning using the spatio-temporal characteristic of videos. Second, a model for converting 1-dimensional (1D) landmark information of face into two-dimensional (2D) images, is newly proposed, and a CNN-LSTM network based on the model is proposed for better emotion recognition. Third, based on an observation that audio signals are often very effective for specific emotions, we propose an audio deep learning mechanism robust to the specific emotions. Finally, so-called emotion adaptive fusion is applied to enable synergy of multiple networks. The proposed network improves emotion classification performance by appropriately integrating existing supervised learning and semi-supervised learning networks. In the fifth attempt on the given test set in the EmotiW2017 challenge, the proposed method achieved a classification accuracy of 57.12%.
https://doi.org/10.5909/JBE.2018.23.3.351 인용 PDF KSCI KPUBS

Exploratory Research on Automating the Analysis of Scientific Argumentation Using Machine Learning (머신 러닝을 활용한 과학 논변 구성 요소 코딩 자동화 가능성 탐색 연구)

Lee, Gyeong-Geon;Ha, Heesoo;Hong, Hun-Gi;Kim, Heui-Baik
- Journal of The Korean Association For Science Education
- /
- v.38 no.2
- /
- pp.219-234
- /
- 2018
In this study, we explored the possibility of automating the process of analyzing elements of scientific argument in the context of a Korean classroom. To gather training data, we collected 990 sentences from science education journals that illustrate the results of coding elements of argumentation according to Toulmin's argumentation structure framework. We extracted 483 sentences as a test data set from the transcription of students' discourse in scientific argumentation activities. The words and morphemes of each argument were analyzed using the Python 'KoNLPy' package and the 'Kkma' module for Korean Natural Language Processing. After constructing the 'argument-morpheme:class' matrix for 1,473 sentences, five machine learning techniques were applied to generate predictive models relating each sentences to the element of argument with which it corresponded. The accuracy of the predictive models was investigated by comparing them with the results of pre-coding by researchers and confirming the degree of agreement. The predictive model generated by the k-nearest neighbor algorithm (KNN) demonstrated the highest degree of agreement [54.04% (${\kappa}=0.22$)] when machine learning was performed with the consideration of morpheme of each sentence. The predictive model generated by the KNN exhibited higher agreement [55.07% (${\kappa}=0.24$)] when the coding results of the previous sentence were added to the prediction process. In addition, the results indicated importance of considering context of discourse by reflecting the codes of previous sentences to the analysis. The results have significance in that, it showed the possibility of automating the analysis of students' argumentation activities in Korean language by applying machine learning.
https://doi.org/10.14697/jkase.2018.38.2.219 인용 PDF KSCI

Back Analysis of Field Measurements Around the Tunnel with the Application of Genetic Algorithms (유전자 알고리즘을 이용한 터널 현장 계측 결과의 역해석)

Kim Sun-Myung;Yoon Ji-Sun;Jun Duk-Chan;Yoon Sang-Gil
- Journal of the Korean Geotechnical Society
- /
- v.20 no.7
- /
- pp.69-78
- /
- 2004
In this study, the back analysis program was developed by applying the genetic algorithm, one of artificial intelligence fields, to the direct method. The optimization process which has influence on the efficiency of the direct method was modulated with genetic algorithm. On conditions that the displacement computed by forward analysis for a certain rock mass model was the same as the displacement measured at the tunnel section, back analysis was executed to verify the validity of the program. Usefulness of the program was confirmed by comparing relative errors calculated by back analysis, which is carried out under the same rock mass conditions as analysis model of Gens et at (1987), one of back analysis case in the past. We estimated the total displacement occurring by tunnelling with the crown settlement and convergence measured at the working faces in three tunnel sites of Kyungbu Express railway. Those data measured at the working face are used for back analysis as the input data after confidence test. As the results of the back analysis, we comprehended the tendency of tunnel behaviors with comparing the respective deformation characteristics obtained by the measurement at the working face and by back analysis. Also the usefulness and applicability of the back analysis program developed in this study were verified.
PDF KSCI

5G Mobile Communications: 4th Industrial Aorta (5G 이동통신: 4차 산업 대동맥)

Kim, Jeong Su;Lee, Moon Ho
- The Journal of the Convergence on Culture Technology
- /
- v.4 no.1
- /
- pp.337-351
- /
- 2018
This paper discusses 5G IOT, Augmented Reality, Cloud Computing, Big Data, Future Autonomous Driving Vehicle technology, and presents 5G utilization of Pyeongchang Winter Olympic Games and Jeju Smart City model. The reason is that 5G is the main artery of the 4th industry.5G is the fourth industrial aorta because 5G is the core infrastructure of the fourth industrial revolution. In order for the AI, autonomous vehicle, VR / AR, and Internet (IoT) era to take off, data must be transmitted several times faster and more securely than before. For example, if you send a stop signal to LTE, which is a communication technology, to a remote autonomous vehicle, it takes a hundredth of a second. It seems to be fairly fast, but if you run at 100km / h, you can not guarantee safety because the car moves 30cm until it stops. 5G is more than 20 gigabits per second (Gbps), about 40 times faster than current LTE. Theoretically, the vehicle can be set up within 1 cm. 5G not only connects 1 million Internet (IoT) devices within a radius of 1 kilometer, but also has a speed delay of less than 0.001 sec. Steve Mollenkov, chief executive officer of Qualcomm, the world's largest maker of smartphones, said, "5G is a key element and innovative technology that will connect the future." With 5G commercialization, there will be an economic effect of 12 trillion dollars in 2035 and 22 million new jobs We can expect to see the effect of creation.
https://doi.org/10.17703/JCCT.2018.4.1.337 인용 PDF KSCI

Evaluation of Applicability of RGB Image Using Support Vector Machine Regression for Estimation of Leaf Chlorophyll Content of Onion and Garlic (양파 마늘의 잎 엽록소 함량 추정을 위한 SVM 회귀 활용 RGB 영상 적용성 평가)

Lee, Dong-ho;Jeong, Chan-hee;Go, Seung-hwan;Park, Jong-hwa
- Korean Journal of Remote Sensing
- /
- v.37 no.6_1
- /
- pp.1669-1683
- /
- 2021
AI intelligent agriculture and digital agriculture are important for the science of agriculture. Leaf chlorophyll contents(LCC) are one of the most important indicators to determine the growth status of vegetable crops. In this study, a support vector machine (SVM) regression model was produced using an unmanned aerial vehicle-based RGB camera and a multispectral (MSP) sensor for onions and garlic, and the LCC estimation applicability of the RGB camera was reviewed by comparing it with the MSP sensor. As a result of this study, the RGB-based LCC model showed lower results than the MSP-based LCC model with an average R² of 0.09, RMSE 18.66, and nRMSE 3.46%. However, the difference in accuracy between the two sensors was not large, and the accuracy did not drop significantly when compared with previous studies using various sensors and algorithms. In addition, the RGB-based LCC model reflects the field LCC trend well when compared with the actual measured value, but it tends to be underestimated at high chlorophyll concentrations. It was possible to confirm the applicability of the LCC estimation with RGB considering the economic feasibility and versatility of the RGB camera. The results obtained from this study are expected to be usefully utilized in digital agriculture as AI intelligent agriculture technology that applies artificial intelligence and big data convergence technology.
https://doi.org/10.7780/kjrs.2021.37.6.1.15 인용 PDF KSCI HTML

An Interpretable Log Anomaly System Using Bayesian Probability and Closed Sequence Pattern Mining (베이지안 확률 및 폐쇄 순차패턴 마이닝 방식을 이용한 설명가능한 로그 이상탐지 시스템)

Yun, Jiyoung;Shin, Gun-Yoon;Kim, Dong-Wook;Kim, Sang-Soo;Han, Myung-Mook
- Journal of Internet Computing and Services
- /
- v.22 no.2
- /
- pp.77-87
- /
- 2021
With the development of the Internet and personal computers, various and complex attacks begin to emerge. As the attacks become more complex, signature-based detection become difficult. It leads to the research on behavior-based log anomaly detection. Recent work utilizes deep learning to learn the order and it shows good performance. Despite its good performance, it does not provide any explanation for prediction. The lack of explanation can occur difficulty of finding contamination of data or the vulnerability of the model itself. As a result, the users lose their reliability of the model. To address this problem, this work proposes an explainable log anomaly detection system. In this study, log parsing is the first to proceed. Afterward, sequential rules are extracted by Bayesian posterior probability. As a result, the "If condition then results, post-probability" type rule set is extracted. If the sample is matched to the ruleset, it is normal, otherwise, it is an anomaly. We utilize HDFS datasets for the experiment, resulting in F1score 92.7% in test dataset.
https://doi.org/10.7472/jksii.2021.22.2.77 인용 PDF KSCI HTML

Search Result 1,494, Processing Time 0.025 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)