• Title/Summary/Keyword: topic modelling analysis

Search Result 39, Processing Time 0.024 seconds

International Research on Geotechnical Risk & Landslide Hazards (지반공학적 재해 및 산사태 위험도 분석에 관한 연구)

  • Yoon, Gil-Lim;Yoon, Yeo-Won;Kim, Hong-Yeon
    • Proceedings of the Korean Geotechical Society Conference
    • /
    • 2009.03a
    • /
    • pp.444-455
    • /
    • 2009
  • Great concerns on geotechnical risk & hazard assessment have been increased due to human and economic damage by natural disasters with recent global climate changes. In this paper, geotechnical problems in particular, landslides which is interested in European countries and North America, were mainly discussed. For these, 18 key topics on geotechnical risk and hazards which had been discussed at the LARAM 2008 workshop in Italy were analyzed after grouping by subjects. Main topic contents consisted of applications such as field measurement, early warning systems, uncertainty analysis of parameters using radar, optical data and statistical theory and so on. And the problems related to analysis of vulnerability and deformation due to earthquakes, investigation of gas zone using seismic reflection data in a landslide area, risk quantification and hazard assessment of landslide movements and multi-dimensional analysis for stability of complex slopes were attracted. Also, there were studies on risk matters of cultural heritage, the blockglide of clayey ground, simulations of debris flows based on GIS, quantification of the failure processes of rock slopes, a meshless method for 3D crack modelling, and finally risk assessment for cryological processes due to global warming.

  • PDF

Analysis of articles on water quality accidents in the water distribution networks using big data topic modelling and sentiment analysis (빅데이터 토픽모델링과 감성분석을 활용한 물공급과정에서의 수질사고 기사 분석)

  • Hong, Sung-Jin;Yoo, Do-Guen
    • Journal of Korea Water Resources Association
    • /
    • v.55 no.spc1
    • /
    • pp.1235-1249
    • /
    • 2022
  • This study applied the web crawling technique for extracting big data news on water quality accidents in the water supply system and presented the algorithm in a procedural way to obtain accurate water quality accident news. In addition, in the case of a large-scale water quality accident, development patterns such as accident recognition, accident spread, accident response, and accident resolution appear according to the occurrence of an accident. That is, the analysis of the development of water quality accidents through key keywords and sentiment analysis for each stage was carried out in detail based on case studies, and the meanings were analyzed and derived. The proposed methodology was applied to the larval accident period of Incheon Metropolitan City in 2020 and analyzed. As a result, in a situation where the disclosure of information that directly affects consumers, such as water quality accidents, is restricted, the tone of news articles and media reports about water quality accidents with long-term damage in the event of an accident and the degree of consumer pride clearly change over time. could check This suggests the need to prepare consumer-centered policies to increase consumer positivity, although rapid restoration of facilities is very important for the development of water quality accidents from the supplier's point of view.

Detection of Depression Trends in Literary Cyber Writers Using Sentiment Analysis and Machine Learning

  • Faiza Nasir;Haseeb Ahmad;CM Nadeem Faisal;Qaisar Abbas;Mubarak Albathan;Ayyaz Hussain
    • International Journal of Computer Science & Network Security
    • /
    • v.23 no.3
    • /
    • pp.67-80
    • /
    • 2023
  • Rice is an important food crop for most of the population in Nowadays, psychologists consider social media an important tool to examine mental disorders. Among these disorders, depression is one of the most common yet least cured disease Since abundant of writers having extensive followers express their feelings on social media and depression is significantly increasing, thus, exploring the literary text shared on social media may provide multidimensional features of depressive behaviors: (1) Background: Several studies observed that depressive data contains certain language styles and self-expressing pronouns, but current study provides the evidence that posts appearing with self-expressing pronouns and depressive language styles contain high emotional temperatures. Therefore, the main objective of this study is to examine the literary cyber writers' posts for discovering the symptomatic signs of depression. For this purpose, our research emphases on extracting the data from writers' public social media pages, blogs, and communities; (3) Results: To examine the emotional temperatures and sentences usage between depressive and not depressive groups, we employed the SentiStrength algorithm as a psycholinguistic method, TF-IDF and N-Gram for ranked phrases extraction, and Latent Dirichlet Allocation for topic modelling of the extracted phrases. The results unearth the strong connection between depression and negative emotional temperatures in writer's posts. Moreover, we used Naïve Bayes, Support Vector Machines, Random Forest, and Decision Tree algorithms to validate the classification of depressive and not depressive in terms of sentences, phrases and topics. The results reveal that comparing with others, Support Vectors Machines algorithm validates the classification while attaining highest 79% f-score; (4) Conclusions: Experimental results show that the proposed system outperformed for detection of depression trends in literary cyber writers using sentiment analysis.

Study of Analysis for Autonomous Vehicle Collision Using Text Embedding (텍스트 임베딩을 이용한 자율주행자동차 교통사고 분석에 관한 연구)

  • Park, Sangmin;Lee, Hwanpil;So, Jaehyun(Jason);Yun, Ilsoo
    • The Journal of The Korea Institute of Intelligent Transport Systems
    • /
    • v.20 no.1
    • /
    • pp.160-173
    • /
    • 2021
  • Recently, research on the development of autonomous vehicles has increased worldwide. Moreover, a means to identify and analyze the characteristics of traffic accidents of autonomous vehicles is needed. Accordingly, traffic accident data of autonomous vehicles are being collected in California, USA. This research examined the characteristics of traffic accidents of autonomous vehicles. Primarily, traffic accident data for autonomous vehicles were analyzed, and the text data used text-embedding techniques to derive major keywords and four topics. The methodology of this study is expected to be used in the analysis of traffic accidents in autonomous vehicles.

Investigating Topics of Incivility Related to COVID-19 on Twitter: Analysis of Targets and Keywords of Hate Speech (트위터에서의 COVID-19와 관련된 반시민성 주제 탐색: 혐오 대상 및 키워드 분석)

  • Kim, Kyuli;Oh, Chanhee;Zhu, Yongjun
    • Journal of the Korean Society for information Management
    • /
    • v.39 no.1
    • /
    • pp.331-350
    • /
    • 2022
  • This study aims to understand topics of incivility related to COVID-19 from analyzing Twitter posts including COVID-19-related hate speech. To achieve the goal, a total of 63,802 tweets that were created between December 1st, 2019, and August 31st, 2021, covering three targets of hate speech including region and public facilities, groups of people, and religion were analyzed. Frequency analysis, dynamic topic modeling, and keyword co-occurrence network analysis were used to explore topics and keywords. 1) Results of frequency analysis revealed that hate against regions and public facilities showed a relatively increasing trend while hate against specific groups of people and religion showed a relatively decreasing trend. 2) Results of dynamic topic modeling analysis showed keywords of each of the three targets of hate speech. Keywords of the region and public facilities included "Daegu, Gyeongbuk local hate", "interregional hate", and "public facility hate"; groups of people included "China hate", "virus spreaders", and "outdoor activity sanctions"; and religion included "Shincheonji", "Christianity", "religious infection", "refusal of quarantine", and "places visited by confirmed cases". 3) Similarly, results of keyword co-occurrence network analysis revealed keywords of three targets: region and public facilities (Corona, Daegu, confirmed cases, Shincheonji, Gyeongbuk, region); specific groups of people (Coronavirus, Wuhan pneumonia, Wuhan, China, Chinese, People, Entry, Banned); and religion (Corona, Church, Daegu, confirmed cases, infection). This study attempted to grasp the public's anti-citizenship public opinion related to COVID-19 by identifying domestic COVID-19 hate targets and keywords using social media. In particular, it is meaningful to grasp public opinion on incivility topics and hate emotions expressed on social media using data mining techniques for hate-related to COVID-19, which has not been attempted in previous studies. In addition, the results of this study suggest practical implications in that they can be based on basic data for contributing to the establishment of systems and policies for cultural communication measures in preparation for the post-COVID-19 era.

Current Status and Agenda for Regional Central Library Social Minority Service (국내 지역대표도서관 소수자서비스의 현황과 과제)

  • Chul Jung
    • Journal of Korean Library and Information Science Society
    • /
    • v.53 no.4
    • /
    • pp.233-266
    • /
    • 2022
  • The purpose of this study is to derive and propose agenda to improve the quality of minority services provided by regional cental libraries at the present time when information gap is deepening. First, text mining and topic modeling were conducted on 144 studies in the field of library and information science that dealt with minorities, and the discussions surrounding minorities in the domestic library world were examined in detail. Next, the current status of services for minorities in Regional central libraries were examined in detail, and tasks requiring discussion were sought in planning and operation of services for minorities in Regional central libraries. To this end, interviews were conducted with practitioners, in charge of services for minorities at Regional central libraries. Specifically, 1) awareness of minorities by practitioners, 2) current status of minority services, and 3) responsibility and role of Regional central libraries for planning and operating minority services and necessary support were analyzed. Based on the analysis results, the following tasks were derived. 1) Recategorization of minority groups, 2) Establishment of reference resource, 3) Reinforcement of education, and 4) Cooperation support between regional representative libraries and local public libraries were derived and suggested.

Prediction of Customer Satisfaction Using RFE-SHAP Feature Selection Method (RFE-SHAP을 활용한 온라인 리뷰를 통한 고객 만족도 예측)

  • Olga Chernyaeva;Taeho Hong
    • Journal of Intelligence and Information Systems
    • /
    • v.29 no.4
    • /
    • pp.325-345
    • /
    • 2023
  • In the rapidly evolving domain of e-commerce, our study presents a cohesive approach to enhance customer satisfaction prediction from online reviews, aligning methodological innovation with practical insights. We integrate the RFE-SHAP feature selection with LDA topic modeling to streamline predictive analytics in e-commerce. This integration facilitates the identification of key features-specifically, narrowing down from an initial set of 28 to an optimal subset of 14 features for the Random Forest algorithm. Our approach strategically mitigates the common issue of overfitting in models with an excess of features, leading to an improved accuracy rate of 84% in our Random Forest model. Central to our analysis is the understanding that certain aspects in review content, such as quality, fit, and durability, play a pivotal role in influencing customer satisfaction, especially in the clothing sector. We delve into explaining how each of these selected features impacts customer satisfaction, providing a comprehensive view of the elements most appreciated by customers. Our research makes significant contributions in two key areas. First, it enhances predictive modeling within the realm of e-commerce analytics by introducing a streamlined, feature-centric approach. This refinement in methodology not only bolsters the accuracy of customer satisfaction predictions but also sets a new standard for handling feature selection in predictive models. Second, the study provides actionable insights for e-commerce platforms, especially those in the clothing sector. By highlighting which aspects of customer reviews-like quality, fit, and durability-most influence satisfaction, we offer a strategic direction for businesses to tailor their products and services.

Efficient Data Management for Hull Condition Assessment

  • Jaramillo, David;Cabos, Christian;Renard, Philippe
    • International Journal of CAD/CAM
    • /
    • v.6 no.1
    • /
    • pp.9-17
    • /
    • 2006
  • Performing inspections for Hull Condition Monitoring and Assessment as stipulated in IACS unified requirements and IMO's Condition Assessment Scheme (CAS) IMO Resolution MEPC.94(46), 2001, Condition Assessment Scheme, IMO Resolution MEPC.111(50), 2003, Amendments to regulation 13G, addition of new regulation 13H involves a huge amount of measurement data to be collected, processed, analysed and maintained. Information to be recorded consists of thickness measurements and visual assessment of coating and cracks. The amount of data and increasing requirements with respect to condition assessment demand efficient computer support. Currently, due to the lack of standardization for this kind of data, the thickness measurements are recorded manually on ship drawings or tables. In this form, handling of the measurements is tedious and error-prone and assessment is difficult. Data reporting and analysis takes a long time, leading to some repairs being performed only at the next docking of the ship or making an additional docking necessary. The recently started ED funded project CAS addresses this topic and develops-as a first step-a data model for Hull Condition Monitoring and Assessment (HCMA) based on XML-technology. The model includes simple geometry representation to facilitate a graphically supported data collection as well as an easy visualisation of the measurement results. In order to ensure compatibility with the current way of working, the content of the data model is strictly confined to the requirements of the measurement process. Appropriate data interfaces to classification software will enable rapid assessment by the classification societies, thus improving the process in terms of time and cost savings. In particular, decision-making can be done while the ship is still in the dock for maintenance.

Analysis of Global Entrepreneurship Trends Due to COVID-19: Focusing on Crunchbase (Covid-19에 따른 글로벌 창업 트렌드 분석: Crunchbase를 중심으로)

  • Shinho Kim;Youngjung Geum
    • Asia-Pacific Journal of Business Venturing and Entrepreneurship
    • /
    • v.18 no.3
    • /
    • pp.141-156
    • /
    • 2023
  • Due to the unprecedented worldwide pandemic of the new Covid-19 infection, business trends of companies have changed significantly. Therefore, it is strongly required to monitor the rapid changes of innovation trends to design and plan future businesses. Since the pandemic, many studies have attempted to analyze business changes, but they are limited to specific industries and are insufficient in terms of data objectivity. In response, this study aims to analyze business trends after Covid-19 using Crunchbase, a global startup data. The data is collected and preprocessed every two years from 2018 to 2021 to compare the business trends. To capture the major trends, a network analysis is conducted for the industry groups and industry information based on the co-occurrence. To analyze the minor trends, LDA-based topic modelling and word2vec-based clustering is used. As a result, e-commerce, education, delivery, game and entertainment industries are promising based on their technological advances, showing extension and diversification of industry boundaries as well as digitalization and servitization of business contents. This study is expected to help venture capitalists and entrepreneurs to understand the rapid changes under the impact of Covid-19 and to make right decisions for the future.

  • PDF