• Title/Summary/Keyword: text mining technique

Search Result 222, Processing Time 0.03 seconds

Development of Safety Evaluation Scenarios for Autonomous Vehicle Tests Using 5-Layer Format(Case of the Community Road) (5-레이어 포맷을 이용한 자율주행자동차 실험 시나리오 개발(커뮤니티부 도로를 중심으로))

  • Park, Sangmin;So, Jaehyun(Jason);Ko, Hangeom;Jeong, Harim;Yun, Ilsoo
    • The Journal of The Korea Institute of Intelligent Transport Systems
    • /
    • v.18 no.2
    • /
    • pp.114-128
    • /
    • 2019
  • Recently, the interest in the safety of autonomous vehicles has globally been increasing. Also, there is controversy over the reliability and safety about autonomous vehicle. In Korea, the K-City which is a test-bed for testing autonomous vehicles has been constructing. There is a need for test scenarios for autonomous vehicle test in terms of safety. The purpose of this study is to develop the evaluation scenario for autonomous vehicle at community roads in K-City by using crash data collected by the Korea National Police Agency and a text-mining technique. As a result, 24 scenarios were developed in order to test autonomous vehicle in community roads. Finally, the logical and concrete scenario forms were derived based on the Pegasus 5-layer format.

A Study on the Efficient Countermeasures of Military in Accordance with Changing Security Environments (4차 산업혁명에 따른 군사보안 발전방안 연구)

  • Kim, Doo Hwan;Park, Ho Jeong
    • Convergence Security Journal
    • /
    • v.20 no.4
    • /
    • pp.47-59
    • /
    • 2020
  • The Army, which is dreaming of a military leap forward through the fourth industrial revolution, needs to also consider the side effects and adverse functions of the fourth industrial revolution. In particular, this study conducted an analysis of whether it was consistent with the global technological trend of normal 'military security'. This paper focuses on the countermeasures that could result from 4th industrial revolution by utilizing the text-mining technique and social network technique of big data. 1. Active promotion of a convergence program with private, public, militaryand industrial, academic, and solidarity, 2. Information Sharing for International Cooperation and Cooperation in Cyber security, 3. Military Innovation and Military Unsymmetric Cyber security innovation, 4.The Establishment of Military Security Convergence Interface Management System in accordance with the Fourth Industrial Revolution, 5. Cooperation in the transition from technology engineering to social technology, 6. Establishing a military security governance system in the military, 7. Specifying confidential military digital data We look forward to providing useful information so that the results of this study can help develop the military and enhance military confidentiality.

A Web Text Mining Technique using Semantic Relations based on WordNet and Text Corpus (WordNet과 텍스트 코퍼스에 기반한 의미 관계를 활용한 웹 텍스트 조사 기법)

  • Lee, Ho-Suk;Kim, Yung-Taek
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2007.06c
    • /
    • pp.181-184
    • /
    • 2007
  • 본 논문은 문장 분석에 의하여 의미 관계를 생성하고 의미 네트워크에 의하여 유사한 의미 관계를 고려하는 의미 중심의 웹 텍스트 검색 기법에 대하여 논의한다. 기존의 웹 텍스트 검색은 단어만을 혹은 의미 관계만을 고려한 검색이었다고 할 수 있다. 그러나 문장 분석에 의한 의미 관계의 생성과 의미 네트워크에 의한 유사한 의미 관계의 고려는 기존의 단어 중심 혹은 의미 관계 중심의 검색 한계를 넘어서 유사한 의미 관계를 고려한 좀 더 포괄적이고 계층적인 검색을 가능하게 할 것으로 생각된다.

  • PDF

Pilot Experiment for Named Entity Recognition of Construction-related Organizations from Unstructured Text Data

  • Baek, Seungwon;Han, Seung H.;Jung, Wooyong;Kim, Yuri
    • International conference on construction engineering and project management
    • /
    • 2022.06a
    • /
    • pp.847-854
    • /
    • 2022
  • The aim of this study is to develop a Named Entity Recognition (NER) model to automatically identify construction-related organizations from news articles. This study collected news articles using web crawling technique and construction-related organizations were labeled within a total of 1,000 news articles. The Bidirectional Encoder Representations from Transformers (BERT) model was used to recognize clients, constructors, consultants, engineers, and others. As a pilot experiment of this study, the best average F1 score of NER was 0.692. The result of this study is expected to contribute to the establishment of international business strategies by collecting timely information and analyzing it automatically.

  • PDF

Comparative Analysis of Happiness and Unhappiness using Topic Modeling: Korea, U.S., U.K., and Brazil (토픽모델링을 이용한 국가간 행복과 불행 토픽 비교 분석 : 한국, 미국, 영국, 브라질)

  • Lee, So-Hyun;Lee, Yun-Kyung;Song, Eui-ryung;Kim, Hee-Woong
    • Knowledge Management Research
    • /
    • v.18 no.3
    • /
    • pp.101-124
    • /
    • 2017
  • Recently, 'happiness' has become a major issue of national level, exceeding the matter of personal issue. Especially, Korea has actually increased its GDP by focusing on the economic growth for decades, and now it has achieved the economic/technical development as an IT power. However, Korean people's satisfaction with life called 'happiness index' is moving back every year. Even though there have been continuous efforts to enhance the national happiness by mentioning it as an essential issue in the national level, there are not many researches related to it. This study drew measures to enhance happiness by extracting happiness factors and unhappiness factors of Korea through social network service. Especially, it aims to analyze, compare, and apply happiness factors and unhappiness factors of three countries such as the US, UK, and Brazil with higher happiness indexes than Korea. For this, through the topic modeling of text mining technique, postings including keywords about happiness and unhappiness were collected/analyzed from Twitter of Korea, the US, UK, and Brazil. The significance of this study is to discuss measures to increase happiness and to decrease unhappiness by mining/analyzing the actual public opinions about happiness and unhappiness in four countries like Korea, the US, UK, and Brazil by using the topic modeling. Through this, the quality of life of Korean people could be improved by suggesting measures to enhance happiness and to decrease unhappiness in the level of individual, family, society, and government.

Social Media Bigdata Analysis Based on Information Security Keyword Using Text Mining (텍스트마이닝을 활용한 정보보호 키워드 기반 소셜미디어 빅데이터 분석)

  • Chung, JinMyeong;Park, YoungHo
    • Journal of Korea Society of Industrial Information Systems
    • /
    • v.27 no.5
    • /
    • pp.37-48
    • /
    • 2022
  • With development of Digital Technology, social issues are communicated through digital-based platform such as SNS and form public opinion. This study attempted to analyze big data from Twitter, a world-renowned social network service, and find out the public opinion. After collecting Twitter data based on 14 keywords for 1 year in 2021, analyzed the term-frequency and relationship among keyword documents with pearson correlation coefficient using Data-mining Technology. Furthermore, the 6 main topics that on the center of information security field in 2021 were derived through topic modeling using the LDA(Latent Dirichlet Allocation) technique. These results are expected to be used as basic data especially finding key agenda when establishing strategies for the next step related industries or establishing government policies.

Data Empowered Insights for Sustainability of Korean MNEs

  • PARK, Young-Eun
    • The Journal of Asian Finance, Economics and Business
    • /
    • v.6 no.3
    • /
    • pp.173-183
    • /
    • 2019
  • This study aims to utilize big data contents of news and social media for developing a corporate strategy of multinational enterprises and their global decision-making through the data mining technique, especially text mining. In this paper, the data of 2 news media (BBC and CNN) and 2 social media (Facebook and Twitter) were collected for the three global leading Korean companies (Samsung, Hyundai Motor Company, and LG) from April, 2018 to April, 2019. The findings of this paper have shown that traditional news media and also modern social media have become devastating tools to extract global trends or phenomena for businesses. Moreover, this presents that a company can adopt a two-track strategy through two different types of media by deriving the key issues or trends from news media channels and also grasping consumers' sentiments, preference or issues of interest such as battery or design from social media. In addition, analyzing the texts of those media and understanding the association rules greatly contribute to the comparison between two different types of media channels to see the difference. Lastly, this provides meaningful and valuable data empowered insights to find a future direction comprehensively and develop a global strategy for sustainability of business.

Comparison of Online Shopping Mall BEST 100 using Exploratory Data Analysis (탐색적 자료 분석(EDA) 기법을 활용한 국내 11개 대표 온라인 쇼핑몰 BEST 100 비교)

  • Kang, Jicheon;Kang, Juyoung
    • The Journal of Bigdata
    • /
    • v.3 no.1
    • /
    • pp.1-12
    • /
    • 2018
  • Since the beginning of the first online shopping mall, BEST 100 is being provided as the core of all shopping mall websites. BEST 100 is greatly important because consumers can identify popular products at a glance. However, there are only studies using sales outcome indicators, and prior studies using BEST 100 are insignificant. Therefore, this study selected 11 online shopping malls and compared their main characteristics. As a research method, exploratory data analysis technique (EDA) was used by crawling the BEST 100 components of each shopping mall website, such as product name, price, and free shipping check. As a result, the total average price of 11 shopping malls was 72,891.41 won. Sales texts were classified into 8 categories by text mining. The most common category was the fashion part, but it is significant that the setting of the category analyzed the marketing text, not the product attribute. This study has implications for understanding the current online market flow and suggesting future directions by using EDA.

A Case Study on Characteristics of Gender and Major in Career Preparation of University Students from Low-income Families: Application of Text Frequency Analysis and Association Rules (저소득층 대학생들의 진로준비과정에서의 성별·전공별 특성에 대한 사례연구: 텍스트 빈도분석과 연관분석의 적용)

  • Lee, Jihye;Lee, Shinhye
    • Journal of Digital Convergence
    • /
    • v.16 no.12
    • /
    • pp.61-69
    • /
    • 2018
  • This study aims to understand and to infer the implications from the career preparation experiences of low-income university students in the context of high youth unemployment rate and the polarization of the social classes. For this purpose, we selected 13 university students who received scholarship from the S scholarship foundation and conducted analysis using text mining techniques based on the six-time interviews. According to the results, university students seem to be influenced by home environment and income level when recalling previous academic experience or designing career during the interview process. Also, these differences were found to have different characteristics according to gender and major. This study is meaningful in that the qualitative research data is analyzed by applying the text mining technique in a convergent way. As a result, the college life and career preparation of low-income university students were explored through the frequency and relation of words.

What has Korea told in the WTO? : An analysis on the Ministerial Conference Statements (WTO에서 한국은 무슨 말을 해왔나?: 각료회의 대표발언문 분석을 중심으로)

  • Jeong-meen Suh
    • Korea Trade Review
    • /
    • v.48 no.1
    • /
    • pp.29-53
    • /
    • 2023
  • This study analyzes the statements made by representatives of member countries at the WTO Ministerial Conference (MC), the highest decision-making body of the WTO, to examine the position and attitude that Korea has shown at the WTO during the last 27 years. After constructing text dataset by extracting about 1,800 statement documents made by member countries from the WTO document database, the text mining technique is applied to figure out the characteristics of Korea's statements compared to other member countries. Through formal characteristics such as the number of remarks and length of speech, basic attitudes such as continuity of Korea's interest in the WTO and the level of interest in the WTO are measured. In terms of substantive characteristics, the topics in the statements of Korea are categorized through the LDA topic model, and the keywords of Korea for each session are analyzed through comparative analysis with statements by other member countries.