• Title/Summary/Keyword: 빅데이터 수집

Search Result 1,010, Processing Time 0.022 seconds

A Named Entity Recognition Model in Criminal Investigation Domain using Pretrained Language Model (사전학습 언어모델을 활용한 범죄수사 도메인 개체명 인식)

  • Kim, Hee-Dou;Lim, Heuiseok
    • Journal of the Korea Convergence Society
    • /
    • v.13 no.2
    • /
    • pp.13-20
    • /
    • 2022
  • This study is to develop a named entity recognition model specialized in criminal investigation domains using deep learning techniques. Through this study, we propose a system that can contribute to analysis of crime for prevention and investigation using data analysis techniques in the future by automatically extracting and categorizing crime-related information from text-based data such as criminal judgments and investigation documents. For this study, the criminal investigation domain text was collected and the required entity name was newly defined from the perspective of criminal analysis. In addition, the proposed model applying KoELECTRA, a pre-trained language model that has recently shown high performance in natural language processing, shows performance of micro average(referred to as micro avg) F1-score 98% and macro average(referred to as macro avg) F1-score 95% in 9 main categories of crime domain NER experiment data, and micro avg F1-score 98% and macro avg F1-score 62% in 56 sub categories. The proposed model is analyzed from the perspective of future improvement and utilization.

Analysis of global trends on smart manufacturing technology using topic modeling (토픽모델링을 활용한 주요국의 스마트제조 기술 동향 분석)

  • Oh, Yoonhwan;Moon, HyungBin
    • Journal of Korea Society of Industrial Information Systems
    • /
    • v.27 no.4
    • /
    • pp.65-79
    • /
    • 2022
  • This study identified smart manufacturing technologies using patent and topic modeling, and compared the technology development trends in countries such as the United States, Japan, Germany, China, and South Korea. To this purpose, this study collected patents in the United States and Europe between 1991 and 2020, processed patent abstracts, and identified topics by applying latent Dirichlet allocation model to the data. As a result, technologies related to smart manufacturing are divided into seven categories. At a global level, it was found that the proportion of patents in 'data processing system' and 'thermal/fluid management' technologies is increasing. Considering the fact that South Korea has relative competitiveness in thermal/fluid management technologies related to smart manufacturing, it would be a successful strategy for South Korea to promote smart manufacturing in heavy and chemical industry. This study is significant in that it overcomes the limitations of quantitative technology level evaluation proposed a new methodology that applies text mining.

A study of changes in user experience and service evaluation - Topic modeling of Netflix app reviews (사용자 경험과 서비스 평가의 변화에 관한 연구 - 넷플릭스 앱 리뷰 토픽 모델링을 통해)

  • Seon Yeong Yu;Mi Jin Noh;Yang Sok Kim;Mu Moung Cho Han
    • Smart Media Journal
    • /
    • v.12 no.6
    • /
    • pp.27-34
    • /
    • 2023
  • As Netflix usage has increased due to the COVID-19 pandemic, users' experiences with the service have also increased. Therefore, this study aims to conduct topic modeling analysis based on Netflix review data to explore the changes in Netflix user experience and service before and after the COVID-19 pandemic. We collected Netflix app review data from the Google Play Store using the Google Play Scraper library, and used topic modeling to examine keyword differences between app reviews before and after the pandemic. The analysis revealed four main topics: Netflix app features, Netflix content, Netflix service usage, and Netflix overall reviews. After the pandemic, when user experience increased, users tended to use more diverse and detailed keywords in their reviews. By using Netflix review data to analyze users' opinions, this study shows the changes in user experience of Netflix services before and after the pandemic, which can be used as a guide to strengthen competitiveness in the competitive OTT market.

The Development of a Web-based Realtime Monitoring System for Facility Energy Uses in Forging Processes (단조공정에서 설비 에너지 사용에 대한 웹 기반 실시간 모니터링 시스템 개발)

  • Hwang, Hyun-suk;Seo, Young-won;Kim, Tae-yeon
    • Journal of Internet Computing and Services
    • /
    • v.19 no.1
    • /
    • pp.87-95
    • /
    • 2018
  • Due to global warming and increased energy costs around the world, interests of energy saving and efficiency have been increased. In particular, forging factories need methods to save energy and increase productivity because of needing amounts of energy uses. To solve the problem, we propose a system, which includes collection, monitoring, and analysis process, to monitor energy uses each facility in realtime based on the IoT devices. This system insists of worksheets management, facility/energy management, realtime monitoring, history search, data analysis through connecting with existed ERP/MES Systems in manufacturing factories. The energy monitoring process is to present used energy collected from IoT devices connected with installed gasmeter and wattmeter each facility. This system provide the change of energy uses, usage fee, energy conversion, and green gas information in realtime on Web and mobile devices. This system will be enhanced with energy saving technology by analyzing constructed big data of energy uses. We can also propose a method to increase productivity by integrating this system with functions of digitalized worksheets and optimized models for production process.

A Study on Disaster Safety Management Policy Using the 4th Industrial Revolution and ICBMS (4차 산업혁명과 ICBMS를 활용한 재난안전관리에 관한 연구)

  • Kang, Heau-Jo
    • Journal of Digital Contents Society
    • /
    • v.18 no.6
    • /
    • pp.1213-1216
    • /
    • 2017
  • Recently due to the increasing uncertainty of the disaster environment caused by climate change the effects of disasters have become larger due to the confluence and solidification diversification into disaster type and secondary damage. In this paper, we apply ICBMS through intelligent information technology and big data analysis to all processes of disaster safety management to minimize human, social, economic and environment damage from accidents or disasters, and prevention by control technology preparation by education and training expansion to remember by body, response by advanced technology of disaster response unmanned technology restoration by creation of local community environment ecosystem, investigation and analysis by intelligent information technology learn about disaster safety management 4.0. In addition, technical limitation and problems in the $4^{th}$ industrial revolution and the application of big data were analyzed and suggested alternatives and strategies to overcome.

A reviews on the social network analysis using R (R을 이용한 사회연결망 분석에 대한 고찰)

  • Choi, Kyoungho;Yoo, Jin Ah
    • Journal of the Korea Convergence Society
    • /
    • v.6 no.1
    • /
    • pp.77-83
    • /
    • 2015
  • Though the SNA (social network analysis ; SNA) has been used for various fields, esp. social science field, ig. politics, journalism, and science of public administration as well as natural science field, there are few studies about the introduction of analysis tools. In order to perform the SNA, collecting data which are fit for the purpose, statistical values deduction and visualized results made by analysis tool are necessary, but the studies, which explain them systematically, are not sufficient yet. So, in this study, we are intended to introduce the analytic process, from the data input to the interpretation, with proven data. using the R program, which is free, in order to help researchers who have any plan to study using the SNA. The proven data in this study are quoted ones in the domestic scientific journals of food, which are those supplied citation index DB of Korean scientific journals. As a study methodology, the SNA is a new paradigm to substitute existing research methods as well as a complement of statistical analysis. Therefore, this study would contribute to vitalization of the SNA.

A Study on the Roles of Academic Library for Supporting Class and Learning Activities in Korea (대학도서관의 수업·학습 활동 지원 역할에 관한 연구)

  • Lee, Yong-Jae;Lee, Ji-Wook
    • Journal of Korean Library and Information Science Society
    • /
    • v.50 no.4
    • /
    • pp.359-379
    • /
    • 2019
  • This study aims to suggest the ways to reinforce academic library's supports for users' class·learning activities. For this purpose, this study collected the development plans of academic libraries in Korea, and analysed the plans for supporting class·learning activities. As a result, it is shown that the most libraries emphasized 'expansion of learning material' and marked it on development plan. As subsequent plans, libraries provided the action plans of 'expansion of reading education and reading programs', 'expansion of electronic materials', 'expansion of characterized materials' one after another. This study suggests 'user-centered collection development and expansion of learning materials', 'activation of library services making use of big data', 'enlargement of engagement services for handicapped and foreign students' as ways to strengthen the services of academic libraries to support class·learning activities of users.

Design and Implementation of Self-installing Agricultural Automation System for Remote Monitoring and Control Based on LPWA Technology (저전력 장거리 무선통신기술(LPWA) 기반 원격감시 및 제어가 가능한 자가설치형 농업 자동화 시스템 설계 및 구현)

  • Baek, JaeGu;Lee, Hyung-Woo
    • Journal of Internet of Things and Convergence
    • /
    • v.3 no.1
    • /
    • pp.13-19
    • /
    • 2017
  • In this paper, we designed and implemented Thing Connected-Green, a self-installing agricultural automation system capable of remote monitoring and control based on Low Power Wide Area communication technology (LPWA). Farming requires water, sunlight, soil, fertilizer, temperature control, etc., and these elements can be remotely monitored and controlled using an automated system. Using this system, it is possible to construct an agricultural automation system which can be optimized according to the kind of plant and cultivation environment from vinyl house to flower garden. The information gathered from the sensor is stored in the server through the gateway, and the optimal cultivation environment can be set and operated using the smart phone based on the big data.

A Study on Tourism Resource Strategy of Film Location using Social Bigdata based on SNS Trend Analysis of Jeonju Area (소셜 빅데이터를 활용한 영화촬영지 관광자원화 방안 -전주 지역의 관광체험 SNS 동향 분석을 토대로-)

  • Park, Ji-Yeong;Kim, Geon;Kim, Chan-Young;Oh, Hyo-Jung
    • The Journal of the Korea Contents Association
    • /
    • v.16 no.11
    • /
    • pp.477-487
    • /
    • 2016
  • In 1995, the filming location of the drama had been famous, and as a result it brings the effect of increasing tourists of that areas. After that, many local governments try to host the filming on their regions to be potential tourist attractions. With the same stream, Jeonju also has attempted to host International Film Festival and to set up Jeonju Film Commission and Jeonju Cinema Complex. However, although the city already has rich infrastructure facilities to make films, the city hardly tries to use the filming locations as tourist attractions. This study suggests four ways of using filming locations as tourist attractions to activate Jeonju economy and improve Jeonju's cultural image. We firstly collect social bigdata related with tourists of filming locations and tourist attractions in Jeonju from Twitter, which is the most representative SNS, and then perform frequency and trend analysis. We also investigate major factors of visits to tourist's attractions based on content analysis of tweet mentions.

An Exploratory Study of Happiness and Unhappiness Among Koreans based on Text Mining Techniques (텍스트마이닝 기법을 활용한 한국인의 행복과 불행 탐색연구)

  • Park, Sanghyeon;Do, Kanghyuk;Kim, Hakyeong;Park, Gaeun;Yun, Jinhyeok;Kim, Kyungil
    • The Journal of the Korea Contents Association
    • /
    • v.18 no.7
    • /
    • pp.10-27
    • /
    • 2018
  • The purpose of this study is to explore the meaning of happiness and unhappiness in Korean society through text mining analysis. Similar words with keywords(happiness/unhappiness) from online news portal are extracted using Word2Vec and TF-IDF method. We also use the K-LIWC dictionary to perform the sentiment analysis of words associated with happiness and unhappiness. In TF-IDF analysis, happiness and unhappiness are highly related to social factors and social issues of the year. In Word2Vec analysis, 'Hope' has been similar with happiness for six years. In K-LIWC analysis, 'money/financial issues', 'school', 'communication' is highly related with happiness and unhappiness. In addition, 'physical condition and symptom' is highly related to unhappiness. Implications, limitations, and suggestions for future research are also discussed.