• 제목/요약/키워드: Big Data Trend Analysis

Search Result 330, Processing Time 0.03 seconds

A study on the electric railway load pattern analysis and building database program (전기철도 부하특성 분석 및 데이터베이스 구축)

  • Jeon, Yong-Joo;Kim, Chi-Tae;Lee, Gi-Chun;Lee, Sung-Uk
    • Proceedings of the KSR Conference
    • /
    • 2006.11b
    • /
    • pp.719-722
    • /
    • 2006
  • At present, In korea one of big characteristics in electricity power market is unique seller but in the near future competitions are expected in the market. Another big trend is development of IT technology. Through IT, remote inspection for power usage are possible. So huge power consumer like KORAIL it is necessary to investigate power consumption pattern. This paper presents load consumption pattern for representative substation and billing system database program. Base on the substation annual power usage data, the characteristic of the substation power consumption are investigated and effective electrical billing system are compared each other. The database program was properly designed to examine the billings.

  • PDF

Research trends in statistics for domestic and international journal using paper abstract data (초록데이터를 활용한 국내외 통계학 분야 연구동향)

  • Yang, Jong-Hoon;Kwak, Il-Youp
    • The Korean Journal of Applied Statistics
    • /
    • v.34 no.2
    • /
    • pp.267-278
    • /
    • 2021
  • As time goes by, the amount of data is increasing regardless of government, business, domestic or overseas. Accordingly, research on big data is increasing in academia. Statistics is one of the major disciplines of big data research, and it will be interesting to understand the research trend of statistics through big data in the growing number of papers in statistics. In this study, we analyzed what studies are being conducted through abstract data of statistical papers in Korea and abroad. Research trends in domestic and international were analyzed through the frequency of keyword data of the papers, and the relationship between the keywords was visualized through the Word Embedding method. In addition to the keywords selected by the authors, words that are importantly used in statistical papers selected through Textrank were also visualized. Lastly, 10 topics were investigated by applying the LDA technique to the abstract data. Through the analysis of each topic, we investigated which research topics are frequently studied and which words are used importantly.

Twitter Issue Tracking System by Topic Modeling Techniques (토픽 모델링을 이용한 트위터 이슈 트래킹 시스템)

  • Bae, Jung-Hwan;Han, Nam-Gi;Song, Min
    • Journal of Intelligence and Information Systems
    • /
    • v.20 no.2
    • /
    • pp.109-122
    • /
    • 2014
  • People are nowadays creating a tremendous amount of data on Social Network Service (SNS). In particular, the incorporation of SNS into mobile devices has resulted in massive amounts of data generation, thereby greatly influencing society. This is an unmatched phenomenon in history, and now we live in the Age of Big Data. SNS Data is defined as a condition of Big Data where the amount of data (volume), data input and output speeds (velocity), and the variety of data types (variety) are satisfied. If someone intends to discover the trend of an issue in SNS Big Data, this information can be used as a new important source for the creation of new values because this information covers the whole of society. In this study, a Twitter Issue Tracking System (TITS) is designed and established to meet the needs of analyzing SNS Big Data. TITS extracts issues from Twitter texts and visualizes them on the web. The proposed system provides the following four functions: (1) Provide the topic keyword set that corresponds to daily ranking; (2) Visualize the daily time series graph of a topic for the duration of a month; (3) Provide the importance of a topic through a treemap based on the score system and frequency; (4) Visualize the daily time-series graph of keywords by searching the keyword; The present study analyzes the Big Data generated by SNS in real time. SNS Big Data analysis requires various natural language processing techniques, including the removal of stop words, and noun extraction for processing various unrefined forms of unstructured data. In addition, such analysis requires the latest big data technology to process rapidly a large amount of real-time data, such as the Hadoop distributed system or NoSQL, which is an alternative to relational database. We built TITS based on Hadoop to optimize the processing of big data because Hadoop is designed to scale up from single node computing to thousands of machines. Furthermore, we use MongoDB, which is classified as a NoSQL database. In addition, MongoDB is an open source platform, document-oriented database that provides high performance, high availability, and automatic scaling. Unlike existing relational database, there are no schema or tables with MongoDB, and its most important goal is that of data accessibility and data processing performance. In the Age of Big Data, the visualization of Big Data is more attractive to the Big Data community because it helps analysts to examine such data easily and clearly. Therefore, TITS uses the d3.js library as a visualization tool. This library is designed for the purpose of creating Data Driven Documents that bind document object model (DOM) and any data; the interaction between data is easy and useful for managing real-time data stream with smooth animation. In addition, TITS uses a bootstrap made of pre-configured plug-in style sheets and JavaScript libraries to build a web system. The TITS Graphical User Interface (GUI) is designed using these libraries, and it is capable of detecting issues on Twitter in an easy and intuitive manner. The proposed work demonstrates the superiority of our issue detection techniques by matching detected issues with corresponding online news articles. The contributions of the present study are threefold. First, we suggest an alternative approach to real-time big data analysis, which has become an extremely important issue. Second, we apply a topic modeling technique that is used in various research areas, including Library and Information Science (LIS). Based on this, we can confirm the utility of storytelling and time series analysis. Third, we develop a web-based system, and make the system available for the real-time discovery of topics. The present study conducted experiments with nearly 150 million tweets in Korea during March 2013.

Analysis of Overseas Research Trends Related to Artificial Intelligence (AI) in Elementary, Middle and High School Education (초·중·고 교육분야의 인공지능(AI) 관련 해외 연구동향 분석)

  • Jung, Young-Joo;Kim, Hea-Jin
    • Journal of Korean Library and Information Science Society
    • /
    • v.52 no.3
    • /
    • pp.313-334
    • /
    • 2021
  • This study aimed to analyze AI research trends related to elementary, middle, and high school education. To this end, the related literature was collected from the SCOPUS database and the publication period of the collected literature was from 1974 to March 2021, with 154 journal papers and 571 conference papers. Research trends were analyzed based on the co-occurrences analysis technique of 4,521 words of author keyword and index keyword included in these papers. As a result of the analysis, big data, data mining, data science and deep learning were found as the latest research trends with machine learning and there was a difference between elementary, middle and high school education. It can be seen that elementary school had a lot of robot-related research, middle school had a lot of game and data-related research, and high school had various and in-depth research. In discussion, we mapped the top 50 words common to elementary, middle, and high schools with the 'Artificial Intelligence Basics' curriculum of Korean Government and '5 Big Ideas' of the United States Government so that AI research can be viewed at a glance.

An Open Source Mobile Cloud Service: Geo-spatial Image Filtering Tools Using R (오픈소스 모바일 클라우드 서비스: R 기반 공간영상정보 필터링 사례)

  • Kang, Sanggoo;Lee, Kiwon
    • Spatial Information Research
    • /
    • v.22 no.5
    • /
    • pp.1-8
    • /
    • 2014
  • Globally, mobile, cloud computing or big data are the recent marketable key terms. These trend technologies or paradigm in the ICT (Information Communication Technology) fields exert large influence on the most application fields including geo-spatial applications. Among them, cloud computing, though the early stage in Korea now, plays a important role as a platform for other trend technologies uses. Especially, mobile cloud, an integrated platform with mobile device and cloud computing can be considered as a good solution to overcome well known limitations of mobile applications and to provide more information processing functionalities to mobile users. This work is a case study to design and implement the mobile application system for geo-spatial image filtering processing operated on mobile cloud platform built using OpenStack and various open sources. Filtering processing is carried out using R environment, recently being recognized as one of big data analysis technologies. This approach is expected to be an element linking geo-spatial information for new service model development and the geo-spatial analysis service development using R.

A Study on the Strategies for Activating the Vegan Fashion Brand in the Meaning Out - Based on an Instagram Hashtag Analysis - (미닝아웃 시대의 비건 패션 브랜드 활성화 전략 연구 - 인스타그램 해시태그 분석을 중심으로 -)

  • Kyunghee Jung;Soojeong Bae
    • Journal of Fashion Business
    • /
    • v.27 no.3
    • /
    • pp.132-149
    • /
    • 2023
  • This study aims to analyze Instagram hashtags based on big data to investigate changes in consumer trends and perceptions of vegan fashion, and to derive strategies for revitalizing vegan fashion brands based on derived results. Among social media, Instagram was selected as a collection channel, and Instagram hashtags for 'Vegan Fashion' were collected from July 1, 2021 to December 31, 2021. After conducting semantic network analysis with the Ucinet 6 program based on the collected data, the CONCOR analysis on vegan fashion showed the following four clusters: 'Veganism practiced with fashion', 'Bag type of vegan fashion brand', 'Sharing vegan fashion', and 'Diversification of eco-friendly products'. Analysis results showed that the Instagram hashtag for vegan fashion confirmed the MZ generation's increased interest in vegan fashion and their thoughts to recommend and share frequently used items or brand products to people around them. CONCOR analysis of vegan fashion brands showed the following four groups: 'Differentiating the material of vegan bags', 'Eco-friendly products of vegan fashion brands', 'Interest in vegan shoes', and 'Donation campaign of vegan fashion brands'. CONCOR analysis on Meaningout showed the following four clusters: 'MZ Generation's Meaningout Start-up', 'Recommendation Platform for Skin Products', 'Value Consumption Trend for Eco-friendly Clothing', and 'Interest in Eco-friendly Packaging'. The results of this study on vegan fashion, a practical eco-friendly movement that can require changes in social responsibility and perception as issues that directly affect animals, the environment, and humans, are expected to provide basic data to help domestic vegan fashion brands develop marketing strategies.

A Korean nationwide investigation of the national trend of complex regional pain syndrome vis-à-vis age-structural transformations

  • Lee, Joon-Ho;Park, Suyeon;Kim, Jae Heon
    • The Korean Journal of Pain
    • /
    • v.34 no.3
    • /
    • pp.322-331
    • /
    • 2021
  • Background: The present study employed National Health Insurance Data to explore complex regional pain syndrome (CRPS) updated epidemiology in a Korean context. Methods: A CRPS cohort for the period 2009-2016 was created based on Korean Standard Classification of Diseases codes alongside the national registry. The general CRPS incidence rate and the yearly incidence rate trend for every CRPS type were respectively the primary and secondary outcomes. Among the analyzed risk factors were age, sex, region, and hospital level for the yearly trend of the incidence rate for every CRPS. Statistical analysis was performed via the chi-square test and the linear and logistic linear regression tests. Results: Over the research period, the number of registered patients was 122,210. The general CRPS incidence rate was 15.83 per 100,000, with 19.5 for type 1 and 12.1 for type 2. The condition exhibited a declining trend according to its overall occurrence, particularly in the case of type 2 (P < 0.001). On the other hand, registration was more pervasive among type 1 compared to type 2 patients (61.7% vs. 38.3%), while both types affected female individuals to a greater extent. Regarding age, individuals older than 60 years of age were associated with the highest prevalence in both types, regardless of sex (P < 0.001). Conclusions: CRPS displayed an overall incidence of 15.83 per 100,000 in Korea and a declining trend for every age group which showed a negative association with the aging shift phenomenon.

The Study on Data Governance Research Trends Based on Text Mining: Based on the publication of Korean academic journals from 2009 to 2021 (텍스트 마이닝을 활용한 데이터 거버넌스 연구 동향 분석: 2009년~2021년 국내 학술지 논문을 중심으로)

  • Jeong, Sun-Kyeong
    • Journal of Digital Convergence
    • /
    • v.20 no.4
    • /
    • pp.133-145
    • /
    • 2022
  • As a result of the study, the poorest keywords were information, big data, management, policy, government, law, and smart. In addition, as a result of network analysis, related research was being conducted on topics such as data industry policy, data governance performance, defense, governance, and data public. The four topics derived through topic modeling were "DG policy," "DG platform," "DG in laws," and "DG implementation," of which research related to "DG platform" showed an increasing trend, and "DG implementation" tended to shrink. This study comprehensively summarized data governance-related studies. Data governance needs to expand research areas from various perspectives and related fields such as data management and data integration policies at the organizational level, and related technologies. In the future, we can expand the analysis targets for overseas data governance and expect follow-up studies on research directions and policy directions in industries that require data-based future industries such as Industry 4.0, artificial intelligence, and Metaverse.

Secure Authentication Protocol in Hadoop Distributed File System based on Hash Chain (해쉬 체인 기반의 안전한 하둡 분산 파일 시스템 인증 프로토콜)

  • Jeong, So Won;Kim, Kee Sung;Jeong, Ik Rae
    • Journal of the Korea Institute of Information Security & Cryptology
    • /
    • v.23 no.5
    • /
    • pp.831-847
    • /
    • 2013
  • The various types of data are being created in large quantities resulting from the spread of social media and the mobile popularization. Many companies want to obtain valuable business information through the analysis of these large data. As a result, it is a trend to integrate the big data technologies into the company work. Especially, Hadoop is regarded as the most representative big data technology due to its terabytes of storage capacity, inexpensive construction cost, and fast data processing speed. However, the authentication token system of Hadoop Distributed File System(HDFS) for the user authentication is currently vulnerable to the replay attack and the datanode hacking attack. This can cause that the company secrets or the personal information of customers on HDFS are exposed. In this paper, we analyze the possible security threats to HDFS when tokens or datanodes are exposed to the attackers. Finally, we propose the secure authentication protocol in HDFS based on hash chain.

Analysis of the Trends of Construction Technology Development based on Big Data - Focused on Construction Patents in Relation to the 4th Industrial Revolution ICT Technologies - (빅데이터 기반의 건설기술 개발 트렌드 분석에 관한 연구 - 4차 산업혁명 ICT 기술 관련 건설특허를 중심으로 -)

  • Han, Jae Hoon;Kim, Han Soo
    • Korean Journal of Construction Engineering and Management
    • /
    • v.18 no.5
    • /
    • pp.20-31
    • /
    • 2017
  • As global interests in the 4th Industrial Revolution have recently increased, it becomes critical for the construction industry to pro-actively cope with it. For effective actions, the construction industry needs to make active use of 4th Industrial Revolution technologies based on the up-to-date understanding of the trends of construction technology development employing the 4th Industrial Revolution technologies. The objective of the study is to investigate and identify key trends of ICT construction technology development over the last ten years based on Big Data Analytics. The study identifies eleven key trends and discusses that ICT construction technology development has not been as active as expected and software technologies have been less developed compared to hardware technologies.