• Title/Summary/Keyword: Big data analysis tool

Search Result 158, Processing Time 0.03 seconds

A study on the efficient patent search process using big data analysis tool R (빅데이터 분석 도구 R을 활용한 효율적인 특허 검색에 관한 연구)

  • Zhang, Jing-Lun;Jang, Jung-Hwan;Kim, Suk-Ju;Lee, Hyun-Keun;Lee, Chang-Ho
    • Journal of the Korea Safety Management & Science
    • /
    • v.15 no.4
    • /
    • pp.289-294
    • /
    • 2013
  • Due to sudden transition to intellectual society corresponding with fast technology progress, companies and nations need to focus on development and guarantee of intellectual property. The possession of intellectual property has been the important factor of competition power. In this paper we developed the efficient patent search process with big data analysis tool R. This patent search process consists of 5 steps. We result that at first this process obtain the core patent search key words and search the target patents through search formula using the combination of above patent search key words.

Comparison study of SARIMA and ARGO models for in influenza epidemics prediction

  • Jung, Jihoon;Lee, Sangyeol
    • Journal of the Korean Data and Information Science Society
    • /
    • v.27 no.4
    • /
    • pp.1075-1081
    • /
    • 2016
  • The big data analysis has received much attention from the researchers working in various fields because the big data has a great potential in detecting or predicting future events such as epidemic outbreaks and changes in stock prices. Reflecting the current popularity of big data analysis, many authors have proposed methods tracking influenza epidemics based on internet-based information. The recently proposed 'autoregressive model using Google (ARGO) model' (Yang et al., 2015) is one of those influenza tracking models that harness search queries from Google as well as the reports from the Centers for Disease Control (CDC), and appears to outperform the existing method such as 'Google Flu Trends (GFT)'. Although the ARGO predicts well the outbreaks of influenza, this study demonstrates that a classical seasonal autoregressive integrated moving average (SARIMA) model can outperform the ARGO. The SARIMA model incorporates more accurate seasonality of the past influenza activities and takes less input variables into account. Our findings show that the SARIMA model is a functional tool for monitoring influenza epidemics.

Distributed Processing of Big Data Analysis based on R using SparkR (SparkR을 이용한 R 기반 빅데이터 분석의 분산 처리)

  • Ryu, Woo-Seok
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.17 no.1
    • /
    • pp.161-166
    • /
    • 2022
  • In this paper, we analyze the problems that occur when performing the big data analysis using R as a data analysis tool, and present the usefulness of the data analysis with SparkR which connects R and Spark to support distributed processing of big data effectively. First, we study the memory allocation problem of R which occurs when loading large amounts of data and performing operations, and the characteristics and programming environment of SparkR. And then, we perform the comparison analysis of the execution performance when linear regression analysis is performed in each environment. As a result of the analysis, it was shown that R can be used for data analysis through SparkR without additional language learning, and the code written in R can be effectively processed distributedly according to the increase in the number of nodes in the cluster.

Extracting of Interest Issues Related to Patient Medical Services for Small and Medium Hospital by SNS Big Data Text Mining and Social Networking (중소병원 환자의료서비스에 관한 관심 이슈 도출을 위한 SNS 빅 데이터 텍스트 마이닝과 사회적 연결망 적용)

  • Hwang, Sang Won
    • Korea Journal of Hospital Management
    • /
    • v.23 no.4
    • /
    • pp.26-39
    • /
    • 2018
  • Purposes: The purpose of this study is to analyze the issue of interest in patient medical service of small and medium hospitals using big data. Methods: The method of this study was implemented by data mining and social network using SNS big data. The analysis tool were extracted key keywords and analyzed correlation by using Textom, Ucinet6 and NetDraw program. Findings: In the results of frequency, the network-centered and closeness centrality analysis, It was shown that the government center is interested in the major explanations and evaluations of the technology, information, security, safety, cost and problems of small and medium hospitals, coping with infections, and actual involvement in bank settlement. And, were extracted care for disabilities such as pediatrics, dentistry, obstetrics and gynecology, dementia, nursing, the elderly, and rehabilitation. Practical Implications: Future studies will be more useful if analyzed the needs of customers for medical services in the metropolitan area and provinces may be different in the small and medium hospitals to be studied, further classification studies.

Analysis on Domestic Franchise Food Tech Interest by using Big Data

  • Hyun Seok Kim;Yang-Ja Bae;Munyeong Yun;Gi-Hwan Ryu
    • International Journal of Internet, Broadcasting and Communication
    • /
    • v.16 no.2
    • /
    • pp.179-184
    • /
    • 2024
  • Franchise are now a red ocean in Food industry and they need to find other options to appeal for their product, the uprising content, food tech. The franchises are working on R&D to help franchisees with the operations. Through this paper, we analyze the franchise interest on food tech and to help find the necessity of development for franchisees who are in needs with hand, not of human, but of technology. Using Textom, a big data analysis tool, "franchise" and "food tech" were selected as keywords, and search frequency information of Naver and Daum was collected for a year from 01 January, 2023 to 31 December, 2023, and data preprocessing was conducted based on this. For the suitability of the study and more accurate data, data not related to "food tech" was removed through the refining process, and similar keywords were grouped into the same keyword to perform analysis. As a result of the word refining process, a total of 10,049 words were derived, and among them, the top 50 keywords with the highest relevance and search frequency were selected and applied to this study. The top 50 keywords derived through word purification were subjected to TF-IDF analysis, visualization analysis using Ucinet6 and NetDraw programs, network analysis between keywords, and cluster analysis between each keyword through Concor analysis. By using big data analysis, it was found out that franchise do have interest on food tech. "technology", "franchise", "robots" showed many interests and keyword "R&D" showed that franchise are keen on developing food tech to seize competitiveness in Franchise Industry.

A Study of Consumer Perception on Fashion Show Using Big Data Analysis (빅데이터를 활용한 패션쇼에 대한 소비자 인식 연구)

  • Kim, Da Jeong;Lee, Seunghee
    • Journal of Fashion Business
    • /
    • v.23 no.3
    • /
    • pp.85-100
    • /
    • 2019
  • This study examines changes in consumer perceptions of fashion shows, which are critical elements in the apparel industry and a means to represent a brand's image and originality. For this purpose, big data in clothing marketing, text mining, semantic network analysis techniques were applied. This study aims to verify the effectiveness and significance of fashion shows in an effort to give directions for their future utilization. The study was conducted in two major stages. First, data collection with the key word, "fashion shows," was conducted across websites, including Naver and Daum between 2015 and 2018. The data collection period was divided into the first- and second-half periods. Next, Textom 3.0 was utilized for data refinement, text mining, and word clouding. The Ucinet 6.0 and NetDraw, were used for semantic network analysis, degree centrality, CONCOR analysis and also visualization. The level of interest in "models" was found to be the highest among the perception factors related to fashion shows in both periods. In the first-half period, the consumer interests focused on detailed visual stimulants such as model and clothing while in the second-half period, perceptions changed as the value of designers and brands were increasingly recognized over time. The findings of this study can be utilized as a tool to evaluate fashion shows, the apparel industry sectors, and the marketing methods. Additionally, it can also be used as a theoretical framework for big data analysis and as a basis of strategies and research in industrial developments.

Recommendation of tourist attractions based on Preferences using big data

  • KIM HYUN SEOK;Gi-hwan Ryu;kim im yeo-reum
    • International Journal of Advanced Culture Technology
    • /
    • v.11 no.3
    • /
    • pp.327-331
    • /
    • 2023
  • This paper proposes a tourist destination recommendation application that combines a chatbot and a recommendation system. The data to be entered into the chatbot was through big data on social media. Through TEXTOM, a total of 22,701 data were collected over a one-year period from January 2022 to January 2023. Non-terms that interfere with analysis were removed through the data purification process. Using refined data, network visualization and CONCOR analysis were used to identify the information users want to obtain about travel to Jeju Island, and categories for each cluster were organized. The content was intuitively organized so that even those who approached it for the first time could easily use it, reducing the difficulty of operating the application. In this paper, users can select their own preferences and receive information. In addition, a tool called a chatbot allows users to focus more on the process of acquiring information by gaining a sense of reality while operating the application. This suggests an application that can reach the purpose of the curator by affecting the user's desire to visit tourist attractions.

Keyword Analysis of Data Technology Using Big Data Technique (빅데이터 기법을 활용한 Data Technology의 키워드 분석)

  • Park, Sung-Uk
    • Journal of Korea Technology Innovation Society
    • /
    • v.22 no.2
    • /
    • pp.265-281
    • /
    • 2019
  • With the advent of the Internet-based economy, the dramatic changes in consumption patterns have been witnessed during the last decades. The seminal change has led by Data Technology, the integrated platform of mobile, online, offline and artificial intelligence, which remained unchallenged. In this paper, I use data analysis tool (TexTom) in order to articulate the definitfite notion of data technology from Internet sources. The data source is collected for last three years (November 2015 ~ November 2018) from Google and Naver. And I have derived several key keywords related to 'Data Technology'. As a result, it was found that the key keyword technologies of Big Data, O2O (Offline-to-Online), AI, IoT (Internet of things), and cloud computing are related to Data Technology. The results of this study can be used as useful information that can be referred to when the Data Technology age comes.

A Study on Big Data Based Method of Patient Care Analysis (빅데이터 기반 환자 간병 방법 분석 연구)

  • Park, Ji-Hun;Hwang, Seung-Yeon;Yun, Bum-Sik;Choe, Su-Gil;Lee, Don-Hee;Kim, Jeong-Joon;Moon, Jin-Yong;Park, Kyung-won
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.20 no.3
    • /
    • pp.163-170
    • /
    • 2020
  • With the development of information and communication technologies, the growing volume of data is increasing exponentially, raising interest in big data. As technologies related to big data have developed, big data is being collected, stored, processed, analyzed, and utilized in many fields. Big data analytics in the health care sector, in particular, is receiving much attention because they can also have a huge social and economic impact. It is predicted that it will be able to use Big Data technology to analyze patients' diagnostic data and reduce the amount of money that is spent on simple hospital care. Therefore, in this thesis, patient data is analyzed to present to patients who are unable to go to the hospital or caregivers who do not have medical expertise with close care guidelines. First, the collected patient data is stored in HDFS and the data is processed and classified using R, a big data processing and analysis tool, in the Hadoop environment. Visualize to a web server using R Shiny, which is used to implement various functions of R on the web.

Big data text mining analysis to identify non-face-to-face education problems (비대면 교육 문제점 파악을 위한 빅데이터 텍스트 마이닝 분석)

  • Park, Sung Jae;Hwang, Ug-Sun
    • Korean Educational Research Journal
    • /
    • v.43 no.1
    • /
    • pp.1-27
    • /
    • 2022
  • As the COVID-19 virus became prevalent worldwide, non-face-to-face contact was implemented in various ways, and the education system also began to draw much attention due to rapid non-face-to-face contact. The purpose of this study is to analyze the direction of non-face-to-face education in line with the continuously changing educational environment to date. In this study, data were visualized using Textom and Ucinet6 analysis tool programs to collect social network big data with various opinions. As a result of the study, keywords related to "COVID-19" were dominant, and keywords with high frequency such as "article" and "news" existed. As a result of the analysis, various issues related to non-face-to-face education, such as network failures and security issues, were identified. After the analysis, the direction of the non-face-to-face education system was studied according to the growth of the education market and changes in the educational environment. In addition, there is a need to strengthen security and feedback on teaching methods in non-face-to-face education analyzed using big data.