• Title/Summary/Keyword: Unstructured data analysis

Search Result 426, Processing Time 0.029 seconds

Analyzing the Relevancy of Policy by Abnormal Pattern Analysis : Focused on the Case of S-City's e-Card for Child Meal Support (이상 패턴 분석을 통한 정책의 적합성 분석 연구 : S 시의 아동 급식 전자 카드 사례를 중심으로)

  • Jeon, Jongshik;Kwon, Ohbyung
    • Journal of Information Technology Services
    • /
    • v.17 no.1
    • /
    • pp.135-153
    • /
    • 2018
  • E-Card Service for Child Nutrition Program is one of the main public policy services nowadays. In case of inconvenience during the use of the e-cards, it is recommended to cooperate with related organizations in order to promptly handle and provide guidance, and thoroughly manage child feeding service such as hygiene, nutrition and kindness etc. To do so, it is very important to provide food service that meets local actual conditions and children's needs in a cost effective manner for the underage who are worried about the poorly-fed by understanding the pattern of child feeding e-card service. Hence. this paper aims to investigate how child feeding e-card service efficiently provides meals according to the local situation and children's needs through big data analysis and to propose a method of identifying welfare conditions according to the purpose of service with actual application examples. The results suggest that, first of all, this study is able to judge appropriateness of public institution's policy in a timely and repetitive manner through non-standard data analysis such as Naver News and transaction data. Secondly, this paper proposes a multi-layered analysis framework, which performs online open data analysis to detect policy issues, visualizes retrieval and preprocessing of real data, and performs abnormal pattern recognition. These will be worthy of reference to other similar projects.

A Qualitative Study on Breast Cancer Survivors' Experiences (유방암 생존자의 질병 극복 경험)

  • Yun, Mira;Song, Misoon
    • Perspectives in Nursing Science
    • /
    • v.10 no.1
    • /
    • pp.41-51
    • /
    • 2013
  • Purpose: This study was performed to understand the characteristics and the meaning of the illness experience of breast cancer survivors as basic data for the development of an intervention program. Methods: The participants were 25 breast cancer survivors who had completed treatment at a tertiary hospital in Seoul. Data were collected through in-depth and unstructured audio-recorded interviews by the investigator. The participants were asked to describe their illness experience. The data were analyzed according to Giorgi's method for phenomenological analysis. Results: The interview data were organized by theme into 6 categories that emerged from the analysis. The themes were acceptance of the illness, active coping with reality, gaining strength through the support of surrounding people, struggling to overcome a negative mindset, self-reflection, and the pursuit of a meaningful new life. Conclusion: We recommend the development of a survivorship program based on self-reflection, which can engender self-transcendence and spiritual well-being.

  • PDF

A Study on the Cultural and Technical Influence Factor Using Unstructured Data Analysis (비정형 데이터 분석을 이용한 수원 화성의 문화·기술적 영향요인 연구)

  • Park, Eun Soo;Kim, Ji Eun
    • Korea Science and Art Forum
    • /
    • v.20
    • /
    • pp.227-241
    • /
    • 2015
  • As time is rapidly changing, the culture to represent an era is getting more subdivided and complex. Due to cultural diversity, the influence, cause, characteristics which could be understood in individual field centered by space in the past cannot be understood now only by the viewpoint of one field, and it has become difficult to predict and correspond to the change of the future. With the development of information and knowledge delivery system, various cultural contents to form a space are being created and lapsed, but there are a lot of parts which cannot be explained or understood by only one point of view. To inspect these situation, this study is aimed to draw the cultural and technical causes that became the influence with Suwon Hwaseong, a traditional space with historical superiority, analyze the key factors that became the main factor to form the space, and consider the importance of the related factors. Suwon Hwaseong is a new town formed by the order of King Jeongjo. Suwon Hwaseong at that time was a space with the will and effort of many people who dreamed a new era, and it has a meaning of varoius time ans space as historical facts and cultural values as well as the progress and development of scientific technology. The unstructured data technique which is applied as the method of analysis in this study can be said to be a new value judgement and viewpoint in interpreting the space. Therefore, this study is a new trial to provide a frame for multilaterally interpreting the various traditional space and culture of Korea from the past to the present.

An Extraction Method of Sentiment Infromation from Unstructed Big Data on SNS (SNS상의 비정형 빅데이터로부터 감성정보 추출 기법)

  • Back, Bong-Hyun;Ha, Ilkyu;Ahn, ByoungChul
    • Journal of Korea Multimedia Society
    • /
    • v.17 no.6
    • /
    • pp.671-680
    • /
    • 2014
  • Recently, with the remarkable increase of social network services, it is necessary to extract interesting information from lots of data about various individual opinions and preferences on SNS(Social Network Service). The sentiment information can be applied to various fields of society such as politics, public opinions, economics, personal services and entertainments. To extract sentiment information, it is necessary to use processing techniques that store a large amount of SNS data, extract meaningful data from them, and search the sentiment information. This paper proposes an efficient method to extract sentiment information from various unstructured big data on social networks using HDFS(Hadoop Distributed File System) platform and MapReduce functions. In experiments, the proposed method collects and stacks data steadily as the number of data is increased. When the proposed functions are applied to sentiment analysis, the system keeps load balancing and the analysis results are very close to the results of manual work.

Agriculture Big Data Analysis System Based on Korean Market Information

  • Chuluunsaikhan, Tserenpurev;Song, Jin-Hyun;Yoo, Kwan-Hee;Rah, Hyung-Chul;Nasridinov, Aziz
    • Journal of Multimedia Information System
    • /
    • v.6 no.4
    • /
    • pp.217-224
    • /
    • 2019
  • As the world's population grows, how to maintain the food supply is becoming a bigger problem. Now and in the future, big data will play a major role in decision making in the agriculture industry. The challenge is how to obtain valuable information to help us make future decisions. Big data helps us to see history clearer, to obtain hidden values, and make the right decisions for the government and farmers. To contribute to solving this challenge, we developed the Agriculture Big Data Analysis System. The system consists of agricultural big data collection, big data analysis, and big data visualization. First, we collected structured data like price, climate, yield, etc., and unstructured data, such as news, blogs, TV programs, etc. Using the data that we collected, we implement prediction algorithms like ARIMA, Decision Tree, LDA, and LSTM to show the results in data visualizations.

A Study on FIFA Partner Adidas of 2022 Qatar World Cup Using Big Data Analysis

  • Kyung-Won, Byun
    • International Journal of Internet, Broadcasting and Communication
    • /
    • v.15 no.1
    • /
    • pp.164-170
    • /
    • 2023
  • The purpose of this study is to analyze the big data of Adidas brand participating in the Qatar World Cup in 2022 as a FIFA partner to understand useful information, semantic connection and context from unstructured data. Therefore, this study collected big data generated during the World Cup from Adidas participating in sponsorship as a FIFA partner for the 2022 Qatar World Cup and collected data from major portal sites to understand its meaning. According to text mining analysis, 'Adidas' was used the most 3,340 times based on the frequency of keyword appearance, followed by 'World Cup', 'Qatar World Cup', 'Soccer', 'Lionel Messi', 'Qatar', 'FIFA', 'Korea', and 'Uniform'. In addition, the TF-IDF rankings were 'Qatar World Cup', 'Soccer', 'Lionel Messi', 'World Cup', 'Uniform', 'Qatar', 'FIFA', 'Ronaldo', 'Korea', and 'Nike'. As a result of semantic network analysis and CONCOR analysis, four groups were formed. First, Cluster A named it 'Qatar World Cup Sponsor' as words such as 'Adidas', 'Nike', 'Qatar World Cup', 'Sponsor', 'Sponsor Company', 'Marketing', 'Nation', 'Launch', 'Official', 'Commemoration' and 'National Team' were formed into groups. Second, B Cluster named it 'Group stage' as words such as 'Qatar', 'Uruguay', 'FIFA' and 'group stage' were formed into groups. Third, C Cluster named it 'Winning' as words such as 'World Cup Winning', 'Champion', 'France', 'Argentina', 'Lionel Messi', 'Advertising' and 'Photograph' formed a group. Fourth, D Cluster named it 'Official Ball' as words such as 'Official Ball', 'World Cup Official Ball', 'Soccer Ball', 'All Times', 'Al Rihla', 'Public', 'Technology' was formed into groups.

Domain Adaptation for Opinion Classification: A Self-Training Approach

  • Yu, Ning
    • Journal of Information Science Theory and Practice
    • /
    • v.1 no.1
    • /
    • pp.10-26
    • /
    • 2013
  • Domain transfer is a widely recognized problem for machine learning algorithms because models built upon one data domain generally do not perform well in another data domain. This is especially a challenge for tasks such as opinion classification, which often has to deal with insufficient quantities of labeled data. This study investigates the feasibility of self-training in dealing with the domain transfer problem in opinion classification via leveraging labeled data in non-target data domain(s) and unlabeled data in the target-domain. Specifically, self-training is evaluated for effectiveness in sparse data situations and feasibility for domain adaptation in opinion classification. Three types of Web content are tested: edited news articles, semi-structured movie reviews, and the informal and unstructured content of the blogosphere. Findings of this study suggest that, when there are limited labeled data, self-training is a promising approach for opinion classification, although the contributions vary across data domains. Significant improvement was demonstrated for the most challenging data domain-the blogosphere-when a domain transfer-based self-training strategy was implemented.

Words Recommendation Algorithm for Similarity Connection based on Data Transmutability (데이터 변형성 기반 유사성 연결을 위한 단어 추천 알고리즘)

  • Kim, Boon-Hee
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.8 no.11
    • /
    • pp.1719-1724
    • /
    • 2013
  • Big data which requires a different approach from existing data processing methods, is unstructured data with a variety of features. The features mean the volume of data, the rate of change of the data, the data with a variety of features. Tweets of twitter in only Korea are more than 5 millions per day. So much cheaper data storage and analysis system due to the increasing demand for information, the value of research is increasing. In this paper, the technology required by the deformation characteristics of the data elements as a technology priority-based word-based recommendation algorithm is proposed.

A Study on Distributed Processing of Big Data and User Authentication for Human-friendly Robot Service on Smartphone (인간 친화적 로봇 서비스를 위한 대용량 분산 처리 기술 및 사용자 인증에 관한 연구)

  • Choi, Okkyung;Jung, Wooyeol;Lee, Bong Gyou;Moon, Seungbin
    • Journal of Internet Computing and Services
    • /
    • v.15 no.1
    • /
    • pp.55-61
    • /
    • 2014
  • Various human-friendly robot services have been developed and mobile cloud computing is a real time computing service that allows users to rent IT resources what they want over the internet and has become the new-generation computing paradigm of information society. The enterprises and nations are actively underway of the business process using mobile cloud computing and they are aware of need for implementing mobile cloud computing to their business practice, but it has some week points such as authentication services and distributed processing technologies of big data. Sometimes it is difficult to clarify the objective of cloud computing service. In this study, the vulnerability of authentication services on mobile cloud computing is analyzed and mobile cloud computing model is constructed for efficient and safe business process. We will also be able to study how to process and analyze unstructured data in parallel to this model, so that in the future, providing customized information for individuals may be possible using unstructured data.

Analysis of the Utilization of Mobile Applications by Generation Z using Topic Modeling :Focusing on Users' Essay Data (토픽모델링을 활용한 Z세대의 애플리케이션 효용성에 대한 분석: 이용자의 에세이 데이터를 중심으로)

  • Park, Ju-Yeon;Jeong, Do-Heon
    • Journal of Industrial Convergence
    • /
    • v.20 no.1
    • /
    • pp.43-51
    • /
    • 2022
  • The purpose of this study is to provide basic information necessary for the establishment of mobile service marketing strategies, educational service development, and engineering education for Generation Z by analyzing the utilitization of various applications by Gen Z. To this end, 177 essays on mobile service usage experience were collected, major topics were analyzed using topic modeling, and these were visualized through word cloud analysis. As a result of the study, the main topics were related to 'transportation' such as movement and public transportation, 'personal management' such as schedule management, financial management, food management, 'transaction' such as checkout, meeting, purchase, 'leisure' such as eating out, travel, study, culture. Additionally, words such as time, thought, people, life, bus, information, confirmation, payment, KakaoTalk, and so on were found to have a high of frequency of use. Also, there was found to be a difference between topics by college. This study is meaningful in that it collected essays, which are unstructured data, and analyzed them through topic modeling.