• 제목/요약/키워드: Data Collecting

검색결과 2,211건 처리시간 0.025초

An Efficient Design and Implementation of an MdbULPS in a Cloud-Computing Environment

  • Kim, Myoungjin;Cui, Yun;Lee, Hanku
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제9권8호
    • /
    • pp.3182-3202
    • /
    • 2015
  • Flexibly expanding the storage capacity required to process a large amount of rapidly increasing unstructured log data is difficult in a conventional computing environment. In addition, implementing a log processing system providing features that categorize and analyze unstructured log data is extremely difficult. To overcome such limitations, we propose and design a MongoDB-based unstructured log processing system (MdbULPS) for collecting, categorizing, and analyzing log data generated from banks. The proposed system includes a Hadoop-based analysis module for reliable parallel-distributed processing of massive log data. Furthermore, because the Hadoop distributed file system (HDFS) stores data by generating replicas of collected log data in block units, the proposed system offers automatic system recovery against system failures and data loss. Finally, by establishing a distributed database using the NoSQL-based MongoDB, the proposed system provides methods of effectively processing unstructured log data. To evaluate the proposed system, we conducted three different performance tests on a local test bed including twelve nodes: comparing our system with a MySQL-based approach, comparing it with an Hbase-based approach, and changing the chunk size option. From the experiments, we found that our system showed better performance in processing unstructured log data.

주성분 분석을 이용한 지역기반의 날씨의 스트림 데이터 분석 (Stream Data Analysis of the Weather on the Location using Principal Component Analysis)

  • 김상엽;김광덕;배경호;류근호
    • 한국측량학회지
    • /
    • 제28권2호
    • /
    • pp.233-237
    • /
    • 2010
  • The recent advance of sensor networks and ubiquitous techniques allow collecting and analyzing of the data which overcome the limitation imposed by time and space in real-time for making decisions. Also, analysis and prediction of collected data can support useful and necessary information to users. The collected data in sensor networks environment is the stream data which has continuous, unlimited and sequential properties. Because of the continuous, unlimited and large volume properties of stream data, managing stream data is difficult. And the stream data needs dynamic processing method because of the memory constraint and access limitation. Accordingly, we analyze correlation stream data using principal component analysis. And using result of analysis, it helps users for making decisions.

국내 태양에너지 자원 데이터의 신뢰성 분석 (Reliability Analysis of Solar Radiation Resources Data in Korea)

  • 조덕기;윤창열;김광득;강용혁
    • 한국태양에너지학회:학술대회논문집
    • /
    • 한국태양에너지학회 2011년도 춘계학술발표대회 논문집
    • /
    • pp.63-67
    • /
    • 2011
  • KnowledgThe Korea Institute of Energy Research(KIER) has begun collecting horizontal global insolation data since May, 1982 at different locations. Because of a poor reliability of existing data, KIER's new data will be extensively used by the solar system users as well as by research institutes. But the quality of solar insolation data is not always good. This reports on an attempt to identify systematic error in such data using clear-day analysis for data rehabilitation. Clear-day analysis is successful in uncovering solar insolation data of questionable quality. It is not proven that rehabilitation process can improve the quality of data for daily or monthly means, but it is suggested that the method can be used to improve the quality of data for monthly means of several years for use in many applications of solar energy plarming. Earlier studies finding a maximum ETR of about 0.80 are confirmed.

  • PDF

국내 수평면 전일사량 데이터의 정확도 평가에 관한 연구 (A Study on Accuracy Evaluation of Horizontal Global Radiation Data in Korea)

  • 조덕기;전일수;이태규
    • 태양에너지
    • /
    • 제20권1호
    • /
    • pp.31-43
    • /
    • 2000
  • The Korea Institute of Energy Research(KIER) has been collecting horizontal global radiation data since May, 1982 for 16 different locations. KIER's new data is expected to be extensively used by designer and researchers of solar systems in lieu of unreliable old ones. Unfortunately, the quality of the data has not always been properly mentioned. Some of them were taken at temporary field stations where the primary goal of the measurement was quick estimation of local solar radiation. The purpose of this study is to systematically identify errors in such data set using clear-day analysis in an effort to rehabilitate error-ridden old data. Clear-day analysis successfully uncovered solar radiation data that had questionable quality. Even through the rehabilitation process not necessarily improves the quality of data for daily or monthly mean, it can be used to improve the quality of data for monthly means of several years and the processed data can be used in various applications of solar energy with more confidence. A average ETR value of 0.63 obtained in this study is in good agreement with previous results obtained by other researchers.

  • PDF

Big Data Analysis of the Women Who Score Goal Sports Entertainment Program: Focusing on Text Mining and Semantic Network Analysis.

  • Hyun-Myung, Kim;Kyung-Won, Byun
    • International Journal of Internet, Broadcasting and Communication
    • /
    • 제15권1호
    • /
    • pp.222-230
    • /
    • 2023
  • The purpose of this study is to provide basic data on sports entertainment programs by collecting data on unstructured data generated by Naver and Google for SBS entertainment program 'Women Who Score Goal', which began regular broadcast in June 2021, and analyzing public perceptions through data mining, semantic matrix, and CONCOR analysis. Data collection was conducted using Textom, and 27,911 cases of data accumulated for 16 months from June 16, 2021 to October 15, 2022. For the collected data, 80 key keywords related to 'Kick a Goal' were derived through simple frequency and TF-IDF analysis through data mining. Semantic network analysis was conducted to analyze the relationship between the top 80 keywords analyzed through this process. The centrality was derived through the UCINET 6.0 program using NetDraw of UCINET 6.0, understanding the characteristics of the network, and visualizing the connection relationship between keywords to express it clearly. CONCOR analysis was conducted to derive a cluster of words with similar characteristics based on the semantic network. As a result of the analysis, it was analyzed as a 'program' cluster related to the broadcast content of 'Kick a Goal' and a 'Soccer' cluster, a sports event of 'Kick a Goal'. In addition to the scenes about the game of the cast, it was analyzed as an 'Everyday Life' cluster about training and daily life, and a cluster about 'Broadcast Manipulation' that disappointed viewers with manipulation of the game content.

Learning Method for Real-time Crime Prediction Model Utilizing CCTV

  • Bang, Seung-Hwan;Cho, Hyun-Bo
    • 한국컴퓨터정보학회논문지
    • /
    • 제21권5호
    • /
    • pp.91-98
    • /
    • 2016
  • We propose a method to train a model that can predict the probability of a crime being committed. CCTV data by matching criminal events are required to train the crime prediction model. However, collecting CCTV data appropriate for training is difficult. Thus, we collected actual criminal records and converted them to an appropriate format using variables by considering a crime prediction environment and the availability of real-time data collection from CCTV. In addition, we identified new specific crime types according to the characteristics of criminal events and trained and tested the prediction model by applying neural network partial least squares for each crime type. Results show a level of predictive accuracy sufficiently significant to demonstrate the applicability of CCTV to real-time crime prediction.

병원전 의료지도 개선방안 (Improvement Strategies for Prehospital Medical Direction in Korea)

  • 엄태환
    • 한국응급구조학회지
    • /
    • 제11권3호
    • /
    • pp.111-118
    • /
    • 2007
  • Purpose : It was to present strategies on activation of prehospital medical direction in Korea. Methods : This study was conducted by analysing some papers on prehospital medical direction and statistical data from the National Emergency Management Agency. Results : There was no active application of medical direction methods such as Priority Dispatch System, Pre-Arrival Instructions, System Status Management and no data on prehospital medical direction. To estimate direct medical control on emergency patients who were sorted by EMTs in 2006 was only 2.5%. Conclusion : To improve prehospital medical direction, it needed to applicate data collecting & using system and in-direct & direct medical control by medical doctor.

  • PDF

Study on Incident Detection System Using Fuzzy Logic

  • Kim, Intaek;Lee, Eunggi
    • 한국지능시스템학회:학술대회논문집
    • /
    • 한국퍼지및지능시스템학회 1998년도 The Third Asian Fuzzy Systems Symposium
    • /
    • pp.268-271
    • /
    • 1998
  • this paper presents the potential application of fuzzy logic to the automatic incident detection system. While the conventional incident detection algorithms are based on a binary decision process, the algorithm using fuzzy logic can incorporate ambiguity which occurs in determining incidents. Since collecting good amount of data to construct data base for incidents is pretty expensive, a traffic simulator called FRESIM is used to simulate traffic condition in a freeway. Incident data are obtained by changing input parameters of the simulator and the fuzzy algorithm generates fuzzy rule for determining normal and incident traffic conditions. In this paper, various steps are described to test the algorithm and its results are summarized.

  • PDF

태양복사 측정에 의한 주요 도시의 Global Dimming 현상분석 (Analysis of Global Dimming Appearances Using the Solar Radiation Measurement in Korean Major Cities)

  • 조덕기;강용혁
    • 한국신재생에너지학회:학술대회논문집
    • /
    • 한국신재생에너지학회 2008년도 춘계학술대회 논문집
    • /
    • pp.146-149
    • /
    • 2008
  • Since the atmospheric clearness index is main factor for evaluating global-dimming of atmosphere environment, it is necessary to estimate its characteristics all over the major cities in Korea. We have begun collecting clearness index data since 1982 at 16 different cities and considerable effort has been made for constructing a standard value from measured data at each city. The new clearness data for global-dimming analysis will be extensively used by evaluating atmospheric environment as well as by solar PV application system designer or users. From the results, we can conclude that 1) Yearly mean 61.9 % of the atmospheric clearness index was evaluated for clear day all over 2) A significant difference of atmospheric clearness index is observed between 1982-1989 and1990-1997, 1998-2005 through 16 different cities in Korea.

  • PDF

농촌계획지원용 지역자원평가시스템 구축(IV) - 사례지역 적용연구 - (Resources Evaluation System for Rural Planning Purposes(IV) - Application Study to the Case Areas -)

  • 최수명;한경수;황한철
    • 한국농공학회:학술대회논문집
    • /
    • 한국농공학회 1998년도 학술발표회 발표논문집
    • /
    • pp.198-203
    • /
    • 1998
  • This study, a sub-one of comprehensive research works titled under “Rural Resources Evaluation System”, tried to verify utility/applicability of the developed model system through the case study works on 3 sample villages, Backya, Uyan and Suyu, representing the lowland, upland and seashore villages respectively. From the various surveying and collecting works including the official/statistical data collection, map analysis, insitu investigation, field survey and written material review, the original data set were obtained and manipulated into final input data for resources grading. After then, by the automatized calculation procedure of “Rural Resources Evaluation System”, score results for resources evaluation were finally produced with the total maximum score being 1,000. Through comparing works among score results of 3 case villages and between score results and areal characteristics of each case village, the applicability of the system developed in this study was well confirmed.

  • PDF