• 제목/요약/키워드: Data Collecting

검색결과 2,247건 처리시간 0.027초

PAPG: Private Aggregation Scheme based on Privacy-preserving Gene in Wireless Sensor Networks

  • Zeng, Weini;Chen, Peng;Chen, Hairong;He, Shiming
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제10권9호
    • /
    • pp.4442-4466
    • /
    • 2016
  • This paper proposes a privacy-preserving aggregation scheme based on the designed P-Gene (PAPG) for sensor networks. The P-Gene is constructed using the designed erasable data-hiding technique. In this P-Gene, each sensory data item may be hidden by the collecting sensor node, thereby protecting the privacy of this data item. Thereafter, the hidden data can be directly reported to the cluster head that aggregates the data. The aggregation result can then be recovered from the hidden data in the cluster head. The designed P-Genes can protect the privacy of each data item without additional data exchange or encryption. Given the flexible generation of the P-Genes, the proposed PAPG scheme adapts to dynamically changing reporting nodes. Apart from its favorable resistance to data loss, the extensive analyses and simulations demonstrate how the PAPG scheme efficiently preserves privacy while consuming less communication and computational overheads.

남성 음성 triphone DB 구축에 관한 연구 (Dialogic Male Voice Triphone DB Construction)

  • 김유진;백상훈;한민수;정재호
    • 한국음향학회지
    • /
    • 제15권2호
    • /
    • pp.61-71
    • /
    • 1996
  • 본 논문에서는 음성합성을 위한 대화체(회화체) 음성의 triphone 단위 데이터베이스 구축에 대하여 보고한다. 특히 본 연구는 방송 매체를 이용하여 대화체 음성을 수집하고 3차에 걸친 대화체 표기(transcription)작업을 거쳐 triphone 단위의 분할 및 음성기호층 단계의 레이블링을 목표로 진행되었다. 수집된 총 10시간 방송분량중 6시간 분량을 데이터베이스 구축에 사용하였으며, 나머지 4시간은 예비 분으로 수집되었다. 낭독체 음성 데이터베이스 구축과는 여러 면에서 다른, 대화체 음성 데이터베이스 구축을 위한 음성 데이터 수집에서부터 triphone 단위 레이블링까지의 과정을 본 논문에서 기술하고, 보다 체계적이고 일관성있는 대화체 음성 데이터베이스 구축을 위해 필요한 계획 및 요구 사항에 대해서 논하고자 한다.

  • PDF

An Efficient Design and Implementation of an MdbULPS in a Cloud-Computing Environment

  • Kim, Myoungjin;Cui, Yun;Lee, Hanku
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제9권8호
    • /
    • pp.3182-3202
    • /
    • 2015
  • Flexibly expanding the storage capacity required to process a large amount of rapidly increasing unstructured log data is difficult in a conventional computing environment. In addition, implementing a log processing system providing features that categorize and analyze unstructured log data is extremely difficult. To overcome such limitations, we propose and design a MongoDB-based unstructured log processing system (MdbULPS) for collecting, categorizing, and analyzing log data generated from banks. The proposed system includes a Hadoop-based analysis module for reliable parallel-distributed processing of massive log data. Furthermore, because the Hadoop distributed file system (HDFS) stores data by generating replicas of collected log data in block units, the proposed system offers automatic system recovery against system failures and data loss. Finally, by establishing a distributed database using the NoSQL-based MongoDB, the proposed system provides methods of effectively processing unstructured log data. To evaluate the proposed system, we conducted three different performance tests on a local test bed including twelve nodes: comparing our system with a MySQL-based approach, comparing it with an Hbase-based approach, and changing the chunk size option. From the experiments, we found that our system showed better performance in processing unstructured log data.

주성분 분석을 이용한 지역기반의 날씨의 스트림 데이터 분석 (Stream Data Analysis of the Weather on the Location using Principal Component Analysis)

  • 김상엽;김광덕;배경호;류근호
    • 한국측량학회지
    • /
    • 제28권2호
    • /
    • pp.233-237
    • /
    • 2010
  • The recent advance of sensor networks and ubiquitous techniques allow collecting and analyzing of the data which overcome the limitation imposed by time and space in real-time for making decisions. Also, analysis and prediction of collected data can support useful and necessary information to users. The collected data in sensor networks environment is the stream data which has continuous, unlimited and sequential properties. Because of the continuous, unlimited and large volume properties of stream data, managing stream data is difficult. And the stream data needs dynamic processing method because of the memory constraint and access limitation. Accordingly, we analyze correlation stream data using principal component analysis. And using result of analysis, it helps users for making decisions.

국내 태양에너지 자원 데이터의 신뢰성 분석 (Reliability Analysis of Solar Radiation Resources Data in Korea)

  • 조덕기;윤창열;김광득;강용혁
    • 한국태양에너지학회:학술대회논문집
    • /
    • 한국태양에너지학회 2011년도 춘계학술발표대회 논문집
    • /
    • pp.63-67
    • /
    • 2011
  • KnowledgThe Korea Institute of Energy Research(KIER) has begun collecting horizontal global insolation data since May, 1982 at different locations. Because of a poor reliability of existing data, KIER's new data will be extensively used by the solar system users as well as by research institutes. But the quality of solar insolation data is not always good. This reports on an attempt to identify systematic error in such data using clear-day analysis for data rehabilitation. Clear-day analysis is successful in uncovering solar insolation data of questionable quality. It is not proven that rehabilitation process can improve the quality of data for daily or monthly means, but it is suggested that the method can be used to improve the quality of data for monthly means of several years for use in many applications of solar energy plarming. Earlier studies finding a maximum ETR of about 0.80 are confirmed.

  • PDF

국내 수평면 전일사량 데이터의 정확도 평가에 관한 연구 (A Study on Accuracy Evaluation of Horizontal Global Radiation Data in Korea)

  • 조덕기;전일수;이태규
    • 태양에너지
    • /
    • 제20권1호
    • /
    • pp.31-43
    • /
    • 2000
  • The Korea Institute of Energy Research(KIER) has been collecting horizontal global radiation data since May, 1982 for 16 different locations. KIER's new data is expected to be extensively used by designer and researchers of solar systems in lieu of unreliable old ones. Unfortunately, the quality of the data has not always been properly mentioned. Some of them were taken at temporary field stations where the primary goal of the measurement was quick estimation of local solar radiation. The purpose of this study is to systematically identify errors in such data set using clear-day analysis in an effort to rehabilitate error-ridden old data. Clear-day analysis successfully uncovered solar radiation data that had questionable quality. Even through the rehabilitation process not necessarily improves the quality of data for daily or monthly mean, it can be used to improve the quality of data for monthly means of several years and the processed data can be used in various applications of solar energy with more confidence. A average ETR value of 0.63 obtained in this study is in good agreement with previous results obtained by other researchers.

  • PDF

Big Data Analysis of the Women Who Score Goal Sports Entertainment Program: Focusing on Text Mining and Semantic Network Analysis.

  • Hyun-Myung, Kim;Kyung-Won, Byun
    • International Journal of Internet, Broadcasting and Communication
    • /
    • 제15권1호
    • /
    • pp.222-230
    • /
    • 2023
  • The purpose of this study is to provide basic data on sports entertainment programs by collecting data on unstructured data generated by Naver and Google for SBS entertainment program 'Women Who Score Goal', which began regular broadcast in June 2021, and analyzing public perceptions through data mining, semantic matrix, and CONCOR analysis. Data collection was conducted using Textom, and 27,911 cases of data accumulated for 16 months from June 16, 2021 to October 15, 2022. For the collected data, 80 key keywords related to 'Kick a Goal' were derived through simple frequency and TF-IDF analysis through data mining. Semantic network analysis was conducted to analyze the relationship between the top 80 keywords analyzed through this process. The centrality was derived through the UCINET 6.0 program using NetDraw of UCINET 6.0, understanding the characteristics of the network, and visualizing the connection relationship between keywords to express it clearly. CONCOR analysis was conducted to derive a cluster of words with similar characteristics based on the semantic network. As a result of the analysis, it was analyzed as a 'program' cluster related to the broadcast content of 'Kick a Goal' and a 'Soccer' cluster, a sports event of 'Kick a Goal'. In addition to the scenes about the game of the cast, it was analyzed as an 'Everyday Life' cluster about training and daily life, and a cluster about 'Broadcast Manipulation' that disappointed viewers with manipulation of the game content.

Learning Method for Real-time Crime Prediction Model Utilizing CCTV

  • Bang, Seung-Hwan;Cho, Hyun-Bo
    • 한국컴퓨터정보학회논문지
    • /
    • 제21권5호
    • /
    • pp.91-98
    • /
    • 2016
  • We propose a method to train a model that can predict the probability of a crime being committed. CCTV data by matching criminal events are required to train the crime prediction model. However, collecting CCTV data appropriate for training is difficult. Thus, we collected actual criminal records and converted them to an appropriate format using variables by considering a crime prediction environment and the availability of real-time data collection from CCTV. In addition, we identified new specific crime types according to the characteristics of criminal events and trained and tested the prediction model by applying neural network partial least squares for each crime type. Results show a level of predictive accuracy sufficiently significant to demonstrate the applicability of CCTV to real-time crime prediction.

병원전 의료지도 개선방안 (Improvement Strategies for Prehospital Medical Direction in Korea)

  • 엄태환
    • 한국응급구조학회지
    • /
    • 제11권3호
    • /
    • pp.111-118
    • /
    • 2007
  • Purpose : It was to present strategies on activation of prehospital medical direction in Korea. Methods : This study was conducted by analysing some papers on prehospital medical direction and statistical data from the National Emergency Management Agency. Results : There was no active application of medical direction methods such as Priority Dispatch System, Pre-Arrival Instructions, System Status Management and no data on prehospital medical direction. To estimate direct medical control on emergency patients who were sorted by EMTs in 2006 was only 2.5%. Conclusion : To improve prehospital medical direction, it needed to applicate data collecting & using system and in-direct & direct medical control by medical doctor.

  • PDF

Study on Incident Detection System Using Fuzzy Logic

  • Kim, Intaek;Lee, Eunggi
    • 한국지능시스템학회:학술대회논문집
    • /
    • 한국퍼지및지능시스템학회 1998년도 The Third Asian Fuzzy Systems Symposium
    • /
    • pp.268-271
    • /
    • 1998
  • this paper presents the potential application of fuzzy logic to the automatic incident detection system. While the conventional incident detection algorithms are based on a binary decision process, the algorithm using fuzzy logic can incorporate ambiguity which occurs in determining incidents. Since collecting good amount of data to construct data base for incidents is pretty expensive, a traffic simulator called FRESIM is used to simulate traffic condition in a freeway. Incident data are obtained by changing input parameters of the simulator and the fuzzy algorithm generates fuzzy rule for determining normal and incident traffic conditions. In this paper, various steps are described to test the algorithm and its results are summarized.

  • PDF