• Title/Summary/Keyword: Data Paper

Search Result 56,265, Processing Time 0.07 seconds

Contour Plot to Explore the Structure of Categorical Data

  • Kim, Hyun Chul;Huh, Moon Yul;Chung, Hee Suk
    • Communications for Statistical Applications and Methods
    • /
    • v.10 no.2
    • /
    • pp.371-385
    • /
    • 2003
  • In this paper, contour plot is considered as a method to explore the structure of categorical data. For this purpose, the paper suggests a method to sort two-way contingency table with respect to the expected marginals. It is found that the suggested plot provides us with valuable information for the underlying data structure. Firstly, we can investigate independency between the categories by examining the differences of expected frequency contours and observed frequency contours. With the plot, we can also visually investigate the existence of outliers inherent in the data. These properties of the suggested contour plot will be demonstrated by several sets of real data.

Improvement of Control Performance by Data Fusion of Sensors

  • Na, Seung-You;Shin, Dae-Jung
    • International Journal of Fuzzy Logic and Intelligent Systems
    • /
    • v.4 no.1
    • /
    • pp.63-69
    • /
    • 2004
  • In this paper, we propose a general framework for sensor data fusion applied to control systems. Since many kinds of disturbances are introduced to a control system, it is necessary to rely on multisensor data fusion to improve control performance in spite of the disturbances. Multisensor data fusion for a control system is considered a sequence of making decisions for a combination of sensor data to make a proper control input in uncertain conditions of disturbance effects on sensors. The proposed method is applied to a typical control system of a flexible link system in which reduction of oscillation is obtained using a photo sensor at the tip of the link. But the control performance depends heavily on the environmental light conditions. To overcome the light disturbance difficulties, an accelerometer is used in addition to the existing photo sensor. Improvement of control performance is possible by utilizing multisensor data fusion for various output responses to show the feasibility of the proposed method in this paper.

Bayesian pooling for contingency tables from small areas

  • Jo, Aejung;Kim, Dal Ho
    • Journal of the Korean Data and Information Science Society
    • /
    • v.27 no.6
    • /
    • pp.1621-1629
    • /
    • 2016
  • This paper studies Bayesian pooling for analysis of categorical data from small areas. Many surveys consist of categorical data collected on a contingency table in each area. Statistical inference for small areas requires considerable care because the subpopulation sample sizes are usually very small. Typically we use the hierarchical Bayesian model for pooling subpopulation data. However, the customary hierarchical Bayesian models may specify more exchangeability than warranted. We, therefore, investigate the effects of pooling in hierarchical Bayesian modeling for the contingency table from small areas. In specific, this paper focuses on the methods of direct or indirect pooling of categorical data collected on a contingency table in each area through Dirichlet priors. We compare the pooling effects of hierarchical Bayesian models by fitting the simulated data. The analysis is carried out using Markov chain Monte Carlo methods.

A Spatial Structural Query Language-G/SQL

  • Fang, Yu;Chu, Fang;Xinming, Tang
    • Proceedings of the KSRS Conference
    • /
    • 2002.10a
    • /
    • pp.860-879
    • /
    • 2002
  • Traditionally, Geographical Information Systems can only process spatial data in a procedure-oriented way, and the data can't be treated integrally. This method limits the development of spatial data applications. A new and promising method to solve this problem is the spatial structural query language, which extends SQL and provides integrated accessing to spatial data. In this paper, the theory of spatial structural query language is discussed, and a new geographical data model based on the concepts and data model in OGIS is introduced. According to this model, we implemented a spatial structural query language G/SQL. Through the studies of the 9-Intersection Model, G/SQL provides a set of topological relational predicates and spatial functions for GIS application development. We have successfully developed a Web-based GIS system-WebGIS-using G/SQL. Experiences show that the spatial operators G/SQL offered are complete and easy-to-use. The BNF representation of G/SQL syntax is included in this paper.

  • PDF

A Study on the Data Visualization for Real Time Power System Operation (실시간 전력계통 운영을 위한 데이터 시각화에 관한 연구)

  • Chog, Yoon-Sung;Joung, Jinyoung
    • The Transactions of The Korean Institute of Electrical Engineers
    • /
    • v.62 no.10
    • /
    • pp.1361-1367
    • /
    • 2013
  • This paper describes and suggests the data visualization for real time power system operation based on energy management system. Because real time power system operation performs analysis of the vast amount of on-line data, the operators need intuitive data visualization to find out useful information in the big data. Especially, in emergency situation, the data visualization is able to assist the operators in handling the crisis quickly and efficiently. Therefore, this paper aims to improve displays of output of real time power system operation by visualizing on-line big data. Through this study, we can develop improved visualization technique for real time power system operation, which has highly readable displays of output and intuitive information.

Issues and Empirical Results for Improving Text Classification

  • Ko, Young-Joong;Seo, Jung-Yun
    • Journal of Computing Science and Engineering
    • /
    • v.5 no.2
    • /
    • pp.150-160
    • /
    • 2011
  • Automatic text classification has a long history and many studies have been conducted in this field. In particular, many machine learning algorithms and information retrieval techniques have been applied to text classification tasks. Even though much technical progress has been made in text classification, there is still room for improvement in text classification. In this paper, we will discuss remaining issues in improving text classification. In this paper, three improvement issues are presented including automatic training data generation, noisy data treatment and term weighting and indexing, and four actual studies and their empirical results for those issues are introduced. First, the semi-supervised learning technique is applied to text classification to efficiently create training data. For effective noisy data treatment, a noisy data reduction method and a robust text classifier from noisy data are developed as a solution. Finally, the term weighting and indexing technique is revised by reflecting the importance of sentences into term weight calculation using summarization techniques.

A Content-based Pocket Switched Networks Routing Scheme for Mobile Data Offloading (모바일 데이터 오프로딩을 위한 콘텐츠 기반 Pocket 교환 네트워크 라우팅 기법)

  • Cabacas, Regin;Park, Hong-keun;Lee, Kisong;Ra, In-ho
    • Proceedings of the Korea Contents Association Conference
    • /
    • 2015.05a
    • /
    • pp.33-34
    • /
    • 2015
  • Continuous improvements of network infrastructures and mobile data offloading strategies are among the solutions of cellular providers to cope with the increase in mobile data demand. These options requires a lot of cost and time to implement. Few researches have been conducted to assess the applicability of Pocket Switched Network (PSN) to support mobile data offloading. In this paper, we present a PSN mobile data-offloading scheme that utilizes mobile users with available connectivity to deliver content-aware data to other mobile users. This paper also aims to evaluate the applicability and feasibility of PSN routing schemes to improve the current strategies in mobile data offloading. The simulation results show admirable results in terms of message delivery and latency.

  • PDF

Development of Pattern Classifying System for cDNA-Chip Image Data Analysis

  • Kim, Dae-Wook;Park, Chang-Hyun;Sim, Kwee-Bo
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 2005.06a
    • /
    • pp.838-841
    • /
    • 2005
  • DNA Chip is able to show DNA-Data that includes diseases of sample to User by using complementary characters of DNA. So this paper studied Neural Network algorithm for Image data processing of DNA-chip. DNA chip outputs image data of colors and intensities of lights when some sample DNA is putted on DNA-chip, and we can classify pattern of these image data on user pc environment through artificial neural network and some of image processing algorithms. Ultimate aim is developing of pattern classifying algorithm, simulating this algorithm and so getting information of one's diseases through applying this algorithm. Namely, this paper study artificial neural network algorithm for classifying pattern of image data that is obtained from DNA-chip. And, by using histogram, gradient edge, ANN and learning algorithm, we can analyze and classifying pattern of this DNA-chip image data. so we are able to monitor, and simulating this algorithm.

  • PDF

Automatic Generation of Machine Readable Context Annotations for SPARQL Results

  • Choi, Ji-Woong
    • Journal of the Korea Society of Computer and Information
    • /
    • v.21 no.10
    • /
    • pp.1-10
    • /
    • 2016
  • In this paper, we propose an approach to generate machine readable context annotations for SPARQL Results. According to W3C Recommendations, the retrieved data from RDF or OWL data sources are represented in tabular form, in which each cell's data is described by only type and value. The simple query result form is generally useful, but it is not sufficient to explain the semantics of the data in query results. To explain the meaning of the data, appropriate annotations must be added to the query results. In this paper, we generate the annotations from the basic graph patterns in user's queries. We could also manipulate the original queries to complete the annotations. The generated annotations are represented using the RDFa syntax in our study. The RDFa expressions in HTML are machine-understandable. We believe that our work will improve the trustworthiness of query results and contribute to distribute the data to meet the vision of the Semantic Web.

DEVELOPMENT OF ON-LINE DATA VISUALIZATION PROGRAM AND ITS APPLICATION (온-라인 데이터 가시화 프로그램의 개발과 그 적용)

  • Kang, S.H.;Kim, B.S.
    • 한국전산유체공학회:학술대회논문집
    • /
    • 2008.03a
    • /
    • pp.290-296
    • /
    • 2008
  • In this paper development of on-line data visualization program is described and some examples of data postprocessing are shown. The program is written in JAVA language and runs as a JAVA applet on the web browser such as Internet Explorer or Firefox. Remote users can use the program to visualize and analyze their own flow data by accessing the program server through the internet and loading data files in proper formats from their local computers. This paper describes briefly about algorithms for data visualization, structure and available functions of the program, and web sever system. The mechanism of how the JAVA applet can access and process local data files and relevant coding techniques are explained as well. Also explained is what is required for the remote users and client computers to access the program on-line. Some visualization examples performed on a local computer are illustrated by accessing the server remotely.

  • PDF