• Title/Summary/Keyword: Type/format Inference

Search Result 2, Processing Time 0.015 seconds

Enhanced Regular Expression as a DGL for Generation of Synthetic Big Data

  • Kai, Cheng;Keisuke, Abe
    • Journal of Information Processing Systems
    • /
    • v.19 no.1
    • /
    • pp.1-16
    • /
    • 2023
  • Synthetic data generation is generally used in performance evaluation and function tests in data-intensive applications, as well as in various areas of data analytics, such as privacy-preserving data publishing (PPDP) and statistical disclosure limit/control. A significant amount of research has been conducted on tools and languages for data generation. However, existing tools and languages have been developed for specific purposes and are unsuitable for other domains. In this article, we propose a regular expression-based data generation language (DGL) for flexible big data generation. To achieve a general-purpose and powerful DGL, we enhanced the standard regular expressions to support the data domain, type/format inference, sequence and random generation, probability distributions, and resource reference. To efficiently implement the proposed language, we propose caching techniques for both the intermediate and database queries. We evaluated the proposed improvement experimentally.

A Design of Weather Ontology for Intelligent Weather Service (지능형 기상 서비스를 위한 기상 온톨로지의 설계)

  • Jung, Eui-Hyun
    • Journal of the Korea Society of Computer and Information
    • /
    • v.13 no.4
    • /
    • pp.185-193
    • /
    • 2008
  • In spite of rapid development of IT-related meteorology and services, human users still ought to check the weather information manually as they did before because traditional weather information retrieval is based on pull-type and human interpretation. Furthermore, the automatic machine-driven weather information processing has been neglected for a long time although the intelligent weather information processing is expected to be very useful for personal daily life and ubiquitous computing. In this paper, we discussed a design of GRIB based ontology to enable smart weather information processing. GRIB is the general purposed and world-wildly used weather data format approved by the World Meteorological Organization. With the designed ontology and the inference system containing Jess engine, several intelligent weather applications have been implemented and tested to verify the virtue of machine-driven weather information processing.

  • PDF