• 제목/요약/키워드: SQL similarity analysis

검색결과 3건 처리시간 0.015초

A Table Integration Technique Using Query Similarity Analysis

  • Choi, Go-Bong;Woo, Yong-Tae
    • 한국컴퓨터정보학회논문지
    • /
    • 제24권3호
    • /
    • pp.105-112
    • /
    • 2019
  • In this paper, we propose a technique to analyze similarity between SQL queries and to assist integrating similar tables. First, the table information was extracted from the SQL queries through the query structure analyzer, and the similarity between the tables was measured using the Jacquard index technique. Then, similar table clusters are generated through hierarchical cluster analysis method and the co-occurence probability of the table used in the query is calculated. The possibility of integrating similar tables is classified by using the possibility of co-occurence of similarity table and table, and classifying them into an integrable cluster, a cluster requiring expert review, and a cluster with low integration possibility. This technique analyzes the SQL query in practice and analyse the possibility of table integration independent of the existing business, so that the existing schema can be effectively reconstructed without interruption of work or additional cost.

Genomic data Analysis System using GenoSync based on SQL in Distributed Environment

  • Seine Jang;Seok-Jae Moon
    • International journal of advanced smart convergence
    • /
    • 제13권3호
    • /
    • pp.150-155
    • /
    • 2024
  • Genomic data plays a transformative role in medicine, biology, and forensic science, offering insights that drive advancements in clinical diagnosis, personalized medicine, and crime scene investigation. Despite its potential, the integration and analysis of diverse genomic datasets remain challenging due to compatibility issues and the specialized nature of existing tools. This paper presents the GenomeSync system, designed to overcome these limitations by utilizing the Hadoop framework for large-scale data handling and integration. GenomeSync enhances data accessibility and analysis through SQL-based search capabilities and machine learning techniques, facilitating the identification of genetic traits and the resolution of forensic cases. By pre-processing DNA profiles from crime scenes, the system calculates similarity scores to identify and aggregate related genomic data, enabling accurate prediction models and personalized treatment recommendations. GenomeSync offers greater flexibility and scalability, supporting complex analytical needs across industries. Its robust cloud-based infrastructure ensures data integrity and high performance, positioning GenomeSync as a crucial tool for reliable, data-driven decision-making in the genomic era.

『태평혜민화제국방(太平惠民和劑局方)』과 『소문선명론방(素問宣明論方)』과 『난실비장(蘭室秘藏)』의 방제구성 비교 (Analysis of Prescriptions from Taepyeonghyeminhwajegukbang, Somunsunmyungronbang and Nansilbijang)

  • 오월환;김기욱;이병욱;김은하
    • 대한한의학원전학회지
    • /
    • 제27권4호
    • /
    • pp.121-131
    • /
    • 2014
  • Objectives : The objectives of this study is to compare the differences in medical herbs combination of Taepyeonghyeminhwajegukbang, Somunsunmyungronbang and Nansilbijang. Methods : This study was proceeded by using Access 2007 and SQL Server and prescriptions of which herbal configuration could be indicated by weight unit were analysed from 765 prescriptions of Taepyeonghyeminhwajegukbang, 350 prescriptions of Somunsunmyungronbang and 277 prescriptions of Nansilbijang. Results : If more than 80% similarity, it is considered similar prescription. Only 6 Prescriptions of Nansilbijang similar to Prescriptions of Taepyeonghyeminhwajegukbang. And 20 Prescriptions of Somunsunmyungronbang similar to Prescriptions of Taepyeonghyeminhwajegukbang. Conclusions : Configuration patterns of Somunsunmyungronbang and Nansilbijang are different from Taepyeonghyeminhwajegukbang. So We can prove that theory of Somunsunmyungronbang and Nansilbijang are different from Taepyeonghyeminhwajegukbang.