• Title/Summary/Keyword: Database and statistics

Search Result 433, Processing Time 0.024 seconds

An Efficient Algorithm for Mining Frequent Sequences In Spatiotemporal Data

  • Vhan Vu Thi Hong;Chi Cheong-Hee;Ryu Keun-Ho
    • 한국공간정보시스템학회:학술대회논문집
    • /
    • 2005.11a
    • /
    • pp.61-66
    • /
    • 2005
  • Spatiotemporal data mining represents the confluence of several fields including spatiotemporal databases, machine loaming, statistics, geographic visualization, and information theory. Exploration of spatial data mining and temporal data mining has received much attention independently in knowledge discovery in databases and data mining research community. In this paper, we introduce an algorithm Max_MOP for discovering moving sequences in mobile environment. Max_MOP mines only maximal frequent moving patterns. We exploit the characteristic of the problem domain, which is the spatiotemporal proximity between activities, to partition the spatiotemporal space. The task of finding moving sequences is to consider all temporally ordered combination of associations, which requires an intensive computation. However, exploiting the spatiotemporal proximity characteristic makes this task more cornputationally feasible. Our proposed technique is applicable to location-based services such as traffic service, tourist service, and location-aware advertising service.

  • PDF

Estimation of a Nationwide Statistics of Hernia Operation Applying Data Mining Technique to the National Health Insurance Database (데이터마이닝 기법을 이용한 건강보험공단의 수술 통계량 근사치 추정 -허니아 수술을 중심으로-)

  • Kang, Sung-Hong;Seo, Seok-Kyung;Yang, Yeong-Ja;Lee, Ae-Kyung;Bae, Jong-Myon
    • Journal of Preventive Medicine and Public Health
    • /
    • v.39 no.5
    • /
    • pp.433-437
    • /
    • 2006
  • Objectives: The aim of this study is to develop a methodology for estimating a nationwide statistic for hernia operations with using the claim database of the Korea Health Insurance Cooperation (KHIC). Methods: According to the insurance claim procedures, the claim database was divided into the electronic data interchange database (EDI_DB) and the sheet database (Paper_DB). Although the EDI_DB has operation and management codes showing the facts and kinds of operations, the Paper_DB doesn't. Using the hernia matched management code in the EDI_DB, the cases of hernia surgery were extracted. For drawing the potential cases from the Paper_DB, which doesn't have the code, the predictive model was developed using the data mining technique called SEMMA. The claim sheets of the cases that showed a predictive probability of an operation over the threshold, as was decided by the ROC curve, were identified in order to get the positive predictive value as an index of usefulness for the predictive model. Results: Of the claim databases in 2004, 14,386 cases had hernia related management codes with using the EDI system. For fitting the models with applying the data mining technique, logistic regression was chosen rather than the neural network method or the decision tree method. From the Paper_DB, 1,019 cases were extracted as potential cases. Direct review of the sheets of the extracted cases showed that the positive predictive value was 95.3%. Conclusions: The results suggested that applying the data mining technique to the claim database in the KHIC for estimating the nationwide surgical statistics would be useful from the aspect of execution and cost-effectiveness.

Prevention Meteorological Database Information for the Assessment of Natural Disaster (자연재해 평가를 위한 방재기상 DB 정보)

  • Choi, Hyo-Jin;Park, Jong-Kil;Jung, Woo-Sik
    • 한국방재학회:학술대회논문집
    • /
    • 2007.02a
    • /
    • pp.315-318
    • /
    • 2007
  • In order to reduce the amount of damage from natural disasters, we needs prevention meteorological database classified into the cause of disaster, damage elements etc. For this, we have analyzed four data, such as Statistical yearbook of calamities issued by the National Emergency Management Agency and Annual Climatological Report issued by the Korea Meteorological Administration and Recently 10 years for natural disaster damage and Statistics Yearbook from the Ministry of Government Administration and Human affairs. Through the analysis of disaster data, we have selected input variables, such as causes and elements, occurrence frequencies, vulnerable areas of natural disaster, etc. In order to reduce damage from natural disaster, the prevention activities and forecasting based on meteorological parameters and damage datas are required. In addition, it is necessary to process meteorological information for disaster prevention activities. Through these procedure, we have established the foundation of database about natural disasters. This database will be used to assess the natural disasters and build risk model and natural disasters mitigation plan.

  • PDF

Esophageal Cancer in Korea: Epidemiology and Treatment Patterns

  • Park, Seong Yong;Kim, Dae Joon
    • Journal of Chest Surgery
    • /
    • v.54 no.6
    • /
    • pp.454-459
    • /
    • 2021
  • According to statistics from 2017, esophageal cancer is the fifteenth most common cancer and the eleventh most common cause of cancer-related death in Korea. The most common pathology is esophageal squamous cell carcinoma. Moreover, the incidence of esophageal cancer has been gradually decreasing in Korea, and the percentage of early-stage cases has gradually increased to the point that it is higher than that of other countries. The 5-year relative survival rate has improved over time. Approximately 800 esophagectomy procedures are performed annually. Using a cut-off number of 21 cases per 2 years to define high-volume centers, it was found that 70% of esophagectomies were performed by a few high-volume centers. Unfortunately, there is no nationwide registry or database on esophageal cancer and esophagectomy in Korea. Efforts to establish a nationwide database on esophageal cancer and esophagectomy should be made.

Contents and Patent Map Analysis on the Internet Sites for Statistical Information

  • Cho, Kwang-Hyun;Park, Hee-Chang
    • Journal of the Korean Data and Information Science Society
    • /
    • v.17 no.2
    • /
    • pp.411-420
    • /
    • 2006
  • There are many statistical information sites as the use of internet is increased quickly in recent years. In this paper, we explore and analyze internet sites for statistical information such as statistical survey system, education, database, and terminology. And then we classify these sites to apply statistical information to some particular spheres easily. Also, we analyze the patent map for domestic patents of statistical information. In so doing, the result of this study aims at enhancing our understanding of internet sites for statistical information.

  • PDF

Construction of Integrated Agricultural Statistical System Architecture for Effective Policy (농업정책 실효성 증대를 위한 농업통계시스템 아키텍처 구축)

  • Lee, Min-Soo;Chae, Young-Chan;Hong, Hee-Yeon;Kim, Sang-Ho;Kim, Jeong-Seop
    • Journal of Korean Society of Rural Planning
    • /
    • v.11 no.4 s.29
    • /
    • pp.75-91
    • /
    • 2005
  • This study designs an integrated data architecture to systematically manage the agricultural statistics database. Managing the agricultural statistics is important since it provides data for policies and decision making for agribusinesses. Ministry of Agriculture and the National Statistical Office collect the basic agricultural statistic data which provides the basis of logical decision making and agricultural policies. However, the agricultural statistic data has not well been used. The data has not been consistently collected nor managed. The raw data has not been organized nor processed to meet various demands. The needs has been arisen for a consistent agricultural statistics system to increase the relevance, accessibility, and efficiency of data for various users. There are massive amount of data accumulated over a long time period. Introducing the new system and reorganizing the data will bear large risks. A systematic method is required to reduce the risks in planing, building, and maintaining the database without hindering administration. This study provides a design of the agricultural statistics system architecture based on the user requirement analysis (URA) and similar systems abroad. We have also build a prototype to check the implementability of the system design.

The Development of a System for Product Search Using a Sensibility and Configuration Database on Designing Men's Jackets (신사복 재킷디자인의 감성 및 형상 데이터베이스를 이용한 제품검색 시스템 개발에 관한 연구)

  • Park, Yun-A
    • Journal of the Korean Home Economics Association
    • /
    • v.44 no.4 s.218
    • /
    • pp.133-144
    • /
    • 2006
  • The contemporary period is called "the age of sensibility" in which each individual consumer seeks to have her or his own products. Businesses are in need of design developments with an emphasis on customer sensitivity, and at the same time consumers must understand their own sensitivity to acquire information on designs that suit them. This research established a sensitivity and configuration database on designing men's jackets using the sensitivity engineering approach to clothing design information. The user interface was created on the Internet. Sixty-seven sensitivity terms of vocabulary appropriate for the assessment of men's jacket design were selected, and the different designs were classified into six items and 24 categories. Thirty men's jackets with different designs were produced for sensory testing and the results were analyzed in accordance with general linear I statistics. A sensitivity database was established for each category. My-sql, PHP, Java Script, and Html were used for the configuration database work. The configuration of items/categories, with the most appropriate sensitivity database information assigned to the selected sensitivity vocabulary, was programmed for display on the computer screen. The sensitivity vocabulary of a customer's choice for each factor was selected for the program to run, while the category and product configuration of the men's jacket most suitable for the search was displayed based on the user interface.

A visual query database system for the Sample Research DB of the National Health Insurance Service (국민건강보험공단의 표본연구DB를 위한 비주얼 쿼리 데이터베이스 시스템 개발 연구)

  • Cho, Sang-Hoon;Kim, HeeChan;Kang, Gunseog
    • The Korean Journal of Applied Statistics
    • /
    • v.30 no.1
    • /
    • pp.13-24
    • /
    • 2017
  • The Sample Cohort DB supplied by the National Health Insurance Service is a valuable resource for statistical studies as well as for health and medical studies. It takes significant time and effort to extract data from this Cohort DB having a large size. As such, we introduce a database system, conveniently called the National Health Insurance Service Cohort DB Extract Tool (NICE Tool), which supports several useful operations for effectively and efficiently managing the Cohort DB. For example, researchers can extract variables and cases related with study by simply clicking a computer mouse without any prior knowledge regarding SAS DATA step or SQL. We expect that NICE Tool will facilitate the faster extraction of data and eventually lead to the active use of the Cohort DB for research purposes.

HyperDB - A High Performance Data Analysis System Based on Grid Computing Technology

  • Kim, Tae-Kyung;Na, Jong-Hwa;Chon, Wan-Sup
    • Journal of the Korean Data and Information Science Society
    • /
    • v.18 no.1
    • /
    • pp.161-174
    • /
    • 2007
  • In this paper, we propose a high performance database cluster system called HyperDB to process OLAP queries efficiently. HyperDB is a virtual database system running on top of internet-connected PCs; the PCs are used for their own purpose at ordinary times, but they are able to participate in the database cluster system at non-office hours. We propose fully logical replication technique and optimal parallel intra-query routing technique for extensibility and performance. Experiment for TPC-R benchmark shows significant performance upgrade compared with conventional approaches.

  • PDF

The historical development of online systems (온라인 시스템의 역사적 발전)

  • 사공철
    • Journal of the Korean Society for information Management
    • /
    • v.11 no.2
    • /
    • pp.111-130
    • /
    • 1994
  • Comprehensive and historical development including important technological components of online information systems is described. Growth of database and online information service identified in the worldwide statistics is analyzed. Development and present state of database and online services in Korea are outlined. Based on the above historical conditions the future trend of online system is presented.

  • PDF