• Title/Summary/Keyword: Query Model

Search Result 563, Processing Time 0.029 seconds

Design and Implementation of Tag Coupling-based Boolean Query Matching System for Ranked Search Result (태그결합을 이용한 불리언 검색에서 순위화된 검색결과를 제공하기 위한 시스템 설계 및 구현)

  • Kim, Yong;Joo, Won-Kyun
    • Journal of the Korean Society for information Management
    • /
    • v.29 no.4
    • /
    • pp.101-121
    • /
    • 2012
  • Since IR systems which adopt only Boolean IR model can not provide ranked search result, users have to conduct time-consuming checking process for huge result sets one by one. This study proposes a method to provide search results ranked by using coupling information between tags instead of index weight information in Boolean IR model. Because document queries are used instead of general user queries in the proposed method, key tags used as queries in a relevant document are extracted. A variety of groups of Boolean queries based on tag couplings are created in the process of extracting queries. Ranked search result can be extracted through the process of matching conducted with differential information among the query groups and tag significance information. To prove the usability of the proposed method, the experiment was conducted to find research trend analysis information on selected research information. Aslo, the service based on the proposed methods was provided to get user feedback for a year. The result showed high user satisfaction.

The Scalability and the Strategy for EMR Database Encryption Techniques

  • Shin, David;Sahama, Tony;Kim, Steve Jung-Tae;Kim, Ji-Hong
    • Journal of information and communication convergence engineering
    • /
    • v.9 no.5
    • /
    • pp.577-582
    • /
    • 2011
  • EMR(Electronic Medical Record) is an emerging technology that is highly-blended between non-IT and IT area. One of methodology to link non-IT and IT area is to construct databases. Nowadays, it supports before and after-treatment for patients and should satisfy all stakeholders such as practitioners, nurses, researchers, administrators and financial department and so on. In accordance with the database maintenance, DAS (Data as Service) model is one solution for outsourcing. However, there are some scalability and strategy issues when we need to plan to use DAS model properly. We constructed three kinds of databases such as plain-text, MS built-in encryption which is in-house model and custom AES (Advanced Encryption Standard) - DAS model scaling from 5K to 2560K records. To perform custom AES-DAS better, we also devised Bucket Index using Bloom Filter. The simulation showed the response times arithmetically increased in the beginning but after a certain threshold, exponentially increased in the end. In conclusion, if the database model is close to in-house model, then vendor technology is a good way to perform and get query response times in a consistent manner. If the model is DAS model, it is easy to outsource the database, however, some technique like Bucket Index enhances its utilization. To get faster query response times, designing database such as consideration of the field type is also important. This study suggests cloud computing would be a next DAS model to satisfy the scalability and the security issues.

Calibration of Portable Particulate Mattere-Monitoring Device using Web Query and Machine Learning

  • Loh, Byoung Gook;Choi, Gi Heung
    • Safety and Health at Work
    • /
    • v.10 no.4
    • /
    • pp.452-460
    • /
    • 2019
  • Background: Monitoring and control of PM2.5 are being recognized as key to address health issues attributed to PM2.5. Availability of low-cost PM2.5 sensors made it possible to introduce a number of portable PM2.5 monitors based on light scattering to the consumer market at an affordable price. Accuracy of light scatteringe-based PM2.5 monitors significantly depends on the method of calibration. Static calibration curve is used as the most popular calibration method for low-cost PM2.5 sensors particularly because of ease of application. Drawback in this approach is, however, the lack of accuracy. Methods: This study discussed the calibration of a low-cost PM2.5-monitoring device (PMD) to improve the accuracy and reliability for practical use. The proposed method is based on construction of the PM2.5 sensor network using Message Queuing Telemetry Transport (MQTT) protocol and web query of reference measurement data available at government-authorized PM monitoring station (GAMS) in the republic of Korea. Four machine learning (ML) algorithms such as support vector machine, k-nearest neighbors, random forest, and extreme gradient boosting were used as regression models to calibrate the PMD measurements of PM2.5. Performance of each ML algorithm was evaluated using stratified K-fold cross-validation, and a linear regression model was used as a reference. Results: Based on the performance of ML algorithms used, regression of the output of the PMD to PM2.5 concentrations data available from the GAMS through web query was effective. The extreme gradient boosting algorithm showed the best performance with a mean coefficient of determination (R2) of 0.78 and standard error of 5.0 ㎍/㎥, corresponding to 8% increase in R2 and 12% decrease in root mean square error in comparison with the linear regression model. Minimum 100 hours of calibration period was found required to calibrate the PMD to its full capacity. Calibration method proposed poses a limitation on the location of the PMD being in the vicinity of the GAMS. As the number of the PMD participating in the sensor network increases, however, calibrated PMDs can be used as reference devices to nearby PMDs that require calibration, forming a calibration chain through MQTT protocol. Conclusions: Calibration of a low-cost PMD, which is based on construction of PM2.5 sensor network using MQTT protocol and web query of reference measurement data available at a GAMS, significantly improves the accuracy and reliability of a PMD, thereby making practical use of the low-cost PMD possible.

Adaptive User Profile for Information Retrieval from the Web

  • Srinil, Phaitoon;Pinngern, Ouen
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 2003.10a
    • /
    • pp.1986-1989
    • /
    • 2003
  • This paper proposes the information retrieval improvement for the Web using the structure and hyperlinks of HTML documents along with user profile. The method bases on the rationale that terms appearing in different structure of documents may have different significance in identifying the documents. The method partitions the occurrence of terms in a document collection into six classes according to the tags in which particular terms occurred (such as Title, H1-H6 and Anchor). We use genetic algorithm to determine class importance values and expand user query. We also use this value in similarity computation and update user profile. Then a genetic algorithm is used again to select some terms from user profile to expand the original query. Lastly, the search engine uses the expanded query for searching and the results of the search engine are scored by similarity values between each result and the user profile. Vector space model is used and the weighting schemes of traditional information retrieval were extended to include class importance values. The tested results show that precision is up to 81.5%.

  • PDF

An Object Manager Interface for Object-Oriented Query Processing (객체지향 질의처리를 위한 객체관리기 인터페이스)

  • Lee, Yon-Sik;Chon, Byoung-Sil;Ryu, Keun-Ho
    • The Transactions of the Korea Information Processing Society
    • /
    • v.2 no.1
    • /
    • pp.1-11
    • /
    • 1995
  • Object-oriented database systems represent the complex data model of real- world and manage the complex data. An object manager handles the manipulation and access of all objects in object-oriented database management systems. In this paper, we not only propose the design rules of an object manager interface(OMI) which is specified the calling function and the semantics of an object manager for object-oriented query processing, but also implement the OMI based on the rules. The OMI implemented supports the independently developing environment of query processing and object management modules, and can be applicable to various applications, and also provides the efficient access method to users.

  • PDF

Design of Moving Object Query Processing Based on UDF (UDF 기반 이동객체 질의 처리 설계 및 구현)

  • Yoo, Kihyun;Yang, Pyoung Woo;Nam, Kwang Woo
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.6 no.2
    • /
    • pp.85-90
    • /
    • 2017
  • Various mobile devices are spreading in recent developments in mobile computing environments. Especially the popularity of mobile devices equipped with GPS has become widespread, and various application services utilizing location information are born. In this paper, we propose a system model for storing and managing the trajectory of moving objects, which is the set of location information of moving objects acquired in continuous time, and the UDF (User-Defined Functions) based trajectory index method which can quickly query the large data set of moving object and the Pre-Materialized table method. Then we compare and evaluate the performance of each method through experiments. Experimental results show that the Pre-Materialized table method is about 1.2 times faster than the UDF based trajectory index method on execution time.

Structured Information Modeling and Query Method for SMIL Documents (SMIL 문서의 구조 정보 모델 및 검색)

  • 류은숙;이기호;이규철
    • Journal of Korea Multimedia Society
    • /
    • v.7 no.3
    • /
    • pp.293-307
    • /
    • 2004
  • The SMIL(Synchronized Multimedia Integration Language) documents are represented as logical structure information, spatial layout structure information, temporal synchronization structure information and hyperlink structure information, according as the structural characteristics of SMIL documents based on XML. This paper proposes the effective modeling and query method for the multi -structure information of inherent SMIL documents. In particular, we present the object-oriented modeling by using UML class diagram in order to represent the objects classes for the structured information of SMIL documents, and the hierarchical structure and the relationships for the objects classes. In addition, the objects classes definition is specified in compliance with SQL3 for database standard language. We also propose the access method and the query representation for hierarchical structure in order to retrieve efficiently the structural objects of SMIL documents.

  • PDF

Genealogy-based Indexing Technique for XML Documents (XML문서를 위한 족보 기반 인덱싱 기법)

  • 이월영;용환승
    • Journal of KIISE:Databases
    • /
    • v.31 no.1
    • /
    • pp.72-81
    • /
    • 2004
  • Theses days, a number of data over the Internet are represented using XML because of a virtue of XML. In proportion to the increase of XML data, query processing techniques are required that support quickly and efficiently the diverse queries to search the useful information on XML documents. But, up to now, the researches handling queries for XML data are methodologies focusing on how to process regular path expressions. Therefore, we have developed a new genealogy-based indexing technique to solve various queries such as not only regular path expression but also simple path expression, path expression referencing other elements, and so on. Also, we have applied this technique on object-relational model and evaluated the performance for many documents and various query types. The result shows improved performance in comparison with the other storage techniques.

On Regularity of Daily Distribution of Queries in Search Engine (검색엔진에서 일간질의 어분포의 정상성에 관한 연구)

  • Park, Sang-Gue;Lee, Chan-Kyu;Yoon, Kyung-Hyun;Kim, Seong-Hee;Lee, Jun-Ho
    • Journal of the Korean Society for information Management
    • /
    • v.24 no.4
    • /
    • pp.255-265
    • /
    • 2007
  • In this paper we analyzed regularity of daily patterns of distribution of Queries coming from internet search engine. And then, we proposed a Pareto distribution and Zipf law for identifying the query distribution and applied them to daily queries on the search engine during 2 week. We found that there is some evidence that Pareto and Zipf laws can be applied to evaluate the regularity of daily patterns of distribution of queries in search engine. Those results can be used to provide a better understanding of the social interests and trends using the query distribution patterns.

A Framework for Developing Distributed Application with Web-Tier Object Modeling (웹계층 오브젝트 모델링을 통한 분산 애플리케이션 개발 프레임웍크)

  • Cheon, Sang-Ho;Kwon, Ki-Hyeon;Choi, Hyung-Jin
    • The KIPS Transactions:PartD
    • /
    • v.11D no.5
    • /
    • pp.1143-1148
    • /
    • 2004
  • To develop multi-tier model or distributed architecture based distributed application needs to consider various aspects such as division of role between web-designer and software developer, defining entity and its usage, database connection and transaction processing etc. This paper presents DONSL(Data Server of Non SQL-Query) architecture that provides solution to above aspects through web-tier object modeling. This is the architecture that guarantees the transaction processing and performance between web-tier and DBMS through simplified usage of query logic property. This new conceptual framework also solves enterprise site implementation problems simplifying tier, and removing DAO(Data Access Object) and entity.