• Title/Summary/Keyword: Part-to-Part Indexing

Search Result 108, Processing Time 0.035 seconds

A PageRank based Data Indexing Method for Designing Natural Language Interface to CRM Databases (분석 CRM 실무자의 자연어 질의 처리를 위한 기업 데이터베이스 구성요소 인덱싱 방법론)

  • Park, Sung-Hyuk;Hwang, Kyeong-Seo;Lee, Dong-Won
    • CRM연구
    • /
    • v.2 no.2
    • /
    • pp.53-70
    • /
    • 2009
  • Understanding consumer behavior based on the analysis of the customer data is one essential part of analytic CRM. To do this, the analytic skills for data extraction and data processing are required to users. As a user has various kinds of questions for the consumer data analysis, the user should use database language such as SQL. However, for the firm's user, to generate SQL statements is not easy because the accuracy of the query result is hugely influenced by the knowledge of work-site operation and the firm's database. This paper proposes a natural language based database search framework finding relevant database elements. Specifically, we describe how our TableRank method can understand the user's natural query language and provide proper relations and attributes of data records to the user. Through several experiments, it is supported that the TableRank provides accurate database elements related to the user's natural query. We also show that the close distance among relations in the database represents the high data connectivity which guarantees matching with a search query from a user.

  • PDF

An Efficient Split Algorithm to Minimize the Overlap between Node Index Spaces in a Multi-dimensional Indexing Scheme M-tree (다차원 색인구조 M-트리에서 노드 색인 공간의 중첩을 최소화하기 위한 효율적인 분할 알고리즘)

  • Im Sang-hyuk;Ku Kyong-I;Kim Ki-chang;Kim Yoo-Sung
    • The KIPS Transactions:PartD
    • /
    • v.12D no.2 s.98
    • /
    • pp.233-246
    • /
    • 2005
  • To enhance the user response time of content-based retrieval service for multimedia information, several multi-dimensional index schemes have been proposed. M-tree, a well-known multidimensional index scheme is of metric space access method, and is based on the distance between objects in the metric space. However, since the overlap between index spaces of nodes might enlarge the number of nodes of M-tree accessed for query processing, the user response time for content-based multimedia information retrieval grows longer. In this paper, we propose a node split algorithm which is able to reduce the sire of overlap between index spaces of nodes in M-tree. In the proposed scheme, we choose a virtual center point as the routing object and entry redistribution as the postprocessing after node split in order to reduce the radius of index space of a node, and finally in order to reduce the overlap between the index spaces of routing nodes. From the experimental results, we can see the proposed split algorithm reduce the overlap between index space of nodes and finally enhance the user response time for similarity-based query processing.

SOM-Based $R^{*}-Tree$ for Similarity Retrieval (자기 조직화 맵 기반 유사 검색 시스템)

  • O, Chang-Yun;Im, Dong-Ju;O, Gun-Seok;Bae, Sang-Hyeon
    • The KIPS Transactions:PartD
    • /
    • v.8D no.5
    • /
    • pp.507-512
    • /
    • 2001
  • Feature-based similarity has become an important research issue in multimedia database systems. The features of multimedia data are useful for discriminating between multimedia objects. the performance of conventional multidimensional data structures tends to deteriorate as the number of dimensions of feature vectors increase. The $R^{*}-Tree$ is the most successful variant of the R-Tree. In this paper, we propose a SOM-based $R^{*}-Tree$ as a new indexing method for high-dimensional feature vectors. The SOM-based $R^{*}-Tree$ combines SOM and $R^{*}-Tree$ to achieve search performance more scalable to high-dimensionalties. Self-Organizingf Maps (SOMs) provide mapping from high-dimensional feature vectors onto a two-dimensional space. The map is called a topological feature map, and preserves the mutual relationships (similarity) in the feature spaces of input data, clustering mutually similar feature vectors in neighboring nodes. Each node of the topological feature map holds a codebook vector. We experimentally compare the retrieval time cost of a SOM-based $R^{*}-Tree$ with of an SOM and $R^{*}-Tree$ using color feature vectors extracted from 40,000 images. The results show that the SOM-based $R^{*}-Tree$ outperform both the SOM and $R^{*}-Tree$ due to reduction of the number of nodes to build $R^{*}-Tree$ and retrieval time cost.

  • PDF

A correlation analysis between state variables of rainfall-runoff model and hydrometeorological variables (강우-유출 모형의 상태변수와 수문기상변량과의 상관성 분석)

  • Shim, Eunjeung;Uranchimeg, Sumiya;Lee, Yearin;Moon, Young-Il;Lee, Joo-Heon;Kwon, Hyun-Han
    • Journal of Korea Water Resources Association
    • /
    • v.54 no.12
    • /
    • pp.1295-1304
    • /
    • 2021
  • For the efficient use and management of water resources, a reliable rainfall-runoff analysis is necessary. Still, continuous hydrological data and rainfall-runoff data are insufficient to secure through measurements and models. In particular, as part of the reasonable improvement of a rainfall-runoff model in the case of an ungauged watershed, regionalization is being used to transfer the parameters necessary for the model application to the ungauged watershed. In this study, the GR4J model was selected, and the SCEM-UA method was used to optimize parameters. The rainfall-runoff model for the analysis of the correlation between watershed characteristics and parameters obtained through the model was regionalized by the Copula function, and rainfall-runoff analysis with the regionalized parameters was performed on the ungauged watershed. In the process, the intermediate state variables of the rainfall-runoff model were extracted, and the correlation analysis between water level and the ground water level was investigated. Furthermore, in the process of rainfall-runoff analysis, the Standardized State variable Drought Index (SSDI) was calculated by calculating and indexing the state variables of the GR4J model. and the calculated SSDI was compared with the standardized Precipitation index (SPI), and the hydrological suitability evaluation of the drought index was performed to confirm the possibility of drought monitoring and application in the ungauged watershed.

Study for the Deficiency and Excessiveness Diagnosis in the Front Point by Elastic State (모혈(募穴)의 탄력(彈力) 상태(狀態) 측정(測定)에 의한 허실(虛實) 진단(診斷) 연구(硏究))

  • Na, Chang-Su;Yoon, Yeo-Choong;Park, Hyun-Cheal;Lee, Dong-Kyu;Choi, Chan-Hern;Jang, Kyung-Sun;So, Cheal-Ho
    • Journal of Acupuncture Research
    • /
    • v.17 no.1
    • /
    • pp.27-41
    • /
    • 2000
  • The meridian system is the most essential and basic connecting structure that maintains the vital activities of viscera and bowels by connecting them with each part of body's surface. Doctors can understand the healthy condition, and the region and deficiency-excessiveness of disease by observing the condition of Qi flowing. Deficiency and excessiveness could be differentiated by various symptoms expressed in meridian system. Especially there could be several clues like pain, heat-cold, protuberance-depression, change of color and shine in the line of channel leads to the judgment of deficiency-excessiveness The diagnosis of deficiency and excessiveness can be generalized by quantification of elastic status in skin surface along the meridian system. By comparing data from measurement of elastic condition with those from traditional deficiency and excessiveness, it could be utilized for the development of oriental medicine. All biological activities in the human body are based on meridian system according to the oriental medicine. Also the meridian system is viewed as basic and essential structure connecting internal viscera and each part of body. The areas of expressed channel phenomena are muscle to bone, muscle to muscle and bone to bone. These areas are called depression where meridian system is present and any changing state on those points can be measured. It could be difficult in diagnosing the reaction of meridian system because doctor can depend on his own judgment. Therefore, it is necessary to quantify and indexate channel reactions. To quantify the channel reactions, specially manufactured instrument was used to quantify the protuberance and depression to differentiate the deficiency and excessiveness. The results follow as below; 1. The elastic index measurement by the equipment proved a pattern of agreement showing the values that ranged within standard deviation 0.05kgf/cm throughout the experiment except few cases' measurement in CV-17. 2. To evaluate the state of deficiency & excessiveness of elastic index measurements in frontal point, elastic index measurements in the front paint were compared to the elastic index measured surrounding the point within 2.5 cm. Such result of indexing procedure was closely matched to the concept of palpitation. 3. If the elastic index values in the surrounding front point closely located to the elastic index values in the front point, the judgement on the state of deficiency and excessiveness was delayed. Otherwise, it was judged as deficiency or excessiveness. 4. Out of total 12 cases of comparing the elastic index values to the elastic index values in the surrounding front point, Three to nine front points were judged as either in the state of deficiency or excessiveness. 5. Among the nine front points judged as either in the state of deficiency or excessiveness, Four cases were matched to the electric index measured by EAV that evaluating the internal organs by five different phases. If more clinical cases are accumulated, it is expected to systematically theorize and improve the concept of deficiency and excessiveness in the internal organs using the front point.

  • PDF

Measurement Invariance of Journal Selection Criteria between Researchers in Library and Information Science and Social Science (문헌정보학 및 사회과학 분야 연구자의 학술지 선정요인에 대한 측정 동일성 검증)

  • Lee, Jongwook;Park, Jungkyu;Yang, Kiduk;Oh, Dong-Geun
    • Journal of Korean Library and Information Science Society
    • /
    • v.52 no.2
    • /
    • pp.235-252
    • /
    • 2021
  • As part of effort to develop the strategies of internationalization of social science academic journals in South Korea, this study attempts to verify the measurement invariance of journal selection criteria across the groups of library and information science researchers and social science researchers. The authors collected 146 survey responses from researchers who have published at least one paper in SSCI/Scopus-indexed social science journals between 2014 and 2016. As a result of the study, it was found that the configural and partial metric invariance of the journal selection criteria held across the two groups, implying that the model of journal selection criteria is appropriate to use in the field of social science as well as library and information science. Additionally, the authors investigated the perceptions of journal selection criteria indicators in the two groups, and it was shown that researchers in both groups considered peer review and indexing in major databases important. The findings of this study could be useful for publishers or academic societies to develop improvement strategies of their journals.

Rule Discovery and Matching for Forecasting Stock Prices (주가 예측을 위한 규칙 탐사 및 매칭)

  • Ha, You-Min;Kim, Sang-Wook;Won, Jung-Im;Park, Sang-Hyun;Yoon, Jee-Hee
    • Journal of KIISE:Databases
    • /
    • v.34 no.3
    • /
    • pp.179-192
    • /
    • 2007
  • This paper addresses an approach that recommends investment types for stock investors by discovering useful rules from past changing patterns of stock prices in databases. First, we define a new rule model for recommending stock investment types. For a frequent pattern of stock prices, if its subsequent stock prices are matched to a condition of an investor, the model recommends a corresponding investment type for this stock. The frequent pattern is regarded as a rule head, and the subsequent part a rule body. We observed that the conditions on rule bodies are quite different depending on dispositions of investors while rule heads are independent of characteristics of investors in most cases. With this observation, we propose a new method that discovers and stores only the rule heads rather than the whole rules in a rule discovery process. This allows investors to define various conditions on rule bodies flexibly, and also improves the performance of a rule discovery process by reducing the number of rules. For efficient discovery and matching of rules, we propose methods for discovering frequent patterns, constructing a frequent pattern base, and indexing them. We also suggest a method that finds the rules matched to a query issued by an investor from a frequent pattern base, and a method that recommends an investment type using the rules. Finally, we verify the superiority of our approach via various experiments using real-life stock data.

WordNet-Based Category Utility Approach for Author Name Disambiguation (저자명 모호성 해결을 위한 개념망 기반 카테고리 유틸리티)

  • Kim, Je-Min;Park, Young-Tack
    • The KIPS Transactions:PartB
    • /
    • v.16B no.3
    • /
    • pp.225-232
    • /
    • 2009
  • Author name disambiguation is essential for improving performance of document indexing, retrieval, and web search. Author name disambiguation resolves the conflict when multiple authors share the same name label. This paper introduces a novel approach which exploits ontologies and WordNet-based category utility for author name disambiguation. Our method utilizes author knowledge in the form of populated ontology that uses various types of properties: titles, abstracts and co-authors of papers and authors' affiliation. Author ontology has been constructed in the artificial intelligence and semantic web areas semi-automatically using OWL API and heuristics. Author name disambiguation determines the correct author from various candidate authors in the populated author ontology. Candidate authors are evaluated using proposed WordNet-based category utility to resolve disambiguation. Category utility is a tradeoff between intra-class similarity and inter-class dissimilarity of author instances, where author instances are described in terms of attribute-value pairs. WordNet-based category utility has been proposed to exploit concept information in WordNet for semantic analysis for disambiguation. Experiments using the WordNet-based category utility increase the number of disambiguation by about 10% compared with that of category utility, and increase the overall amount of accuracy by around 98%.