• Title/Summary/Keyword: Case Retrieval

Search Result 314, Processing Time 0.027 seconds

Query Formulation for Heuristic Retrieval in Obfuscated and Translated Partially Derived Text

  • Kumar, Aarti;Das, Sujoy
    • Journal of Information Science Theory and Practice
    • /
    • v.3 no.1
    • /
    • pp.24-39
    • /
    • 2015
  • Pre-retrieval query formulation is an important step for identifying local text reuse. Local reuse with high obfuscation, paraphrasing, and translation poses a challenge of finding the reused text in a document. In this paper, three pre-retrieval query formulation strategies for heuristic retrieval in case of low obfuscated, high obfuscated, and translated text are studied. The strategies used are (a) Query formulation using proper nouns; (b) Query formulation using unique words (Hapax); and (c) Query formulation using most frequent words. Whereas in case of low and high obfuscation and simulated paraphrasing, keywords with Hapax proved to be slightly more efficient, initial results indicate that the simple strategy of query formulation using proper nouns gives promising results and may prove better in reducing the size of the corpus for post processing, for identifying local text reuse in case of obfuscated and translated text reuse.

Case-Based Reasoning Cost Estimation Model Using Two-Step Retrieval Method

  • Lee, Hyun-Soo;Seong, Ki-Hoon;Park, Moon-Seo;Ji, Sae-Hyun;Kim, Soo-Young
    • Land and Housing Review
    • /
    • v.1 no.1
    • /
    • pp.1-7
    • /
    • 2010
  • Case-based reasoning (CBR) method can make estimators understand the estimation process more clearly. Thus, CBR is widely used as a methodology for cost estimation. In CBR, the quality of case retrieval affects the relevance of retrieved cases and hence the overall quality of the reminding capability of CBR system. Thus, it is essential to retrieve relevant past cases for establishing a robust CBR system. Case retrieval needs the following tasks to obtain appropriate case(s); indexing, search, and matching (Aamodt and Plaza 1994). However, the previous CBR researches mostly deal with matching process that has limits such as accuracy and efficiency of case retrieval. In order to address this issue, this research presents a CBR cost model for building projects that has two-step retrieval process: decision tree and nearest neighbor methods. Specifically, the proposed cost model has indexing, search and matching modules. Features in the model are divided into shape-based and scale-based attributes. Based on these, decision tree is established for facilitating the search task and nearest neighbor method was utilized for matching task. In regard to applying nearest neighbor method, attribute weights are assigned using GA optimization and similarity is calculated using the principle of distance measuring. Thereafter, the proposed CBR cost model is developed using 174 cases and validated using 12 test cases.

Case-Based Reasoning Method Using Case Data Base of Tall Buildings in Korea (국내 초고층 건물의 사례 데이터베이스를 이용한 사례기반추론기법)

  • Song, Hwa-Cheol;Park, Soo-Yong;Kim, Soo-Hwan
    • Journal of Korean Association for Spatial Structures
    • /
    • v.7 no.6
    • /
    • pp.75-82
    • /
    • 2007
  • In this study, a design-supporting system, which is intended to assist engineers in the schematic phase of the structural design, is developed using a case database that contains design information of tall buildings in Korea. A case-based reasoning method utilizing the case database is proposed. The inductive retrieval module for selecting structural system, in the initial stage, from the design information of case database for 47 tall buildings is presented. Also, the nearest-neighbor retrieval method for selecting similar design cases is introduced.

  • PDF

Fixture Planning Using Case-Based Reasoning (사례기반 추론방법을 이용한 치공구의 선정)

  • 현상필;이홍희
    • Journal of Korean Society of Industrial and Systems Engineering
    • /
    • v.22 no.51
    • /
    • pp.129-138
    • /
    • 1999
  • The aim of this research is the development of an automated fixture planning system for prismatic parts using the case-based reasoning (CBR). CBR is the problem solving paradigm that uses the similarity between a new problem and old cases to solve the new problem. This research uses CBR for the fixture planning. A case is composed with the information of the part, the components of fixture and the method of fixing for the part. The basic procedure is the retrieval and adaptation for the case, and this research presents the method of retrieval that selects most similar case to the new situation. The retrieval-step is divided into an index matching and an aggregated matching. The adaptation is accomplished by the modification, which transforms the selected case to the solution of the situation of the input part by the specified CBR algorithm. The components of fixture and the method of fixing are determined for a new part by the procedure.

  • PDF

Financial Forecasting System using Data Editing Technique and Case-based Reasoning (자료편집기법과 사례기반추론을 이용한 재무예측시스템)

  • Kim, Gyeong-Jae
    • Proceedings of the Korean Institute of Intelligent Systems Conference
    • /
    • 2007.11a
    • /
    • pp.283-286
    • /
    • 2007
  • This paper proposes a genetic algorithm (GA) approach to instance selection in case-based reasoning (CBR) for the prediction of Korea Stock Price Index (KOSPI). CBR has been widely used in various areas because of its convenience and strength in complex problem solving. Nonetheless, compared to other machine learning techniques, CBR has been criticized because of its low prediction accuracy. Generally, in order to obtain successful results from CBR, effective retrieval of useful prior cases for the given problem is essential. However, designing a good matching and retrieval mechanism for CBR systems is still a controversial research issue. In this paper, the GA optimizes simultaneously feature weights and a selection task for relevant instances for achieving good matching and retrieval in a CBR system. This study applies the proposed model to stock market analysis. Experimental results show that the GA approach is a promising method for instance selection in CBR.

  • PDF

Construction of Efficient Semantic Net and Component Retrieval in Case-Based Reuse (Case 기반 재사용에서 효율적인 의미망의 구축과 컴포넌트 검색)

  • Han Jung-Soo
    • The Journal of the Korea Contents Association
    • /
    • v.6 no.3
    • /
    • pp.20-27
    • /
    • 2006
  • In this paper we constructed semantic net that can efficiently conform retrieval and reuse of object-oriented source code. In order that initial relevance of semantic net was constructed using thesaurus to represent concept of object-oriented inheritance between each node. Also we made up for the weak points in spreading activation method that use to activate node and line of semantic net and to impulse activation value. Therefore we proposed the method to enhance retrieval time and to keep the quality of spreading activation.

  • PDF

Information Retrieval in Construction Hazard Identification (건설 위험 식별을 위한 정보 검색)

  • Kim, Hyun-Soo;Lee, Hyun-Soo;Park, Moon-Seo;Hwang, Sung-Joo
    • Korean Journal of Construction Engineering and Management
    • /
    • v.12 no.2
    • /
    • pp.53-63
    • /
    • 2011
  • The repetitive occurrence of similar accident is one of the biggest feature in construction disasters. Similar accident cases provide direct information for finding risk of scheduled activities and planning safety countermeasure. Many systems are developed to retrieve and use past accident cases by researchers. However, these researches have some limitations for performing too much retrieval to obtain results considering construction site conditions or not reflecting characteristics of safety planning steps or both. To overcome these limitations, this study proposes accident case retrieval system that can search similar accident cases. It also helps safety planning using information retrieval and building information modeling. The retrieval system extracts BIM objects and composes a query set combining BIM objects with site information DB. With past accident cases DB compares a query set, it seeks the most similar case. And results are provided to safety managers. Based on results of this study, safety managers can reduce excessive query generation. Furthermore, they can be easy to recognize risk of a construction site by obtaining coordinations of objects where similar accidents occurred.

Cooperative Case-based Reasoning Using Approximate Query Answering (근사질의 응답기능을 이용한 협동적 사례기반추론)

  • 김진백
    • The Journal of Information Systems
    • /
    • v.8 no.1
    • /
    • pp.27-44
    • /
    • 1999
  • Case-Based Reasoning(CBR) offers a new approach for developing knowledge based systems. CBR has several research issues which can be divided into two categories : (1) static issues and (2) dynamic issues. The static issues are related to case representation scheme and case data model, that is, focus on casebase which is a repository of cases. The dynamic issues, on the other hand, are related to case retrieval procedure and problem solving process, i.e. case adaptation phase. This research is forcused on retrieval procedure Traditional query processing accepts precisely specified queries and only provides exact answers, thus requiring users to fully understand the problem domain and the casebase schema, but returning limited or even null information if the exact answer is not available. To remedy such a restriction, extending the classical notion of query answering to approximate query answering(AQA) has been explored. AQA can be achieved by neighborhood query answering or associative query answering. In this paper, neighborhood query answering technique is used for AQA. To reinforce the CBR process, a new retrieval procedure(cooperative CBR) using neighborhood query answering is proposed. An neighborhood query answering relaxes a query scope to enlarge the search range, or relaxes an answer scope to include additional information. Computer Aided Process Planning(CAPP) is selected as cooperative CBR application domain for test. CAPP is an essential key for achieving CIM. It is the bridge between CAD and CAM and translates the design information into manufacturing instructions. As a result of the test, it is approved that the problem solving ability of cooperative CBR is improved by relaxation technique.

  • PDF

Improving Real-Time Efficiency of Case Retrieving Process for Case-Based Reasoning

  • Park, Yoon-Joo
    • Asia pacific journal of information systems
    • /
    • v.25 no.4
    • /
    • pp.626-641
    • /
    • 2015
  • Conventional case-based reasoning (CBR) does not perform efficiently for high-volume datasets because of case retrieval time. To overcome this problem, previous research suggested clustering a case base into several small groups and retrieving neighbors within a corresponding group to a target case. However, this approach generally produces less accurate predictive performance than the conventional CBR. This paper proposes a new case-based reasoning method called the clustering-merging CBR (CM-CBR). The CM-CBR method dynamically indexes a search pool to retrieve neighbors considering the distance between a target case and the centroid of a corresponding cluster. This method is applied to three real-life medical datasets. Results show that the proposed CM-CBR method produces similar or better predictive performance than the conventional CBR and clustering-CBR methods in numerous cases with significantly less computational cost.

Object-Based Image Search Using Color and Texture Homogeneous Regions (유사한 색상과 질감영역을 이용한 객체기반 영상검색)

  • 유헌우;장동식;서광규
    • Journal of Institute of Control, Robotics and Systems
    • /
    • v.8 no.6
    • /
    • pp.455-461
    • /
    • 2002
  • Object-based image retrieval method is addressed. A new image segmentation algorithm and image comparing method between segmented objects are proposed. For image segmentation, color and texture features are extracted from each pixel in the image. These features we used as inputs into VQ (Vector Quantization) clustering method, which yields homogeneous objects in terns of color and texture. In this procedure, colors are quantized into a few dominant colors for simple representation and efficient retrieval. In retrieval case, two comparing schemes are proposed. Comparing between one query object and multi objects of a database image and comparing between multi query objects and multi objects of a database image are proposed. For fast retrieval, dominant object colors are key-indexed into database.