• Title/Summary/Keyword: Law Retrieval System

Search Result 12, Processing Time 0.028 seconds

Developing and Evaluating an Ontology-based Legal Retrieval System (온톨로지 기반 법률 검색시스템의 구축 및 평가에 관한 연구)

  • Chang, In-Ho
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.45 no.2
    • /
    • pp.345-366
    • /
    • 2011
  • The law affects our daily lives, and hence, constitutes a crucial information resource. However, electronic access to legal information using keyword-based retrieval systems appears to provide users with limited satisfaction. There are many factors behind this inadequacy. First, the discrepancies between formal legal terms and their counterparts in common language are quite large. Second, the situation is further confounded by frequent abbreviations in legal terms. Third, even though there is a constant deluge of legal information, users' needs have evolved to demand more Q and A type searches. All of these factors make the existing retrieval systems inefficient and ineffective. This article suggests an ontology-based system as a means to deal with such difficulties. To that end, a legal retrieval system(experimental system), built on the basis of a newly-constructed law ontology, was tested against a keyword-based legal retrieval system(existing one), yielding data on their relative effectiveness in retrieval and user satisfaction.

A Study on the Implementation of Law Information Retrieval System (법령 정보검색 시스템 구현에 관한 연구)

  • Min, Jae-Hong;Cho, Pyung-Dong;Yang, Jin-Hyuk;Park, Pyung-Koo;Chung, In-Jeong
    • The Transactions of the Korea Information Processing Society
    • /
    • v.7 no.11S
    • /
    • pp.3702-3713
    • /
    • 2000
  • Telecommunications standards have two different types of regulations: one is a law. enacted by government which all telecommunications related industries must observe. The other is a recommendatory standards. formulated by either government agency or some standardization organizations. Observation of these standards is not obligatory. However, technical standards are strict laws and ordinances based on common judgement and various conditions for evaluation of levels and limits. This paper deals with enhancing productivity of enactment and revision of technical standards. Through database of above related information we secure information continuity and public property of cyber space for the public. In this paper. we also classify recent data within the website in and out of the country offering four different methods of information retrieval and management system. The four retrieval methods suggested in this paper are itemized keyword retrieval. hierarchical retrieval, regulatory keyword retrieval and chronological keyword retrieval. These various retrieval methods provide the public with information of enactment and amendment of laws and regulations in the cyber space. thereby guarantees the sharing of information. Finally the important feature of the information retrieval system implemented in this paper is the online updating capability of law and regulations through the internet.

  • PDF

A Study on the Effectiveness of Information Retrieval (정보검색효율에 관한 연구)

  • Yoon Koo-ho
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.8
    • /
    • pp.73-101
    • /
    • 1981
  • Retrieval effectiveness is the principal criterion for measuring the performance of an information retrieval system. The effectiveness of a retrieval system depends primarily on the extent to which it can retrieve wanted documents without retrieving unwanted ones. So, ultimately, effectiveness is a function of the relevant and nonrelevant documents retrieved. Consequently, 'relevance' of information to the user's request has become one of the most fundamental concept encountered in the theory of information retrieval. Although there is at present no consensus as to how this notion should be defined, relevance has been widely used as a meaningful quantity and an adequate criterion for measures of the evaluation of retrieval effectiveness. The recall and precision among various parameters based on the 'two-by-two' table (or, contingency table) were major considerations in this paper, because it is assumed that recall and precision are sufficient for the measurement of effectiveness. Accordingly, different concepts of 'relevance' and 'pertinence' of documents to user requests and their proper usages were investigated even though the two terms have unfortunately been used rather loosely in the literature. In addition, a number of variables affecting the recall and precision values were discussed. Some conclusions derived from this study are as follows: Any notion of retrieval effectiveness is based on 'relevance' which itself is extremely difficult to define. Recall and precision are valuable concepts in the study of any information retrieval system. They are, however, not the only criteria by which a system may be judged. The recall-precision curve represents the average performance of any given system, and this may vary quite considerably in particular situations. Therefore, it is possible to some extent to vary the indexing policy, the indexing policy, the indexing language, or the search methodology to improve the performance of the system in terms of recall and precision. The 'inverse relationship' between average recall and precision could be accepted as the 'fundamental law of retrieval', and it should certainly be used as an aid to evaluation. Finally, there is a limit to the performance(in terms of effectiveness) achievable by an information retrieval system. That is : "Perfect retrieval is impossible."

  • PDF

A Development of Ontology-Based Law Retrieval System: Focused on Railroad R&D Projects (온톨로지 기반 법령 검색시스템의 개발: 철도·교통 분야 연구개발사업을 중심으로)

  • Won, Min-Jae;Kim, Dong-He;Jung, Hae-Min;Lee, Sang Keun;Hong, June Seok;Kim, Wooju
    • The Journal of Society for e-Business Studies
    • /
    • v.20 no.4
    • /
    • pp.209-225
    • /
    • 2015
  • Research and development projects in railroad domain are different from those in other domains in terms of their close relationship with laws. Some cases are reported that new technologies from R&D projects could not be industrialized because of relevant laws restricting them. This problem comes from the fact that researchers don't know exactly what laws can affect the result of R&D projects. To deal with this problem, we suggest a model for law retrieval system that can be used by researchers of railroad R&D projects to find related legislation. Input of this system is a research plan describing the main contents of projects. After laws related to the R&D project is provided with their rankings, which are assigned by scores we developed. A ranking of a law means its order of priority to be checked. By using this system, researchers can search the laws that may affect R&D projects throughout all the stages of project cycle. So, using our system model, researchers can get a list of laws to be considered before the project they participate ends. As a result, they can adjust their project direction by checking the law list, avoiding their elaborate projects being useless.

An Efficient Boolean Query Processing in Information Retrieval (효율적인 부울 질의 연산에 관한 연구)

  • 채승기;남영광;박현주
    • Journal of the Korean Society for information Management
    • /
    • v.13 no.1
    • /
    • pp.173-185
    • /
    • 1996
  • In this paper, we propose four optimizing methods for effectively processing queries in the Booleam information retrieval system ; (i) the short-circuit evaluation scheme used for optimizing logical expressions in programming lan-guages is applied to Boolean queries.(II) use the difference of the number of index word frequencies appearing in the related documents. (IIi) reduce the number of operators in the queries by applying the distribution law in the set theory. (iv) evaluate only once for the repeated expressions in the query. These methods have been implemented and tested in KRISTAL-II system on the UNIX workstation environment.

  • PDF

Building Hybrid Stop-Words Technique with Normalization for Pre-Processing Arabic Text

  • Atwan, Jaffar
    • International Journal of Computer Science & Network Security
    • /
    • v.22 no.7
    • /
    • pp.65-74
    • /
    • 2022
  • In natural language processing, commonly used words such as prepositions are referred to as stop-words; they have no inherent meaning and are therefore ignored in indexing and retrieval tasks. The removal of stop-words from Arabic text has a significant impact in terms of reducing the size of a cor- pus text, which leads to an improvement in the effectiveness and performance of Arabic-language processing systems. This study investigated the effectiveness of applying a stop-word lists elimination with normalization as a preprocessing step. The idea was to merge statistical method with the linguistic method to attain the best efficacy, and comparing the effects of this two-pronged approach in reducing corpus size for Ara- bic natural language processing systems. Three stop-word lists were considered: an Arabic Text Lookup Stop-list, Frequency- based Stop-list using Zipf's law, and Combined Stop-list. An experiment was conducted using a selected file from the Arabic Newswire data set. In the experiment, the size of the cor- pus was compared after removing the words contained in each list. The results showed that the best reduction in size was achieved by using the Combined Stop-list with normalization, with a word count reduction of 452930 and a compression rate of 30%.

Term Mapping Methodology between Everyday Words and Legal Terms for Law Information Search System (법령정보 검색을 위한 생활용어와 법률용어 간의 대응관계 탐색 방법론)

  • Kim, Ji Hyun;Lee, Jong-Seo;Lee, Myungjin;Kim, Wooju;Hong, June Seok
    • Journal of Intelligence and Information Systems
    • /
    • v.18 no.3
    • /
    • pp.137-152
    • /
    • 2012
  • In the generation of Web 2.0, as many users start to make lots of web contents called user created contents by themselves, the World Wide Web is overflowing by countless information. Therefore, it becomes the key to find out meaningful information among lots of resources. Nowadays, the information retrieval is the most important thing throughout the whole field and several types of search services are developed and widely used in various fields to retrieve information that user really wants. Especially, the legal information search is one of the indispensable services in order to provide people with their convenience through searching the law necessary to their present situation as a channel getting knowledge about it. The Office of Legislation in Korea provides the Korean Law Information portal service to search the law information such as legislation, administrative rule, and judicial precedent from 2009, so people can conveniently find information related to the law. However, this service has limitation because the recent technology for search engine basically returns documents depending on whether the query is included in it or not as a search result. Therefore, it is really difficult to retrieve information related the law for general users who are not familiar with legal terms in the search engine using simple matching of keywords in spite of those kinds of efforts of the Office of Legislation in Korea, because there is a huge divergence between everyday words and legal terms which are especially from Chinese words. Generally, people try to access the law information using everyday words, so they have a difficulty to get the result that they exactly want. In this paper, we propose a term mapping methodology between everyday words and legal terms for general users who don't have sufficient background about legal terms, and we develop a search service that can provide the search results of law information from everyday words. This will be able to search the law information accurately without the knowledge of legal terminology. In other words, our research goal is to make a law information search system that general users are able to retrieval the law information with everyday words. First, this paper takes advantage of tags of internet blogs using the concept for collective intelligence to find out the term mapping relationship between everyday words and legal terms. In order to achieve our goal, we collect tags related to an everyday word from web blog posts. Generally, people add a non-hierarchical keyword or term like a synonym, especially called tag, in order to describe, classify, and manage their posts when they make any post in the internet blog. Second, the collected tags are clustered through the cluster analysis method, K-means. Then, we find a mapping relationship between an everyday word and a legal term using our estimation measure to select the fittest one that can match with an everyday word. Selected legal terms are given the definite relationship, and the relations between everyday words and legal terms are described using SKOS that is an ontology to describe the knowledge related to thesauri, classification schemes, taxonomies, and subject-heading. Thus, based on proposed mapping and searching methodologies, our legal information search system finds out a legal term mapped with user query and retrieves law information using a matched legal term, if users try to retrieve law information using an everyday word. Therefore, from our research, users can get exact results even if they do not have the knowledge related to legal terms. As a result of our research, we expect that general users who don't have professional legal background can conveniently and efficiently retrieve the legal information using everyday words.

Development of a Concept Network Useful for Specialized Search Engines (전문검색엔진을 위한 개념망의 개발)

  • 주정은;구상회
    • Journal of Information Technology Applications and Management
    • /
    • v.10 no.2
    • /
    • pp.33-41
    • /
    • 2003
  • It is not easy to find desired information in the world wide web. In this research, we introduce a notion of concept network that is useful in finding information if it is used in search engines that are specialized in domains such as medicine, law or engineering. The concept network that we propose is a network in which nodes represent significant concepts in the domain, and links represent relationships between the concepts. We may use the concept network constructor as a preprocessor to speci-alized search engines. When user enters a target word to find information, our system generates and displays a concept network in which nodes are con-cepts that are closely related with the target word. By reviewing the network, user may confirm that the target word is properly selected for his intention, otherwise he may replace the target word with better ones discovered in the network. In this research, we propose a detailed method to construct concept net-work, implemented a prototypical system that constructs concept networks, and illustrate its usefulness by demonstrating a practical case.

  • PDF

A Comparative Analysis on the Research Products of Each Other Between Korea and Japan - With an Emphasis on the Social Fields - (사회영역에 있어서 한일간 지식정보의 생산과 흐름 - 사회.교육.행정.법률을 중심으로-)

  • 최정태
    • Journal of Korean Library and Information Science Society
    • /
    • v.33 no.2
    • /
    • pp.1-24
    • /
    • 2002
  • This study intends to analyze materials which Korea and Japan have investigated about each other during last In years(1901-2000). To the end, we collected monographs and constructed DBs('Korea-Japan Information Retrieval System'). Using it, this study analyzed a characteristic of the publication period, subjects, and producers from a bibliographical point of view. In particular, this study concentrated upon the subject of sociology, education, public administration, and law fields.

  • PDF

Uncertainties of SO2 Vertical Column Density Retrieval from Ground-based Hyper-spectral UV Sensor Based on Direct Sun Measurement Geometry (지상관측 기반 태양 직달광 관측장비의 초분광 자외센서로부터 이산화황 연직칼럼농도의 불확실성 분석 연구)

  • Kang, Hyeongwoo;Park, Junsung;Yang, Jiwon;Choi, Wonei;Kim, Daewon;Lee, Hanlim
    • Korean Journal of Remote Sensing
    • /
    • v.35 no.2
    • /
    • pp.289-298
    • /
    • 2019
  • In this present study, the effects of Signal to Noise Ratio (SNR), Full Width Half Maximum (FWHM), Aerosol Optical Depth (AOD), $O_3$ Vertical Column Density ($O_3$ VCD), and Solar Zenith Angle (SZA) on the accuracy of sulfur dioxide Vertical Column Density ($SO_2$ VCD) retrieval have been quantified using the Differential Optical Absorption Spectroscopy (DOAS) method with the ground-based direct-sun synthetic radiances. The synthetic radiances produced based on the Beer-Lambert-Bouguer law without consideration of the diffuse effect. In the SNR condition of 650 (1300) with FWHM = 0.6 nm, AOD = 0.2, $O_3$ VCD = 300 DU, and $SZA=30^{\circ}$, the Absolute Percentage Difference (APD) between the true $SO_2$ VCD values and those retrieved ranges from 80% (28%) to 16% (5%) for the $SO_2$ VCD of $8.1{\times}10^{15}$ and $2.7{\times}10^{16}molecules\;cm^{-2}$, respectively. For an FWHM of 0.2 nm (1.0 nm) with the $SO_2$ VCD values equal to or greater than $2.7{\times}10^{16}molecules\;cm^{-2}$, the APD ranges from 6.4% (29%) to 6.2% (10%). Additionally, when FWHM, SZA, AOD, and $O_3$ VCD values increase, APDs tend to be large. On the other hand, SNR values increase, APDs are found to decrease. Eventually, it is revealed that the effects of FWHM and SZA on $SO_2$ VCD retrieval accuracy are larger than those of $O_3$ VCD and AOD. The SZA effects on the reduction of $SO_2$ VCD retrieval accuracy is found to be dominant over the that of FWHM for the condition of $SO_2$ VCD larger than $2.7{\times}10^{16}molecules\;cm^{-2}$.