• Title/Summary/Keyword: Retrieval systems

Search Result 1,018, Processing Time 0.026 seconds

Term Mapping Methodology between Everyday Words and Legal Terms for Law Information Search System (법령정보 검색을 위한 생활용어와 법률용어 간의 대응관계 탐색 방법론)

  • Kim, Ji Hyun;Lee, Jong-Seo;Lee, Myungjin;Kim, Wooju;Hong, June Seok
    • Journal of Intelligence and Information Systems
    • /
    • v.18 no.3
    • /
    • pp.137-152
    • /
    • 2012
  • In the generation of Web 2.0, as many users start to make lots of web contents called user created contents by themselves, the World Wide Web is overflowing by countless information. Therefore, it becomes the key to find out meaningful information among lots of resources. Nowadays, the information retrieval is the most important thing throughout the whole field and several types of search services are developed and widely used in various fields to retrieve information that user really wants. Especially, the legal information search is one of the indispensable services in order to provide people with their convenience through searching the law necessary to their present situation as a channel getting knowledge about it. The Office of Legislation in Korea provides the Korean Law Information portal service to search the law information such as legislation, administrative rule, and judicial precedent from 2009, so people can conveniently find information related to the law. However, this service has limitation because the recent technology for search engine basically returns documents depending on whether the query is included in it or not as a search result. Therefore, it is really difficult to retrieve information related the law for general users who are not familiar with legal terms in the search engine using simple matching of keywords in spite of those kinds of efforts of the Office of Legislation in Korea, because there is a huge divergence between everyday words and legal terms which are especially from Chinese words. Generally, people try to access the law information using everyday words, so they have a difficulty to get the result that they exactly want. In this paper, we propose a term mapping methodology between everyday words and legal terms for general users who don't have sufficient background about legal terms, and we develop a search service that can provide the search results of law information from everyday words. This will be able to search the law information accurately without the knowledge of legal terminology. In other words, our research goal is to make a law information search system that general users are able to retrieval the law information with everyday words. First, this paper takes advantage of tags of internet blogs using the concept for collective intelligence to find out the term mapping relationship between everyday words and legal terms. In order to achieve our goal, we collect tags related to an everyday word from web blog posts. Generally, people add a non-hierarchical keyword or term like a synonym, especially called tag, in order to describe, classify, and manage their posts when they make any post in the internet blog. Second, the collected tags are clustered through the cluster analysis method, K-means. Then, we find a mapping relationship between an everyday word and a legal term using our estimation measure to select the fittest one that can match with an everyday word. Selected legal terms are given the definite relationship, and the relations between everyday words and legal terms are described using SKOS that is an ontology to describe the knowledge related to thesauri, classification schemes, taxonomies, and subject-heading. Thus, based on proposed mapping and searching methodologies, our legal information search system finds out a legal term mapped with user query and retrieves law information using a matched legal term, if users try to retrieve law information using an everyday word. Therefore, from our research, users can get exact results even if they do not have the knowledge related to legal terms. As a result of our research, we expect that general users who don't have professional legal background can conveniently and efficiently retrieve the legal information using everyday words.

User Perspective Website Clustering for Site Portfolio Construction (사이트 포트폴리오 구성을 위한 사용자 관점의 웹사이트 클러스터링)

  • Kim, Mingyu;Kim, Namgyu
    • Journal of Internet Computing and Services
    • /
    • v.16 no.3
    • /
    • pp.59-69
    • /
    • 2015
  • Many users visit websites every day to perform information retrieval, shopping, and community activities. On the other hand, there is intense competition among sites which attempt to profit from the Internet users. Thus, the owners or marketing officers of each site try to design a variety of marketing strategies including cooperation with other sites. Through such cooperation, a site can share customers' information, mileage points, and hyperlinks with other sites. To create effective cooperation, it is crucial to choose an appropriate partner site that may have many potential customers. Unfortunately, it is exceedingly difficult to identify such an appropriate partner among the vast number of sites. In this paper, therefore, we devise a new methodology for recommending appropriate partner sites to each site. For this purpose, we perform site clustering from the perspective of visitors' similarities, and then identify a group of sites that has a number of common customers. We then analyze the potential for the practical use of the proposed methodology through its application to approximately 140 million actual site browsing histories.

Discussions on the Accessibility of School Library DLS Catalogue Records - Focused on Literary Collections - (학교도서관 DLS 목록의 자료 접근성에 대한 논의 - 문학 분야 장서를 중심으로 -)

  • Kang, Bong-Suk;Jung, Youngmi
    • Journal of Korean Library and Information Science Society
    • /
    • v.50 no.4
    • /
    • pp.539-559
    • /
    • 2019
  • One of the fundamental roles of libraries is to provide users with efficient and easy retrieval of materials. Various discussions have been made at domestic and abroad to improve the accessibility of materials by category, user, and collection, and at the center of this is the issue of improving classification and cataloging systems. However, there are few studies in this area dealing with the data accessibility of the DLS catalog, which is a central tool for accessing domestic school library materials. This study started from the appeal of school library users to the difficulty of searching and accessing books, especially literature. This study is an exploratory study that attempts to derive problems by finding the causes of there difficulties from various aspects. To this study, we surveyed and analyzed the current status of school library collections, the data registration of the school library support system DLS, the subject accessibility of catalog records produced through this, and the recognition and opinions of school library professionals. As a result, school library collections were highly concentrated in the literature field, and it was found that there was not enough catalog bibliographic records to provide efficient access to these collections. In addition, it was found to be somewhat lacking through the DLS search function to compensate for this. Surveys of school librarians and librarians have also identified this problem, and a rich topic index and search keyword assignments have been drawn to the majority of opinions as a way to improve access to materials in school library catalogs. As a continuous discussion on this subject, the plan for improving access to school library materials will be more concrete through future user studies and new challenges for bookshelf classification.

Automatic Clustering of Same-Name Authors Using Full-text of Articles (논문 원문을 이용한 동명 저자 자동 군집화)

  • Kang, In-Su;Jung, Han-Min;Lee, Seung-Woo;Kim, Pyung;Goo, Hee-Kwan;Lee, Mi-Kyung;Goo, Nam-Ang;Sung, Won-Kyung
    • Proceedings of the Korea Contents Association Conference
    • /
    • 2006.11a
    • /
    • pp.652-656
    • /
    • 2006
  • Bibliographic information retrieval systems require bibliographic data such as authors, organizations, source of publication to be uniquely identified using keys. In particular, when authors are represented simply as their names, users bear the burden of manually discriminating different users of the same name. Previous approaches to resolving the problem of same-name authors rely on bibliographic data such as co-author information, titles of articles, etc. However, these methods cannot handle the case of single author articles, or the case when articles do not have common terms in their titles. To complement the previous methods, this study introduces a classification-based approach using similarity between full-text of articles. Experiments using recent domestic proceedings showed that the proposed method has the potential to supplement the previous meta-data based approaches.

  • PDF

A Study on Sewage Characteristics in Hanam City (하남시 오수발생특성에 대한 연구)

  • Choi, Gye-Woon;Hyun, Ji-Hwan;Lee, Ho-Sun
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2005.05b
    • /
    • pp.1317-1322
    • /
    • 2005
  • 하수관거 설계시나 단지개발사업, 그리고 하수관거정비사업과 같이 오수처리시설의 적정 규모 결정을 위해서는 정확한 상수사용량 및 오수발생량 원단위가 요구되지만 국내의 경우 이러한 원단위에 대한 기초자료 부족과 자료의 신빙성 결여로 인해 적정 원단위를 결정하는데 어려움이 있다. 이러한 관점에서 단지개발이 이루어지는 도시에서는 도시의 규모, 입지조건, 기후조건, 생활양식 등 다양한 요인들이 고려된 오수발생패턴 및 발생량 조사가 필요하며, 조사된 원단위는 오수처리시설의 적정 규모 결정뿐 아니라 침입수/유입수 분석 및 하수관거정비에 대한 성과예측에도 활용될 수 있다. 본 연구에서는 현재 단지개발 및 하수관거정비사업이 진행중인 하남시의 표본이 될 수 있는 대표구역을 선정하고 그 지역에서 조사지역을 세부적으로 분류하여 각 지역별 오수발생특성을 분석하였다. 대상지역인 하남시는 총면적의 $97\%$가 자연녹지 및 생산녹지이며, 나머지 $3\%$는 일반주거지역 및 일반 상업지역으로 나뉜다. 그리고 도시계획상 공장지역으로 편성된 부분이 없어 앞으로도 하남시 대부분의 면적이 녹지와 주거/상업지역으로 구성될 것이다. 이러한 하남시의 특성을 고려하여 조사지역은 공장지역을 제외한 일반주거지역, 밀집주거지역, 영업지역으로 분류하였으며 이렇게 분류된 지역은 각각 오수발생패턴 및 오수농도에 대한 조사를 실시하여 오수발생특성을 분석하였고, 조사지역별 인구수 조사와 연계하여 원단위 자료를 추출하였다. 이렇게 조사된 자료들을 통해 침입수/유입수 분석에 요구되는 오수전환율, 야간생활하수량 비율을 산정하였으며, 차후 단지개발 및 관거정비 후에 발생하는 오수 발생특성과 비교분석을 통하여 하남시 지역의 오수발생특성에 대한 신뢰성 있는 자료로 활용될 것으로 기대된다. RMA2 모형을 이용하여 충주댐에서의 물의 흐름을 해석한 결과 옥순대교$\~$청풍대교 구간 사이에 댐 및 지형적 영향으로 인해 잘 발달된 와류가 하도 전체를 통하여 발생되고 있었고 이는 댐 부유물 정체현상이 나타나는 지점과 잘 일치하고 있었다.정함 후 감마분석에 의하여 구하였다. CF:CS 연령모델을 적용한 결과 깊이에 따른 supported $^{210}Pb$와 퇴적 속도는 0.91cm/year 인 것으로 산정 되었다.RS is a more advanced content-based image retrieval system than other systems which support only concepts or image features.방하는 것이 선계기준에 적합한 것으로 나타났다. 밸브 개폐에 따른 수압 변화를 모의한 결과 밸브 개폐도를 적절히 유지하여 필요수량의 확보 및 누수방지대책에 활용할 수 있을 것으로 판단된다.8R(mm)(r^2=0.84)$로 지수적으로 증가하는 경향을 나타내었다. 유거수량은 토성별로 양토를 1.0으로 기준할 때 사양토가 0.86으로 가장 작았고, 식양토 1.09, 식토 1.15로 평가되어 침투수에 비해 토성별 차이가 크게 나타났다. 이는 토성이 세립질일 수록 유거수의 저항이 작기 때문으로 생각된다. 경사에 따라서는 경사도가 증가할수록 증가하였으며 $10\% 경사일 때를 기준으로 $Ro(mm)=Ro_{10}{\times}0.797{\times}e^{-0.021s(\%)}$로 나타났다.천성 승모판 폐쇄 부전등을 초래하는 심각한 선천성 심질환이다. 그러나 진단 즉시 직접 좌관상동맥-대동맥 이식술로 수술적 교정을 해줌으로써 좋은 성적을 기대할 수 있음을 보여주

  • PDF

Customized Configuration with Template and Options (맞춤구성을 위한 템플릿과 Option 기반의 추론)

  • 이현정;이재규
    • Journal of Intelligence and Information Systems
    • /
    • v.8 no.1
    • /
    • pp.119-139
    • /
    • 2002
  • In electronic catalogs, each item is represented as an independent unit while the parts of the item can be composed of a higher level of functionality. Thus, the search for this kind of product database is limited to the retrieval of most similar standard commodities. However, many industrial products need to configure optional parts to fulfill the required specifications. Since there are many paths in finding the required specifications, we need to develop a search system via the configuration process. In this system, we adopt a two-phased approach. The first phase finds the most similar template, and the second phase adjusts the template specifications toward the required set of specifications by the Constraint and Rule Satisfaction Problem approach. There is no guarantee that the most similar template can find the most desirable configuration. The search system needs backtracking capability, so the search can stop at a satisfied local optimal satisfaction. This framework is applied to the configuration of computers and peripherals. Template-based reasoning is basically the same as case-based reasoning. The required set of specifications is represented by a list of criteria, and matched with the product specifications to find the closest ones. To measure the distance, we develop a thesaurus of values, which can identify the meaning of numbers, symbols, and words. With this configuration, the performance of the search by configuration algorithm is evaluated in terms of feasibility and admissibility.

  • PDF

Elicitation of Collective Intelligence by Fuzzy Relational Methodology (퍼지관계 이론에 의한 집단지성의 도출)

  • Joo, Young-Do
    • Journal of Intelligence and Information Systems
    • /
    • v.17 no.1
    • /
    • pp.17-35
    • /
    • 2011
  • The collective intelligence is a common-based production by the collaboration and competition of many peer individuals. In other words, it is the aggregation of individual intelligence to lead the wisdom of crowd. Recently, the utilization of the collective intelligence has become one of the emerging research areas, since it has been adopted as an important principle of web 2.0 to aim openness, sharing and participation. This paper introduces an approach to seek the collective intelligence by cognition of the relation and interaction among individual participants. It describes a methodology well-suited to evaluate individual intelligence in information retrieval and classification as an application field. The research investigates how to derive and represent such cognitive intelligence from individuals through the application of fuzzy relational theory to personal construct theory and knowledge grid technique. Crucial to this research is to implement formally and process interpretatively the cognitive knowledge of participants who makes the mutual relation and social interaction. What is needed is a technique to analyze cognitive intelligence structure in the form of Hasse diagram, which is an instantiation of this perceptive intelligence of human beings. The search for the collective intelligence requires a theory of similarity to deal with underlying problems; clustering of social subgroups of individuals through identification of individual intelligence and commonality among intelligence and then elicitation of collective intelligence to aggregate the congruence or sharing of all the participants of the entire group. Unlike standard approaches to similarity based on statistical techniques, the method presented employs a theory of fuzzy relational products with the related computational procedures to cover issues of similarity and dissimilarity.

Multi-Dimensional Keyword Search and Analysis of Hotel Review Data Using Multi-Dimensional Text Cubes (다차원 텍스트 큐브를 이용한 호텔 리뷰 데이터의 다차원 키워드 검색 및 분석)

  • Kim, Namsoo;Lee, Suan;Jo, Sunhwa;Kim, Jinho
    • Journal of Information Technology and Architecture
    • /
    • v.11 no.1
    • /
    • pp.63-73
    • /
    • 2014
  • As the advance of WWW, unstructured data including texts are taking users' interests more and more. These unstructured data created by WWW users represent users' subjective opinions thus we can get very useful information such as users' personal tastes or perspectives from them if we analyze appropriately. In this paper, we provide various analysis efficiently for unstructured text documents by taking advantage of OLAP (On-Line Analytical Processing) multidimensional cube technology. OLAP cubes have been widely used for the multidimensional analysis for structured data such as simple alphabetic and numberic data but they didn't have used for unstructured data consisting of long texts. In order to provide multidimensional analysis for unstructured text data, however, Text Cube model has been proposed precently. It incorporates term frequency and inverted index as measurements to search and analyze text databases which play key roles in information retrieval. The primary goal of this paper is to apply this text cube model to a real data set from in an Internet site sharing hotel information and to provide multidimensional analysis for users' reviews on hotels written in texts. To achieve this goal, we first build text cubes for the hotel review data. By using the text cubes, we design and implement the system which provides multidimensional keyword search features to search and to analyze review texts on various dimensions. This system will be able to help users to get valuable guest-subjective summary information easily. Furthermore, this paper evaluats the proposed systems through various experiments and it reveals the effectiveness of the system.

An Exploration on Food Waste Management of Local Governments (전국 지방자치단체의 음식물쓰레기 관리 분석)

  • Oh, Jeongik;Lee, Hyunjeong
    • Journal of Korean Society of Environmental Engineers
    • /
    • v.38 no.3
    • /
    • pp.101-109
    • /
    • 2016
  • This research is to explore food waste management across local governments. In particular, pubic administration on food waste, food waste management (from generation to disposal) and civil complaints in jurisdiction are examined. In doing so, a self-administered questionnaire survey was conducted among civil officers in charge of food waste management, and all the collected responses were statistically analyzed. The main results were as follows: public spending on food waste management was a little larger in metropolises than in provincial cities, and the largest food waste source was identified as households (in housing). While regular collection of food waste by trucks was the most common transport method adopted by local governments, resource recovery for compost/fertilizer production was widely used. Also, most of the respondents agreed that the current approach to food waste handling practices are necessarily replaced with more advanced technology converting waste into energy or fuel. Further, it's found that the civil complaints on food waste management were largely categorized into 3 groups - food waste handling, civil service and food waste retrieval. Therefore, the findings indicate that the development and application of no-food waste or waste-to-resource systems are effective in housing estates where large amount of food waste is generated and eliminated.

Effective Picture Search in Lifelog Management Systems using Bluetooth Devices (라이프로그 관리 시스템에서 블루투스 장치를 이용한 효과적인 사진 검색 방법)

  • Chung, Eun-Ho;Lee, Ki-Yong;Kim, Myoung-Ho
    • Journal of KIISE:Computing Practices and Letters
    • /
    • v.16 no.4
    • /
    • pp.383-391
    • /
    • 2010
  • A Lifelog management system provides users with services to store, manage, and search their life logs. This paper proposes a fully-automatic collecting method of real world social contacts and lifelog search engine using collected social contact information as keyword. Wireless short-distance network devices in mobile phones are used to detect social contacts of their users. Human-Bluetooth relationship matrix is built based on the frequency of a human-being and a Bluetooth device being observed at the same time. Results show that with 20% of social contact information out of full social contact information of the observation times used for calculation, 90% of human-Bluetooth relationship can be correctly acquired. A lifelog search-engine that takes human names as keyword is suggested which compares two vectors, a row of Human-Bluetooth matrix and a vector of Bluetooth list scanned while a lifelog was created, using vector information retrieval model. This search engine returns more lifelog than existing text-matching search engine and ranks the result unlike existing search-engine.