• Title/Summary/Keyword: 태그 기반 정보검색

Search Result 136, Processing Time 0.021 seconds

Implementation of an XML-Based Editor/Transformer for Large Volume of Similar Documents (XML 기반의 대용량 유사 문서 편집기/변환기 구현)

  • 황인준
    • The Journal of Society for e-Business Studies
    • /
    • v.9 no.1
    • /
    • pp.21-38
    • /
    • 2004
  • With its recent popularity, Web is now considered as a huge repository of information. Most documents on the web have been created using HTML(Hyper Text Markup Language). Even though HTML is simple and easy to learn, it has several features that are obstacles to the efficient information retrieval. XML(eXtensible Markup Language) can provide a solution to such problems and in fact, has already been used in many applications, XML is a standard markup language for exchanging data on the web. It can describe a document structure freely by defining its DTD, which enables efficient integration and retrieval of data on the web. In this paper, we propose a versatile and efficient XML document manager. Its features include (i) form-based XML editor that enables easy creation of new XML documents, (ii) automatic document converter that can transform HTML documents with similar structure into XML documents automatically, and (iii) GUI-based DTD editor.

  • PDF

Development and Evaluation of Information Extraction Module for Postal Address Information (우편주소정보 추출모듈 개발 및 평가)

  • Shin, Hyunkyung;Kim, Hyunseok
    • Journal of Creative Information Culture
    • /
    • v.5 no.2
    • /
    • pp.145-156
    • /
    • 2019
  • In this study, we have developed and evaluated an information extracting module based on the named entity recognition technique. For the given purpose in this paper, the module was designed to apply to the problem dealing with extraction of postal address information from arbitrary documents without any prior knowledge on the document layout. From the perspective of information technique practice, our approach can be said as a probabilistic n-gram (bi- or tri-gram) method which is a generalized technique compared with a uni-gram based keyword matching. It is the main difference between our approach and the conventional methods adopted in natural language processing that applying sentence detection, tokenization, and POS tagging recursively rather than applying the models sequentially. The test results with approximately two thousands documents are presented at this paper.

The Design of Customized Board using the Web 2.0 (웹 2.0을 기반으로 한 맞춤형 게시판)

  • Park, Sung-Shin;Kim, Chang-Suk;Kim, Dae-Su
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.17 no.6
    • /
    • pp.773-779
    • /
    • 2007
  • Internet bulletin boards have been used to exchange their idea and information among Internet users. But the existing Internet bulletin boards can not satisfy user's personal view. In this raper, Web 2.0 based customized Internet bulletin board is to design. The proposed Internet bulletin board provides each user with personalized information which are established by user beforehand. So user can retrieve his interested information fast. Moreover user can generate his own personalized bulletin board to collect one's interested information automatically. The personalized bulletin board is connected to several Internet bulletin boards with RSS feeds.

The RFID Object Information Management System Design and Implementation For the RFID-Based Ubiquitous Applications (RFID 기반의 유비쿼터스 응용을 위한 RFID 객체정보 관리 시스템 설계 및 개발)

  • Park, Chan-Hee;Kim, Hak-Soo;Choi, Yun-Ho;Kim, Jong-Jin;Shin, Young-Jae;Kim, Jae-Hyung;Son, Jin-Hyun
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2007.05a
    • /
    • pp.1424-1427
    • /
    • 2007
  • 최근, RFID 기술이 발전하면서 RFID의 활용분야는 유통 물류 시스템 중심에서 로봇이나 홈 네트워크 시스템과 같은 다양한 분야로 확대되고 있다. 이는, RFID태그와 RFID리더를 사용하여 객체의 정보를 빠르고 정확하게 검색 할 수 있기 때문이며, 이에 따라 RFID 네트워크를 이용하는 시스템의 효과적인 객체정보 추출을 위한 연구가 활발히 진행되어 왔다. 이와 관련하여, 본 논문에서는 RFID를 이용하는 시스템의 효과적인 객체정보 관리 시스템을 설계하고 개발하였다. 이는 RFID 네트워크 외부에 존재하는 RFID 게이트웨이와 RFID 네트워크를 구성하는 어플리케이션 서버, 변환 서버, 객체정보 서버로 구성된다.

  • PDF

Design and implementation of low-power tracking device based on IEEE 802.11 (IEEE 802.11 기반 저전력 위치 추적 장치의 설계 및 구현)

  • Son, Sanghyun;Kim, Taewook;Baek, Yunju
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.18 no.2
    • /
    • pp.466-474
    • /
    • 2014
  • According to wireless network technology and mobile processors performance were improved, the small wireless mobile device such as smart phones has been widely utilized. The mobile devices can be used GPS information, thereby the services based on location information was increased. GPS was impossible to provide location information in indoor and signal shading environment, and the tracking systems based on short distance wireless communication are required infrastructure. The IEEE 802.11 based tracking system is possible estimation using APs, however the tracking device is exhausted battery power seriously. In this paper, we propose IEEE 802.11 based low-power tracking system. We reduced power consumption from channel scanning and network connection. For performance evaluation, we designed and implemented the tracking tag device, and measured power consumption of the device. As the simulation result, we confirmed that the power consumption was reduced 46% compare to the standard execution.

A Study on Skimming of News Article for an Efficient Browsing (효과적인 브라우징을 위한 뉴스 기사 요약에 관한 연구)

  • 이주호;정승도;조정원;최병욱
    • Proceedings of the IEEK Conference
    • /
    • 2000.09a
    • /
    • pp.219-222
    • /
    • 2000
  • 수많은 종류의 비디오 데이터를 효율적으로 검색하기 위해서는 데이터를 분석하여 사용자에게 먼저 전체 비디오의 요약을 제시하는 것이 효과적이다. 본 논문에서는 기사 단위로 분할된 뉴스 기사 전체를 보여주지 않으면서도 기사의 내용을 왜곡됨이 없이 요약하여 효과적으로 사용자에게 보여주기 위한 방법을 제안한다. 본 논문에서는 사용자에게 시각적인 요약 정보를 앵커 프레임 추출 및 대표 프레임 추출을 통해 필름 스트림(film trip)의 형태로 제시하고, 기사를 소개하는 앵커의 첫 대사를 폐쇄 자막(closed-caption)을 이용하여 추출하여, 이를 기사의 내용에 대한 요약으로 필름 스트립과 같이 제시하도록 하였다. 앵커 프레임을 추출하기 위해 본 논문에서는 폐쇄 자막에서의 "앵커:" 태그가 존재하는 시간 구간과 동기된 프레임을 선정한다. 또한 대표 프레임은 공개형 자막(open-cpation)이 존재하는 프레임과 빈도에 기반한 가중치가 높은 .폐쇄 자막에서의 키워드와 동기된 프레임을 선정하도록 하였다. 본 논문의 뉴스 기사 요약 시스템은 시각적인 프레임제시와 함께 기사의 내용을 바탕으로 하는 기사 요약문을 같이 사용자에게 제공함으로써 기존의 필름 스트립형태만 제공하던 시스템에 비하여 사용자 중심의 지능형 요약 서비스가 가능함을 실험을 통해 보인다.

  • PDF

Improved Internet Resource Recommendation Method using FOAF and SNA (FOAF와 SNA를 이용한 개선된 인터넷 자원 추천 방법)

  • Wang, Qing;Sohn, Jong-Soo;Chung, In-Jeong
    • The KIPS Transactions:PartB
    • /
    • v.19B no.3
    • /
    • pp.165-176
    • /
    • 2012
  • In recent years, due to rapidly increasing user-created internet contents coupled with the development of community-based websites, the internet resource recommendation systems are attracting attentions of the users. However, most of the systems have failed in properly reflecting users' characteristics and thus they have difficulty in recommending appropriate resources to users. In this paper, we propose an internet resource recommendation method using FOAF and SNA which fully reflects the characteristics of users. In our method, 1) we extract the data about user characteristics and tags using FOAF; 2) we generate graphs representing users, user characteristics and tags after inserting data into 3 matrixes and integrating them; 3) we recommend the appropriate internet resources after selecting common characteristics of the recommended items and Hot tags by analyzing social network. For verification of our proposed method, we implemented our method to establish and analyze an experimental social group. We verified through our experiments that the more users added in the social network, the higher quality of recommendation result we got than the item-based recommendation method. By using the suggested idea in this paper, we can make a more appropriate recommendation of resources to users while effectively retrieving explosively increasing internet resources.

XML Document Analysis based on Similarity (유사성 기반 XML 문서 분석 기법)

  • Lee, Jung-Won;Lee, Ki-Ho
    • Journal of KIISE:Software and Applications
    • /
    • v.29 no.6
    • /
    • pp.367-376
    • /
    • 2002
  • XML allows users to define elements using arbitrary words and organize them in a nested structure. These features of XML offer both challenges and opportunities in information retrieval and document management. In this paper, we propose a new methodology for computing similarity considering XML semantics - meanings of the elements and nested structures of XML documents. We generate extended-element vectors, using thesaurus, to normalize synonyms, compound words, and abbreviations and build similarity matrix using them. And then we compute similarity between XML elements. We also discover and minimize XML structure using automata(NFA(Nondeterministic Finite Automata) and DFA(Deterministic Finite automata). We compute similarity between XML structures using similarity matrix between elements and minimized XML structures. Our methodology considering XML semantics shows 100% accuracy in identifying the category of real documents from on-line bookstore.

A Design and Implementation of EPCIS Repository for RFID and Sensor Data (RFID와 센서 데이터 처리를 위한 EPCIS 저장소 설계 및 구현)

  • Hyun, Seung-Ryul;Lee, Sang-Jeong
    • Journal of the Korea Society of Computer and Information
    • /
    • v.15 no.12
    • /
    • pp.151-162
    • /
    • 2010
  • In order to build up the ubiquitous computer environment, there are many researches on automatic identification, sensor networks, and home networks etc. EPCIS (EPC Information Services), which is proposed by EPCglobal, is a standard on the repository managing tag data that is needed to develop RFID application system. In this paper, the EPCIS repository is designed and implemented. It is able to search the object dependent upon general object recognition and environment information variation. And sensor data, which is also massive data and is changed with position, is integrated into RFID data in the system. By doing so, it is possible to do the convergence managements of object recognition with variations of USN (Ubiquitous Sensor Network) environment.

A Korean Language Stemmer based on Unsupervised Learning (자율 학습에 의한 실질 형태소와 형식 형태소의 분리)

  • Jo, Se-Hyeong
    • The KIPS Transactions:PartB
    • /
    • v.8B no.6
    • /
    • pp.675-684
    • /
    • 2001
  • This paper describes a method for stemming of Korean language by using unsupervised learning from raw corpus. This technique does not require a lexicon or any language-specific knowledge. Since we use unsupervised learning, the time and effort required for learning is negligible. Unlike heuristic approaches that are theoretically ungrounded, this method is based on widely accepted statistical methods, and therefore can be easily extended. The method is currently applied only to Korean language, but it can easily be adapted to other agglutinative languages, since it is not language-dependent.

  • PDF