• Title/Summary/Keyword: 태그 기반 정보검색

Search Result 136, Processing Time 0.022 seconds

An Automatic Web Page Classification System Using Meta-Tag (메타 태그를 이용한 자동 웹페이지 분류 시스템)

  • Kim, Sang-Il;Kim, Hwa-Sung
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.38B no.4
    • /
    • pp.291-297
    • /
    • 2013
  • Recently, the amount of web pages, which include various information, has been drastically increased according to the explosive increase of WWW usage. Therefore, the need for web page classification arose in order to make it easier to access web pages and to make it possible to search the web pages through the grouping. Web page classification means the classification of various web pages that are scattered on the web according to the similarity of documents or the keywords contained in the documents. Web page classification method can be applied to various areas such as web page searching, group searching and e-mail filtering. However, it is impossible to handle the tremendous amount of web pages on the web by using the manual classification. Also, the automatic web page classification has the accuracy problem in that it fails to distinguish the different web pages written in different forms without classification errors. In this paper, we propose the automatic web page classification system using meta-tag that can be obtained from the web pages in order to solve the inaccurate web page retrieval problem.

Construction of Folksonomy-Based Microcontents Using Upper Ontology Modeling (상위온톨로지 모델링을 이용한 폭소노미 기반 마이크로컨텐츠 구축)

  • Lee, Seung-Min
    • Journal of the Korean Society for information Management
    • /
    • v.28 no.4
    • /
    • pp.161-182
    • /
    • 2011
  • Metadata and folksonomy are two main approaches in representing, organizing, and retrieving resources in the current information environment. Many researches have conducted studies to combine of metadata and folksonomy in order to utilize the strengths of both approaches. This research proposed an approach to utilize both metadata and folksonomy in representing resources by using microcontents. Microcontents in this research is a conceptual structure that reflects dynamic characteristics of folksonomy and the structure of metadata. By connecting folksonomy with metadata through this microcontents structure, both approaches can maximize their strengths and minimize their weaknesses in representing, organizing, and retrieving resources.

Korean Lexical Disambiguation Based on Statistical Information (통계정보에 기반을 둔 한국어 어휘중의성해소)

  • 박하규;김영택
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.19 no.2
    • /
    • pp.265-275
    • /
    • 1994
  • Lexical disambiguation is one of the most basic areas in natural language processing such as speech recognition/synthesis, information retrieval, corpus tagging/ etc. This paper describes a Korean lexical disambiguation mechanism where the disambigution is perfoemed on the basis of the statistical information collected from corpora. In this mechanism, the token tags corresponding to the results of the morphological analysis are used instead of part of speech tags for the purpose of detail disambiguation. The lexical selection function proposed shows considerably high accuracy, since the lexical characteristics of Korean such as concordance of endings or postpositions are well reflected in it. Two disambiguation methods, a unique selection method and a multiple selection method, are provided so that they can be properly according to the application areas.

  • PDF

A Noun Extractor using Connectivity Information (좌우접속정보를 이용한 명사추출기)

  • An, Dong-Un
    • Annual Conference on Human and Language Technology
    • /
    • 1999.10d
    • /
    • pp.173-178
    • /
    • 1999
  • 본 논문의 명사추출기는 정보검색시스템을 위한 색인어 추출기로 좌우접속정보를 이용한 형태소해석을 통하여 얻어진 형태소들 중에서 명사를 추출한다. 본 형태소해석기는 형태소해석을 위한 언어지식과 어절 분리 엔진을 분리하여 수정과 확장이 용이하게 하였다. 사용한 언어지식은 좌우접속정보로서 한 어절을 이루는 형태소들의 품사간의 접속여부를 행렬로 표현한 것이다. 어절 분리 엔진은 사전을 참조하여 한 어절에서 최장일치법에 의해 형태소를 분리하고 좌우접속정보를 참조하여 형태소 분리가 올바른지를 판단한다. 형태소들의 품사분류는 표준 태그셋을 기반으로 음절 정보를 추가하여 확장하였다. 형태소를 해석한 결과 미등록어가 발생하였을 때 미등록어에서 명사를 추정하는 모듈이 없기 때문에 재현율은 좋지 않았다.

  • PDF

Multimedia Contents Recommendation Method using Mood Vector in Social Networks (소셜네트워크에서 분위기 벡터를 이용한 멀티미디어 콘텐츠 추천 방법)

  • Moon, Chang Bae;Lee, Jong Yeol;Kim, Byeong Man
    • Journal of Korea Society of Industrial Information Systems
    • /
    • v.24 no.6
    • /
    • pp.11-24
    • /
    • 2019
  • The tendency of buyers of web information is changing from the cost-effectiveness to the cost-satisfaction. There is such tendency in the recommendation of multimedia contents, some of which are folksonomy-based recommendation services using mood. However, there is a problem that they does not consider synonyms. In order to solve this problem, some studies have solved the problem by defining 12 moods of Thayer model as AV values (Arousal and Valence), but the recommendation performance is lower than that of a keyword-based method at the recall level 0.1. In this paper, we propose a method based on using mood vector of multimedia contents. The method can solve the synonym problem while maintaining the same performance as the keyword-based method even at the recall level 0.1. Also, for performance analysis, we compare the proposed method with an existing method based on AV value and a keyword-based method. The result shows that the proposed method outperform the existing methods.

IEEE 802.11-based Power-aware Location Tracking System (저전력을 고려한 IEEE 802.11 기반 위치 추적 시스템)

  • Son, Sang-Hyun;Baik, Jong-Chan;Baek, Yun-Ju
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.37 no.7B
    • /
    • pp.578-585
    • /
    • 2012
  • Location tracking system through GPS and Wi-Fi is available at no additional cost in an environment of IEEE 802.11-based wireless network. It is useful for many applications in outdoor environment. However, a previous systems used for general device to tag. It is unsuitable for power aware location tracking system because general devices is more expensive and non-optimized for tracking. The hand-off method of IEEE 802.11 standard is not enough considering power consumption. This thesis analyzes the previous location tracking systems and proposes power aware system. First, we designed and implemented tag to optimize location tracking. Next, we propose low-power hand-off method and low-power behavior model in implemented tag. The proposed hand-off method resolve power problem by using the location information and behavior model minimize power consumption of tag through power-saving mode and the concept of duty cycle. To evaluating proposed methods and system performance, we perform simulations and experiments in real environment. And then, we calculate tag's power consumption based on the actual measured current consumption of each operation. In a simulation result, the proposed behavior model and hand-off method reduced about 98%, 59% than the standard's hand-off and default behavior model.

An Integrated Design of Middleware and EPCIS for RFID and Sensor Data (RFID와 센서 데이터 처리를 위한 미들웨어와 EPCIS 통합 설계)

  • Hyun, Seung-Ryul;Lee, Sang-Jeong
    • Journal of the Korea Society of Computer and Information
    • /
    • v.17 no.1
    • /
    • pp.193-202
    • /
    • 2012
  • RFID tag awareness information and sensor data continuously change, and are categorized with the position. They are able to similar data in the side, called massive data to change in time. If two data are managed together, a convergence process of object awareness along change of environment is possible. If RFID middleware and EPCIS repository realized the integrated system, it is usable with the functions of middleware and repository at the same time. The real-time awareness information retrieval is possible without process, getting information from another middleware. In this paper, it is able to continuously read information from RFID reader and sensor equipment and store to database in order to make general object awareness and an object retrieval dependent on an environmental information change possible by real time. ALE-compliant middleware and EPCIS repository proposing for standards at EPCglobal is designed and implemented to be able to deal with RFID and sensor data to bases on the collected data.

Design and frnplernentation of a Query Processing Algorithm for Dtstributed Semistructlred Documents Retrieval with Metadata hterface (메타데이타 인터페이스를 이용한 분산된 반구조적 문서 검색을 위한 질의처리 알고리즘 설계 및 구현)

  • Choe Cuija;Nam Young-Kwang
    • Journal of KIISE:Software and Applications
    • /
    • v.32 no.6
    • /
    • pp.554-569
    • /
    • 2005
  • In the semistructured distributed documents, it is very difficult to formalize and implement the query processing system due to the lack of structure and rule of the data. In order to precisely retrieve and process the heterogeneous semistructured documents, it is required to handle multiple mappings such as 1:1, 1:W and W:1 on an element simultaneously and to generate the schema from the distributed documents. In this paper, we have proposed an query processing algorithm for querying and answering on the heterogeneous semistructured data or documents over distributed systems and implemented with a metadata interface. The algorithm for generating local queries from the global query consists of mapping between g1oba1 and local nodes, data transformation according to the mapping types, path substitution, and resolving the heterogeneity among nodes on a global input query with metadata information. The mapping, transformation, and path substitution algorithms between the global schema and the local schemas have been implemented the metadata interface called DBXMI (for Distributed Documents XML Metadata Interface). The nodes with the same node name and different mapping or meanings is resolved by automatically extracting node identification information from the local schema automatically. The system uses Quilt as its XML query language. An experiment testing is reported over 3 different OEM model semistructured restaurant documents. The prototype system is developed under Windows system with Java and JavaCC compiler.

Efficient Browsing Method based on Metadata of Video Contents (동영상 컨텐츠의 메타데이타에 기반한 효율적인 브라우징 기법)

  • Chun, Soo-Duck;Shin, Jung-Hoon;Lee, Sang-Jun
    • Journal of KIISE:Computing Practices and Letters
    • /
    • v.16 no.5
    • /
    • pp.513-518
    • /
    • 2010
  • The advancement of information technology along with the proliferation of communication and multimedia has increased the demand of digital contents. Video data of digital contents such as VOD, NOD, Digital Library, IPTV, and UCC are getting more permeated in various application fields. Video data have sequential characteristic besides providing the spatial and temporal information in its 3D format, making searching or browsing ineffective due to long turnaround time. In this paper, we suggest ATVC(Authoring Tool for Video Contents) for solving this issue. ATVC is a video editing tool that detects key frames using visual rhythm and insert metadata such as keywords into key frames via XML tagging. Visual rhythm is applied to map 3D spatial and temporal information to 2D information. Its processing speed is fast because it can get pixel information without IDCT, and it can classify edit-effects such as cut, wipe, and dissolve. Since XML data save key frame information via XML tag and keyword information, it can furnish efficient browsing.

A Study of Extension of the EJB Deployment Descriptor File with XSchema) (XSchema를 이용한 EJB 배치설명파일의 확장 방안 연구)

  • 공재원;심우곤;백인섭
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2001.10a
    • /
    • pp.400-402
    • /
    • 2001
  • 컴포넌트는 소프트웨어 재사용의 핵심 기술로 인식되고 있으며, 현재 많은 수의 컴포넌트가 개발되고 사용되고 있다[9]. 많은 수의 컴포넌트들 중에서 특정 도메인에 이미 적절하다고 판단된 컴포넌트를 검색해서 사용하는 과정이 필수적이며[6], 이를 위해서 컴포넌트에 대한 정확한 명세서가 뒷받침 되어야 한다. 본 논문에서 다루고 있는 컴포넌트 모델의 하나인 썬(Sun)社의 EJB ver1.1 은 배치설명파일(Deployment Descriptor)을 XML로 기술하고 있으며, DTD로 Validation 체크를 하고 있다. 그러나 DTD 는 표현할 수 있는 데이터 타입에서 한계를 가지며 하나의 XML은 여러 개의 DTD 파일을 가질수 없기 때문에 확장성에서도 취약함을 나타낸다. 이를 해결하기 위해서 XSchema 로 변환하였다. 또한 현재 EJB 의 배치설명파일에서는 컴포넌트의 결합 및 의존성에 대한 표현이 부족하기 때문에 이를 보완하기 위해서 컴포넌트 Contract 에 대한 속성들을 기반으로 하여 새로운 태그를 지정해보도록한다.

  • PDF