• Title/Summary/Keyword: Web Document

Search Result 757, Processing Time 0.025 seconds

Research on Function and Policy for e-Government System using Semantic Technology (전자정부내 의미기반 기술 도입에 따른 기능 및 정책 연구)

  • Go, Gwang-Seop;Jang, Yeong-Cheol;Lee, Chang-Hun
    • 한국디지털정책학회:학술대회논문집
    • /
    • 2007.06a
    • /
    • pp.79-87
    • /
    • 2007
  • This paper aims to offer a solution based on semantic document classification to improve e-Government utilization and efficiency for people using their own information retrieval system and linguistic expression Generally, semantic document classification method is an approach that classifies documents based on the diverse relationships between keywords in a document without fully describing hierarchial concepts between keywords. Our approach considers the deep meanings within the context of the document and radically enhances the information retrieval performance. Concept Weight Document Classification(CoWDC) method, which goes beyond using exist ing keyword and simple thesaurus/ontology methods by fully considering the concept hierarchy of various concepts is proposed, experimented, and evaluated. With the recognition that in order to verify the superiority of the semantic retrieval technology through test results of the CoWDC and efficiently integrate it into the e-Government, creation of a thesaurus, management of the operating system, expansion of the knowledge base and improvements in search service and accuracy at the national level were needed.

  • PDF

Web Document-based Associate Knowledge Extraction Method : Applying to Bioinformatics (웹 도큐먼트 기반 연관 지식 추출 기법 : 생명정보분야에의 적용)

  • 문현정;김교정
    • Journal of Internet Computing and Services
    • /
    • v.2 no.5
    • /
    • pp.9-19
    • /
    • 2001
  • In this paper. we develop associate knowledge extraction method for finding and expanding user preference knowledge automatically from web document database. To reflect user interest or preferences, agent explores and extracts relevant information to central term involving the intent of users from the example documents. To do so, we apply association rule exploration data-mining method to the extraction of the relevant objects in the web documents. Also, to give the weighted-value to the extracted and relevant information, we present associate tag block-based weighting method. We applied to bioinformatics above associate knowledge extraction method to find related keywords.

  • PDF

Design and Implementation of Web-based Software Engineering Tool for Robot (웹 기반 로봇 소프트웨어 공학 도구 설계 및 구현)

  • Hong, Chang-Ho;Park, Hong-Seong
    • Journal of Institute of Control, Robotics and Systems
    • /
    • v.17 no.9
    • /
    • pp.908-915
    • /
    • 2011
  • As the requirement of user for robot functionality, the function and interface for controlling the robot system is more sophisticated and complicated. Accordingly development process of robot is more complicated and it takes much longer time to develop a robot system. Software development using project management tool is more important in software engineering because of the complexity of software, especially robot system. This paper proposes SEED (Software Engineering Equipment for Development), which is a web-based and integrated software engineering tool to provide independent tools for robot software development. SEED includes the document management tool, the software configuration management tool, the software testing tool on developing robot software and provide a functionality of collaborated and remote development due to WEB-based operations.

Design of Web Agent Using User Profile and Automatic Document Categorization (사용자 정보와 자동 문서 분류를 이용한 웹 에이전트의 설계)

  • Lee, Seung-Won;Kwon, Young-Hoon;Ryu, Je;Han, Kwang-Rok
    • Proceedings of the IEEK Conference
    • /
    • 1999.06a
    • /
    • pp.407-410
    • /
    • 1999
  • WWW is an important method for retrieving or providing informations. Not only the amount of information but also it is widely located on the web, it is difficult for users to get or search information. Furthermore, to use search engine is also inconvenient, because it just uses a keyword without concerning a user's interest. At this point, we propose a design of web agent that uses the automatic document categorization system and user's profile concerning with a user's interest, so the agent can actively provide a information.

  • PDF

An Index System using Restrictive Distance (거리 제한을 이용한 색인 시스템)

  • Park, Chan-Ee;Kim, Sang-Bok
    • Journal of the Korea Society of Computer and Information
    • /
    • v.11 no.1 s.39
    • /
    • pp.273-282
    • /
    • 2006
  • In this paper, we propose index method introducing distance concept in word by a method weighting word. This index method is frequent representing an inquiry word and document index and compound noun or more than two adjoin nouns or noun phrase, the farther the distance between these nouns, the fewer selected ratio decreases in index point is the aiming, this choose guide word candidate by existent weight grant method and distance between candidates chose candidate finally in index within 3 sentences. Using in these way I document of 100 kinds of newspaper, scientific treatise, web document and so on, showed the correctness rate resulted of newspaper 92.03% scientific treatise 95% web document 73.33%.

  • PDF

A Corpus Construction System of Consistent Document Categorization and Keyword Extraction (일관성 있는 문서분류 및 키워드 추출을 위한 말뭉치 구축도구)

  • Jeong, Jae-Cheol;Park, So-Young;Chang, Ju-No;Kihl, Tae-Suk
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2010.10a
    • /
    • pp.675-676
    • /
    • 2010
  • As the number of documents rapidly increases in the web environment, the efficient document classification approaches have been required to retrieve the desired information from too many documents. In this paper, we propose a corpus construction tool to annotate document classification information such as category, keywords, and usage to each product description document. The proposed tool can help a human annotator to correctly identify this information by providing the verification step to check the input results of other human annotators. Also, the human annotator can construct the corpus anytime anywhere by using the web-based proposed system.

  • PDF

Combining Multiple Sources of Evidence to Enhance Web Search Performance

  • Yang, Kiduk
    • Journal of Korean Library and Information Science Society
    • /
    • v.45 no.3
    • /
    • pp.5-36
    • /
    • 2014
  • The Web is rich with various sources of information that go beyond the contents of documents, such as hyperlinks and manually classified directories of Web documents such as Yahoo. This research extends past fusion IR studies, which have repeatedly shown that combining multiple sources of evidence (i.e. fusion) can improve retrieval performance, by investigating the effects of combining three distinct retrieval approaches for Web IR: the text-based approach that leverages document texts, the link-based approach that leverages hyperlinks, and the classification-based approach that leverages Yahoo categories. Retrieval results of text-, link-, and classification-based methods were combined using variations of the linear combination formula to produce fusion results, which were compared to individual retrieval results using traditional retrieval evaluation metrics. Fusion results were also examined to ascertain the significance of overlap (i.e. the number of systems that retrieve a document) in fusion. The analysis of results suggests that the solution spaces of text-, link-, and classification-based retrieval methods are diverse enough for fusion to be beneficial while revealing important characteristics of the fusion environment, such as effects of system parameters and relationship between overlap, document ranking and relevance.

Web-based XML Tree Builder for XML Service Program Development (XML 웹 서비스 프로그램 개발을 위한 웹 기반의 XML Tree 생성)

  • Park Young-Soo;Chang Duk-Chul
    • The KIPS Transactions:PartD
    • /
    • v.12D no.5 s.101
    • /
    • pp.765-772
    • /
    • 2005
  • For the excellent U web services, the program developers should create both user's client program and the sewer program Recently, the structural alteration of programs are often caused by infernal and external environment concerned with social trends and preferences as well as user's requirements expectations. So, for reducing the development time and workload of XML web service programs, it is necessary to develop the web program generation tools. This paper suggest how to design and implement the web-based XML Tree builder, which was able to provide the information for promptly and easily generating the XML Schema, XML Stylesheet and XML Document through the XML Development Supporting Server System.

Semantic Topic Selection Method of Document for Classification (문서분류를 위한 의미적 주제선정방법)

  • Ko, kwang-Sup;Kim, Pan-Koo;Lee, Chang-Hoon;Hwang, Myung-Gwon
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.11 no.1
    • /
    • pp.163-172
    • /
    • 2007
  • The web as global network includes text document, video, sound, etc and connects each distributed information using link Through development of web, it accumulates abundant information and the main is text based documents. Most of user use the web to retrieve information what they want. So, numerous researches have progressed to retrieve the text documents using the many methods, such as probability, statistics, vector similarity, Bayesian, and so on. These researches however, could not consider both the subject and the semantics of documents. As a result user have to find by their hand again. Especially, it is more hard to find the korean document because the researches of korean document classification is insufficient. So, to overcome the previous problems, we propose the korean document classification method for semantic retrieval. This method firstly, extracts TF value and RV value of concepts that is included in document, and maps into U-WIN that is korean vocabulary dictionary to select the topic of document. This method is possible to classify the document semantically and showed the efficiency through experiment.

Dynamic recomposition of document category using user intention tree (사용자 의도 트리를 사용한 동적 카테고리 재구성)

  • Kim, Hyo-Lae;Jang, Young-Cheol;Lee, Chang-Hoon
    • The KIPS Transactions:PartB
    • /
    • v.8B no.6
    • /
    • pp.657-668
    • /
    • 2001
  • It is difficult that web documents are classified with exact user intention because existing document classification systems are based on word frequency number using single keyword. To improve this defect, first, we use keyword, a query, domain knowledge. Like explanation based learning, first, query is analyzed with knowledge based information and then structured user intention information is extracted. We use this intention tree in the course of existing word frequency number based document classification as user information and constraints. Thus, we can classify web documents with more exact user intention. In classifying document, structured user intention information is helpful to keep more documents and information which can be lost in the system using single keyword information. Our hybrid approach integrating user intention information with existing statistics and probability method is more efficient to decide direction and range of document category than existing word frequency approach.

  • PDF