• Title/Summary/Keyword: HTML Documents

Search Result 149, Processing Time 0.024 seconds

Storing XML Documents using Oracle8i XDK (Oracle8i XDK를 이용한 XML 문서의 저장)

  • 하상호;이강석;백인천
    • Proceedings of the Korea Multimedia Society Conference
    • /
    • 2000.04a
    • /
    • pp.324-327
    • /
    • 2000
  • XML은 웹 상에서 데이터의 원활한 교환을 위해서 HTML을 보완하여 설계된 차세대 인터넷문서작성용언어이다. XML 문서와 같은 반구조(semistructured) 의 특성을 갖는 데이터를 효과적으로 다루기 위한 새로운 데이터모델과 질의어가 제안되어 오고 있지만, 여기서는 관계형 데이터베이스에 XML 문서를 효과적으로 저장하는 방법에 관해서 논의한다. 먼저, 도서를 표현하는 XML 문서를 위한 DTD를 제시하고, 이 DTD를 관계 테이블로 변환하는 방법을 논의한다. 다음에는 Oracle서 지원하는 XDK를 이용하여 XML문서를 Oracle8i DB에 저장하는 방법에 대해서 논의한다.

  • PDF

A Categorization Model Based On Information Structure of HTML Documents (구조 정보를 이용한 웹 문서 범주화 모형)

  • 조이영;최상희;정영미
    • Proceedings of the Korean Society for Information Management Conference
    • /
    • 2000.08a
    • /
    • pp.147-152
    • /
    • 2000
  • 본 연구는 다양한 웹 문서를 효과적으로 범주화 할 수 있는 모형을 구축하는데 그 목적이 있다. 이를 위해 본 연구에서는 웹 문서가 가지고 있는 구조 정보인 링크(link)와 문서 단계(level)를 활용하여 문서 유형을 식별한 후, 각 유형별로 범주화 과정을 달리 적용하여 범주화 성능을 개선시키는 방법을 고안하였다.

  • PDF

The Development of ORED, a Web-Based Educational System for Operations Research (웹 기반 경영과학 교육 시스템 ORED의 개발)

  • 박순달;임성묵;도승용;이승석;김호동
    • Korean Management Science Review
    • /
    • v.19 no.1
    • /
    • pp.89-106
    • /
    • 2002
  • ORED is a Web-Based educational system for operations research. It consists of operations research theories, help system for theories, cases, application programs and management system. Users can study theories and cases through HTML documents and solve Problems with java applet and servelet programs. The help system provides users with detailed explanations of theories. And the management system provides the administrator with efficient tools necessary for managing the ORED In the Web.

A Character Recognition on Complex Color Documents (복잡한 컬러 문서에 대한 문자인식)

  • 양철용;김갑기;김진욱;김항준
    • Proceedings of the Korea Institute of Convergence Signal Processing
    • /
    • 2000.08a
    • /
    • pp.233-236
    • /
    • 2000
  • 최근 수많은 인쇄된 문서들이 HTML과 같은 디지털 문서로 바뀌고 있으며 이를 자동으로 변환해 주는 문자인식 기술에 대한 관심이 증가하고 있다. 본 논문에서는 그림과 글자가 공존하는 문서에서 자동으로 문자영역을 추출해서 문자를 인식하는 방법을 제안한다. 우선 입력문서는 유사한 칼라로 이루어진 영역들로 나누어진 뒤 휴리스틱 룰에 의해 문자후보 영역과 비 문자 영역으로 나누어진다. 그 다음 이들 문자후보영역들은 문자인식기를 이용하여 문자 혹은 문자의 일부분으로 인식된다. 제안된 방법으로 여러 문서들에 대하여 실험한 결과를 보이며 그 성능을 평가한다.

  • PDF

Judging Translated Web Document & Constructing Bilingual Corpus (웹 번역문서 판별과 병렬 말뭉치 구축)

  • Jee-hyung, Kim;Yill-byung, Lee
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2004.10a
    • /
    • pp.787-789
    • /
    • 2004
  • People frequently feel the need of a general searching tool that frees from language barrier when they find information through the internet. Therefore, it is necessary to have a multilingual parallel corpus to search with a word that includes a search keyword and has a corresponding word in another language, Multilingual parallel corpus can be built and reused effectively through the several processes which are judgment of the web documents, sentence alignment and word alignment. To build a multilingual parallel corpus, multi-lingual dictionary should be constructed in each language and HTML should be simplified. And by understanding the meaning and the statistics of document structure, judgment on translated web documents will be made and the searched web pages will be aligned in sentence unit.

  • PDF

Adaptive User Profile for Information Retrieval from the Web

  • Srinil, Phaitoon;Pinngern, Ouen
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 2003.10a
    • /
    • pp.1986-1989
    • /
    • 2003
  • This paper proposes the information retrieval improvement for the Web using the structure and hyperlinks of HTML documents along with user profile. The method bases on the rationale that terms appearing in different structure of documents may have different significance in identifying the documents. The method partitions the occurrence of terms in a document collection into six classes according to the tags in which particular terms occurred (such as Title, H1-H6 and Anchor). We use genetic algorithm to determine class importance values and expand user query. We also use this value in similarity computation and update user profile. Then a genetic algorithm is used again to select some terms from user profile to expand the original query. Lastly, the search engine uses the expanded query for searching and the results of the search engine are scored by similarity values between each result and the user profile. Vector space model is used and the weighting schemes of traditional information retrieval were extended to include class importance values. The tested results show that precision is up to 81.5%.

  • PDF

Cognitive Based Context Aware Reference History Management Tool

  • Punithan, Dharani;McKay, Bob
    • 한국HCI학회:학술대회논문집
    • /
    • 2009.02a
    • /
    • pp.227-231
    • /
    • 2009
  • The aim of the research is to focus on the cognitive principles and to achieve human-level intelligence in referring context based browser history and the Windows history. One of the major problems faced by today's computer users is insufficient and single exclusive context based reference of the browser history and the Windows history. Today we search for the browser history and Windows history in different places even though the context is the same. For e.g., When working on a research paper or preparing a business presentation, a user may require to refer many web sites on the internet and various documents on the local computer. The browser can provide only time based history. The windows document history is also time based and limited to list only few documents. Hence, we propose a tool "Cognitive Based Context Aware Reference History Management Tool" which helps to access the exclusive reference of context and time based history in one place. The tool also proposes to store image history with urls and classifies images of a specific topic accessed in different time, bookmarks management and cross browser history management. These features are very useful as we can access all related documents (doc, docx, ppt, pptx, pdf, txt, and html), web pages, images and bookmarks in one place. The tool uses the cognitive principles like classification and association to achieve the purpose.

  • PDF

A XML/EDI System for Maritime Export Customs Clearance

  • Kim, Hyun S.;Park, Nam K.;Hyung R. Chol
    • Proceedings of the Korea Inteligent Information System Society Conference
    • /
    • 2001.01a
    • /
    • pp.45-49
    • /
    • 2001
  • Korean government and companies have given a lot of their efforts to exchange electronic documents between themselves and their partners. As the results of them. Korean EDI standards were made by Korean EDIFACT Committee and the standards have been used by companies and governmental organization in Korea. However, Korean export customs clearance EDI system is based on VAN(Value Added Network) and one VAN company ha monopolistic right to relay EDI documents to Korean Customs Service. Therefor is leads to a lot of problems such as inconvenient software, expensive transmission fee and the difficulty of connection with the in-house systems of user companies. To solve these problems, a few good solutions and systems have been suggested and one of them is the Internet EDI. we will suggest a new export customs clearance EDI system running on the Web. This system is basically an Internet EDI system, but we have developed this system using XML instead of HTML, XML is a new markup language with merit such as isolating data from style of documents. This system consists of 7 modules, schema/style/template management, XML/EDI document management, XML/EDI transformation, EDI transmission, certification management and log management. Also this system can be used with other traditional EDI systems that have UN/EDIFACT standards. We will discuss the advantages and disadvantages of XML/EDI system for customs clearance. The development of this system will be a leading study for XML/EDI standards in export clearance EDI system.

  • PDF

XML-based Retrieval System for SCORM-based Virtual Learning Contents (SCORM 기반의 XML 학습 컨텐츠 검색 시스템)

  • Choi, Byung-Uk;Song, Mi-Sook;Cho, Jung-Won
    • The Journal of Korean Association of Computer Education
    • /
    • v.6 no.1
    • /
    • pp.9-17
    • /
    • 2003
  • XML(eXtensible Markup Language), next generation internet standard language has the advantage of easy re-use and re-structure in other computing environment because it has the separate data, presentation and structure. In this paper, we implement the efficient retrieval system for the general user by limiting the XML documents on the multimedia learning contents for the virtual education system. The system design is based on SCO Metadata unit defined in SCORM as the proposed virtual education standard. Each XML documents has three indexes - keyword, element and attribute. Also, it makes possible to retrieve data without previous knowledge of the DTD by making the element retrieval screen structure for the user interface. And it gives the user various result screen formats such as XML and HTML by restructuring the retrieval result through XML-QL and XSL, respectively.

  • PDF

A Web Services-based Client OLAP API and Its Application to Cube Browsing (웹 서비스 기반의 클라이언트 OLAP API와 큐브 브라우징에의 응용 사례)

  • Bae, Eun-Ju;Kim, Myung
    • The KIPS Transactions:PartD
    • /
    • v.10D no.1
    • /
    • pp.143-152
    • /
    • 2003
  • XML and Web Services draw a lot of attention as standard technologies for data exchange and integration among heterogeneous platforms XML/A, which supports such technologies, is a SOAP based XML APl that facilitates data exchange between a client application and a data analysis engine through the Internet. The fact that the XML format is used for data exchange makes XML/A to be platform-independent. However. client application developers have to go through a tedious Job of treating the same type of XML documents fur downloading data from the server. Also, an XML query language is needed for extracting data from the XML documents sent by the server. In this paper, we present a high level client OLAP API, called DXML, for the client application developers in the windows environment to easily use the OLAP services of XML/A. XMLMD consists of properties and methods needed for OLAP application development. XMLMD is to XML/A what ADOMD is to OLEDB for OLAP. We also present a web OLAP cube browser that is developed using XMLMD. The browser display's data in various formats such as XML, HTML, Excel, and graph.