• Title/Summary/Keyword: HTML Documents

Search Result 149, Processing Time 0.022 seconds

XML Documents Transcoding using Semantic Annotation (시맨틱 어노테이션을 이용한 XML 문서 트랜스코딩)

  • 이진상;송특섭;손원성;고승규;임순범;최윤철
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2004.04b
    • /
    • pp.523-525
    • /
    • 2004
  • 기존의 웹 컨텐츠를 휴대폰이나 poA등과 같은 개인용 단말기에 표현하기에는 단말기 성능상의 제약(낮은 CPU 성능, 작은 출력 화면, 입출력 방법의 단순함 등)이 따르게 되므로 컨텐츠 변환의 과정이 필요하게 된다. 트랜스코딩이감 기존의 웹 컨텐츠를 단말기의 환경에 따라 적합한 형태로 변환하는 것을 의미하며, HTML 문서의 레이아웃 정보를 이용하여 변환하는 연구가 다양하게 이루어져 왔다. 본 논문에서는 사용자 의견을 반영한 XML문서의 정확한 트랜스 코딩을 위하여 시맨틱 어노테이션 기법을 제안한다. XML 문서의 트랜스코딩에는 IPTC(International Press Telecommunications Council)에서 정한 NewsML을 기반으로 하였으며, 본 논문에서 제안하는 트랜스코딩 프레임워크는 크게 3단계로 나뉘어 진다. 어노테이션 생성 및 인식, 어노테이션의 구조 정보를 활용한 페이지 생성 및 페이지 앱 구성, 디바이스에 따른 페이지의 변환으로 구성된다. 향후 연구로는 어노테이션과 페이지 생성 기법을 통해 생성된 XML 문서를 CC/PP를 이용하여 poA나 휴대폰 등의 시스템에 적합하게 변환하는 기법 등이 요구된다.

  • PDF

A Design of Active Questionnaire System Based on Web Standards (웹표준 기반 능동형 설문 시스템 설계)

  • Jang, Seon-Ah;Yang, Jae-Gun;Bae, Jae-Hak J.
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2011.04a
    • /
    • pp.777-779
    • /
    • 2011
  • 현재의 컴퓨터 설문 시스템은 지필형 설문에 비해 여러 가지 이점이 있지만 처리절차 또는 규칙들이 시스템에 하드 코딩(Hard Coding)되기 때문에 설문이 변경되면 설문처리 엔진을 수정 또는 재구현해야 한다. 이러한 문제를 해결하고자 기존에 능동문서를 도입한 컴퓨터 설문 시스템인 능동문서 기반의 웹 설문 시스템(WINAD: The Web Interview System with Active Documents)을 구현하였다. 하지만 WINAD 시스템은 응답자의 참여도를 높이거나 응답률을 최대화할 수 있는 플래쉬, 플랙스, 실버라이트 등 인터렉티브한 설문지 작성 환경을 제공하지 않는다. 브라우저 상호호환성과 웹접근성 문제를 발생시키기 때문이다. 본 논문에서는 WINAD 시스템을 모바일 및 다양한 브라우저 환경에서 동작가능 하도록 HTML5 기술을 적용한 웹표준 기반 능동형 설문 시스템을 설계하였다.

A Digital Library Prototype for Access to Diverse Collections (다양한 장서 접근을 위한 디지털 도서관의 프로토타입 구축)

  • Choi Won-Tae
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.32 no.2
    • /
    • pp.295-307
    • /
    • 1998
  • This article is an overview of the digital library project, indicating what roles Koreas diverse digital collections may play. Our digital library prototype has simple architecture, consisting of digital repositories, filters, indexing and searching, and clients. Digital repositories include various types of materials and databases. The role of filters is to recognize a format of a document collection and mark the structural components of each of its documents. We are using a database management system (ORACLE and ConText) supporting user-defined functions and access methods that allows us to easily incorporate new object analysis, structuring, and indexing technology into a repository. Clients can be considered browsers or viewers designed for different document data types, such as image, audio, video, SGML, PDF, and KORMARC. The combination of navigational tools supports a variety of approaches to identifying collections and browsing or searching for individual items. The search interface was implemented using HTML forms and the World Wide Web's CGI mechanism.

  • PDF

Korean Web Content Extraction using Tag Rank Position and Gradient Boosting (태그 서열 위치와 경사 부스팅을 활용한 한국어 웹 본문 추출)

  • Mo, Jonghoon;Yu, Jae-Myung
    • Journal of KIISE
    • /
    • v.44 no.6
    • /
    • pp.581-586
    • /
    • 2017
  • For automatic web scraping, unnecessary components such as menus and advertisements need to be removed from web pages and main contents should be extracted automatically. A content block tends to be located in the middle of a web page. In particular, Korean web documents rarely include metadata and have a complex design; a suitable method of content extraction is therefore needed. Existing content extraction algorithms use the textual and structural features of content blocks because processing visual features requires heavy computation for rendering and image processing. In this paper, we propose a new content extraction method using the tag positions in HTML as a quasi-visual feature. In addition, we develop a tag rank position, a type of tag position not affected by text length, and show that gradient boosting with the tag rank position is a very accurate content extraction method. The result of this paper shows that the content extraction method can be used to collect high-quality text data automatically from various web pages.

A the internet distance education system development of the LINUXBASED subtitle - A the center of textbook design module (리눅스 기반의 인터넷 원격 교육 시스템 개발-교재 설계 모듈을 중심으로)

  • 성평식
    • Journal of the Korea Computer Industry Society
    • /
    • v.2 no.2
    • /
    • pp.141-150
    • /
    • 2001
  • Within a context of rapid technological change and shifting market conditions, the education environment requires new paradigm of education delivery. In accordance of such a technological progress, distance education system, which makes the learning take place at anytime anywhere, overcoming barriers of time, or distance, is emerging as a mainstream of education delivery replacing the convectional one way delivery system from instructor to learners. This paper aims to introduce the development principle and algorithm about Instructional System Desgin(ISD) module, a part of a total solution for distance education services. It was developed on Linux, a free Unix-type operating system. Linux supports so various network protocols, sharing the network resources in a smooth way, that it is able to integrate with other operating system very easily, especially with Windows NT or Windows 2000 servers. In terms of quality, it never falls behind the windows products which are commercially available only. It is a right operating system for the such a school environment that is usually limited in budgets. The development environment of the distance education solution to which ISD module belongs is composing of Apache server for web server, lava bean based on components for ISD module, PHP, server-side scripting language, for HTML documents and MySQL for DBMS.

  • PDF

Certificate-based SSO Protocol Complying with Web Standard (웹 표준을 준수하는 인증서기반 통합 인증 프로토콜)

  • Yun, Jong Pil;Kim, Jonghyun;Lee, Kwangsu
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.20 no.8
    • /
    • pp.1466-1477
    • /
    • 2016
  • Public key infrastructure(PKI), principle technology of the certificate, is a security technology providing functions such as identification, non-repudiation, and anti-forgery of electronic documents on the Internet. Our government and financial organizations use PKI authentication using ActiveX to prevent security accident on the Internet service. However, like ActiveX, plug-in technology is vulnerable to security and inconvenience since it is only serviceable to certain browser. Therefore, the research on HTML5 authentication system has been conducted actively. Recently, domestic bank introduced PKI authentication complying with web standard for the first time. However, it still has inconvenience to register a certification on each website because of same origin policy of web storage. This paper proposes the certificate based SSO protocol that complying with web standard to provide user authentication using certificate on several sites by going around same origin policy and its security proof.

리눅스 기반의 인터넷 원격 교육 시스템 개발 -교재 설계 모듈을 중심으로-

  • 성평식;박춘원
    • Proceedings of the Korea Society for Industrial Systems Conference
    • /
    • 2000.11a
    • /
    • pp.635-645
    • /
    • 2000
  • Within a context of rapid technological change and shifting market conditions, the education environment requires new paradigm of education delivery In accordance of such a technological progress, distance education system, which makes the loaming take place at anytime anywhere, overcoming barriers of time, or distance, is emerging as a mainstream of education delivery replacing the convectional one way delivery system from instructor to learners. This Paper aims to introduce the development principle and algorithm about Instructional System Design(ISD) module, a part of a total solution for distance education services. It was developed on Linux, a free Unix-type operating system. Linux supports so various network protocols, sharing the network resources in a smooth way, that it is able to integrate with other operating system very easily, especially with Windows NT or Windows 2000 servers. In terms of quality, it never falls behind the windows products which are commercially available only, It is a right operating system for the such a school environment that is usually limited in budgets. The development environment of the distance education solution to which ISD module belongs is composing of Apache server for web server, Java bean based on components for ISD module, PHP, server-side scripting language, for HTML documents and MySQL for DBMS.

  • PDF

XLink Semantics in Object Repository for Guaranteeing Referential Integrity (객체지향 정보 저장소에서 참조 무결성 보장을 위한 XLink 시맨틱스)

  • 박희경;박상원;김형주
    • Journal of KIISE:Computing Practices and Letters
    • /
    • v.9 no.6
    • /
    • pp.589-601
    • /
    • 2003
  • XLink is XML Linking Language that defines links between XML documents or XML elements. It uses XML syntax to create structures that can describe links similar to the simple unidirectional hyperlinks of HTML, as well as more sophisticated links. There exist several relationships between resources that linked by XLink. Without supports from XML Repository for these relationship management, there is a huge overhead related to the management of relationships during both application development and maintenance, since the relationships need to be hard-coded directly into the application program itself. In this paper, we propose an XLink supporting system in object repository. We describe the design and implementation of this system on top of XDOM(persistent DOMlink XML repository). To support XLink, our system offers referential relationship semantics, extended link and the expansion of DOM tree for XLink.

A High-Speed Message Processing System for Mobile Web Services (모바일 웹 서비스를 위한 고속 메시지 처리 시스템)

  • Kim, Jin-Il;Kim, Yong-Tae;Park, Gil-Cheol
    • Journal of the Korea Society of Computer and Information
    • /
    • v.13 no.3
    • /
    • pp.161-168
    • /
    • 2008
  • A standard Web server or web application servers require the Web Servlet container to execute SOAP messages. But it requires additional process to make web service modules and need communication port. Also, Recently many web applications is becoming increasingly demand against the dynamic document than the static documnet. But a recent study has found that Apache Web Server always does not show the better performance. The more it have the dynamic documents, rather it can show worse performance. To solve this problem, we propose a new High-Speed Message Processing System, in which the SOAP_MP and the WSDL builder are used. The WSDL builder convert HTML/XML to WSDL files automatically and the SOAP_MP minimize SOAP massage processing time by eliminating the Tomcat Servlet container in the mobile Web Services implementation. We compare and analyze the System, which was proposed by us, with the standard Web Service system.

  • PDF

Design and Implementation of a Document-Oriented and Web-Based Nuclear Design Automation System (문서중심 및 웹기반 핵설계 자동화 시스템의 설계 및 구현)

  • Park, Yong-Soo;Kim, Jong-Kyung
    • The KIPS Transactions:PartD
    • /
    • v.11D no.6
    • /
    • pp.1319-1326
    • /
    • 2004
  • To automate nuclear design works which are time-consuming and man-power intensive, Innovative Design Processor ($IDP^{TM}$) is being developed. Two basic principles of IDP are the document-oriented design and the web-based design. The document-oriented design is that, if the designer writes a design document called active document and feeds it to a special program which has a robust parser, the finai document with complete analysis, table and plots is made automatically. The active documents can be written with ordinary HTML/XML editors or created automatically on the web, which is another framework of IDP. Using the proper mix-up of server side and client side programming under the LAMP (Linux/Apache/MySQL/PHP) environment, the design process on the web is modeled as a design wizard style so that even a novice designer makes the design document easily.