• Title/Summary/Keyword: web pages

Search Result 553, Processing Time 0.03 seconds

Analyzing Coverage and Coverage Overlap of Korean Web Directories (국내 웹 디렉토리들의 커버리지 및 커버리지 중복성 분석)

  • 배희진;이진숙;이준호;박소연
    • Journal of the Korean Society for information Management
    • /
    • v.21 no.1
    • /
    • pp.173-186
    • /
    • 2004
  • This study examines coverage and coverage overlap of the three major Korean web directories, Naver, Yahoo Korea, and Empas. This study also suggests a methodology for collecting and processing web sites provided by these web directories. A method for napping main categories was developed. Each directory provided registered web pages in a slightly different way. Reference links had a significant influence on the coverage of each web directory. The overlap of pages among three directories was quite low, It is expected that this study could contribute to the field of web research by providing insights to how directories provide web pages and suggesting a methodology for the analysis of directory coverage.

WebSES : Web Site Sensibility Evaluation System based on Color Combination (WebSES : 배색을 이용한 웹 사이트 감성 평가 시스템)

  • 유헌우;조경자;홍지영;박수이
    • Science of Emotion and Sensibility
    • /
    • v.7 no.1
    • /
    • pp.51-64
    • /
    • 2004
  • In this paper, we propose a web page retrieval system based on the sensibility evaluation induced by the color combination of web pages. The realized system consist of two modules - the indexing module that automatically extracts and indexes the color information from the web page and the retrieval module that retrieves web pages based on the color combination when sensibility adjective is presented. Also, to verify the system usefulness, we analyzed the ranking of web pages retrieved by the system and by human subjects (non-expels and experts for color web page design) using two statistical methods of correlation and paired-t test. Results by non-experts showed the realized system was suitable for 10 sensibility adjectives among 18 sensibility adjectives, and results by experts showed that the realized system was suitable for 14 sensibility adjectives among 18 sensibility adjectives.

  • PDF

Automatic Generation of Voice Web Pages Based on SALT (SALT 기반 음성 웹 페이지의 자동 생성)

  • Ko, You-Jung;Kim, Yoon-Joong
    • Journal of KIISE:Software and Applications
    • /
    • v.37 no.3
    • /
    • pp.177-184
    • /
    • 2010
  • As a voice browser is introduced, voice dialog application becomes available on the Web environment. The voice dialog application consists of voice Web pages that need to translate the dialog scripts into SALT(Speech Application Language Tags). The current Web pages have been designed for visual. They, however, are potentially capable of using voice dialog. This paper, therefore, proposes an automated voice Web generation method that finds the elements for voice dialog from Web pages based HTML and converts them into SALT. The automatic generation system of a voice Web page consists of a lexical analyzer and a syntactic analyzer that converts a Web page which is described in HTML to voice Web page which is described in HTML+SALT. The converted voice Web page is designed to be able to handle not only the current mouse and keyboard input but also voice dialog.

Efficient Text Identifier for Mobile Web Browser

  • Nomoto, Leonardo Juniti;Kim, Chang-Su
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2008.11a
    • /
    • pp.75-76
    • /
    • 2008
  • Mobile devices are being widely used to access Internet contents. However, most available web pages are designed for desktop computers and consequently it is inconvenient to browse large web pages on mobile devices with small screen. Text identification is a process to extract texts from the body of a web page, which are then displayed in a comfortable way for reading. In this paper, we propose a text extraction scheme and discuss its implementation.

  • PDF

The impact of inter-host links in crawling important pages early

  • Alam, Hijbul;Ha, Jong-Woo;Sim, Kyu-Sun;Lee, Sang-Keun
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2010.06c
    • /
    • pp.118-121
    • /
    • 2010
  • The dynamic nature and exponential growth of the World Wide Web remain crawling important pages early still challenging. State-of-the-art crawl scheduling algorithms require huge running time to prioritize web pages during crawling. In this research, we proposed crawl scheduling algorithms that are not only fast but also download important pages early. The algorithms give high importance to some specific pages those have good linkages such as inlinks from different domains or host. The proposed algorithms were experimented on publically available large datasets. The results of experiments showed that propagating more importance to the inter-host links improves the effectiveness of crawl scheduling than the current state-of-the-art crawl scheduling algorithms.

  • PDF

Mobile Web-Access Evaluation and Automatic Translation System (모바일 기반의 웹접근성 평가 및 자동변환 시스템)

  • Kim, Seung-Cheon;Hwang, Ho-Young;Rho, Kwang-Hyun
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.12 no.4
    • /
    • pp.195-200
    • /
    • 2012
  • This paper introduces Web Accessibility to web pages, which is the basic method of obtaining information in modern Internet. Also we explore the system that is capable of examine whether the corresponding web pages are following the accessibility regulation. In order to examine web pages, we need to classify all the objects that web page contains. And the classified objects are to be examined by the standards of Web Accessibility. And also this paper introduces the mobile translation system for web accessibility.

Evaluating the Quality of Basic Life Support Information for Primary Korean-Speaking Individuals on the Internet (국내 인터넷 웹 페이지에 나타난 기본심폐소생술 정보의 질 평가)

  • Kang, Hee Do;Moon, Hyung Jun;Lee, Jung Won;Choi, Jae Hyung;Lee, Dong Wook;Kim, Hyun Su;Kang, In Gu;Kim, Doh Eui;Lee, Hyung Jung;Lee, Han You
    • Health Communication
    • /
    • v.13 no.2
    • /
    • pp.125-132
    • /
    • 2018
  • Purpose: The aim of this study is to investigate the quality of basic life support (BLS) information for primary Korean-speaking individuals on the internet. Methods: Using the $Google^{(C)}$ search engine, we searched for the terms 'CPR', 'cardiopulmonary resuscitation (in Korean)' and 'cardiac arrest (in Korean)'. The accuracy, reliability and accessibility of web pages was evaluated based on the 2015 American heart association(AHA) guidelines for CPR & emergency cardiovascular care, the health on the net foundation code of conduct and Korean web content accessibility guidelines 2.1, respectively. Results: Of the 178 web pages screened, 50 met criteria for inclusion. The overall quality of BLS information was not enough (median 5/7, IQR 4.75-6). 23(36%) pages were created in accordance with 2010 AHA guidelines. Only 24(48%) web pages educated on how to use the automated electrical defibrillator. The attribution and transparency of the reliability of pages was relatively low, 20(40%) and 16(32%). The web accessibility score was relatively high. Conclusion: A small of proportion of internet web pages searched by Google have high quality BLS information for a Korean-speaking population. Web pages based on past guideline were still being searched. The notation of the source of CPR information and the transparency of the author should be improved. The verification and evaluation of the quality of BLS information exposed to the Internet are continuously needed.

Architecture of XRML-based Comparison Shopping Mall and Its Performance on Delivery Cost Estimation (XRML 기반 비교쇼핑몰의 구조와 배송비 산정에 관한 실증분석)

  • Lee Jae Kyu;Kang Juyoung
    • Journal of the Korean Operations Research and Management Science Society
    • /
    • v.30 no.2
    • /
    • pp.185-199
    • /
    • 2005
  • With the growth of internet shopping malls, there is increasing interest in comparison shopping mall. However most comparison sites compare only book prices by collecting simple XML data and do not provide .the exact comparison Including precise shipping costs. Shipping costs vary depending on each customer's address, the delivery method, and the category of selected goods, so rule based system is required in order to calculate exact shipping costs. Therefore, we designed and implemented comparison shopping mall which compares not only book prices but also shipping costs using rule based inference. By adopting the extensible Rule Markup language (XRML) approach, we proposed the methodology of extracting delivery rules from Web pages of each shopping mall. The XRML approach can facilitate nearly automatic rule extraction from Web pages and consistency maintenance between Web pages and rule base. We developed a ConsiderD system which applies our rule acquisition methodology based on XRML. The objective of the ConsiderD system is to compare the exact total cost of books including the delivery cost over Amazon.com, BarnesandNoble.com, and Powells.com. With this prototype, we conducted an experiment to show the potential of automatic rule acquisition from Web pages and illustrate the effect of delivery cost.

An Implementation and Performance Evaluation of Fast Web Crawler with Python

  • Kim, Cheong Ghil
    • Journal of the Semiconductor & Display Technology
    • /
    • v.18 no.3
    • /
    • pp.140-143
    • /
    • 2019
  • The Internet has been expanded constantly and greatly such that we are having vast number of web pages with dynamic changes. Especially, the fast development of wireless communication technology and the wide spread of various smart devices enable information being created at speed and changed anywhere, anytime. In this situation, web crawling, also known as web scraping, which is an organized, automated computer system for systematically navigating web pages residing on the web and for automatically searching and indexing information, has been inevitably used broadly in many fields today. This paper aims to implement a prototype web crawler with Python and to improve the execution speed using threads on multicore CPU. The results of the implementation confirmed the operation with crawling reference web sites and the performance improvement by evaluating the execution speed on the different thread configurations on multicore CPU.