• Title/Summary/Keyword: Korean Web Statistics

Search Result 268, Processing Time 0.024 seconds

Implementation of a Web Robot and Statistics on the Korean Web (웹 로봇 구현 및 한국 웹 통계보고)

  • Kim, Sung-Jin;Lee, Sang-Ho
    • The KIPS Transactions:PartC
    • /
    • v.10C no.4
    • /
    • pp.509-518
    • /
    • 2003
  • A web robot is a program that downloads and stores web pages. Implementation issues for developing web robots have been studied widely and various web statistics are reported in the literature. First, this paper describes the overall architecture of our robot and implementation decisions on several important issues. Second, we show empirical statistics on approximately 74 million Korean web pages. Third, we monitored 1,424 Korean web sites to observe the changes of web pages. We identify what factors of web pages could affect the changes. The factors may be used for the selection of web pages to be updated incrementally.

A Development Study of Tool for Web Log Analysis

  • Choi, Seungbae;Kang, Changwan;Kim, Kyukon;Son, Jongkwan
    • Communications for Statistical Applications and Methods
    • /
    • v.11 no.1
    • /
    • pp.93-106
    • /
    • 2004
  • Recently, many data of various types is gained with development of computer in many fields. Especially, web log data generating in web site furnish beneficial information on an organization. The enterprise's destiny is swayed by according as how these information gaining from the web site utilize. In this paper, for the purpose of obtaining useful information, we present a tool is called WebBizi for web log analysis. This will be helpful to enterprise working the web site.

Geovisualization of Migration Statistics Using Flow Mapping Based on Web GIS (Web GIS 기반 유선도 작성을 통한 인구이동통계의 지리적 시각화)

  • Kim, Kam-Young;Lee, Sang-Il
    • Journal of the Korean Geographical Society
    • /
    • v.47 no.2
    • /
    • pp.268-281
    • /
    • 2012
  • In spite of the usefulness of migration statistics in spatially understanding social processes and identifying social effects of spatial processes, services and analyses of the statistics have been restricted due to the complexity of their data structure. In addition, flow mapping functionality which is a useful method to explore and visualize the migration statistics has yet to be fully represented in modern GIS applications. Given this, the purpose of this research is to demonstrate the possibility of flow mapping and the exploratory spatial analysis of the migration statistics in a Web GIS environment. For this, the characteristics of the statistics were examined from database, GIS, and cartographic perspectives. Then, O-D structure of the migration statistics was converted to spatial data appropriate to f low mapping based on the characteristics. The interface of Web GIS is specialized the migration statistics and provides exploratory visualization by allowing dynamic interactions such as spatial focusing and attribute filtering.

  • PDF

A Clustering Algorithm Considering Structural Relationships of Web Contents

  • Kang Hyuncheol;Han Sang-Tae;Sun Young-Su
    • Communications for Statistical Applications and Methods
    • /
    • v.12 no.1
    • /
    • pp.191-197
    • /
    • 2005
  • Application of data mining techniques to the world wide web, referred to as web mining, has been the focus of several recent researches. With the explosive growth of information sources available on the world wide web, it has become increasingly necessary to track and analyze their usage patterns. In this study, we introduce a process of pre-processing and cluster analysis on web log data and suggest a distance measure considering the structural relationships between web contents. Also, we illustrate some real examples of cluster analysis for web log data and look into practical application of web usage mining for eCRM.

Implementation of a Web-Based Electronic Text for High School's Probability and Statistics Education

  • Choi, Sook-Hee
    • Communications for Statistical Applications and Methods
    • /
    • v.11 no.2
    • /
    • pp.329-343
    • /
    • 2004
  • With advancement of computer and network, world wide web(WWW) as a medium of information communication is generalized in many fields. In educational aspect, applications of WWW as alternative media for class teachings or printed matters are increasing. In this article, we demonstrate a web-based electronic text on the 'probability and statistics' which is one of six fields of mathematics in the 7th curriculum. This text places importance on comprehension of concepts of probability and statistics as an applied science.

Design and Implementation of Public Web Services Analyzer (웹 서비스 분석기의 디자인과 구현)

  • Matai Janarbek
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2005.11a
    • /
    • pp.241-243
    • /
    • 2005
  • Web services (WS) present a new promising software technology, which provides application-to-application interaction. They are built on the top of existing web protocol and based on open XML standards. Web services are described using WSDL, and the UDDI is a integration directory provide registry of Web Services descriptions. WSDL provides information of Web Services but it is getting more and more important to know more than those provided by WSDL. From WSDL we can not get the information like usage of WS, performance of WS, complexity of WS, usability of WS with other web service. In this paper, we proposed a new method for Web Services so called Public Web Services Analyzer (PWSA). This technique is based on analyzing various public UDDI registries in order to get various kinds of statistics of web services. Those statistics will be used by both web services developers and consumers for finding them suitable services for their needs. PWSA guarantees that it can provide enough information to find right web services for both Web Services Consumers and Web Service Developers.

  • PDF

Implementation of Estimation and Inference on the Web

  • Kang, Heemo;Sim, Songyong
    • Communications for Statistical Applications and Methods
    • /
    • v.7 no.3
    • /
    • pp.913-926
    • /
    • 2000
  • An electronic statistics text on the web is implemented. The introduced text provide interactive instructions on the statistical estimation and inference. As a by-product, we also provide a calculation of quantiles and p-value of t-distribution and standard normal distribution. This program was written in JAVA programming language.

  • PDF

Regression and Correlation Analysis via Dynamic Graphs

  • Kang, Hee Mo;Sim, Songyong
    • Communications for Statistical Applications and Methods
    • /
    • v.10 no.3
    • /
    • pp.695-705
    • /
    • 2003
  • In this article, we propose a regression and correlation analysis via dynamic graphs and implement them in Java Web Start. For the polynomial relations between dependent and independent variables, dynamic graphics are implemented for both polynomial regression and spline estimates for an instant model selection. The results include basic statistics. They are available both as a web-based service and an application.

Consumer behavior prediction using Airbnb web log data (에어비앤비(Airbnb) 웹 로그 데이터를 이용한 고객 행동 예측)

  • An, Hyoin;Choi, Yuri;Oh, Raeeun;Song, Jongwoo
    • The Korean Journal of Applied Statistics
    • /
    • v.32 no.3
    • /
    • pp.391-404
    • /
    • 2019
  • Customers' fixed characteristics have often been used to predict customer behavior. It has recently become possible to track customer web logs as customer activities move from offline to online. It has become possible to collect large amounts of web log data; however, the researchers only focused on organizing the log data or describing the technical characteristics. In this study, we predict the decision-making time until each customer makes the first reservation, using Airbnb customer data provided by the Kaggle website. This data set includes basic customer information such as gender, age, and web logs. We use various methodologies to find the optimal model and compare prediction errors for cases with web log data and without it. We consider six models such as Lasso, SVM, Random Forest, and XGBoost to explore the effectiveness of the web log data. As a result, we choose Random Forest as our optimal model with a misclassification rate of about 20%. In addition, we confirm that using web log data in our study doubles the prediction accuracy in predicting customer behavior compared to not using it.

The Design and Implementation of Web-based Statistical Consulting System

  • Ryu, Jae-Yeol;Lee, Jung-Hoon;Jo, Min-Ji;Kim, Ae-Ji
    • 한국데이터정보과학회:학술대회논문집
    • /
    • 2006.11a
    • /
    • pp.167-180
    • /
    • 2006
  • The statistical survey and analysis is much restricted to time, space and material. The statistical survey and analysis could hardly resume. The statistical survey and analysis is very important to create various and accurate information. The statistical survey and analysis which is not a expert knowledge have many problems in productivity of information, reliability and etc. In this paper, we study the design and Implementation of web-based statistical survey and analysis consulting system which a client meet easily a statistical expert on the web.

  • PDF