• Title/Summary/Keyword: Web Page Analysis

Search Result 154, Processing Time 0.028 seconds

Development of Web-Based Wind Data Analysis System for HeMOSU-1 (웹기반 해모수-1 풍황자료 분석 시스템 개발)

  • Ryu, Ki-Wahn;Park, Kun-Sung;Lee, Jong-Hwa;Oh, Soo-Yun;Kim, Ji-Young;Park, Myoung-Ho
    • Journal of Wind Energy
    • /
    • v.4 no.1
    • /
    • pp.60-67
    • /
    • 2013
  • A web-based program was developed for analyzing weather and structure data from the HeMOSU-1 offshore meteorological mast installed by the KEPCO Research Institute, and 35 km west-southwestward away from Gyeokpo located in Jeonbuk province. All of the measured data are obtained through the data transmitter and the server systems equipped on the HeMOSU-1 and the aerodynamic laboratory in Chonbuk National University respectively. The dualised server system consists of two servers, one is for logging the 1 second based raw data with 10 minute averaged values, and the other is for managing web page with processed weather data. Daily or weekly 10-min averaged data can be provided based on the input date by users. Processed weather data such as wind rose, Weibull distribution, diurnal distribution, turbulence intensity according to wind speed, wind energy density, and so forth are visualized through the web page which would be both useful and informative for developing the wind farm or designing a wind blade for the wind farm nearby southwest sea around the Korean Peninsula. The URL for this web page is http://www.hemosu.org/.

Optimized Web Design Method by Analyzing the Websites (웹사이트 분석을 통한 최적화 설계 방안)

  • Jang, Hee-Seon
    • Convergence Security Journal
    • /
    • v.15 no.2
    • /
    • pp.19-24
    • /
    • 2015
  • As the Internet usage such as Web3.0, future internet, and internet of things increases, the big data through information exchange between the users and web servers increases. Analyzing those web data, the commercial web sites use the analytic results for marketing and campaign, and non-commercial web sites also use the results to improve the user's services satisfaction. In this paper, the quantitative index is presented to analyze the web sites, and optimized web site design method is also presented through the correlation analysis of index and significance test. From the results for 138 web sites, it is observed that strong plus(+) correlation for visits-unique visitors and page views-average visit duration exists. We also observe the minus(-) correlation between bounce rate and page views per user(or ratio of new visits). In specific, to reduce the bounce rate for users, the strategy to increase the page views and ratio of new visits rather than visits and unique visitors is needed.

Realtime Adaptation Transcoding Service, supporting wireless mobile devices and RSS by using Web Structure Analysis (웹 컨텐츠의 구조 분석을 이용한 무선 단말기와 RSS로의 실시간 적응 변환 서비스)

  • Ryu Dong-Yeop;Han Seung-Hyun;Lim Young-Whan
    • Journal of the Korea Society of Computer and Information
    • /
    • v.11 no.4 s.42
    • /
    • pp.61-68
    • /
    • 2006
  • As high speed Internet service technology develops. many people are accessing wireless internet service by using mobile devices. However. an HTLM web page for PC is not very compatible with wireless internet because of its unnecessary advertisements, high resolution. and various multimedia data. This is because the main focus when creating it was on high speed. As this research indicates with the RSS channel. converted from the main part of the HTLM web page, which users show interest in. a wireless compatibleinternet page can easily be created. Consequently, web administrators could create a wireless page easily and quickly, and wireless users would be able to find information in the same manner.

  • PDF

Design and Implementation of an Interestingness Analysis System for Web Personalizatoion & Customization

  • Jung, Youn-Hong;Kim, I-I;Park, Kyoo-seok
    • Journal of Korea Multimedia Society
    • /
    • v.6 no.4
    • /
    • pp.707-713
    • /
    • 2003
  • Convenience and promptness of the internet have been not only making the electronic commerce grow rapidly in case of website, analyzing a navigation pattern of the users has been also making personalization and customization techniques develop rapidly for providing service accordant to individual interestingness. Web personalization and customization skill has been utilizing various methods, such as web log mining to use web log data and web mining to use the transaction of users etc, especially e-CRM analyzing a navigation pattern of the users. In this paper, We measure exact duration time of the users in web page and web site, compute weight about duration time each page, and propose a way to comprehend e-loyalty through the computed weight.

  • PDF

A Study on the Analysis of Color Image of the Web Pages of University Libraries (대학도서관 웹 페이지의 색채이미지 분석에 관한 연구)

  • Lee, Cheol-Chan
    • Journal of Korean Library and Information Science Society
    • /
    • v.38 no.1
    • /
    • pp.89-106
    • /
    • 2007
  • This is to present the information and direction about color scheme image and adjective image in designing web page, by analysing the color image of our country's National University library's web page which is being operated now. The analysis method is to find the RGB of objective site through color emotion standard and to abstract the color chip. It is divided by color scheme image and adjective image. The scope of research is 41 National University libraries registered in National University Libraries Association. The result is that white and grey color of background and colors like light blue and green are main. In case of adjective image, nimble image and clear image was many. next is orderly bellowing image. and elegant image.

  • PDF

Research on Training and Implementation of Deep Learning Models for Web Page Analysis (웹페이지 분석을 위한 딥러닝 모델 학습과 구현에 관한 연구)

  • Jung Hwan Kim;Jae Won Cho;Jin San Kim;Han Jin Lee
    • The Journal of the Convergence on Culture Technology
    • /
    • v.10 no.2
    • /
    • pp.517-524
    • /
    • 2024
  • This study aims to train and implement a deep learning model for the fusion of website creation and artificial intelligence, in the era known as the AI revolution following the launch of the ChatGPT service. The deep learning model was trained using 3,000 collected web page images, processed based on a system of component and layout classification. This process was divided into three stages. First, prior research on AI models was reviewed to select the most appropriate algorithm for the model we intended to implement. Second, suitable web page and paragraph images were collected, categorized, and processed. Third, the deep learning model was trained, and a serving interface was integrated to verify the actual outcomes of the model. This implemented model will be used to detect multiple paragraphs on a web page, analyzing the number of lines, elements, and features in each paragraph, and deriving meaningful data based on the classification system. This process is expected to evolve, enabling more precise analysis of web pages. Furthermore, it is anticipated that the development of precise analysis techniques will lay the groundwork for research into AI's capability to automatically generate perfect web pages.

Web Log Analysis Using Support Vector Regression

  • Jun, Sung-Hae;Lim, Min-Taik;Jorn, Hong-Seok;Hwang, Jin-Soo;Park, Seong-Yong;Kim, Jee-Yun;Oh, Kyung-Whan
    • Communications for Statistical Applications and Methods
    • /
    • v.10 no.1
    • /
    • pp.61-77
    • /
    • 2003
  • Due to the wide expansion of the internet, people can freely get information what they want with lesser efforts. However without adequate forms or rules to follow, it is getting more and more difficult to get necessary information. Because of seemingly chaotic status of the current web environment, it is sometimes called "Dizzy web" The user should wander from page to page to get necessary information. Therefore we need to construct system which properly recommends appropriate information for general user. The representative research field for this system is called Recommendation System(RS), The collaborative recommendation system is one of the RS. It was known to perform better than the other systems. When we perform the web user modeling or other web-mining tasks, the continuous feedback data is very important and frequently used. In this paper, we propose a collaborative recommendation system which can deal with the continuous feedback data and tried to construct the web page prediction system. We use a sojourn time of a user as continuous feedback data and combine the traditional model-based algorithm framework with the Support Vector Regression technique. In our experiments, we show the accuracy of our system and the computing time of page prediction compared with Pearson's correlation algorithm.algorithm.

HTML Text Extraction Using Tag Path and Text Appearance Frequency (태그 경로 및 텍스트 출현 빈도를 이용한 HTML 본문 추출)

  • Kim, Jin-Hwan;Kim, Eun-Gyung
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.25 no.12
    • /
    • pp.1709-1715
    • /
    • 2021
  • In order to accurately extract the necessary text from the web page, the method of specifying the tag and style attributes where the main contents exist to the web crawler has a problem in that the logic for extracting the main contents. This method needs to be modified whenever the web page configuration is changed. In order to solve this problem, the method of extracting the text by analyzing the frequency of appearance of the text proposed in the previous study had a limitation in that the performance deviation was large depending on the collection channel of the web page. Therefore, in this paper, we proposed a method of extracting texts with high accuracy from various collection channels by analyzing not only the frequency of appearance of text but also parent tag paths of text nodes extracted from the DOM tree of web pages.

A Ranking Algorithm for Semantic Web Resources: A Class-oriented Approach (시맨틱 웹 자원의 랭킹을 위한 알고리즘: 클래스중심 접근방법)

  • Rho, Sang-Kyu;Park, Hyun-Jung;Park, Jin-Soo
    • Asia pacific journal of information systems
    • /
    • v.17 no.4
    • /
    • pp.31-59
    • /
    • 2007
  • We frequently use search engines to find relevant information in the Web but still end up with too much information. In order to solve this problem of information overload, ranking algorithms have been applied to various domains. As more information will be available in the future, effectively and efficiently ranking search results will become more critical. In this paper, we propose a ranking algorithm for the Semantic Web resources, specifically RDF resources. Traditionally, the importance of a particular Web page is estimated based on the number of key words found in the page, which is subject to manipulation. In contrast, link analysis methods such as Google's PageRank capitalize on the information which is inherent in the link structure of the Web graph. PageRank considers a certain page highly important if it is referred to by many other pages. The degree of the importance also increases if the importance of the referring pages is high. Kleinberg's algorithm is another link-structure based ranking algorithm for Web pages. Unlike PageRank, Kleinberg's algorithm utilizes two kinds of scores: the authority score and the hub score. If a page has a high authority score, it is an authority on a given topic and many pages refer to it. A page with a high hub score links to many authoritative pages. As mentioned above, the link-structure based ranking method has been playing an essential role in World Wide Web(WWW), and nowadays, many people recognize the effectiveness and efficiency of it. On the other hand, as Resource Description Framework(RDF) data model forms the foundation of the Semantic Web, any information in the Semantic Web can be expressed with RDF graph, making the ranking algorithm for RDF knowledge bases greatly important. The RDF graph consists of nodes and directional links similar to the Web graph. As a result, the link-structure based ranking method seems to be highly applicable to ranking the Semantic Web resources. However, the information space of the Semantic Web is more complex than that of WWW. For instance, WWW can be considered as one huge class, i.e., a collection of Web pages, which has only a recursive property, i.e., a 'refers to' property corresponding to the hyperlinks. However, the Semantic Web encompasses various kinds of classes and properties, and consequently, ranking methods used in WWW should be modified to reflect the complexity of the information space in the Semantic Web. Previous research addressed the ranking problem of query results retrieved from RDF knowledge bases. Mukherjea and Bamba modified Kleinberg's algorithm in order to apply their algorithm to rank the Semantic Web resources. They defined the objectivity score and the subjectivity score of a resource, which correspond to the authority score and the hub score of Kleinberg's, respectively. They concentrated on the diversity of properties and introduced property weights to control the influence of a resource on another resource depending on the characteristic of the property linking the two resources. A node with a high objectivity score becomes the object of many RDF triples, and a node with a high subjectivity score becomes the subject of many RDF triples. They developed several kinds of Semantic Web systems in order to validate their technique and showed some experimental results verifying the applicability of their method to the Semantic Web. Despite their efforts, however, there remained some limitations which they reported in their paper. First, their algorithm is useful only when a Semantic Web system represents most of the knowledge pertaining to a certain domain. In other words, the ratio of links to nodes should be high, or overall resources should be described in detail, to a certain degree for their algorithm to properly work. Second, a Tightly-Knit Community(TKC) effect, the phenomenon that pages which are less important but yet densely connected have higher scores than the ones that are more important but sparsely connected, remains as problematic. Third, a resource may have a high score, not because it is actually important, but simply because it is very common and as a consequence it has many links pointing to it. In this paper, we examine such ranking problems from a novel perspective and propose a new algorithm which can solve the problems under the previous studies. Our proposed method is based on a class-oriented approach. In contrast to the predicate-oriented approach entertained by the previous research, a user, under our approach, determines the weights of a property by comparing its relative significance to the other properties when evaluating the importance of resources in a specific class. This approach stems from the idea that most queries are supposed to find resources belonging to the same class in the Semantic Web, which consists of many heterogeneous classes in RDF Schema. This approach closely reflects the way that people, in the real world, evaluate something, and will turn out to be superior to the predicate-oriented approach for the Semantic Web. Our proposed algorithm can resolve the TKC(Tightly Knit Community) effect, and further can shed lights on other limitations posed by the previous research. In addition, we propose two ways to incorporate data-type properties which have not been employed even in the case when they have some significance on the resource importance. We designed an experiment to show the effectiveness of our proposed algorithm and the validity of ranking results, which was not tried ever in previous research. We also conducted a comprehensive mathematical analysis, which was overlooked in previous research. The mathematical analysis enabled us to simplify the calculation procedure. Finally, we summarize our experimental results and discuss further research issues.

An Analysis and Design Model for Web-based Information System (Web 기반 시스템 분석/설계 모형 개발)

  • 김정희;박세권
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 1998.10b
    • /
    • pp.620-622
    • /
    • 1998
  • Web 기반 시스템 구축시 저렴한 구축비용, 표준화된 문서환경, 클라이언트를 통일하지 않고도 어플리케이션의 구현이 가능한 장점 때문에 인터넷 프로토콜인 TCP/IP와 HTTP를 기반으로 시스템을 구축하려는 기업과 조직이 늘고 있다. 본 논문에서는 web 환경에서 시스템 구축시 적용할 수 잇는 web 기반 시스템 분석/설계 모형을 제안한다. 기존의 web 어플리케이션 개발 방법들이 데이터 분석에 기초를 둔 것과는 달리, 기업과 조직이 가지는 기능을 중심으로 web 어플리케이션을 분석하고, 파악된 기능에서 pageflow를 설계하여 사용자의 요구사항과 조직의 처리가 반영된 page와 네비게이션 경로(navigation path)를 설계할 수 있는 모형을 제시한다.