• Title/Summary/Keyword: Web Link

Search Result 347, Processing Time 0.023 seconds

An Efficient Candidate Pattern Storage Tree Structure and Algorithm for Incremental Web Mining (점진적인 웹 마이닝을 위한 효율적인 후보패턴 저장 트리구조 및 알고리즘)

  • Kang, Hee-Seong;Park, Byung-Jun
    • Proceedings of the KIEE Conference
    • /
    • 2006.04a
    • /
    • pp.3-5
    • /
    • 2006
  • Recent advances in the internet infrastructure have resulted in a large number of huge Web sites and portals worldwide. These Web sites are being visited by various types of users in many different ways. Among all the web page access sequences from different users, some of them occur so frequently that may need an attention from those who are interested. We call them frequent access patterns and access sequences that can be frequent the candidate patterns. Since these candidate patterns play an important role in the incremental Web mining, it is important to efficiently generate, add, delete, and search for them. This thesis presents a novel tree structure that can efficiently store the candidate patterns and a related set of algorithms for generating the tree structure adding new patterns, deleting unnecessary patterns, and searching for the needed ones. The proposed tree structure has a kind of the 3 dimensional link structure and its nodes are layered.

  • PDF

An Experimental Study of Cocitation Analysis on Web Information (웹 정보원의 동시인용분석에 관한 실험적 연구)

  • 정동열;최윤미
    • Journal of the Korean Society for information Management
    • /
    • v.16 no.2
    • /
    • pp.7-26
    • /
    • 1999
  • This experimental study examines informetric analysis of World Wide Web based upon cocitation analysis of Web pages and features of Web resources in the field of communication studies. Cocitation analysis is basically performed to examine the intellectual structure of the communication studies in reflecting link count on the Web. The selected Web resources in the field are mapped in two dimensions based upon the similarities of cocitation frequency, correlation matrix, mutidimensional scale and cluster analysis. Cocitation analysis methods using organizational homepage, personal homepage, or Web index, to Web produced clustering of Web resources that had topical similarities. So far, although informetric analysis of Web resources is in the preliminary stage, it shows that Web can be a new tool for indicating the intellectual structure of a specific research field. In addition, this study analyzes characteristics of printing resources and Web resources, and differences of research methods in applying cocitation analysis.

  • PDF

Design and Implementation of Customer Personalized System Using Web Log and Purchase Database

  • Lee Jae-Hoon;Chung Hyun-Sook;Lee Sung-Joo
    • International Journal of Fuzzy Logic and Intelligent Systems
    • /
    • v.6 no.1
    • /
    • pp.21-26
    • /
    • 2006
  • In this paper, we propose a customer personalized system that presents the web pages to users which are customized to their individuality. It analyzes the action of users who visit the shopping mall, and preferentially supplies the necessary information to them. When they actually buy some items, it forecasts the user's access pattern to web site and their following purchasable items and improves their web page on the bases of their individuality. It reasons the relation among the web documents and among the items by using the log data of web server and the purchase information of DB. For reasoning, it employs Apriori algorithm, which is a method that searches the association rule. It reasons the web pages by considering the user's access pattern and time by using the web log and reasons the user's purchase pattern by using the purchase information of DB. On the basis of the relation among them, it appends the related web pages to link of user's web pages and displays the inferred goods on user's web pages.

Customer Personalized System of eCRM Using Web Log Mining and Rough Set

  • Lee, Jae-Hoon;Chung, Il-Yong;Lee, Sung-Joo
    • Proceedings of the Korean Institute of Intelligent Systems Conference
    • /
    • 2006.11a
    • /
    • pp.29-32
    • /
    • 2006
  • In this paper, we propose a customer personalized system that presents the web pages to users which are customized to their individuality. It analyzes the action of users who visit the shopping mall, and preferentially supplies the necessary information to them. When they actually buy some items, it forecasts the users' access pattern to web site and their following purchasable items and improves their web pare on the bases of their individuality. It reasons the relation among the web documents and among the items by using the log data of web server and the purchase information of DB. For reasoning it employs Rough Set, which is a method that searches the association rule and offers most suitable cases by reduces cases. It reasons the web pages by considering the users' access pattern and time by using the web log and reasons the users' purchase pattern by using the purchase information of DB. On the basis of the relation among them, it appends the related web pages to link of users' web pages and displays the inferred goods on users' web pages.

  • PDF

Research of Adaptive Transformation Method Based on Webpage Semantic Features for Small-Screen Terminals

  • Li, Hao;Liu, Qingtang;Hu, Min;Zhu, Xiaoliang
    • ETRI Journal
    • /
    • v.35 no.5
    • /
    • pp.900-910
    • /
    • 2013
  • Small-screen mobile terminals have difficulty accessing existing Web resources designed for large-screen devices. This paper presents an adaptive transformation method based on webpage semantic features to solve this problem. According to the text density and link density features of the webpages, the webpages are divided into two types: index and content. Our method uses an index-based webpage transformation algorithm and a content-based webpage transformation algorithm. Experiment results demonstrate that our adaptive transformation method is not dependent on specific software and webpage templates, and it is capable of enhancing Web content adaptation on small-screen terminals.

An Implementation of the Ranking Algorithm for Web Documents based on Link Analysis (링크 분석에 기반한 웹 문서 중요도 평가 알고리즘의 구현)

  • Lim, Sung-Chae
    • Proceedings of the Korean Society of Computer Information Conference
    • /
    • 2010.07a
    • /
    • pp.75-78
    • /
    • 2010
  • 웹 검색에는 기존의 정보검색(Information Retrieval) 시스템에서와 다르게 문서 간 하이퍼링크 정보를 바탕으로 각 웹 문서의 고유 중요도를 추정하는 방식이 자주 이용된다. 링크 분석에 기반한 알고리즘 중 PageRank 알고리즘은 구글의 웹 검색 서비스에 적용된 것으로 알려져 있다. 이런 PageRank 알고리즘에 따라 중요도를 계산하는 경우 색인된 웹 문서수가 증가함에 따라 계산에 필요한 CPU 자원의 사용도 함께 증가하며, 문서 수가 수 억 페이지에 달하면 하나의 서버에서는 계산을 수행할 수 없다는 문제가 있다. 본 논문에서는 이런 문제점을 해소하기 위해 여러 대의 서버를 PageRank 계산 용 클러스터로 사용할 수 있는 방법을 제시한다. 제시된 방법은 고속의 LAN을 이용하여 여러 대의 서버를 연결하고 반복적인 행렬 계산을 병렬로 수행할 수 있어 계산 시간을 단축시킬 수 있다. 이런 서버 클러스터 구현을 위해 멀티 쓰레딩 프로그램이 작성되었으며, PageRank 계산에 사용되는 행렬 데이터를 적은 양의 메모리만으로 표현 가능하도록 하였다.

  • PDF

The Design and Implementation of Korean History Web Courseware Using Semantic Network (의미망을 활용한 국사과 웹 코스웨어의 설계 및 구현)

  • Park, Chan-Ghu;Yun, Hong-Won
    • The Journal of Korean Association of Computer Education
    • /
    • v.3 no.1
    • /
    • pp.177-189
    • /
    • 2000
  • This paper describes the design and implementation of Korean History Web courseware using semantic network in order to build learning environment in the viewpoint of cognitive flexibility theory. The most important thing in design for a courseware using semantic network is to build learning environment. The first step to do this is to analyze learning contents and after that we should define the type of link between learning subjects. We should develope the knowledge map which has the link of each type connected with every learning subject.

  • PDF

The Design and Implementation of HTML Document Integrity Management System (HTML 문서의 무결성 유지 시스템의 설계 및 구현)

  • 조이기;이영운;황인문;양수영;김원중
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2002.05a
    • /
    • pp.380-383
    • /
    • 2002
  • It Is difficult to manage broken link with dangling reference, inaccurate reference m the manual site that is consist of HTML documents of much quantity as KLDP(Korean Linux Documentation Project, http:/ /kldp.org) Web site. In this paper. we define relationship and constrain renditions that exist between Web site's HTML documents. And we design and implement HIMS(HTML Document Integrity Management System), which notify user that integrity violation happens or launch trigger operation to keep integrity between HTML documents in case of insert, delete, update.

  • PDF

A Method to Block Spam Mail Automatically Through the Connection to Link URL (링크 유알엘 접속을 통한 스팸메일 자동 차단 방법에 관한 연구)

  • Jung, Nam-Cheol
    • Journal of Digital Contents Society
    • /
    • v.8 no.4
    • /
    • pp.451-458
    • /
    • 2007
  • In this paper, I developed a method whereby spam mail is automatically blocked through the connection to link URL. The blocking system works as follows. First, the system extracts information of URL linked to electronic mail which was delivered from any server on the internet. Next, the system lets itself be connected to the web pages through this URL. Last, the system blocks the electronic mail if those web pages contain any key word which was defined as a clue to spam mail.

  • PDF

Link Analysis on Institutional Repository web Network of Indian Institute of Technologies Registered in open DOAR-uncovering Patterns and Trends Hidden in the Network

  • Kumar, Kutty
    • International Journal of Knowledge Content Development & Technology
    • /
    • v.8 no.2
    • /
    • pp.23-36
    • /
    • 2018
  • Institutional repositories (IR) are promising to be extremely advantageous to scholars especially in developing countries. IR initiatives started in India during the late nineties and the popularity of this concept is growing rapidly in the higher educational and research institutions to disseminate newly emerging knowledge and expertise. The purpose of this paper is to critically analyze the network links of IR websites among four IITs that are registered in open DOAR (Directory of Open Access Repositories) web portal. The Institutional Repositories chosen for the study are IIT Delhi, IIT Hyderabad, IIT Bombay, and IIT Kanpur. The analysis of the study focused on standard graph and network cohesion metrics, such as density, diameter, eccentricity and distances, and clustering coefficient; for an even more detailed analysis advanced centrality measures and fast algorithms such as clique census are used.