• Title/Summary/Keyword: Web pages

Search Result 553, Processing Time 0.032 seconds

Thai Internet Users' Personality Traits and their Preferred Web Portal's Characteristics

  • Tanya, Rattipon;Tanlamai, Uthai
    • Journal of Information Technology Applications and Management
    • /
    • v.20 no.3
    • /
    • pp.19-30
    • /
    • 2013
  • The objective of this research is to identify a web portal's specific functions and layouts that are aligned with personality trait of an individual internet user. This first stage of the overall research project intends to check whether the research instrument, namely the NEO Five Factors Inventory (NEO-FFI), is applicable to assess the personality traits of Thai internet users. Based on these personality traits, text-based description of functions/layouts of a web portal was developed and given to professional designers to mock up example web portal pages. These web portal pages were in alignment with individual personality traits. Rating data on the functions/layouts corresponding to individual personality traits were collected from an online survey of 207 Thai internet users. Results showed that respondents gave more consistent rating to the functions/layouts close to their individual personality traits identified in the text-based descriptions than in the mocked-up web portal pages.

Web Page Segmentation

  • Ahmad, Mahmood;Lee, Sungyoung
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2014.11a
    • /
    • pp.1087-1090
    • /
    • 2014
  • This paper describes an overview and research work related to web page segmentation. Over a period of time, various techniques have been used and proposed to extract meaningful information from web pages automatically. Due to voluminous amount of data this extraction demanded state of the art techniques that segment the web pages just like or close to humans. Motivation behind this is to facilitate applications that rely on the meaningful data acquired from multiple web pages. Information extraction, search engines, re-organized web display for small screen devices are few strong candidate areas where web page extraction has adequate potential and utility of usage.

Effect of Rule Identification in Acquiring Rules from Web Pages (웹 페이지의 내재 규칙 습득 과정에서 규칙식별 역할에 대한 효과 분석)

  • Kang, Ju-Young;Lee, Jae-Kyu;Park, Sang-Un
    • Journal of Intelligence and Information Systems
    • /
    • v.11 no.1
    • /
    • pp.123-151
    • /
    • 2005
  • In the world of Web pages, there are oceans of documents in natural language texts and tables. To extract rules from Web pages and maintain consistency between them, we have developed the framework of XRML(extensible Rule Markup Language). XRML allows the identification of rules on Web pages and generates the identified rules automatically. For this purpose, we have designed the Rule Identification Markup Language (RIML) that is similar to the formal Rule Structure Markup Language (RSML), both as pares of XRML. RIML is designed to identify rules not only from texts, but also from tables on Web pages, and to transform to the formal rules in RSは syntax automatically. While designing RIML, we considered the features of sharing variables and values, omitted terms, and synonyms. Using these features, rules can be identified or changed once, automatically generating their corresponding RSML rules. We have conducted an experiment to evaluate the effect of the RIML approach with real world Web pages of Amazon.com, BamesandNoble.com, and Powells.com We found that $97.7\%$ of the rules can be detected on the Web pages, and the completeness of generated rule components is $88.5\%$. This is good proof that XRML can facilitate the extraction and maintenance of rules from Web pages while building expert systems in the Semantic Web environment.

  • PDF

Mobile Web Automatic Generation using the Web users usage pattern (사용자 이용패턴을 이용한 모바일웹 컨텐츠 자동 생성에 관한 연구)

  • Ko, Hee-Ae;Kim, Jong-Keun;Sim, Kun-Ho;Meihua, Zhao;Yin, Chang-Yi;Lim, Young-Hwan
    • Journal of Digital Contents Society
    • /
    • v.9 no.4
    • /
    • pp.583-590
    • /
    • 2008
  • These days Mobile web Full Browsing is getting the issues which displays whole Web Site contents in browser of mobile device. Since the size of browser is too small, in this paper, it recommends a new method that web users can choose variable service and web contents for mobile contents by adopting new platform. The mobile web would be generated automatically to apply the web 2.0 through the web users pattern. The platform is that no one belong in data, anyone can use this, anyone can modify the data, so variable users can create new contents for proper mobile web. It is the Mobilder application which can build Mobile web pages. Any User can make mobile web pages with Mobilder Before producing the program, in this paper, it will build mobile web pages by web users navigation pattern. After analysing the web users pattern, we will produce the mobile web pages through this.

  • PDF

Extracting Specific Information in Web Pages Using Machine Learning (머신러닝을 이용한 웹페이지 내의 특정 정보 추출)

  • Lee, Joung-Yun;Kim, Jae-Gon
    • Journal of Korean Society of Industrial and Systems Engineering
    • /
    • v.41 no.4
    • /
    • pp.189-195
    • /
    • 2018
  • With the advent of the digital age, production and distribution of web pages has been exploding. Internet users frequently need to extract specific information they want from these vast web pages. However, it takes lots of time and effort for users to find a specific information in many web pages. While search engines that are commonly used provide users with web pages containing the information they are looking for on the Internet, additional time and efforts are required to find the specific information among extensive search results. Therefore, it is necessary to develop algorithms that can automatically extract specific information in web pages. Every year, thousands of international conference are held all over the world. Each international conference has a website and provides general information for the conference such as the date of the event, the venue, greeting, the abstract submission deadline for a paper, the date of the registration, etc. It is not easy for researchers to catch the abstract submission deadline quickly because it is displayed in various formats from conference to conference and frequently updated. This study focuses on the issue of extracting abstract submission deadlines from International conference websites. In this study, we use three machine learning models such as SVM, decision trees, and artificial neural network to develop algorithms to extract an abstract submission deadline in an international conference website. Performances of the suggested algorithms are evaluated using 2,200 conference websites.

Layout Analysis for Calculation of Web Page Similarity as Image

  • Mitsuhashi, Noriaki;Yamaguchi, Toru;Takama, Yasufumi
    • Proceedings of the Korean Institute of Intelligent Systems Conference
    • /
    • 2003.09a
    • /
    • pp.142-145
    • /
    • 2003
  • When we search information on the Web using search engines, they only analyze the text information collected from the source files of Web pages. However, there is a limit to analyze the layout of a Web page only from its source file, although Web page design is the most important factor for a user to estimate a page. In particular it often happens on the Web that the pages of similar design ofter similar information. We propose a method to analyze layout for comparing the design of pages by treating the displayed page as image.

  • PDF

A Study on Personalization System Using Web Log and Purchasing Database (웹 로그와 구매 DB를 이용한 개인화 시스템에 관한 연구)

  • 김영태;이성주
    • Proceedings of the Korean Institute of Intelligent Systems Conference
    • /
    • 2003.09b
    • /
    • pp.23-26
    • /
    • 2003
  • In this paper, a methodolgy for customizing web pages for indivisual users is suggested. It shows an efficient way to personalize web pages by predicting one's site access pattern. In addition, the prediction can reflect one's tendency after actual purchase. By using the APRIORI algorithm, one of the association rule search methods, the associativity among the purchase items can be inferred. This inferrence is based on the log data in a web server and database about purchase. Finally, a web page which contains the relationship, relative links on other web pages, and inferred items can be generated after this process.

  • PDF

A Study on Extracting News Contents from News Web Pages (뉴스 웹 페이지에서 기사 본문 추출에 관한 연구)

  • Lee, Yong-Gu
    • Journal of the Korean Society for information Management
    • /
    • v.26 no.1
    • /
    • pp.305-320
    • /
    • 2009
  • The news pages provided through the web contain unnecessary information. This causes low performance and inefficiency of the news processing system. In this study, news content extraction methods, which are based on sentence identification and block-level tags news web pages, was suggested. To obtain optimal performance, combinations of these methods were applied. The results showed good performance when using an extraction method which applied the sentence identification and eliminated hyperlink text from web pages. Moreover, this method showed better results when combined with the extraction method which used block-level. Extraction methods, which used sentence identification, were effective for raising the extraction recall ratio.

Web Search Personalization based on Preferences for Page Features (문서 특성에 대한 선호도 기반 웹 검색 개인화)

  • Lee, Soo-Jung
    • Journal of The Korean Association of Information Education
    • /
    • v.15 no.2
    • /
    • pp.219-226
    • /
    • 2011
  • Web personalization has focused on extracting web pages interesting to users, to help users searching wanted information efficiently on the web. One of the main methods to achieve this is by using queries, links and users' preferred words in the pages. In this study, we surveyed from the web users the features of pages that are considered important to themselves in selecting web pages. The survey results showed that the content of the pages is the most important. However, images and readability of the page are rated as high as the content for some users. Based on this result, we present a method for maintaining relative weights of major page features differently in the profile for each user, which is used for personalizing web search results. Performance of the proposed personalization method is analyzed to prove its superiority such that it yields as much as 1.5 times higher rate than the system utilizing both queries and preferred words and about 2.3 times higher rate than a generic search engine.

  • PDF

e-Cohesive Keyword based Arc Ranking Measure for Web Navigation (연관 웹 페이지 검색을 위한 e-아크 랭킹 메저)

  • Lee, Woo-Key;Lee, Byoung-Su
    • Journal of KIISE:Databases
    • /
    • v.36 no.1
    • /
    • pp.22-29
    • /
    • 2009
  • The World Wide Web has emerged as largest media which provides even a single user to market their products and publish desired information; on the other hand the user can access what kind of information abundantly enough as well. As a result web holds large amount of related information distributed over multiple web pages. The current search engines search for all the entered keywords in a single webpage and rank the resulting set of web pages as an answer to the user query. But this approach fails to retrieve the pair of web pages which contains more relevant information for users search. We introduce a new search paradigm which gives different weights to the query keywords according to their order of appearance. We propose a new arc weight measure that assigns more relevance to the pair of web pages with alternate keywords present so that the pair of web pages which contains related but distributed information can be presented to the user. Our measure proved to be effective on the similarity search in which the experimentation represented the e~arc ranking measure outperforming the conventional ones.