• Title/Summary/Keyword: web search systems

Search Result 333, Processing Time 0.023 seconds

Concept Network-based Personalized Web Search Systems (개념 네트워크 기반 사용자 인지형 웹 검색 시스템)

  • Yune, Hong-June;Noh, Joon-Ho;Kim, Han-Joon;Lee, Byung-Jeong;Kang, Soo-Yong;Chang, Jae-Young
    • Journal of Internet Computing and Services
    • /
    • v.12 no.2
    • /
    • pp.63-73
    • /
    • 2011
  • In general, conventional search engines provide the same search results for the same queries of users, and however such techniques do not consider users' characteristics. To overcome this problem, we need a new way of personalized search which returns customized search results according to users' preference. In this paper, we propose a concept network profile-based personalized web search system in which the concept network is developed for accumulating users' characteristics. The concept network-based user profile is used to expand initial search queries to achieve personalized search. The concept network is a network structure of concepts where each concept is generated whenever each query is submitted, and it can be defined as a set of keywords extracted from the selected documents. Furthermore, we have improved the concept networks by augmenting intent keywords of each concept with a set of classification tags, called folksonomy, assigned to each document. For an additional personalized search technique, we propose a new re-ranking method that analayzes the degree of overlapped search results.

Quality Dimensions Affecting the Effectiveness of a Semantic-Web Search Engine (검색 효과성에 영향을 미치는 시맨틱웹 검색시스템 품질요인에 관한 연구)

  • Han, Dong-Il;Hong, Il-Yoo
    • Asia pacific journal of information systems
    • /
    • v.19 no.1
    • /
    • pp.1-31
    • /
    • 2009
  • This paper empirically examines factors that potentially influence the success of a Web-based semantic search engine. A research model has been proposed that shows the impact of quality-related factors upon the effectiveness of a semantic search engine, based on DeLone and McLean's(2003) information systems success model. An empirical study has been conducted to test hypotheses formulated around the research model, and statistical methods were applied to analyze gathered data and draw conclusions. Implications for academics and practitioners are offered based on the findings of the study. The proposed model includes three quality dimensions of a Web-based semantic search engine-namely, information quality, system quality and service quality. These three dimensions each have measures designed to collectively assess the respective dimension. The model is intended to examine the relationship between measures of these quality dimensions and measures of two dependent constructs, including individuals' net benefit and user satisfaction. Individuals' net benefit was measured by the extent to which the user's information needs were adequately met, whereas user satisfaction was measured by a combination of the perceived satisfaction with search results and the perceived satisfaction with the overall system. A total of 23 hypotheses have been formulated around the model, and a questionnaire survey has been conducted using a functional semantic search website created by KT and Hakia, so as to collect data to validate the model. Copies of a questionnaire form were handed out in person to 160 research associates and employees working in the area of designing and developing semantic search engines. Those who received the form, 148 respondents returned valid responses. The survey form asked respondents to use the given website to answer questions concerning the system. The results of the empirical study have indicated that, of the three quality dimensions, information quality was found to have the strongest association with the effectiveness of a Web-based semantic search engine. This finding is consistent with the observation in the literature that the aspects of the information quality should serve as a basis for evaluating the search outcomes from a semantic search engine. Measures under the information quality dimension that have a positive effect on informational gratification and user satisfaction were found to be recall and currency. Under the system quality dimension, response time and interactivity, were positively related to informational gratification. On the other hand, only one measure under the service quality dimension, reliability was found to have a positive relationship with user satisfaction. The results were based on the seven hypotheses that have been accepted. One may wonder why 15 out of the 23 hypotheses have been rejected and question the theoretical soundness of the model. However, the correlations between independent variables and dependent variables came out to be fairly high. This suggests that the structural equation model yielded results inconsistent with those of coefficient analysis, because the structural equation model intends to examine the relationship among independent variables as well as the relationship between independent variables and dependent variables. The findings offer some useful implications for owners of a semantic search engine, as far as the design and maintenance of the website is concerned. First, the system should be designed to respond to the user's query as fast as possible. Also it should be designed to support the search process by recommending, revising, and choosing a search query, so as to maximize users' interactions with the system. Second, the system should present search results with maximum recall and currency to effectively meet the users' expectations. Third, it should be capable of providing online services in a reliable and trustworthy manner. Finally, effective increase in user satisfaction requires the improvement of quality factors associated with a semantic search engine, which would in turn help increase the informational gratification for users. The proposed model can serve as a useful framework for measuring the success of a Web-based semantic search engine. Applying the search engine success framework to the measurement of search engine effectiveness has the potential to provide an outline of what areas of a semantic search engine needs improvement, in order to better meet information needs of users. Further research will be needed to make this idea a reality.

Personalized Web Search using Query based User Profile (질의기반 사용자 프로파일을 이용하는 개인화 웹 검색)

  • Yoon, Sung Hee
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.17 no.2
    • /
    • pp.690-696
    • /
    • 2016
  • Search engines that rely on morphological matching of user query and web document content do not support individual interests. This research proposes a personalized web search scheme that returns the results that reflect the users' query intent and personal preferences. The performance of the personalized search depends on using an effective user profiling strategy to accurately capture the users' personal interests. In this study, the user profiles are the databases of topic words and customized weights based on the recent user queries and the frequency of topic words in click history. To determine the precise meaning of ambiguous queries and topic words, this strategy uses WordNet to calculate the semantic relatedness to words in the user profile. The experiments were conducted by installing a query expansion and re-ranking modules on the general web search systems. The results showed that this method has 92% precision and 82% recall in the top 10 search results, proving the enhanced performance.

Preliminary Performance Evaluation of a Web Crawler with Dynamic Scheduling Support (동적 스케줄링 기반 웹 크롤러의 성능분석)

  • Lee, Yong-Doo;Chae, Soo-Hwan
    • Journal of Korea Society of Industrial Information Systems
    • /
    • v.8 no.3
    • /
    • pp.12-18
    • /
    • 2003
  • A web crawler is used widely in a variety of Internet applications such as search engines. As the Internet continues to grow, high performance web crawlers become more essential. Crawl scheduling which manages the allocation of web pages to each process for downloading documents is one of the important issues. In this paper, we identify issues that are important and challenging in the crawl scheduling. To address the issues, we propose a dynamic owl scheduling framework and subsequently a system architecture for a web crawler subject to the framework. This paper presents the architecture of a web crawler with dynamic scheduling support. The result of our preliminary performance evaluation made to the proposed crawler architecture is also presented.

  • PDF

Operation of StarDB web services and its Virtual Observatory supports

  • Shin, Min-Su;Yi, Hahn
    • The Bulletin of The Korean Astronomical Society
    • /
    • v.40 no.2
    • /
    • pp.60.1-60.1
    • /
    • 2015
  • We present the current operation status of StarDB web services by showing its user access statistics. The StarDB web services started its operation in late November, allowing world-wide users to access results of new variability analysis for Northern Sky Variability Survey light curves. New analysis results of various time-series data have been added to the StarDB services. Importantly, our services have supported a simple cone search, which is an internationally well-defined catalog search interface in the international Virtual Observatory systems. We have collected user access statistics such as how users find our analysis data since its operation in later November. We expect our analysis of the StarDB operation to help Korean community members who plan and operate their own web services preparing for a future era of big survey data.

  • PDF

A Brief Survey into the Field of Automatic Image Dataset Generation through Web Scraping and Query Expansion

  • Bart Dikmans;Dongwann Kang
    • Journal of Information Processing Systems
    • /
    • v.19 no.5
    • /
    • pp.602-613
    • /
    • 2023
  • High-quality image datasets are in high demand for various applications. With many online sources providing manually collected datasets, a persisting challenge is to fully automate the dataset collection process. In this study, we surveyed an automatic image dataset generation field through analyzing a collection of existing studies. Moreover, we examined fields that are closely related to automated dataset generation, such as query expansion, web scraping, and dataset quality. We assess how both noise and regional search engine differences can be addressed using an automated search query expansion focused on hypernyms, allowing for user-specific manual query expansion. Combining these aspects provides an outline of how a modern web scraping application can produce large-scale image datasets.

WebChemDB: An Integrated Chemical Database Retrieval System

  • Hou, Bo-Kyeng;Moon, Eun-Joung;Moon, Sung-Chul;Kim, Hae-Jin
    • Genomics & Informatics
    • /
    • v.7 no.4
    • /
    • pp.212-216
    • /
    • 2009
  • WebChemDB is an integrated chemical database retrieval system that provides access to over 8 million publicly available chemical structures, including related information on their biological activities and direct links to other public chemical resources, such as PubChem, ChEBI, and DrugBank. The data are publicly available over the web, using two-dimensional (2D) and three-dimensional (3D) structure retrieval systems with various filters and molecular descriptors. The web services API also provides researchers with functionalities to programmatically manipulate, search, and analyze the data.

Extended Semantic Web Services Retrieval Model for the Intelligent Web Services (지능형 웹 서비스를 위한 확장된 시맨틱 웹서비스 검색 모델)

  • Choi, Ok-Kyung;Han, Sang-Yong;Lee, Zoon-Ky
    • The KIPS Transactions:PartD
    • /
    • v.13D no.5 s.108
    • /
    • pp.725-730
    • /
    • 2006
  • Recently Web services have become a key technology which is indispensable for e-business. Due to its ability to provide the desired information or service regardless of time and place, integrating current application systems within a single business or between multiple businesses with standardized technologies are realized using the open network and Internet. However, the current Web Services Retrieval Systems, based on text oriented search are incapable of providing reliable search results by perceiving the similarity or interrelation between the various terms. Currently there are no web services retrieval models containing such semantic web functions. This research work is purported for solving such problems by designing and implementing an extended Semantic Web Services Retrieval Model that is capable of searching for general web documents, UDDI and semantic web documents. Execution result is proposed in this paper and its efficiency and accuracy are verified through it.

A Study on Personalization System Using Web Log and Purchasing Database (웹 로그와 구매 DB를 이용한 개인화 시스템에 관한 연구)

  • 김영태;이성주
    • Proceedings of the Korean Institute of Intelligent Systems Conference
    • /
    • 2003.09b
    • /
    • pp.23-26
    • /
    • 2003
  • In this paper, a methodolgy for customizing web pages for indivisual users is suggested. It shows an efficient way to personalize web pages by predicting one's site access pattern. In addition, the prediction can reflect one's tendency after actual purchase. By using the APRIORI algorithm, one of the association rule search methods, the associativity among the purchase items can be inferred. This inferrence is based on the log data in a web server and database about purchase. Finally, a web page which contains the relationship, relative links on other web pages, and inferred items can be generated after this process.

  • PDF

Web-based Design Support System for Automotive Engine Pulley (웹 기반 자동차용 엔진 풀리 설계 지원 시스템)

  • Kim H.J.;Chun D.M.;Ahn S.H.;Hwang B.C.;Jang J.D.
    • Proceedings of the Korean Society of Precision Engineering Conference
    • /
    • 2006.05a
    • /
    • pp.639-640
    • /
    • 2006
  • Many companies in mechanical engineering fields have accumulated information of design and manufacturing. The Enterprise Resource Planning (ERP) and Product Data Management (PDM) systems help information gathering and data managing. However, these systems are not flexible to support suitable functionality for specific product because these systems deal with entire enterprise resources. To cope with this issue, a web-based design support system was constructed for the design process of automotive steel pulley. This system provided 1) search service for part design with key word and clustering map, and 2) estimation service of maximum stress. These services reduced design time by reducing iterative jobs with Computer Aided Design (CAD) and Computer Aided Engineering (CAE) for stress analysis, and by enhancing search for existing data of steel pulley.

  • PDF