Search | Korea Science

An Improved Approach to Ranking Web Documents

Gupta, Pooja;Singh, Sandeep K.;Yadav, Divakar;Sharma, A.K.
- Journal of Information Processing Systems
- /
- v.9 no.2
- /
- pp.217-236
- /
- 2013
Ranking thousands of web documents so that they are matched in response to a user query is really a challenging task. For this purpose, search engines use different ranking mechanisms on apparently related resultant web documents to decide the order in which documents should be displayed. Existing ranking mechanisms decide on the order of a web page based on the amount and popularity of the links pointed to and emerging from it. Sometime search engines result in placing less relevant documents in the top positions in response to a user query. There is a strong need to improve the ranking strategy. In this paper, a novel ranking mechanism is being proposed to rank the web documents that consider both the HTML structure of a page and the contextual senses of keywords that are present within it and its back-links. The approach has been tested on data sets of URLs and on their back-links in relation to different topics. The experimental result shows that the overall search results, in response to user queries, are improved. The ordering of the links that have been obtained is compared with the ordering that has been done by using the page rank score. The results obtained thereafter shows that the proposed mechanism contextually puts more related web pages in the top order, as compared to the page rank score.
https://doi.org/10.3745/JIPS.2013.9.2.217 인용 PDF KSCI

Performance Evaluation of Re-ranking and Query Expansion for Citation Metrics: Based on Citation Index Databases (인용 지표를 이용한 재순위화 및 질의 확장의 성능 평가 - 인용색인 데이터베이스를 기반으로 -)

HyeKyung Lee;Yong-Gu lee
- Journal of the Korean Society for Library and Information Science
- /
- v.57 no.3
- /
- pp.249-277
- /
- 2023
The purpose of this study is to explore the potential contribution of citation metrics to improving the search performance of citation index databases. To this end, the study generated ten queries in the field of library and information science and conducted experiments based on the relevance assessment using 3,467 documents retrieved from the Web of Science and 60,734 documents published in 85 SSCI journals in the field of library and information science from 2000 to 2021. The experiments included re-ranking of the top 100 search results using citation metrics and search methods, query expansion experiments using vector space model retrieval systems, and the construction of a citation-based re-ranking system. The results are as follows: 1) Re-ranking using citation metrics differed from Web of Science's performance, acting as independent metrics. 2) Combining query term frequencies and citation counts positively affected performance. 3) Query expansion generally improved performance compared to the vector space model baseline. 4) User-based query expansion outperformed system-based. 5) Combining citation counts with suitability documents affected ranking within top suitability documents.
https://doi.org/10.4275/KSLIS.2023.57.3.249 인용 PDF

A Prototype Model for Handling Fuzzy Query in Voice Search on Smartphones (스마트폰의 음성 검색에서 퍼지 쿼리 처리를 위한 프로토타입 모델)

Choi, Dae-Young
- The KIPS Transactions:PartD
- /
- v.18D no.4
- /
- pp.309-312
- /
- 2011
Handling fuzzy query in voice search on smartphones is one of the most difficult problems. It is mainly derived from the complexity and the degree of freedom of natural language. To reduce the complexity and the degree of freedom of fuzzy query in voice search on smartphones, attribute-driven approach for fuzzy query is proposed. In addition, a new page ranking algorithm based on the values of attributes for handling fuzzy query is proposed. It provides a smartphone user with location-based personalized page ranking based on user's search intentions. It is a further step toward location-based personalized web search for smartphone users. In this paper, we design a prototype model for handling fuzzy query in voice search on smartphones and show the experimental results of the proposed approach compared to existing smartphones.
https://doi.org/10.3745/KIPSTD.2011.18D.4.309 인용 PDF KSCI

A probabilistic information retrieval model by document ranking using term dependencies (용어간 종속성을 이용한 문서 순위 매기기에 의한 확률적 정보 검색)

You, Hyun-Jo;Lee, Jung-Jin
- The Korean Journal of Applied Statistics
- /
- v.32 no.5
- /
- pp.763-782
- /
- 2019
This paper proposes a probabilistic document ranking model incorporating term dependencies. Document ranking is a fundamental information retrieval task. The task is to sort documents in a collection according to the relevance to the user query (Qin et al., Information Retrieval Journal, 13, 346-374, 2010). A probabilistic model is a model for computing the conditional probability of the relevance of each document given query. Most of the widely used models assume the term independence because it is challenging to compute the joint probabilities of multiple terms. Words in natural language texts are obviously highly correlated. In this paper, we assume a multinomial distribution model to calculate the relevance probability of a document by considering the dependency structure of words, and propose an information retrieval model to rank a document by estimating the probability with the maximum entropy method. The results of the ranking simulation experiment in various multinomial situations show better retrieval results than a model that assumes the independence of words. The results of document ranking experiments using real-world datasets LETOR OHSUMED also show better retrieval results.
https://doi.org/10.5351/KJAS.2019.32.5.763 인용 PDF KSCI

Design of Query Processing System to Retrieve Information from Social Network using NLP

Virmani, Charu;Juneja, Dimple;Pillai, Anuradha
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- v.12 no.3
- /
- pp.1168-1188
- /
- 2018
Social Network Aggregators are used to maintain and manage manifold accounts over multiple online social networks. Displaying the Activity feed for each social network on a common dashboard has been the status quo of social aggregators for long, however retrieving the desired data from various social networks is a major concern. A user inputs the query desiring the specific outcome from the social networks. Since the intention of the query is solely known by user, therefore the output of the query may not be as per user's expectation unless the system considers 'user-centric' factors. Moreover, the quality of solution depends on these user-centric factors, the user inclination and the nature of the network as well. Thus, there is a need for a system that understands the user's intent serving structured objects. Further, choosing the best execution and optimal ranking functions is also a high priority concern. The current work finds motivation from the above requirements and thus proposes the design of a query processing system to retrieve information from social network that extracts user's intent from various social networks. For further improvements in the research the machine learning techniques are incorporated such as Latent Dirichlet Algorithm (LDA) and Ranking Algorithm to improve the query results and fetch the information using data mining techniques.The proposed framework uniquely contributes a user-centric query retrieval model based on natural language and it is worth mentioning that the proposed framework is efficient when compared on temporal metrics. The proposed Query Processing System to Retrieve Information from Social Network (QPSSN) will increase the discoverability of the user, helps the businesses to collaboratively execute promotions, determine new networks and people. It is an innovative approach to investigate the new aspects of social network. The proposed model offers a significant breakthrough scoring up to precision and recall respectively.
https://doi.org/10.3837/tiis.2018.03.011 인용 PDF KSCI

Relaxing Queries by Combining Knowledge Abstraction and Semantic Distance Approach (지식 추상화와 의미 거리 접근법을 통합한 질의 완화 방법론)

Shin, Myung-Keun;Park, Sung-Hyuk;Lee, Woo-Key;Huh, Soon-Young
- Journal of the Korean Operations Research and Management Science Society
- /
- v.32 no.1
- /
- pp.125-136
- /
- 2007
The study on query relaxation which provides approximate answers has received attention. In recent years, some arguments have been made that semantic relationships are useful to present the relationships among data values and calculating the semantic distance between two data values can be used as a quantitative measure to express relative distance. The aim of this article is a hierarchical metricized knowledge abstraction (HiMKA) with an emphasis on combining data abstraction hierarchy and distance measure among data values. We propose the operations and the query relaxation algorithm appropriate to the HiMKA. With various experiments and comparison with other method, we show that the HiMKA is very useful for the quantified approximate query answering and our result is to offer a new methodological framework for query relaxation.
PDF KSCI

Personalized Web Search using Query based User Profile (질의기반 사용자 프로파일을 이용하는 개인화 웹 검색)

Yoon, Sung Hee
- Journal of the Korea Academia-Industrial cooperation Society
- /
- v.17 no.2
- /
- pp.690-696
- /
- 2016
Search engines that rely on morphological matching of user query and web document content do not support individual interests. This research proposes a personalized web search scheme that returns the results that reflect the users' query intent and personal preferences. The performance of the personalized search depends on using an effective user profiling strategy to accurately capture the users' personal interests. In this study, the user profiles are the databases of topic words and customized weights based on the recent user queries and the frequency of topic words in click history. To determine the precise meaning of ambiguous queries and topic words, this strategy uses WordNet to calculate the semantic relatedness to words in the user profile. The experiments were conducted by installing a query expansion and re-ranking modules on the general web search systems. The results showed that this method has 92% precision and 82% recall in the top 10 search results, proving the enhanced performance.
https://doi.org/10.5762/KAIS.2016.17.2.690 인용 PDF KSCI

Accelerating Keyword Search Processing over XML Documents using Document-level Ranking (문서 단위 순위화를 통한 XML 문서에 대한 키워드 검색 성능 향상)

Lee, Hyung-Dong;Kim, Hyoung-Joo
- Journal of KIISE:Databases
- /
- v.33 no.5
- /
- pp.538-550
- /
- 2006
XML Keyword search enables us to get information easily without knowledge of structure of documents and returns specific and useful partial document results instead of whole documents. Element level query processing makes it possible, but computational complexity, as the number of documents grows, increases significantly overhead costs. In this paper, we present document-level ranking scheme over XML documents which predicts results of element-level processing to reduce processing cost. To do this, we propose the notion of 'keyword proximity' - the correlation of keywords in a document that affects the results of element-level query processing using path information of occurrence nodes and their resemblances - for document ranking process. In benefit of document-centric view, it is possible to reduce processing time using ranked document list or filtering of low scored documents. Our experimental evaluation shows that document-level processing technique using ranked document list is effective and improves performance by the early termination for top-k query.
PDF KSCI

New Re-ranking Technique based on Concept-Network Profiles for Personalized Web Search (웹 검색 개인화를 위한 개념네트워크 프로파일 기반 순위 재조정 기법)

Kim, Han-Joon;Noh, Joon-Ho;Chang, Jae-Young
- The Journal of the Institute of Internet, Broadcasting and Communication
- /
- v.12 no.2
- /
- pp.69-76
- /
- 2012
This paper proposes a novel way of personalized web search through re-ranking the search results with user profiles of concept-network structure. Basically, personalized search systems need to be based on user profiles that contain users' search patterns, and they actively use the user profiles in order to expand initial queries or to re-rank the search results. The proposed method is a sort of a re-ranking personalized search method integrated with query expansion facility. The method identifies some documents which occur commonly among a set of different search results from the expanded queries, and re-ranks the search results by the degree of co-occurring. We show that the proposed method outperforms the conventional ones by performing the empirical web search with a number of actual users who have diverse information needs and query intents.
https://doi.org/10.7236/JIWIT.2012.12.2.69 인용 PDF KSCI

Personalized Search Technique using Users' Personal Profiles (사용자 개인 프로파일을 이용한 개인화 검색 기법)

Yoon, Sung-Hee
- The Journal of the Korea institute of electronic communication sciences
- /
- v.14 no.3
- /
- pp.587-594
- /
- 2019
This paper proposes a personalized web search technique that produces ranked results reflecting user's query intents and individual interests. The performance of personalized search relies on an effective users' profiling strategy to accurately capture their interests and preferences. User profile is a data set of words and customized weights based on recent user queries and the topic words of web documents from their click history. Personal profile is used to expand a user query to the personalized query before the web search. To determine the exact meaning of ambiguous queries and topic words, this strategy uses WordNet to calculate semantic similarities to words in the user personal profile. Experimental results with query expansion and re-ranking modules installed on general search systems shows enhanced performance with this personalized search technique in terms of precision and recall.
https://doi.org/10.13067/JKIECS.2019.14.3.587 인용 PDF KSCI HTML

Search Result 50, Processing Time 0.032 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)