• Title/Summary/Keyword: Date Filtering

Search Result 30, Processing Time 0.026 seconds

Data Sparsity and Performance in Collaborative Filtering-based Recommendation

  • Kim Jong-Woo;Lee Hong-Joo
    • Management Science and Financial Engineering
    • /
    • v.11 no.3
    • /
    • pp.19-45
    • /
    • 2005
  • Collaborative filtering is one of the most common methods that e-commerce sites and Internet information services use to personalize recommendations. Collaborative filtering has the advantage of being able to use even sparse evaluation data to predict preference scores for new products. To date, however, no in-depth investigation has been conducted on how the data sparsity effect in customers' evaluation data affects collaborative filtering-based recommendation performance. In this study, we analyzed the sparsity effect and used a hybrid method based on customers' evaluations and purchases collected from an online bookstore. Results indicated that recommendation performance decreased monotonically as sparsity increased, and that performance was more sensitive to sparsity in evaluation data rather than in purchase data. Results also indicated that the hybrid use of two different types of data (customers' evaluations and purchases) helped to improve the recommendation performance when evaluation data were highly sparse.

Improving Performance of Jaccard Coefficient for Collaborative Filtering

  • Lee, Soojung
    • Journal of the Korea Society of Computer and Information
    • /
    • v.21 no.11
    • /
    • pp.121-126
    • /
    • 2016
  • In recommender systems based on collaborative filtering, measuring similarity is very critical for determining the range of recommenders. Data sparsity problem is fundamental in collaborative filtering systems, which is partly solved by Jaccard coefficient combined with traditional similarity measures. This study proposes a new coefficient for improving performance of Jaccard coefficient by compensating for its drawbacks. We conducted experiments using datasets of various characteristics for performance analysis. As a result of comparison between the proposed and the similarity metric of Pearson correlation widely used up to date, it is found that the two metrics yielded competitive performance on a dense dataset while the proposed showed much better performance on a sparser dataset. Also, the result of comparing the proposed with Jaccard coefficient showed that the proposed yielded far better performance as the dataset is denser. Overall, the proposed coefficient demonstrated the best prediction and recommendation performance among the experimented metrics.

Performance of GHICW(Group-wise Hybrid Interference Cancellation Scheme based on Wiener filtering) in Multi Rate DS-CDMA System (하이브리드 위너 필터링 간섭제거 기법을 이용한 다중 데이터 율 DS/CDMA 시스템의 성능 분석)

  • 정재필;최원태;박상규
    • Proceedings of the IEEK Conference
    • /
    • 2000.11a
    • /
    • pp.145-148
    • /
    • 2000
  • This paper presents the performance of a GHICW(Group-wise Hybrid Interference Cancellation scheme based on Wiener filtering) receiver for the multi-rate DS-CDMA system. Our scheme has a small processing delay and a simple hardware complexity compared to ordinary interference cancellation schemes by grouping users with the same date rate. The performance improvement of the low rate user is obtained by using a Wiener filter which precisely estimates the high rate users' bit.

  • PDF

Robust $H_{\infty}$ FIR Sampled-Date Filtering for Uncertain Time-Varying Systems with Unknown Nonlinearity

  • Ryu, Hee-Seob;Byung-Moon;Kwon, Oh-Kyu
    • Transactions on Control, Automation and Systems Engineering
    • /
    • v.3 no.2
    • /
    • pp.83-88
    • /
    • 2001
  • The robust linear H(sub)$\infty$ FIR filter, which guarantees a prescribed H(sub)$\infty$ performance, is designed for continuous time-varying systems with unknown cone-bounded nonlinearity. The infinite horizon filtering for time-varying systems is systems is investigated in therms of two Riccati equations by the finite moving horizon.

  • PDF

가상 커뮤니티 공간에서 블로거를 위한 추천시스템

  • Kim, Jae-Gyeong;O, Hyeok;An, Do-Hyeon
    • Proceedings of the Korea Inteligent Information System Society Conference
    • /
    • 2005.11a
    • /
    • pp.415-424
    • /
    • 2005
  • The rapid growth of blog has caused information overload where bloggers in the virtual community space are no longer able to effectively choose the blogs they are exposed to. Recommender systems have been widely advocated as a way of coping with the problem of information overload in e-business environment. Collaborative Filtering (CF) is the most successful recommendation method to date and used in many of the recommender systems. Therefore, we propose a CF-based recommender system for bloggers in the virtual community space. Our proposed methodology consists of three main phases: In the first phase, we apply the "Interest Value" to a recommender system. The Interest Value is a quantity value about user preference in virtual community, and can measure the opinion of users accurately. Next phase, we generate the neighborhood group based on the Interest Value. In the final phase, we use the Community Likeness Score (CLS) to generate the top-n recommendation list. The methodology is explained step by step with an illustrative example and is verified with real data of a blog service provider.

  • PDF

A Study on Extracting the Landuse Change Information of Seoul Using LANDSAT(MSS, TM) Data (1972~1985) (LANDAST(MSS, TM) Data를 이용(利用)한 서울시(市)의 토지이용(土地利用) 경년변화(經年變化)의 추출(抽出)에 관한 연구(硏究) (1972~1985년))

  • Ahn, Chul Ho;Ahn, Ki Won;Kim, Yong Il
    • KSCE Journal of Civil and Environmental Engineering Research
    • /
    • v.9 no.4
    • /
    • pp.113-124
    • /
    • 1989
  • In this study, we tried to extract the land-use change information of Seoul city using the multiple date images of the same geographic area. Multiple date image set is MSS('72, '79, '81, '93) and TM('85), and we carried out geometric correction, digitizing(due to the administrative boundary) in pre-processing process. In addition, we performed land-use classification with MLC(Maximum Likelihood Classifier) after improving the predictive accuracy of classification by filtering technique. At the stage of classification, ground truth data, topographic maps, aerial photographs were used to select the training field and statistical data of that time were compared with the classification result to prove the accuracy. As a result, urban area in Seoul has been increased('72 : 25.8 %${\rightarrow}$'81 : 43.0 %${\rightarrow}$'85 : 51.9 %) and Forest area decreased ('72 : 39.0 %${\rightarrow}$'85 : 28.4 %) as we estimated. Finally, it is concluded that the utilzation of satellite imagery is very effective, economical and helpful in the urban land-use/land-cover monitoring.

  • PDF

Probabilistic filtering for a biological knowledge discovery system with text mining and automatic inference (텍스트 마이닝 및 자동 추론 기반 생물학 지식 발견 시스템을 위한 확률 기반 필터링)

  • Lee, Hee-Jin;Park, Jong-C.
    • Journal of the Korea Society of Computer and Information
    • /
    • v.17 no.2
    • /
    • pp.139-147
    • /
    • 2012
  • In this paper, we discuss the structure of biological knowledge discovery system based on text mining and automatic inference. Given a set of biology documents, the system produces a new hypothesis in an integrated manner. The text mining module of the system first extracts the 'event' information of predefined types from the documents. The inference module then produces a new hypothesis based on the extracted results. Such an integrated system can use information more up-to-date and diverse than other automatic knowledge discovery systems use. However, for the success of such an integrated system, the precision of the text mining module becomes crucial, as any hypothesis based on a single piece of false positive information would highly likely be erroneous. In this paper, we propose a probabilistic filtering method that filters out false positives from the extraction results. Our proposed method shows higher performance over an occurrence-based baseline method.

OLAP4R: A Top-K Recommendation System for OLAP Sessions

  • Yuan, Youwei;Chen, Weixin;Han, Guangjie;Jia, Gangyong
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.11 no.6
    • /
    • pp.2963-2978
    • /
    • 2017
  • The Top-K query is currently played a key role in a wide range of road network, decision making and quantitative financial research. In this paper, a Top-K recommendation algorithm is proposed to solve the cold-start problem and a tag generating method is put forward to enhance the semantic understanding of the OLAP session. In addition, a recommendation system for OLAP sessions called "OLAP4R" is designed using collaborative filtering technique aiming at guiding the user to find the ultimate goals by interactive queries. OLAP4R utilizes a mixed system architecture consisting of multiple functional modules, which have a high extension capability to support additional functions. This system structure allows the user to configure multi-dimensional hierarchies and desirable measures to analyze the specific requirement and gives recommendations with forthright responses. Experimental results show that our method has raised 20% recall of the recommendations comparing the traditional collaborative filtering and a visualization tag of the recommended sessions will be provided with modified changes for the user to understand.

Blog Intelligence (블로그 인텔리전스)

  • Kim, Jae-Kyeong;Kim, Hyea-Kyeong;O, Hyouk
    • Journal of Information Technology Services
    • /
    • v.7 no.3
    • /
    • pp.71-85
    • /
    • 2008
  • The rapid growth of blog has caused information overload where bloggers in the virtual community space are no longer able to effectively choose the blogs they are exposed to. Recommender systems have been widely advocated as a way of coping with the problem of information overload in e-business environment. Collaborative Filtering (CF) is the most successful recommendation method to date and used in many of the recommender systems. In this research, we propose a CF-based recommender system for bloggers to find their similar bloggers or preferable virtual community without burdensome search effort. For such a purpose, we apply the "Interest Value" to CF recommender systems. The Interest Value is the quantity value about users' transaction data in virtual community, and can measure the opinion of users accurately. Based on the Interest Value, the neighborhood group is generated, and virtual community list is recommended using the Community Likeness Score (ClS). Our experimental results upon real data of Korean Blog site show that the methodology is capable of dealing with the information overload issue in virtual community space. And Interest Value is proved to have the potential to meet the challenge of recommendation methodologies in virtual community space.