• Title/Summary/Keyword: multimedia data mining

Search Result 84, Processing Time 0.046 seconds

An Extended Dynamic Web Page Recommendation Algorithm Based on Mining Frequent Traversal Patterns (빈발 순회패턴 탐사에 기반한 확장된 동적 웹페이지 추천 알고리즘)

  • Lee KeunSoo;Lee Chang Hoon;Yoon Sun-Hee;Lee Sang Moon;Seo Jeong Min
    • Journal of Korea Multimedia Society
    • /
    • v.8 no.9
    • /
    • pp.1163-1176
    • /
    • 2005
  • The Web is the largest distributed information space but, the individual's capacity to read and digest contents is essentially fixed. In these Web environments, mining traversal patterns is an important problem in Web mining with a host of application domains including system design and information services. Conventional traversal pattern mining systems use the inter-pages association in sessions with only a very restricted mechanism (based on vector or matrix) for generating frequent K-Pagesets. We extend a family of novel algorithms (termed WebPR - Web Page Recommend) for mining frequent traversal patterns and then pageset to recommend. We add a WebPR(A) algorithm into a family of WebPR algorithms, and propose a new winWebPR(T) algorithm introducing a window concept on WebPR(T). Including two extended algorithms, our experimentation with two real data sets, including LadyAsiana and KBS media server site, clearly validates that our method outperforms conventional methods.

  • PDF

Dynamic Decision Tree for Data Mining (데이터마이닝을 위한 동적 결정나무)

  • Choi, Byong-Su;Cha, Woon-Ock
    • Communications for Statistical Applications and Methods
    • /
    • v.16 no.6
    • /
    • pp.959-969
    • /
    • 2009
  • Decision tree is a typical tool for data classification. This tool is implemented in DAVIS (Huh and Song, 2002). All the visualization tools and statistical clustering tools implemented in DAVIS can communicate with the decision tree. This paper presents methods to apply data visualization techniques to the decision tree using a real data set.

How Query by humming, a Music Information Retrieval System, is Being Used in the Music Education Classroom

  • Bradshaw, Brian
    • Journal of Multimedia Information System
    • /
    • v.4 no.3
    • /
    • pp.99-106
    • /
    • 2017
  • This study does a qualitative and quantitative analysis of how music by humming is being used by music educators in the classroom. Music by humming is part division of music information retrieval. In order to define what a music information retrieval system is first I need to define what it is. Berger and Lafferty (1999) define information retrieval as "someone doing a query to a retrieval system, a user begins with an information need. This need is an ideal document- perfect fit for the user, but almost certainly not present in the retrieval system's collection of documents. From this ideal document, the user selects a group of identifying terms. In the context of traditional IR, one could view this group of terms as akin to expanded query." Music Information Retrieval has its background in information systems, data mining, intelligent systems, library science, music history and music theory. Three rounds of surveys using question pro where completed. The study found that there were variances in knowledge, training and level of awareness of query by humming, music information retrieval systems. Those variance relationships where based on music specialty, level that they teach, and age of the respondents.

Comparison of Feature Selection Processes for Image Retrieval Applications

  • Choi, Young-Mee;Choo, Moon-Won
    • Journal of Korea Multimedia Society
    • /
    • v.14 no.12
    • /
    • pp.1544-1548
    • /
    • 2011
  • A process of choosing a subset of original features, so called feature selection, is considered as a crucial preprocessing step to image processing applications. There are already large pools of techniques developed for machine learning and data mining fields. In this paper, basically two methods, non-feature selection and feature selection, are investigated to compare their predictive effectiveness of classification. Color co-occurrence feature is used for defining image features. Standard Sequential Forward Selection algorithm are used for feature selection to identify relevant features and redundancy among relevant features. Four color spaces, RGB, YCbCr, HSV, and Gaussian space are considered for computing color co-occurrence features. Gray-level image feature is also considered for the performance comparison reasons. The experimental results are presented.

An Efficient Menu Recommendation System with Data Mining on User Preference (사용자 선호도 기반 데이터마이닝을 통한 효율적인 메뉴 추천 시스템)

  • Park, Byeong-Seok;Kang, Seong-Hun;Cho, Hyun-Woo;Jeong, Young-Sik
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2015.10a
    • /
    • pp.1549-1552
    • /
    • 2015
  • 최근 스마트폰을 비롯한 스마트 디바이스의 급격한 보급화가 이루어짐에 따라 추천가 시스템과 같은 개인화 서비스에 관한 연구가 활발히 진행되고 있다. 그러나 이러한 서비스는 활용 방안이 광범위함에도 불구하고 마케팅 등의 특정 분야에 한정되어 있거나 저수준의 QoS를 제공하는 정도에 머물러 있어 국내의 추천가 시스템은 아직 도입단계에 불과하다. 추천가 시스템은 추천할 물품과 같은 객체의 기본 및 평가 정보를 텍스트 형태의 메타 정보로 나타낸다. 이러한 메타 정보 기반 필터링에 의해 주변 경로 및 취향이 고려되지 않은 결과를 사용자에게 제공하고 있다. 이에 사용자와 상호작용하여 건강이나 취향, 식사 이력, 통계 등을 고려해 메뉴를 추천해주는 최적화된 알고리즘 연구가 요구된다. 본 논문에서는 최적화된 내용 기반 필터링을 활용해 사용자의 입력 패턴과 취향을 파악하여 메뉴를 추천해주는 시스템인 UBRS을 제안하고자 한다.

Analysis of Traffic Accident using Association Rule Model

  • Ihm, Sun-Young;Park, Young-Ho
    • Journal of Multimedia Information System
    • /
    • v.5 no.2
    • /
    • pp.111-114
    • /
    • 2018
  • Traffic accident analysis is important to reduce the occurrence of the accidents. In this paper, we analyze the traffic accident with Apriori algorithm to find out an association rule of traffic accident in Korea. We first design the traffic accident analysis model, and then collect the traffic accidents data. We preprocessed the collected data and derived some new variables and attributes for analyzing. Next, we analyze based on statistical method and Apriori algorithm. The result shows that many large-scale accident has occurred by vans in daytime. Medium-scale accident has occurred more in day than nighttime, and by cars more than vans. Small-scale accident has occurred more in night time than day time, however, the numbers were similar. Also, car-human accident is more occurred than car-car accident in small-scale accident.

A Design of XML-Based Distributed MDR Retrieval System for Data Preparation (데이터준비를 위한 XML 기반의 분산 MDR 검색 시스템 설계)

  • Ko Sucbum;Youn Sungdae
    • Journal of Korea Multimedia Society
    • /
    • v.7 no.9
    • /
    • pp.1329-1338
    • /
    • 2004
  • The purpose of data mining is to extract multi-dimensional information from a large database. The only information that we can extract from a large database is the column name, data type or simple comments included in the columns of database tables. With such unstructured and scarce information, it is very difficult and time taking to collect and to cleanse data by analyzing the purpose, characteristic and schema of the column during the data preparation step. In order to solve this problem, we propose solutions for reducing the time spent data preparation step in a relational database environment in this paper. That is, we propose useful elements to be considered during the data preparation step and then these elements are organized to constitute MDR(Metadata Registry) which is becoming the international standard of ISO/IEC : ll179. Finally, we propose a XML-based distributed MDR retrieval system that is convertible among heterogeneous systems and heterogeneous DBMSS.

  • PDF

Activity Data Modeling and Visualization Method for Human Life Activity Recognition (인간의 일상동작 인식을 위한 동작 데이터 모델링과 가시화 기법)

  • Choi, Jung-In;Yong, Hwan-Seung
    • Journal of Korea Multimedia Society
    • /
    • v.15 no.8
    • /
    • pp.1059-1066
    • /
    • 2012
  • With the development of Smartphone, Smartphone contains diverse functions including many sensors that can describe users' state. So there has been increased studies rapidly about activity recognition and life pattern recognition with Smartphone sensors. This research suggest modeling of the activity data to classify extracted data in existing activity recognition study. Activity data is divided into two parts: Physical activity and Logical Activity. In this paper, activity data modeling is theoretical analysis. We classified the basic activity(walking, standing, sitting, lying) as physical activity and the other activities including object, target and place as logical activity. After that we suggested a method of visualizing modeling data for users. Our approach will contribute to generalize human's life by modeling activity data. Also it can contribute to visualize user's activity data for existing activity recognition study.

Improved Statistical Language Model for Context-sensitive Spelling Error Candidates (문맥의존 철자오류 후보 생성을 위한 통계적 언어모형 개선)

  • Lee, Jung-Hun;Kim, Minho;Kwon, Hyuk-Chul
    • Journal of Korea Multimedia Society
    • /
    • v.20 no.2
    • /
    • pp.371-381
    • /
    • 2017
  • The performance of the statistical context-sensitive spelling error correction depends on the quality and quantity of the data for statistical language model. In general, the size and quality of data in a statistical language model are proportional. However, as the amount of data increases, the processing speed becomes slower and storage space also takes up a lot. We suggest the improved statistical language model to solve this problem. And we propose an effective spelling error candidate generation method based on a new statistical language model. The proposed statistical model and the correction method based on it improve the performance of the spelling error correction and processing speed.

Design of a Personalized Web Mining System Using a Sequence Association Rule (스퀀스 연관규칙을 이용한 개인화 웹 마이닝 설계)

  • Yun, Jong-Chan;Youn, Sung-Dae
    • Journal of Korea Multimedia Society
    • /
    • v.10 no.9
    • /
    • pp.1106-1116
    • /
    • 2007
  • Recently e-commerce trade on the web has grown rapidly in scale and complexity, just as web site designs and web servers have become more complicated. In view of these complexities, it is obviously difficult to analyse web user's data since they web users employ so many different web paths. The existing association rule investigation algorithms identify all items with a high correlation. However even though users often only want to find items in which they have interest, it is still difficult to find the rules they want out of all of the many association rules found by existing algorithms. In this paper, we propose a system linking each node with the sequence association rule, linking all routes after finding a path corresponding to a user with the association rule-one of the data mining techniques which identify user patterns in web user paths. The suggested system helps us construct individualized or customer-subdivided sites using the sequence association rule in order to harmonize the paths of web users with user characters.

  • PDF