• Title/Summary/Keyword: term weighting

Search Result 110, Processing Time 0.031 seconds

Document Summarization using Weighting based on Cloud (클라우드 기반의 가중치에 의한 문서요약)

  • Park, Sun;Kim, Chul Won
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2013.10a
    • /
    • pp.305-306
    • /
    • 2013
  • In this paper, we proposes a document summarization method using the weighting based on cloud. The proposed method can minimize the user intervention to use the relevance feedback. It also can improve the quality of document summaries because the inherent semantic of the sentence set are well reflected by term weighting derived from semantic feature using nonnegative matrix factorizaitno based cloud.

  • PDF

Document Summarization using Weighting based on Cloud (클라우드 기반의 가중치에 의한 문서요약)

  • Park, Sun;Kim, Chul Won
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2013.10a
    • /
    • pp.968-969
    • /
    • 2013
  • In this paper, we proposes a document summarization method using the weighting based on cloud. The proposed method can minimize the user intervention to use the relevance feedback. It also can improve the quality of document summaries because the inherent semantic of the sentence set are well reflected by term weighting derived from semantic feature using nonnegative matrix factorizaitno based cloud.

  • PDF

A Study on Negation Handling and Term Weighting Schemes and Their Effects on Mood-based Text Classification (감정 기반 블로그 문서 분류를 위한 부정어 처리 및 단어 가중치 적용 기법의 효과에 대한 연구)

  • Jung, Yu-Chul;Choi, Yoon-Jung;Myaeng, Sung-Hyon
    • Korean Journal of Cognitive Science
    • /
    • v.19 no.4
    • /
    • pp.477-497
    • /
    • 2008
  • Mood classification of blog text is an interesting problem, with a potential for a variety of services involving the Web. This paper introduces an approach to mood classification enhancements through the normalized negation n-grams which contain mood clues and corpus-specific term weighting(CSTW). We've done experiments on blog texts with two different classification methods: Enhanced Mood Flow Analysis(EMFA) and Support Vector Machine based Mood Classification(SVMMC). It proves that the normalized negation n-gram method is quite effective in dealing with negations and gave gradual improvements in mood classification with EMF A. From the selection of CSTW, we noticed that the appropriate weighting scheme is important for supporting adequate levels of mood classification performance because it outperforms the result of TF*IDF and TF.

  • PDF

A Study for the System Light-Weighting Technology of Urban Railway Vehicle (도시철도차량의 시스템 경량화 기술에 관한 연구)

  • Park, Kwang-Bok;Jung, Hyun-Seung;Kim, Jong-Woon
    • Proceedings of the KSR Conference
    • /
    • 2011.10a
    • /
    • pp.2965-2972
    • /
    • 2011
  • The 21th century is carried out a rearing activity of green industries as a part of friendly echo policy for preservation of earth environment. Korean government is spreaded on a whole country to setting up a goal to a sustainable growth power of national for low carbon green growth. The recently rolling stock is received the footlights as a best friendly echo transportation of transport system of automobile, airplane, ship etc., also that is playing a big role to reduction of $CO_2$ greenhouse gas and a discharge of air pollution. This study was carried out the investigation and analysis for technology trend of the system light-weighting of a recently urban railway vehicle on commercial operating in domestic and foreign countries. As the results are proposed a short-term development technology and a long-term development technology of the system light-weighting to be developed for reduction weight of urban railway vehicle.

  • PDF

A Study on the Pivoted Inverse Document Frequency Weighting Method (피벗 역문헌빈도 가중치 기법에 대한 연구)

  • Lee, Jae-Yun
    • Journal of the Korean Society for information Management
    • /
    • v.20 no.4 s.50
    • /
    • pp.233-248
    • /
    • 2003
  • The Inverse Document Frequency (IDF) weighting method is based on the hypothesis that in the document collection the lower the frequency of a term is, the more important the term is as a subject word. This well-known hypothesis is, however, somewhat questionable because some low frequency terms turn out to be insufficient subject words. This study suggests the pivoted IDF weighting method for better retrieval effectiveness, on the assumption that medium frequency terms are more important than low frequency terms. We thoroughly evaluated this method on three test collections and it showed performance improvements especially at high ranks.

A Generalized Least Square Method using Dead Zone (불감대를 사용한 최소자승법의 일반화)

  • 이하정;최종호
    • The Transactions of the Korean Institute of Electrical Engineers
    • /
    • v.37 no.10
    • /
    • pp.727-732
    • /
    • 1988
  • In this paper, a parameter estimation method of linear systems with bounded output disturbances is studied. The bound of the disturbances is assumed to known Weighting factors are proposed to modify LS(Least Square) algorithm in the parameter estimation method. The conditions of weighting factors are given so that the estimation method has good convergence properties. This condition is more relaxed form than other known conditions. The compensation term in the estimation equations is represented by a function of the output prediction error and this function should lie in a specified region on x-y plane to satisfy these conditions of weighting factors. A set of weighting factor is selected and an algorithm is proposed using this set of weighting factor. The proposed algorithm is compared with another existing algorithm by simulation and its performance in parameter estimation id discussed.

  • PDF

A New Algorithm for Recursive Short-term Load Forecasting (순환형식에 의한 기분거좌상측 알고리)

  • Young-Moon Park;Sung-Chul Oh
    • The Transactions of the Korean Institute of Electrical Engineers
    • /
    • v.32 no.5
    • /
    • pp.183-188
    • /
    • 1983
  • This paper deals with short-term load forecasting. The load model is represented by the state variable form to exploit the Kalman filter technique. The load model is derived from Taylor series expansion and remainder term is considered as noise term. In order to solve recursive filter form, among various algorithm of solving Kalman filter, this paper uses exponential data weighting technique. This paper also deals with the asymptotic stability of filter. Case studies are carried out for the hourly power demand forecasting of the Korea electrical system.

  • PDF

Development of a Clustering Model for Automatic Knowledge Classification (지식 분류의 자동화를 위한 클러스터링 모형 연구)

  • 정영미;이재윤
    • Journal of the Korean Society for information Management
    • /
    • v.18 no.2
    • /
    • pp.203-230
    • /
    • 2001
  • The purpose of this study is to develop a document clustering model for automatic classification of knowledge. Two test collections of newspaper article texts and journal article abstracts are built for the clustering experiment. Various feature reduction criteria as well as term weighting methods are applied to the term sets of the test collections, and cosine and Jaccard coefficients are used as similarity measures. The performances of complete linkage and K-means clustering algorithms are compared using different feature selection methods and various term weights. It was found that complete linkage clustering outperforms K-means algorithm and feature reduction up to almost 10% of the total feature sets does not lower the performance of document clustering to any significant extent.

  • PDF

Gain Compensation Method for Codebook-Based Speech Enhancement (코드북 기반 음성향상 기법을 위한 게인 보상 방법)

  • Jung, Seungmo;Kim, Moo Young
    • Journal of the Institute of Electronics and Information Engineers
    • /
    • v.51 no.9
    • /
    • pp.165-170
    • /
    • 2014
  • Speech enhancement techniques that remove surrounding noise are stressed to preprocessor of speech recognition. Among the various speech enhancement techniques, Codebook-based Speech Enhancement (CBSE) operates efficiently in non-stationary noise environments. But, CBSE has some problems that inaccurate gains can be estimated if mismatch occur between input noisy signal and trained speech/noise codevectors. In this paper, the Normalized Weighting Factor (NWF) is calculated by long-term noise estimation algorithm based on Signal-to-Noise Ratio, compensated to the conventional inaccurate gains. The proposed CBSE shows better performance than conventional CBSE.

A Comparative Study of Automaic Indexing Techniques in Pharmacology and Libray & Infomation Science (학문의 주제별 특성에 따른 자동 색인 기법의 비교 연구 - 약학분야와 도서관. 정보학 분야를 중심으로 -)

  • 조수련;사공철
    • Journal of the Korean Society for information Management
    • /
    • v.5 no.2
    • /
    • pp.99-126
    • /
    • 1988
  • The purpose of this ptudy is to presenet a relevant automaitc technigue in accordance with the statistical term characteristie in a collection comprising different subjecits, by comparing and evaluating two automatic indexing technigues (Inverse Document Fregnency Weighting Technigue and Term Discrimiantion Value Weighting Technigues) intht fields of Pharmacology and Library & Information Science.

  • PDF