• Title/Summary/Keyword: summary measure

Search Result 125, Processing Time 0.027 seconds

Automatic Music Summarization Using Similarity Measure Based on Multi-Level Vector Quantization (다중레벨 벡터양자화 기반의 유사도를 이용한 자동 음악요약)

  • Kim, Sung-Tak;Kim, Sang-Ho;Kim, Hoi-Rin
    • The Journal of the Acoustical Society of Korea
    • /
    • v.26 no.2E
    • /
    • pp.39-43
    • /
    • 2007
  • Music summarization refers to a technique which automatically extracts the most important and representative segments in music content. In this paper, we propose and evaluate a technique which provides the repeated part in music content as music summary. For extracting a repeated segment in music content, the proposed algorithm uses the weighted sum of similarity measures based on multi-level vector quantization for fixed-length summary or optimal-length summary. For similarity measures, count-based similarity measure and distance-based similarity measure are proposed. The number of the same codeword and the Mahalanobis distance of features which have same codeword at the same position in segments are used for count-based and distance-based similarity measure, respectively. Fixed-length music summary is evaluated by measuring the overlapping ratio between hand-made repeated parts and automatically generated ones. Optimal-length music summary is evaluated by calculating how much automatically generated music summary includes repeated parts of the music content. From experiments we observed that optimal-length summary could capture the repeated parts in music content more effectively in terms of summary length than fixed-length summary.

A summary-concept based analysis on the representative values and the measures of spread with the 9th grade Korean mathematics textbook (중학교 3학년 수학교과서 통계단원에 나타난 요약개념 분석)

  • Lee, Young-Ha;Lee, Eun-Hee
    • The Mathematical Education
    • /
    • v.50 no.4
    • /
    • pp.489-505
    • /
    • 2011
  • This study is an analysis on the focus of textbooks regarding the statistical chapters of "measures of representative(central tendency) and of the spread". Applying the summary-concept criteria of Juhyeon Nam(2007), 4 kinds of aspect of the chapter; (1) definition and its teleological validity of the measures of representative, (2) definition and practical value of the measures of spread (3) distributional form on the measures of representative and of spread (4) location and scale preservation or invariance of the measures of representative and of spread were observed. On the measures of representative, some definitions were insufficient to check the teleological validity of the measure. Most definitions of the measure of spread were based on the practical view points but no preparation for the future statistical inferences were found even by implication. Some books mention about the measures of representative and of spread for distributions, but we could not find any comments on the correspondence between the sample mean and the expectation of a distribution or population mean. However it is stimulant that some books check the validity of corresponding measures with the location and scale preservation or invariant property, that were not found in the previous curriculum.

Development of Practical Data Mining Methods for Database Summarization

  • Lee, Do-Heon
    • The Journal of Information Technology and Database
    • /
    • v.4 no.2
    • /
    • pp.33-45
    • /
    • 1998
  • Database summarization is the procedure to obtain generalized and representative descriptions expressing the content of a large amount of database at a glance. We present a top-down summary refinement procedure to discover database summaries. The procedure exploits attribute concept hierarchies that represent ISA relationships among domain concepts. It begins with the most generalized summary and proceeds to find more specialized ones by stepwise refinements. This top-down paradigm reveals at least two important advantages compared to the previous bottom-up methods. Firstly, it provides a natural way of reflecting the user's own discovery preference interactively. Secondly, it does not produce too large intermediate result that makes it hard for the bottom-up approach to be applied in practical environment. The proposed procedure can also be easily extended for distributed databases. Information content measure of a database summary is derived in order to identify more informative summaries among the discovered results.

A Review on Intelligent Compaction Techniques in Railroad Construction

  • Oh, Jeongho
    • International Journal of Railway
    • /
    • v.7 no.3
    • /
    • pp.80-84
    • /
    • 2014
  • The purpose of this paper was to review Intelligent Compaction (IC) techniques, which is regarded relatively new to the railroad roadbed construction activity. Most of civil structures are built on roadbed that supposed to provide adequate load bearing support to the upper structure through the qualified compaction process. However, it is not uncommon for structure failure attributed to inadequate compaction control take place in field sites. Unlike traditional compaction control method to check field density at several locations, IC techniques continuously measure various compaction quality indices that represent compaction uniformity. In this paper, a series of literature review relevant to IC techniques was conducted to provide concise summary on the following categories: 1) background of IC technique; 2) Summary of IC vendors and basic principles; 3) modeling of IC behavior, and 4) case study along with correlation between IC with other measurements. In summary, IC technologies seem to be promising in future railroad construction to achieve better compaction quality control so that the serviceability of railroad can be ensured with minimizing rehabilitation and maintenance activities.

A modification of McFadden's R2 for binary and ordinal response models

  • Ejike R. Ugba;Jan Gertheiss
    • Communications for Statistical Applications and Methods
    • /
    • v.30 no.1
    • /
    • pp.49-63
    • /
    • 2023
  • A lot of studies on the summary measures of predictive strength of categorical response models consider the likelihood ratio index (LRI), also known as the McFadden-R2, a better option than many other measures. We propose a simple modification of the LRI that adjusts for the effect of the number of response categories on the measure and that also rescales its values, mimicking an underlying latent measure. The modified measure is applicable to both binary and ordinal response models fitted by maximum likelihood. Results from simulation studies and a real data example on the olfactory perception of boar taint show that the proposed measure outperforms most of the widely used goodness-of-fit measures for binary and ordinal models. The proposed R2 interestingly proves quite invariant to an increasing number of response categories of an ordinal model.

An Estimation of Health-Adjusted Life Expectancy(HALE) for Koreans (한국인의 건강보정 기대여명의 측정)

  • Kang, Eun-Jeong;Kim, Na-Yeon;Yoon, Seok-Jun
    • Health Policy and Management
    • /
    • v.18 no.1
    • /
    • pp.108-126
    • /
    • 2008
  • Summary measures of population health or SMPH is an index which can describe morbidity as well as mortality. Summary measures of population health can be divided into health-adjusted life years which is a life expectancy measure and disability-adjusted life years which represents the gap between the ideal health status and the current health status. This study aims at estimating health-adjusted life expectancy(HALE) which is a measure of health-adjusted life years, by calculating life expectancy adjusted by health status using EQ-5D. The mortality data was obtained from the life table of 2005 which was published by the National Statistical Office and the health status by sex and age was obtained from the EQ-5D scores using the third National Health and Nutrition and Examination Survey in 2005. With these mortality and morbidity data, health-adjusted life expectancy was calculated using Sullivan's method. The study results showed that the health-adjusted life expectancy of males and females was 67.49 and 69.61, respectively, while the life expectancy of males and females was 75.14 and 81.89. In other words, Korean males and females lose 7.65 and 12.28, respectively, from the decrease of quality of life due to diseases and/or injuries. These results can further be interpreted that males lose 10.2% of their life expectancy and females 15.0%. This study suggests that it may be possible to monitor population's health-adjusted life expectancy by continuing to include health-related quality of life measures such as EQ-5D in national health surveys like the National Health and Nutrition and Examination Survey.

Automatic Quality Evaluation with Completeness and Succinctness for Text Summarization (완전성과 간결성을 고려한 텍스트 요약 품질의 자동 평가 기법)

  • Ko, Eunjung;Kim, Namgyu
    • Journal of Intelligence and Information Systems
    • /
    • v.24 no.2
    • /
    • pp.125-148
    • /
    • 2018
  • Recently, as the demand for big data analysis increases, cases of analyzing unstructured data and using the results are also increasing. Among the various types of unstructured data, text is used as a means of communicating information in almost all fields. In addition, many analysts are interested in the amount of data is very large and relatively easy to collect compared to other unstructured and structured data. Among the various text analysis applications, document classification which classifies documents into predetermined categories, topic modeling which extracts major topics from a large number of documents, sentimental analysis or opinion mining that identifies emotions or opinions contained in texts, and Text Summarization which summarize the main contents from one document or several documents have been actively studied. Especially, the text summarization technique is actively applied in the business through the news summary service, the privacy policy summary service, ect. In addition, much research has been done in academia in accordance with the extraction approach which provides the main elements of the document selectively and the abstraction approach which extracts the elements of the document and composes new sentences by combining them. However, the technique of evaluating the quality of automatically summarized documents has not made much progress compared to the technique of automatic text summarization. Most of existing studies dealing with the quality evaluation of summarization were carried out manual summarization of document, using them as reference documents, and measuring the similarity between the automatic summary and reference document. Specifically, automatic summarization is performed through various techniques from full text, and comparison with reference document, which is an ideal summary document, is performed for measuring the quality of automatic summarization. Reference documents are provided in two major ways, the most common way is manual summarization, in which a person creates an ideal summary by hand. Since this method requires human intervention in the process of preparing the summary, it takes a lot of time and cost to write the summary, and there is a limitation that the evaluation result may be different depending on the subject of the summarizer. Therefore, in order to overcome these limitations, attempts have been made to measure the quality of summary documents without human intervention. On the other hand, as a representative attempt to overcome these limitations, a method has been recently devised to reduce the size of the full text and to measure the similarity of the reduced full text and the automatic summary. In this method, the more frequent term in the full text appears in the summary, the better the quality of the summary. However, since summarization essentially means minimizing a lot of content while minimizing content omissions, it is unreasonable to say that a "good summary" based on only frequency always means a "good summary" in its essential meaning. In order to overcome the limitations of this previous study of summarization evaluation, this study proposes an automatic quality evaluation for text summarization method based on the essential meaning of summarization. Specifically, the concept of succinctness is defined as an element indicating how few duplicated contents among the sentences of the summary, and completeness is defined as an element that indicating how few of the contents are not included in the summary. In this paper, we propose a method for automatic quality evaluation of text summarization based on the concepts of succinctness and completeness. In order to evaluate the practical applicability of the proposed methodology, 29,671 sentences were extracted from TripAdvisor 's hotel reviews, summarized the reviews by each hotel and presented the results of the experiments conducted on evaluation of the quality of summaries in accordance to the proposed methodology. It also provides a way to integrate the completeness and succinctness in the trade-off relationship into the F-Score, and propose a method to perform the optimal summarization by changing the threshold of the sentence similarity.

A Study of Information System Availability Guarantee Methods and Application

  • Kim, Hee Wan
    • International Journal of Advanced Culture Technology
    • /
    • v.8 no.3
    • /
    • pp.292-299
    • /
    • 2020
  • This paper presents an evaluation criteria of an information system availability for guaranteeing availability (service target level) from the perspective of the SLA contract and its technical point of view. In order to verify the effectiveness for information system failure and availability guarantee measures, three cases were examined. In summary, the failure time was reduced by 32% ~ 62% after applying the availability guarantee measure, verifying the excellence in the evaluation of an information system availability.

Implementation of Reliability Measure and Distribution (신뢰성 척도 및 분포의 적용)

  • Choi Sung-Woon
    • Journal of the Korea Safety Management & Science
    • /
    • v.7 no.5
    • /
    • pp.175-184
    • /
    • 2005
  • This paper presents the practial guide to implementation of reliability distributions. The applicability and property of various reliability distribution will then be illustrated. Main objective of this study is to present how to use reliability distributions summary with respect to the total life cycle management. This paper provides insight info the good aspects of using relability distributions properly.

Finding Interesting Genes Using Reliability in Various Gene Expression Models

  • Lee, Eun-Kyung;Cook, Dianne;Hoffman, Heike
    • Genomics & Informatics
    • /
    • v.9 no.1
    • /
    • pp.28-36
    • /
    • 2011
  • Most statistical methods for finding interesting genes are focusing on the summary values with large fold-changes or large variations. Very few methods consider the probe level data. We developed a new measure to detect reliability that incorporates the probe level data. This reliability measure is useful for exploring the microarray data without ignoring the probe level data. It is easy to calculate, and it can be used for all the other statistical methods as a good guideline to find real differentially expressed genes. Instead of filtering out genes before the analysis, we use whole genes in the analysis and make decisions with new reliability measures.