• Title/Summary/Keyword: frequency-based method

Search Result 6,108, Processing Time 0.038 seconds

Automatic Quality Evaluation with Completeness and Succinctness for Text Summarization (완전성과 간결성을 고려한 텍스트 요약 품질의 자동 평가 기법)

  • Ko, Eunjung;Kim, Namgyu
    • Journal of Intelligence and Information Systems
    • /
    • v.24 no.2
    • /
    • pp.125-148
    • /
    • 2018
  • Recently, as the demand for big data analysis increases, cases of analyzing unstructured data and using the results are also increasing. Among the various types of unstructured data, text is used as a means of communicating information in almost all fields. In addition, many analysts are interested in the amount of data is very large and relatively easy to collect compared to other unstructured and structured data. Among the various text analysis applications, document classification which classifies documents into predetermined categories, topic modeling which extracts major topics from a large number of documents, sentimental analysis or opinion mining that identifies emotions or opinions contained in texts, and Text Summarization which summarize the main contents from one document or several documents have been actively studied. Especially, the text summarization technique is actively applied in the business through the news summary service, the privacy policy summary service, ect. In addition, much research has been done in academia in accordance with the extraction approach which provides the main elements of the document selectively and the abstraction approach which extracts the elements of the document and composes new sentences by combining them. However, the technique of evaluating the quality of automatically summarized documents has not made much progress compared to the technique of automatic text summarization. Most of existing studies dealing with the quality evaluation of summarization were carried out manual summarization of document, using them as reference documents, and measuring the similarity between the automatic summary and reference document. Specifically, automatic summarization is performed through various techniques from full text, and comparison with reference document, which is an ideal summary document, is performed for measuring the quality of automatic summarization. Reference documents are provided in two major ways, the most common way is manual summarization, in which a person creates an ideal summary by hand. Since this method requires human intervention in the process of preparing the summary, it takes a lot of time and cost to write the summary, and there is a limitation that the evaluation result may be different depending on the subject of the summarizer. Therefore, in order to overcome these limitations, attempts have been made to measure the quality of summary documents without human intervention. On the other hand, as a representative attempt to overcome these limitations, a method has been recently devised to reduce the size of the full text and to measure the similarity of the reduced full text and the automatic summary. In this method, the more frequent term in the full text appears in the summary, the better the quality of the summary. However, since summarization essentially means minimizing a lot of content while minimizing content omissions, it is unreasonable to say that a "good summary" based on only frequency always means a "good summary" in its essential meaning. In order to overcome the limitations of this previous study of summarization evaluation, this study proposes an automatic quality evaluation for text summarization method based on the essential meaning of summarization. Specifically, the concept of succinctness is defined as an element indicating how few duplicated contents among the sentences of the summary, and completeness is defined as an element that indicating how few of the contents are not included in the summary. In this paper, we propose a method for automatic quality evaluation of text summarization based on the concepts of succinctness and completeness. In order to evaluate the practical applicability of the proposed methodology, 29,671 sentences were extracted from TripAdvisor 's hotel reviews, summarized the reviews by each hotel and presented the results of the experiments conducted on evaluation of the quality of summaries in accordance to the proposed methodology. It also provides a way to integrate the completeness and succinctness in the trade-off relationship into the F-Score, and propose a method to perform the optimal summarization by changing the threshold of the sentence similarity.

Developments of Greenhouse Gas Generation Models and Estimation Method of Their Parameters for Solid Waste Landfills (폐기물매립지에서의 온실가스 발생량 예측 모델 및 변수 산정방법 개발)

  • Park, Jin-Kyu;Kang, Jeong-Hee;Ban, Jong-Ki;Lee, Nam-Hoon
    • KSCE Journal of Civil and Environmental Engineering Research
    • /
    • v.32 no.6B
    • /
    • pp.399-406
    • /
    • 2012
  • The objective of this research is to develop greenhouse gas generation models and estimation method of their parameters for solid waste landfills. Two models obtained by differentiating the Modified Gompertz and Logistic models were employed to evaluate two parameters of a first-order decay model, methane generation potential ($L_0$) and methane generation rate constant (k). The parameters were determined by the statistical comparison of predicted gas generation rate data using the two models and actual landfill gas collection data. The values of r-square obtained from regression analysis between two data showed that one model by differentiating the Modified Gompetz was 0.92 and the other model by differentiating the Logistic was 0.94. From this result, the estimation methods showed that $L_0$ and k values can be determined by regression analysis if landfill gas collection data are available. Also, new models based on two models obtained by differentiating the Modified Gompertz and Logistic models were developed to predict greenhouse gas generation from solid waste landfills that actual landfill generation data could not be available. They showed better prediction than LandGEM model. Frequency distribution of the ratio of Qcs (LFG collection system) to Q (prediction value) was used to evaluate the accuracy of the models. The new models showed higher accuracy than LandGEM model. Thus, it is concluded that the models developed in this research are suitable for the prediction of greenhouse gas generation from solid waste landfills.

Comparison of the Recent Trend of Chemistry Education Research Based on the Analysis of the Domestic and Foreign Journals (국내외 학술지를 토대로 분석한 화학교육 연구의 최근 동향 비교)

  • Han, Jae-Young;Lee, Sang-Chul
    • Journal of the Korean Chemical Society
    • /
    • v.56 no.2
    • /
    • pp.290-296
    • /
    • 2012
  • This study analyzed the research papers published in three (2 domestic and 1 foreign) journals, in order to understand the recent trend of chemistry education research. We selected Journal of the Korean Chemical Society (JKCS) and Journal of the Korean Association for Science Education (JKASE) as the domestic journals, and Journal of Chemical Education (JCE) as a foreign journal. The papers published from 2000 to 2009 were analyzed. As the result, the chemistry education research theme focused on 'teaching method and education technology', 'learner's characteristics', and 'chemical concept and experiment' in the order of frequency. The research on 'curriculum and textbooks' was performed often in JKCS reflecting Korean social environment. The most researched chemistry education goal was the 'conceptual understanding/change' followed by 'achievement/grade' in JCE and 'experiment/inquiry skill' in JKCS, and 'attitude/interest/motivation' in JKASE. The research subjects were focused to 'middle or high school students' in JKCS, in contrast to the 'university students' in JCE. More concern to the higher education is required in the domestic research. The most frequently used research method was 'survey/ examination' followed by 'experimental research' in JCE and JKASE and 'data/material analysis' in JKCS. We discussed the implication on future chemistry education research.

Forecasting the Precipitation of the Next Day Using Deep Learning (딥러닝 기법을 이용한 내일강수 예측)

  • Ha, Ji-Hun;Lee, Yong Hee;Kim, Yong-Hyuk
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.26 no.2
    • /
    • pp.93-98
    • /
    • 2016
  • For accurate precipitation forecasts the choice of weather factors and prediction method is very important. Recently, machine learning has been widely used for forecasting precipitation, and artificial neural network, one of machine learning techniques, showed good performance. In this paper, we suggest a new method for forecasting precipitation using DBN, one of deep learning techniques. DBN has an advantage that initial weights are set by unsupervised learning, so this compensates for the defects of artificial neural networks. We used past precipitation, temperature, and the parameters of the sun and moon's motion as features for forecasting precipitation. The dataset consists of observation data which had been measured for 40 years from AWS in Seoul. Experiments were based on 8-fold cross validation. As a result of estimation, we got probabilities of test dataset, so threshold was used for the decision of precipitation. CSI and Bias were used for indicating the precision of precipitation. Our experimental results showed that DBN performed better than MLP.

Measuring Plate Thickness Using Spatial Local Wavenumber Filtering (국소 공간 웨이브넘버 필터링 기법을 이용한 평판 구조물 두께 측정)

  • Kang, To;Lee, Jeong Han;Han, Soon Woo;Park, Jin Ho;Park, Gyuhae;Jeon, Jun Young
    • Journal of the Korean Society for Nondestructive Testing
    • /
    • v.36 no.5
    • /
    • pp.370-376
    • /
    • 2016
  • Corrosion on the surface of a structure can generate cracks or cause walls to thin. This can lead to fracturing, which can eventually lead to fatalities and property loss. In an effort to prevent this, laser imaging technology has been used over the last ten years to detect thin-plate structure, or relatively thin piping. The most common laser imaging was used to develop a new technology for inspecting and imaging a desired area in order to scan various structures for thin-plate structure and thin piping. However, this method builds images by measuring waves reflected from defects, and subsequently has a considerable time delay of a few milliseconds at each scanning point. In addition, the complexity of the system is high, due to additional required components, such as laser-focusing parts. This paper proposes a laser imaging method with an increased scanning speed, based on excitation and the measurement of standing waves in structures. The wavenumber of standing waves changes at sections with a geometrical discontinuity, such as thickness. Therefore, it is possible to detect defects in a structure by generating standing waves with a single frequency and scanning the waves at each point by with the laser scanning system. The proposed technique is demonstrated on a wall-thinned plate with a linear thickness variation.

A Study on the Effects of Search Language on Web Searching Behavior: Focused on the Differences of Web Searching Pattern (검색 언어가 웹 정보검색행위에 미치는 영향에 관한 연구 - 웹 정보검색행위의 양상 차이를 중심으로 -)

  • Byun, Jeayeon
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.52 no.3
    • /
    • pp.289-334
    • /
    • 2018
  • Even though information in many languages other than English is quickly increasing, English is still playing the role of the lingua franca and being accounted for the largest proportion on the web. Therefore, it is necessary to investigate the key features and differences between "information searching behavior using mother tongue as a search language" and "information searching behavior using English as a search language" of users who are non-mother tongue speakers of English to acquire more diverse and abundant information. This study conducted the experiment on the web searching which is applied in concurrent think-aloud method to examine the information searching behavior and the cognitive process in Korean search and English search through the twenty-four undergraduate students at a private university in South Korea. Based on the qualitative data, this study applied the frequency analysis to web search pattern under search language. As a result, it is active, aggressive and independent information searching behavior in Korean search, while information searching behavior in English search is passive, submissive and dependent. In Korean search, the main features are the query formulation by extract and combine the terms from various sources such as users, tasks and system, the search range adjustment in diverse level, the smooth filtering of the item selection in search engine results pages, the exploration and comparison of many items and the browsing of the overall contents of web pages. Whereas, in English search, the main features are the query formulation by the terms principally extracted from task, the search range adjustment in limitative level, the item selection by rely on the relevance between the items such as categories or links, the repetitive exploring on same item, the browsing of partial contents of web pages and the frequent use of language support tools like dictionaries or translators.

Research of Non-integeral Spatial Interpolation for Precise Identifying Soybean Location under Plastic Mulching

  • Cho, Yongjin;Yun, Yeji;Lee, Kyou-seung;Oh, Jong-woo;Lee, DongHoon
    • Proceedings of the Korean Society for Agricultural Machinery Conference
    • /
    • 2017.04a
    • /
    • pp.156-156
    • /
    • 2017
  • Most crop damages have been occurred by vermin(e.g., wild birds and herbivores) during the period between seeding and the cotyledon level. In this study, to minimize the damage by vermin and acquire the benefits such as protection against weeds and maintenance of water content in soil, immediately vinyl mulching after seeding was devised. Vinyl mulching has been generally covered with black color vinyl, that crop seeding locations cannot be detected by visible light range. Before punching vinyl, non-contact and non-destructive methods that can continuously determine the locations are necessary. In this study, a crop position detection method was studied that uses infrared thermal image sensor to determine the cotyledon position under vinyl mulch. The moving system for acquiring image arrays has been developed for continuously detecting crop locations under plastic mulching on the field. A sliding mechanical device was developed to move the sensor, which were arranged in the form of a linear array, perpendicular to the array using a micro-controller integrated with a stepping motor. The experiments were conducted while moving 4.00 cm/s speed of the IR sensor by the rotational speed of the stepping motor based on a digital pulse width modulation signal from the micro-controller. The acquired images were calibrated with the spatial image correlation. The collected data were processed using moving averaging on interpolation to determine the frame where the variance was the smallest in resolution units of 1.02 cm. For this study, the spline method was relatively faster than the other polynomial interpolation methods, because it has a lower maximum order of formulation when using a system such as the tridiagonal linear equation system which provided the capability of real-time processing. The temperature distribution corresponding to the distance between the crops was 10 cm, and the more clearly the leaf pattern of the crop was visually confirmed. The frequency difference was decreased, as the number of overlapped pixels was increased. Also the wave pattern of points where the crops were recognized were reduced.

  • PDF

Two-dimensional Inundation Analysis Using Stochastic Rainfall Variation and Geographic Information System (추계학적 강우변동생성 기법과 GIS를 연계한 2차원 침수해석)

  • Lee, Jin-Young;Cho, Wan-Hee;Han, Kun-Yeun;Ahn, Ki-Hong
    • Journal of the Korean Association of Geographic Information Studies
    • /
    • v.13 no.1
    • /
    • pp.101-113
    • /
    • 2010
  • Recently actual rainfall pattern is decreasing rainy days and increasing in rainfall intensity and the frequency of flood occurrence is also increased. To consider recent situation, Engineers use deterministic methods like a PMP(Probable Maximum Precipitation). If design storm wouldn't occur, increasing of design criteria is extravagant. In addition, the biggest structure cause trouble with residents and environmental problem. And then it is necessary to study considering probability of rainfall parameter in each sub-basin for design of water structure. In this study, stochastic rainfall patterns are generated by using log-ratio method, Johnson system and multivariate Monte Carlo simulation. Using the stochastic rainfall patterns, hydrological analysis, hydraulic analysis and 2nd flooding analysis were performed based on GIS for their applicability. The results of simulations are similar to the actual damage area so the methodology of this study should be used about making a flood risk map or regidental shunting rout map against the region.

Effect of Temperature and Aging on the Relationship Between Dynamic and Static Elastic Modulus of Concrete (온도와 재령이 콘크리트의 동탄성계수와 정 탄성계수의 상관관계에 미치는 영향)

  • 한상훈;김진근;박우선;김동현
    • Journal of the Korea Concrete Institute
    • /
    • v.13 no.6
    • /
    • pp.610-618
    • /
    • 2001
  • This paper investigates the relationships between dynamic elastic modulus and static elastic modulus or compressive strength according to curing temperature, aging, and cement type. Based on this investigation, the new model of the relationships we proposed. Impact echo method estimates the resonant frequency of specimens and uniaxial compression test measures the static elastic modulus and compressive strength. Type I and V cement concretes, which have the water-cement ratios of 0.40 and 0.50, are cured under the isothermal curing temperatures of 10, 23, and 50$\^{C}$ Cement type and aging have no large influence on the relationship between dynamic and static elastic modulus, but the ratio of dynamic and static elastic modulus comes close to 1 as temperature increases. Initial chord elastic modulus which is calculated at lower strain level of stress-strain curve, has the similar value to dynamic elastic modulus. The relationship between dynamic elastic modulus and compressive strength has the same tendency as the relationship between dynamic and static elastic modulus according to cement type, temperature and aging. The proposcd relationship equations between dynamic elastic modulus and static elastic modulus or compressive strength properly estimates the variation of relationships according to cement type md temperature.

The Effects of Modified Constraint Induced Therapy on Upper Extremity Functions of Children With Hemiparesis (수정된 건측 상지 운동 제한 치료가 편마비 아동의 손 기능 향상에 미치는 효과)

  • Ko, Myung-Sook;Jeon, Hye-Seon;Kwon, Oh-Yun;Yoo, Eun-Young
    • Physical Therapy Korea
    • /
    • v.12 no.2
    • /
    • pp.81-89
    • /
    • 2005
  • The purpose of this study was to investigate the effect of Modified Constraint-Induced Therapy (MCIT) on the effected upper extremity of children with hemiparesis. Four children with hemiparetic upper extremity caused by brain injuries were trained by MCIT for ten weeks. During the same period, all of the subjects were also involved in thirty-minute regular physical therapy and occupational therapy. During the treatment period, the unaffected upper extremities of the subjects were restrained by a specially designed hand splint or a mitten for five hours a day, five days per week. For two hours out of the five-hour restraint period, the affected upper extremities were intensively trained by performing various functional tasks, which were individually structured to emphasize use of the affected arm. A single-subject design with A-B-A reversal was employed in this study. The affected limb motor ability was evaluated by Melbourne Assessment, measuring the time to grasp and release nine pegs, and measuring grasping power. As a consequence of this study, the affected limb motor test scores of all four subjects in the baseline period were improved during the treatment period. Furthermore, the treatment effect was maintained during a one-month follow-up period. The results of this study support the assumption that MCIT is an effective therapeutic method to improve the sensory and motor abilities of hemiparetic children. It also increases the frequency of functional use of the hemiparetic hands of brain-injured children. Based on the results of this study, it can also be assumed that the modified CIT method is especially beneficial to these children by reducing the negative emotional effects of forceful restraint of the unaffected upper extremity. To optimize the functional recovery of the paretic upper extremity by CIT, the restriction period per day should be decided individually, according to the characteristics of the individual.

  • PDF