• Title/Summary/Keyword: 시간 고려

Search Result 11,148, Processing Time 0.046 seconds

Relationship among Night Eating and Nutrient Intakes Status in University Students (대학생에서 야식의 섭취가 영양소 섭취 상태에 미치는 영향)

  • Hong, Seung-Hee;Yeon, Jee-Young;Bae, Yun-Jung
    • Journal of the East Asian Society of Dietary Life
    • /
    • v.23 no.3
    • /
    • pp.297-310
    • /
    • 2013
  • This study was performed to investigate relationships among night eating and nutrient intakes status in university students. A total of 271 subjects (male=155, female=116) were divided by using 3-days food record method according to the percentage of energy from night eating: non-night eating, <25% night-eating and ${\geq}25$ night-eating group. There were no significant differences in age, height, weight, percent body fat and BMI among the groups. The proportion of morning anorexia and insomnia was below 2% and 10%, and no differences were observed among the groups by percentage of energy from night eating. In the male subjects, the intakes of energy in the '${\geq}25$ night-eating group' was significantly higher than those of the other groups; whereas, the nutrient density (ND, nutrient intakes per 1,000 kcal) and INQ (index of nutritional quality) of vitamin $B_1$, vitamin $B_2$, vitamin C, calcium and iron in the '${\geq}25$ night-eating group' was significantly lower than those of the other groups. In the female subjects, the intakes of energy in the '<25 night-eating group' was significantly higher than that of the 'non-night eating group'. And the ND and INQ of vitamin C in the '<25 night-eating group' was significantly higher than those of the 'non-night eating group'. In addition, within the male subjects, the INQ of vitamin $B_1$, vitamin $B_2$, vitamin C, calcium and phosphorous showed significantly negative correlations with food intakes, energy intakes and percentages of energy from night eating after the values were adjusted for age. These results suggest that among male university students, night snack intakes above 25% of energy have lower micronutrient qualities of vitamin $B_1$, vitamin $B_2$, vitamin C and calcium.

Korean Sentence Generation Using Phoneme-Level LSTM Language Model (한국어 음소 단위 LSTM 언어모델을 이용한 문장 생성)

  • Ahn, SungMahn;Chung, Yeojin;Lee, Jaejoon;Yang, Jiheon
    • Journal of Intelligence and Information Systems
    • /
    • v.23 no.2
    • /
    • pp.71-88
    • /
    • 2017
  • Language models were originally developed for speech recognition and language processing. Using a set of example sentences, a language model predicts the next word or character based on sequential input data. N-gram models have been widely used but this model cannot model the correlation between the input units efficiently since it is a probabilistic model which are based on the frequency of each unit in the training set. Recently, as the deep learning algorithm has been developed, a recurrent neural network (RNN) model and a long short-term memory (LSTM) model have been widely used for the neural language model (Ahn, 2016; Kim et al., 2016; Lee et al., 2016). These models can reflect dependency between the objects that are entered sequentially into the model (Gers and Schmidhuber, 2001; Mikolov et al., 2010; Sundermeyer et al., 2012). In order to learning the neural language model, texts need to be decomposed into words or morphemes. Since, however, a training set of sentences includes a huge number of words or morphemes in general, the size of dictionary is very large and so it increases model complexity. In addition, word-level or morpheme-level models are able to generate vocabularies only which are contained in the training set. Furthermore, with highly morphological languages such as Turkish, Hungarian, Russian, Finnish or Korean, morpheme analyzers have more chance to cause errors in decomposition process (Lankinen et al., 2016). Therefore, this paper proposes a phoneme-level language model for Korean language based on LSTM models. A phoneme such as a vowel or a consonant is the smallest unit that comprises Korean texts. We construct the language model using three or four LSTM layers. Each model was trained using Stochastic Gradient Algorithm and more advanced optimization algorithms such as Adagrad, RMSprop, Adadelta, Adam, Adamax, and Nadam. Simulation study was done with Old Testament texts using a deep learning package Keras based the Theano. After pre-processing the texts, the dataset included 74 of unique characters including vowels, consonants, and punctuation marks. Then we constructed an input vector with 20 consecutive characters and an output with a following 21st character. Finally, total 1,023,411 sets of input-output vectors were included in the dataset and we divided them into training, validation, testsets with proportion 70:15:15. All the simulation were conducted on a system equipped with an Intel Xeon CPU (16 cores) and a NVIDIA GeForce GTX 1080 GPU. We compared the loss function evaluated for the validation set, the perplexity evaluated for the test set, and the time to be taken for training each model. As a result, all the optimization algorithms but the stochastic gradient algorithm showed similar validation loss and perplexity, which are clearly superior to those of the stochastic gradient algorithm. The stochastic gradient algorithm took the longest time to be trained for both 3- and 4-LSTM models. On average, the 4-LSTM layer model took 69% longer training time than the 3-LSTM layer model. However, the validation loss and perplexity were not improved significantly or became even worse for specific conditions. On the other hand, when comparing the automatically generated sentences, the 4-LSTM layer model tended to generate the sentences which are closer to the natural language than the 3-LSTM model. Although there were slight differences in the completeness of the generated sentences between the models, the sentence generation performance was quite satisfactory in any simulation conditions: they generated only legitimate Korean letters and the use of postposition and the conjugation of verbs were almost perfect in the sense of grammar. The results of this study are expected to be widely used for the processing of Korean language in the field of language processing and speech recognition, which are the basis of artificial intelligence systems.

Effect of Strength Increasing Sizes on the Quality of Fiberboard (섬유판(纖維板)의 증강(增强)사이즈제(齊)가 재질(材質)에 미치는 영향(影響))

  • Shin, Dong So;Lee, Hwa Hyoung
    • Journal of Korean Society of Forest Science
    • /
    • v.30 no.1
    • /
    • pp.19-29
    • /
    • 1976
  • The fiberboard and paper mills in this country are much affected by the price hikes and shortage of phenolic resins, since phenolic acid as a raw material depends on imported good. It is prerequisite to fiberboard industry to help replace with other sized and stabilize the prices and supply of them, improving the quality of boards. Thus, the present study was carried out to examine the effect of strength increasing sized such as urea formaldehyde resin (anion and cation type) and urea melamine copolymer resin, on the quality of the wet forming hardboard, and comparing them with two types of proprietary modified melamine resins, and ordinary size, phenol resin. The Asplund pulp was prepared from wood wastes mixed with 20 percent of lauan and 80 percent of pines as a fibrous material. After sizing agents were added at a pH of 4.5 for 10 minutes with alum in the beater, the stock was made in the form of wet sheet, prepared, and then performed by hot pressing cycle: $180^{\circ}C$, $50-6-5kg/cm^2$, 1-2-7 minutes. The properties of hardboard were examined after air conditioning. The results obtained are summarized as follows: 1. There is a significant difference in specific gravity among hardboards that were treated with strength increasing resins, but no difference is effected by the increase in the resin content. In the case of modified melamine resin, its specific gravity is highest. The middle group comprises cation type of urea resin, anion type of urea resin, and acid colloid of urea-melamine copolymer resin. The lowest is phenolic resin. 2. The difference of the moisture content of hardboard both by the resins and by the amount of each resin applied is significant. The moisture content of hardboard becomes lower along with the increase of each resin content, but there is no difference between 2 and 3 percent. 3. For water absorption, there is a significant difference both in the adhesives used and in the amount of paraffin wax emulsion. The water resistance becomes higher inn proportion to the content of the paraffin wax emulsion. To satisfy KS F standards of the water resistance, a proprietary modified melamine resin (p-6100) and modified cation type of urea resin (p-1500) do not require any paraffin wax emulsion, but in the case of anion type of urea resin, cation type of urea resin, and urea-melamine copolymer resin, 1 percent of paraffin wax emulsion is needed, and 2 percent of paraffin wax emulsion in the case of phenolic resin. 4. The difference of flexural strength of hardboard both by the resins and by the amount of each resin is significant. Modified melamine resin shows the highest degree of flexural strength. Among the middle group are urea-melamine copolymer resin, p-1500, anion type of urea resin, and cation type of urea resin. Phenolic resin is the lowest. The cause may be attributable to factors combined with the pressing temperature, sizing effect, and thermal efficiency of press platens heated electrically. 5. Considering the economic advantages and properties of hardboard, it is proposed that urea-melamine copolymer resin and cation type of urea resin be used for the development of the fiberboard industry. It is desirable to further develop the modified urea-melamine copolymer resin and cation type of urea resin through continuous study.

  • PDF

A Clinical and Pathological Analysis of Children with Membranoproliferative Glomerulonephritis According to the Clinical Manifestations at Presentation (발견 양상에 따른 소아 막증식성 사구체신염의 임상적 및 병리조직학적분석)

  • Jeon Chang-Ho;Kang Mi-Seon;Chung Woo-Yeong
    • Childhood Kidney Diseases
    • /
    • v.8 no.2
    • /
    • pp.186-194
    • /
    • 2004
  • Purpose: Membranoproliferative glomeulonephritis(MPGN) has been diagnosed in an increasing number of asymptomatic cases. These cases have been detected by school urinary screening test even though the total cases of MPGN show a decreasing trend. We have analyzed the clinical and pathological characteristics of children with MPGN according to the clinical manifestations at the time of disease presentation. Methods: A total of 18 patients who had been diagnosed with idiopathic MPGN by percutaneous renal biopsy from January 1990 to February 2004 were involved in our study. The patients were divided into 2 groups as the school urinary screening(A) group and the symptomatic(S) group according to the clinical manifestations at the time of disease presentation. Results: Out of the total 18 patients, 8(44.4%) were in the S group and 10(55.6%) were in the A group. The mean serum total protein, albumin and $C_3$ levels in the S group were significantly lower than those levels of the A group, respectively($4.9{\pm}1.2\;g/dL,\;vs\;7.0{\pm}0.5\;g/dL\;P=0.002,\;2.8{\pm}0.9\;g/dL\;vs.\;4.1{\pm}0.3\;g/dL\;P=0.002,\;63.9{\pm}36.4\;mg/dL\;vs.\;100.8{\pm}39.5\;g/dL\;P=0.041$). The mean total protein amount of 24 hour collected urine in the S group were significantly higher than that of the A group($3684.0{\pm}2601.3\;mg/m^2\;vs.\;559.4{\pm}4.6.9\;mg/m^2$, respectively, P=0.001). Hypocomplementemia was observed in 11(61.1%) out of 18 patients at the time of disease onset, 7(87.5%) in the S group and 4(40%) in the A group. However the hypocomplementemia was decreased in 6(33.3%) out of 18 patients at the time of final follow-up, 3(37.5%) in the S group and 3(30%) in the A group. According to the pathologic type, hypocomplementemia was observed 8 patients(61.5%) with type I disease, 1 patients (100%) with type II disease, 2 patients(50%) in type III disease at the disease onset, but 4 patients(30.8%) in type I disease, 1 patient(100%) in type II disease, 1 patient(33.3%) with type III disease at the time of last follow-up. The incidence of cellular crescent formation and tubular atropy. as observed on light microscopy, were higher in the S group compared to the A group. Mean grade of capillary wall thickening and, mesangial proliferation were significantly higher in the S group. Conclusion: MPGN, as diagnosed in patients with only asymptomatic urinary abnormalities, has been increasing, it is more frequent in asymptomatic patients than in patients with presenting symptoms. Our result suggests that MPGN should be considered in the renal biopsy diagnosis regardless of serum $C_3$ level when urinary abnormalities are found by school urinary screening test.

  • PDF

Weight loss effects of Bariatric Surgery after nutrition education in extremely obese patients (고도비만환자에서 베리아트릭 수술 (Bariatric Surgery) 후 영양교육이 체중감량에 미치는 효과)

  • Jeong, Eun-Ha;Lee, Hong-Chan;Yim, Jung-Eun
    • Journal of Nutrition and Health
    • /
    • v.48 no.1
    • /
    • pp.30-45
    • /
    • 2015
  • Purpose: This study was planned to determine the characteristics of extremely obese patients during Bariatric surgery and to evaluate how the difference in the number of postsurgical personal nutritional educations they received affected the weight loss. Methods: This is a retrospective study on the basis of the medical records of extremely obese patients for 15 months after receiving gastric banding. A total of 60 people were selected as the study subjects and they were divided into the Less Educated Group and the More Educated Group according to the average number of personal nutritional educations they received. We investigated both groups to determine the general characteristic, health related lifestyle habits, obesity related complications and symptoms in possession, and eating habits before their surgery, the body composition measurement result, obesity determination indices at 1, 3, 6, 9, 12, and 15 months before and after their surgery, and the biochemical parameters at 6 months before and after their surgery. Results: Body fat and weight showed rapid reduction until 6 months after the surgery, but thereafter reduced slowly depending on the result of body composition measurement. Regarding body fat and weight, the More Educated Group, who received nutrition education more often, showed significantly lower levels than the Less Educated Group at 15 months after surgery. Regarding BMI and degree of obesity, the More Educated Group showed significantly lower levels than the Less Educated Group at 15 months after surgery. Here, we were assured that BMI is reversely proportional to the number of personal nutritional educations at 15 months, which is more outstanding after surgery than before surgery. Conclusion: Long-term nutritional education is a key factor for the extremely obese patient in maintaining the effects of Bariatric surgery on weight and body fat reduction onwards. In the next stage, considering the characteristics of the study subjects, adoption of individual nutrition education is recommended for postsurgical prospective arbitration of obesity in order to monitor blood pressure, obesity related complications, symptoms in possession, and how eating habits and health related life habits change, and to judge the actual effect of the nutritional education method at the same time.

Clinical Results after Repair of Rotator Cuff Tear in Patients with Accompanying AC Joint Pathology: Clinical Comparison of Non-operative Treatment (회전근개 파열과 동반된 견봉 쇄골 관절 병변이 회전근개 봉합술 후 결과에 미치는 영향: 비수술적 치료를 통한 임상적 비교)

  • Yoo, Moon-Jib;Seo, Joong-Bae;Lee, Dae-Hee;Kim, Sung-Jin
    • Clinics in Shoulder and Elbow
    • /
    • v.15 no.2
    • /
    • pp.86-90
    • /
    • 2012
  • Purpose: We studied the need for distal clavicle resection by comparing rotator cuff tear patients who underwent non-surgical treatment with and without acromioclavicular joint pathology. Materials and Methods: 45 cases that had been under follow up care for at least 9 months after receiving rotator cuff repair in our hospital between Jan. 2005 and Jun. 2011 had been studied. Acromioclavicular joint pathology group and control group were classified by physical examination and MRI findings. The temporal changes in shoulder joint abduction, internal and external rotation strength, ASES and KSS score of the two groups were measured and analyzed. Results: The acromioclavicular joint pathology complicated rotator cuff injury group's strength measurements for abduction, internal rotation, external rotation were each 8.05 (${\pm}4.54$), 11.33 (${\pm}6.05$), 10.24 (${\pm}5.27$) preoperatively and improved to 13.26 (${\pm}5.50$), 17.51 (${\pm}6.80$), 15.60 (${\pm}5.37$) post operatively while the KSS score and ASES score were each 49.07 (${\pm}15.28$) and 48.65 (${\pm}13.27$) preoperatively, improving to 84.48 (${\pm}10.96$) and 84.65. (${\pm}9.86$). The measurements for the group without complicating acromioclavicular pathology are as follows. The strength for abduction, internal rotation, external rotation was each 6.42 (${\pm}3.11$), 7.59 (${\pm}4.81$) and 7.93 (${\pm}4.49$) preoperatively, improving to 15.85 (${\pm}7.35$), 19.18 (${\pm}9.14$), 16.95 (${\pm}5.70$) post operatively, while the KSS score and ASES score each went from 42.12 (${\pm}6.43$) and 41.37 (${\pm}7.42$) to 83.44 (${\pm}6.30$) and 83.17 (${\pm}7.01$) respectively. The measurements for the two groups, however, did not show a statistically significant difference (p>0.05). Conclusion: Analysis of the rotator cuff injury groups with and without AC joint pathology showed that both groups had improved strength, ASES and KSS scores with no statistical difference difference among the groups. As such, it thought that conservative treatment is an acceptable alternative to distal clavicle resection.

Methods for Integration of Documents using Hierarchical Structure based on the Formal Concept Analysis (FCA 기반 계층적 구조를 이용한 문서 통합 기법)

  • Kim, Tae-Hwan;Jeon, Ho-Cheol;Choi, Joong-Min
    • Journal of Intelligence and Information Systems
    • /
    • v.17 no.3
    • /
    • pp.63-77
    • /
    • 2011
  • The World Wide Web is a very large distributed digital information space. From its origins in 1991, the web has grown to encompass diverse information resources as personal home pasges, online digital libraries and virtual museums. Some estimates suggest that the web currently includes over 500 billion pages in the deep web. The ability to search and retrieve information from the web efficiently and effectively is an enabling technology for realizing its full potential. With powerful workstations and parallel processing technology, efficiency is not a bottleneck. In fact, some existing search tools sift through gigabyte.syze precompiled web indexes in a fraction of a second. But retrieval effectiveness is a different matter. Current search tools retrieve too many documents, of which only a small fraction are relevant to the user query. Furthermore, the most relevant documents do not nessarily appear at the top of the query output order. Also, current search tools can not retrieve the documents related with retrieved document from gigantic amount of documents. The most important problem for lots of current searching systems is to increase the quality of search. It means to provide related documents or decrease the number of unrelated documents as low as possible in the results of search. For this problem, CiteSeer proposed the ACI (Autonomous Citation Indexing) of the articles on the World Wide Web. A "citation index" indexes the links between articles that researchers make when they cite other articles. Citation indexes are very useful for a number of purposes, including literature search and analysis of the academic literature. For details of this work, references contained in academic articles are used to give credit to previous work in the literature and provide a link between the "citing" and "cited" articles. A citation index indexes the citations that an article makes, linking the articleswith the cited works. Citation indexes were originally designed mainly for information retrieval. The citation links allow navigating the literature in unique ways. Papers can be located independent of language, and words in thetitle, keywords or document. A citation index allows navigation backward in time (the list of cited articles) and forwardin time (which subsequent articles cite the current article?) But CiteSeer can not indexes the links between articles that researchers doesn't make. Because it indexes the links between articles that only researchers make when they cite other articles. Also, CiteSeer is not easy to scalability. Because CiteSeer can not indexes the links between articles that researchers doesn't make. All these problems make us orient for designing more effective search system. This paper shows a method that extracts subject and predicate per each sentence in documents. A document will be changed into the tabular form that extracted predicate checked value of possible subject and object. We make a hierarchical graph of a document using the table and then integrate graphs of documents. The graph of entire documents calculates the area of document as compared with integrated documents. We mark relation among the documents as compared with the area of documents. Also it proposes a method for structural integration of documents that retrieves documents from the graph. It makes that the user can find information easier. We compared the performance of the proposed approaches with lucene search engine using the formulas for ranking. As a result, the F.measure is about 60% and it is better as about 15%.

The Influence of Ventilation and Shade on the Mean Radiant Temperature of Summer Outdoor (통풍과 차양이 하절기 옥외공간의 평균복사온도에 미치는 영향)

  • Lee, Chun-Seok;Ryu, Nam-Hyung
    • Journal of the Korean Institute of Landscape Architecture
    • /
    • v.40 no.5
    • /
    • pp.100-108
    • /
    • 2012
  • The purpose of the study was to evaluate the influence of shading and ventilation on Mean Radiant Temperature(MRT) of the outdoor space at a summer outdoor. The Wind Speed(WS), Air Temperature(AT) and Globe Temperature(GT) were recorded every minute from $1^{st}$ of May to the $30^{th}$ of September 2011 at a height of 1.2m above in four experimental plots with different shading and ventilating conditions, with a measuring system consisting of a vane type anemometer(Barini Design's BDTH), Resistance Temperature Detector(RTD, Pt-100), standard black globe(${\O}$ 150mm) and data acquisition systems(National Instrument's Labview and Compfile Techs' Moacon). To implement four different ventilating and shading conditions, three hexahedral steel frames, and one natural plot were established in the open grass field. Two of the steel frames had a dimension of $3m(W){\times}3m(L){\times}1.5m(H)$ and every vertical side covered with transparent polyethylene film to prevent lateral ventilation(Ventilation Blocking Plot: VP), and an additional shading curtain was applied on the top side of a frame(Shading and Ventilation Blocking Plot: SVP). The third was $1.5m(W){\times}1.5m(L){\times}1.5m(H)$, only the top side of which was covered by the shading curtain without the lateral film(Shading Plot: SP). The last plot was natural condition without any kind of shading and wind blocking material(Natural Open Plot: NP). Based on the 13,262 records of 44 sunny days, the time serial difference of AT and GT for 24 hour were analyzed and compared, and statistical analysis was done based on the 7,172 records of daytime period from 7 A.M. to 8 P.M., while the relation between the MRT and solar radiation and wind speed was analyzed based on the records of the hottest period from 11 A.M. to 4 P.M.. The major findings were as follows: 1. The peak AT was $40.8^{\circ}C$ at VP and $35.6^{\circ}C$ at SP showing the difference about $5^{\circ}C$, but the difference of average AT was very small within${\pm}1^{\circ}C$. 2. The difference of the peak GT was $12^{\circ}C$ showing $52.5^{\circ}C$ at VP and $40.6^{\circ}C$ at SP, while the gap of average GT between the two plots was $6^{\circ}C$. Comparing all four plots including NP and SVP, it can be said that the shading decrease $6^{\circ}C$ GT while the wind blocking increase $3^{\circ}C$ GT. 3. According to the calculated MRT, the shading has a cooling effect in reducing a maximum of $13^{\circ}C$ and average $9^{\circ}C$ MRT, while the wind blocking has heating effect of increasing average $3^{\circ}C$ MRT. In other words, the MRT of the shaded area with natural ventilation could be cooler than the wind blocking the sunny site to about $16^{\circ}C$ MRT maximum. 4. The regression and correlation tests showed that the shading is more important than the ventilation in reducing the MRT, while both of them do an important role in improving the outdoor thermal comfort. In summary, the results of this study showed that the shade is the first and the ventilation is the second important factor in terms of improving outdoor thermal comfort in summer daylight hours. Therefore, it can be apparently said that the more shade by the forest, shading trees etc., the more effective in conditioning the microclimate of an outdoor space reducing the useless or even harmful heat energy for human activities. Furthermore, the delicately designed wind corridor or outdoor ventilation system can improve even the thermal environment of urban area.

A Study on Market Expansion Strategy via Two-Stage Customer Pre-segmentation Based on Customer Innovativeness and Value Orientation (고객혁신성과 가치지향성 기반의 2단계 사전 고객세분화를 통한 시장 확산 전략)

  • Heo, Tae-Young;Yoo, Young-Sang;Kim, Young-Myoung
    • Journal of Korea Technology Innovation Society
    • /
    • v.10 no.1
    • /
    • pp.73-97
    • /
    • 2007
  • R&D into future technologies should be conducted in conjunction with technological innovation strategies that are linked to corporate survival within a framework of information and knowledge-based competitiveness. As such, future technology strategies should be ensured through open R&D organizations. The development of future technologies should not be conducted simply on the basis of future forecasts, but should take into account customer needs in advance and reflect them in the development of the future technologies or services. This research aims to select as segmentation variables the customers' attitude towards accepting future telecommunication technologies and their value orientation in their everyday life, as these factors wilt have the greatest effect on the demand for future telecommunication services and thus segment the future telecom service market. Likewise, such research seeks to segment the market from the stage of technology R&D activities and employ the results to formulate technology development strategies. Based on the customer attitude towards accepting new technologies, two groups were induced, and a hierarchical customer segmentation model was provided to conduct secondary segmentation of the two groups on the basis of their respective customer value orientation. A survey was conducted in June 2006 on 800 consumers aged 15 to 69, residing in Seoul and five other major South Korean cities, through one-on-one interviews. The samples were divided into two sub-groups according to their level of acceptance of new technology; a sub-group demonstrating a high level of technology acceptance (39.4%) and another sub-group with a comparatively lower level of technology acceptance (60.6%). These two sub-groups were further divided each into 5 smaller sub-groups (10 total smaller sub-groups) through two rounds of segmentation. The ten sub-groups were then analyzed in their detailed characteristics, including general demographic characteristics, usage patterns in existing telecom services such as mobile service, broadband internet and wireless internet and the status of ownership of a computing or information device and the desire or intention to purchase one. Through these steps, we were able to statistically prove that each of these 10 sub-groups responded to telecom services as independent markets. We found that each segmented group responds as an independent individual market. Through correspondence analysis, the target segmentation groups were positioned in such a way as to facilitate the entry of future telecommunication services into the market, as well as their diffusion and transferability.

  • PDF

Automatic Quality Evaluation with Completeness and Succinctness for Text Summarization (완전성과 간결성을 고려한 텍스트 요약 품질의 자동 평가 기법)

  • Ko, Eunjung;Kim, Namgyu
    • Journal of Intelligence and Information Systems
    • /
    • v.24 no.2
    • /
    • pp.125-148
    • /
    • 2018
  • Recently, as the demand for big data analysis increases, cases of analyzing unstructured data and using the results are also increasing. Among the various types of unstructured data, text is used as a means of communicating information in almost all fields. In addition, many analysts are interested in the amount of data is very large and relatively easy to collect compared to other unstructured and structured data. Among the various text analysis applications, document classification which classifies documents into predetermined categories, topic modeling which extracts major topics from a large number of documents, sentimental analysis or opinion mining that identifies emotions or opinions contained in texts, and Text Summarization which summarize the main contents from one document or several documents have been actively studied. Especially, the text summarization technique is actively applied in the business through the news summary service, the privacy policy summary service, ect. In addition, much research has been done in academia in accordance with the extraction approach which provides the main elements of the document selectively and the abstraction approach which extracts the elements of the document and composes new sentences by combining them. However, the technique of evaluating the quality of automatically summarized documents has not made much progress compared to the technique of automatic text summarization. Most of existing studies dealing with the quality evaluation of summarization were carried out manual summarization of document, using them as reference documents, and measuring the similarity between the automatic summary and reference document. Specifically, automatic summarization is performed through various techniques from full text, and comparison with reference document, which is an ideal summary document, is performed for measuring the quality of automatic summarization. Reference documents are provided in two major ways, the most common way is manual summarization, in which a person creates an ideal summary by hand. Since this method requires human intervention in the process of preparing the summary, it takes a lot of time and cost to write the summary, and there is a limitation that the evaluation result may be different depending on the subject of the summarizer. Therefore, in order to overcome these limitations, attempts have been made to measure the quality of summary documents without human intervention. On the other hand, as a representative attempt to overcome these limitations, a method has been recently devised to reduce the size of the full text and to measure the similarity of the reduced full text and the automatic summary. In this method, the more frequent term in the full text appears in the summary, the better the quality of the summary. However, since summarization essentially means minimizing a lot of content while minimizing content omissions, it is unreasonable to say that a "good summary" based on only frequency always means a "good summary" in its essential meaning. In order to overcome the limitations of this previous study of summarization evaluation, this study proposes an automatic quality evaluation for text summarization method based on the essential meaning of summarization. Specifically, the concept of succinctness is defined as an element indicating how few duplicated contents among the sentences of the summary, and completeness is defined as an element that indicating how few of the contents are not included in the summary. In this paper, we propose a method for automatic quality evaluation of text summarization based on the concepts of succinctness and completeness. In order to evaluate the practical applicability of the proposed methodology, 29,671 sentences were extracted from TripAdvisor 's hotel reviews, summarized the reviews by each hotel and presented the results of the experiments conducted on evaluation of the quality of summaries in accordance to the proposed methodology. It also provides a way to integrate the completeness and succinctness in the trade-off relationship into the F-Score, and propose a method to perform the optimal summarization by changing the threshold of the sentence similarity.