• Title/Summary/Keyword: study summary

Search Result 2,353, Processing Time 0.028 seconds

Subject-Balanced Intelligent Text Summarization Scheme (주제 균형 지능형 텍스트 요약 기법)

  • Yun, Yeoil;Ko, Eunjung;Kim, Namgyu
    • Journal of Intelligence and Information Systems
    • /
    • v.25 no.2
    • /
    • pp.141-166
    • /
    • 2019
  • Recently, channels like social media and SNS create enormous amount of data. In all kinds of data, portions of unstructured data which represented as text data has increased geometrically. But there are some difficulties to check all text data, so it is important to access those data rapidly and grasp key points of text. Due to needs of efficient understanding, many studies about text summarization for handling and using tremendous amounts of text data have been proposed. Especially, a lot of summarization methods using machine learning and artificial intelligence algorithms have been proposed lately to generate summary objectively and effectively which called "automatic summarization". However almost text summarization methods proposed up to date construct summary focused on frequency of contents in original documents. Those summaries have a limitation for contain small-weight subjects that mentioned less in original text. If summaries include contents with only major subject, bias occurs and it causes loss of information so that it is hard to ascertain every subject documents have. To avoid those bias, it is possible to summarize in point of balance between topics document have so all subject in document can be ascertained, but still unbalance of distribution between those subjects remains. To retain balance of subjects in summary, it is necessary to consider proportion of every subject documents originally have and also allocate the portion of subjects equally so that even sentences of minor subjects can be included in summary sufficiently. In this study, we propose "subject-balanced" text summarization method that procure balance between all subjects and minimize omission of low-frequency subjects. For subject-balanced summary, we use two concept of summary evaluation metrics "completeness" and "succinctness". Completeness is the feature that summary should include contents of original documents fully and succinctness means summary has minimum duplication with contents in itself. Proposed method has 3-phases for summarization. First phase is constructing subject term dictionaries. Topic modeling is used for calculating topic-term weight which indicates degrees that each terms are related to each topic. From derived weight, it is possible to figure out highly related terms for every topic and subjects of documents can be found from various topic composed similar meaning terms. And then, few terms are selected which represent subject well. In this method, it is called "seed terms". However, those terms are too small to explain each subject enough, so sufficient similar terms with seed terms are needed for well-constructed subject dictionary. Word2Vec is used for word expansion, finds similar terms with seed terms. Word vectors are created after Word2Vec modeling, and from those vectors, similarity between all terms can be derived by using cosine-similarity. Higher cosine similarity between two terms calculated, higher relationship between two terms defined. So terms that have high similarity values with seed terms for each subjects are selected and filtering those expanded terms subject dictionary is finally constructed. Next phase is allocating subjects to every sentences which original documents have. To grasp contents of all sentences first, frequency analysis is conducted with specific terms that subject dictionaries compose. TF-IDF weight of each subjects are calculated after frequency analysis, and it is possible to figure out how much sentences are explaining about each subjects. However, TF-IDF weight has limitation that the weight can be increased infinitely, so by normalizing TF-IDF weights for every subject sentences have, all values are changed to 0 to 1 values. Then allocating subject for every sentences with maximum TF-IDF weight between all subjects, sentence group are constructed for each subjects finally. Last phase is summary generation parts. Sen2Vec is used to figure out similarity between subject-sentences, and similarity matrix can be formed. By repetitive sentences selecting, it is possible to generate summary that include contents of original documents fully and minimize duplication in summary itself. For evaluation of proposed method, 50,000 reviews of TripAdvisor are used for constructing subject dictionaries and 23,087 reviews are used for generating summary. Also comparison between proposed method summary and frequency-based summary is performed and as a result, it is verified that summary from proposed method can retain balance of all subject more which documents originally have.

The Effects of Argumentation-based General Chemistry Laboratory on Preservice Science Teachers' Understanding of Chemistry Concepts and Writing (논의가 강조된 일반화학실험이 예비교사의 글쓰기 능력 및 화학개념 이해에 미치는 효과)

  • Nam, Jeong-Hee;Koh, Mi-Rye;Bak, Deok-Chan;Lim, Jai-Hang;Lee, Dong-Won;Choi, Ae-Ran
    • Journal of The Korean Association For Science Education
    • /
    • v.31 no.8
    • /
    • pp.1077-1091
    • /
    • 2011
  • The purpose of this study was to examine the effects of argumentation-based general chemistry laboratory on preservice science teachers' chemistry concepts understanding and writing. Five topics about argumentation-based general chemistry laboratory activities were developed using Science Writing Heuristic (SWH) approach. Summary Writing Test, and Chemistry Concepts Test were developed as tools to examine the effects of this approach. Both Argumentation-based general chemistry laboratory activities and traditional general chemistry laboratory activities were implemented for the experimental group (23 students), and traditional general chemistry laboratory activities were implemented for the comparative group (16 students). Results of this study indicated that there were significant differences in both groups' chemistry concepts understanding and summary writing. The experimental group showed significantly higher mean score than comparative group in chemistry concepts understanding and summary writing. In the analysis of the sub-component of Summary Writing, there were no significant difference between both groups in 'Big Idea.' However, the experimental group gained significantly higher mean score in 'argumentation,' 'understanding of science concepts,' and 'rhetoric structure.' The results showed that argumentation-based general chemistry laboratory programs were effective in achieving chemistry concepts understanding and writing in general chemistry laboratory.

An application of datamining approach to CQI using the discharge summary (퇴원요약 데이터베이스를 이용한 데이터마이닝 기법의 CQI 활동에의 황용 방안)

  • 선미옥;채영문;이해종;이선희;강성홍;호승희
    • Proceedings of the Korea Inteligent Information System Society Conference
    • /
    • 2000.11a
    • /
    • pp.289-299
    • /
    • 2000
  • This study provides an application of datamining approach to CQI(Continuous Quality Improvement) using the discharge summary. First, we found a process variation in hospital infection rate by SPC (Statistical Process Control) technique. Second, importance of factors influencing hospital infection was inferred through the decision tree analysis which is a classification method in data-mining approach. The most important factor was surgery followed by comorbidity and length of operation. Comorbidity was further divided into age and principal diagnosis and the length of operation was further divided into age and chief complaint. 24 rules of hospital infection were generated by the decision tree analysis. Of these, 9 rules with predictive prover greater than 50% were suggested as guidelines for hospital infection control. The optimum range of target group in hospital infection control were Identified through the information gain summary. Association rule, which is another kind of datamining method, was performed to analyze the relationship between principal diagnosis and comorbidity. The confidence score, which measures the decree of association, between urinary tract infection and causal bacillus was the highest, followed by the score between postoperative wound disruption find postoperative wound infection. This study demonstrated how datamining approach could be used to provide information to support prospective surveillance of hospital infection. The datamining technique can also be applied to various areas fur CQI using other hospital databases.

  • PDF

The Effect of e-Learning Contents' Information Presentation Method on Teaching Presence and Academic Achievement (e-러닝 콘텐츠의 정보제시방식이 교수실재감 및 학업성취도에 미치는 효과)

  • Kim, Jinha;Kim, Kyunghee;Lee, Seongju
    • The Journal of Korean Association of Computer Education
    • /
    • v.22 no.3
    • /
    • pp.79-87
    • /
    • 2019
  • This study examined the effect of e-learning contents with different dual-coding, media-richness, and cognitive-load degree on learning. To do so, after dividing summary and explanation presentation methods in e-learning contents according to information's quantity and kind, the effects on teaching presence and academic achievement were examined. The summary presentation method was produced as text type and text+illustration type and the explanation presentation method as audio type and audio+video type. The results of this study are as follows. First, in the summary method, the text+illustration type had significantly higher teaching presence than text type. Second, in the explanation method, the audio type was found to be significantly higher than the audio+video type. Third, the interaction between the summary method and explanation method was found to be significant in teaching presence and academic achievement.

Study of Cursive Calligraphy of wu zhen(吳鎮)'s Ink bambooo Collection

  • Deng, Zhuoren;Lee, Jaewoo
    • International Journal of Advanced Culture Technology
    • /
    • v.10 no.2
    • /
    • pp.69-78
    • /
    • 2022
  • The purpose of this paper is to summarize the cursive script of traditional calligraphy and develop further possibilities based on the study of the painting and postscript of Ink bambooo, which was painted by wu zhen(吳鎮) during the Yuan Dynasty. The second section in this paper provides a summary of wu zhen(吳鎮)'s life, in addition to "Ink bambooo" and its painting postscript. The third and fourth sections are focused on analyzing the cursive script in the painting postscript of Ink bambooo, including the left-and-right structure, head prefix symbols, and bottom prefix symbols. The aim of this paper is the study of cursive script, and the theories and methods of the characters proposed by Dr. Cai Yonggui (from Fujian Normal University) and Dr. Liu Dongqin (from Southeast University) will be used to provide a summary. The presentation of the research results of this paper is designed to develop further possibilities for this type of traditional calligraphy.

Multi-Sized cumulative Summary Structure Driven Light Weight in Frequent Closed Itemset Mining to Increase High Utility

  • Siva S;Shilpa Chaudhari
    • Journal of information and communication convergence engineering
    • /
    • v.21 no.2
    • /
    • pp.117-129
    • /
    • 2023
  • High-utility itemset mining (HIUM) has emerged as a key data-mining paradigm for object-of-interest identification and recommendation systems that serve as frequent itemset identification tools, product or service recommendation systems, etc. Recently, it has gained widespread attention owing to its increasing role in business intelligence, top-N recommendation, and other enterprise solutions. Despite the increasing significance and the inability to provide swift and more accurate predictions, most at-hand solutions, including frequent itemset mining, HUIM, and high average- and fast high-utility itemset mining, are limited to coping with real-time enterprise demands. Moreover, complex computations and high memory exhaustion limit their scalability as enterprise solutions. To address these limitations, this study proposes a model to extract high-utility frequent closed itemsets based on an improved cumulative summary list structure (CSLFC-HUIM) to reduce an optimal set of candidate items in the search space. Moreover, it employs the lift score as the minimum threshold, called the cumulative utility threshold, to prune the search space optimal set of itemsets in a nested-list structure that improves computational time, costs, and memory exhaustion. Simulations over different datasets revealed that the proposed CSLFC-HUIM model outperforms other existing methods, such as closed- and frequent closed-HUIM variants, in terms of execution time and memory consumption, making it suitable for different mined items and allied intelligence of business goals.

Validity Verification of a Korean Version of Recovery Scale(Client Assessment Summary) for Alcoholics (알코올중독자의 회복척도 CAS(Client Assessment Summary) 한국어판의 타당도 검증)

  • Rhee, Young-Sun;Kim, Soo-Youn
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.17 no.11
    • /
    • pp.386-394
    • /
    • 2016
  • This study investigates the validity of a Korean version of the Client Assessment Summary (CAS), which is a tool used to assess the recovery of alcoholics. We investigated the Korean CAS's suitability for use in assessing the scale of recovery scale of general alcoholics in Korea. In this study, we analyzed the data of 205 abstaining alcoholics in order to determine the validity of the Korean CAS. We undertook relationship analyses of CAS contents, reliability, and composition validity through factor analysis. In addition, we assessed ARS, abstinence period, abstinence self-efficacy, illness insight, and motivation change variables. The factor analysis results, performed after verification of content suitability by assessing 12 questions and 4 factors, confirmed the tool's composition validity, with the results showing relatively high values (R2 = 76.26%, communality ${\geq}0.6$, and KMO = 0.92). Moreover, internal consistency was acceptable (Cronbach's alpha = 0.92), and the correlations among ARS, abstinence self-efficacy, illness insight, and motivation change variables confirmed the validity of the Korean CAS. The proposed Korean CAS is expected to be useful when academically and clinically assessing the recovery of alcoholics; thereby, eventually contributing to successful recoveries from alcoholism.

Characteristics of Scientific Method for the 8th Grade Students‘ Inquiry Reports (8학년 학생들의 탐구 보고서에 나타난 과학방법의 특징)

  • Shin, Mi-Young;Choe, Seung-Urn
    • Journal of the Korean earth science society
    • /
    • v.29 no.4
    • /
    • pp.341-351
    • /
    • 2008
  • The purpose of this study was to investigate eighth graders' scientific method of inquiry used in their reports. We developed a framework, 'Analysis of Scientific Methods and Information Sources', with a perspective of the Nature of Science to analyze students' planning method, data analysis, and information sources. We then compared results with levels of questions to find out whether they affected students' 'Scientific Method'. In addition, we analyzed students' responses of the survey questionnaire, e.g.. how they liked Scientific Method. Results are as follows: First, 'planning method' consisted of 'consultant' and 'activities'. The 'activities' were 'experiment', 'correlational study', and 'observation' Students planned by utilizing 'consultant' more than the other. In case of planning 'activities'. most of them were 'experiment' Second, 'data analysis' consisted of 'summary', 'table', 'chart', 'graph' and so on. Students analyzed their data by using 'summary' frequently. The types of 'summary' were divided into 'simple summary' and 'relational statement' Third, 'information sources' consisted of 'computer', 'library'. and 'professional consultant' Most of the students gathered information from 'computer' Fourth, the types of 'planning method' and 'summary' were affected by the levels of questions. Fifth, some of the students reported their difficulty in 'planning method' because the collected information was less reliable, lacking, and having difficult technical terms.

A Study on Development of Korean Failure Rate Databook (한국형 고장률 데이터 북 개발에 대한 연구)

  • Paik, Soonheum;Lim, Jae-hak
    • Journal of Applied Reliability
    • /
    • v.17 no.4
    • /
    • pp.305-315
    • /
    • 2017
  • Purpose: The purpose of this research is to propose procedure and methodology for developing failure rate databook which is suitable for Korean operation environment. Methods: To this end, we investigate failure databooks used in foreign countries and study the procedure and methodology for collecting failure data, organizing the data, estimating failure rate and summarizing results. Results: We develop the procedure of development of failure databook, the items for data collection, database schema of part details and part summary and contents of failure databook by considering the application environment in Korea. Conclusion: The results of our research could be utilized for the development of Korean failure rate databook and research of reliability prediction model and could ultimately contribute to improve the accuracy of reliability prediction.

The Effects of the Science Writing Heuristic Approach on the Middle School Students' Achievements (중학생의 성취 수준에 따른 탐구적 과학 글쓰기(Science Writing Heuristic) 수업의 효과)

  • Shin, Soyoung;Choi, Aeran;Park, Jong-Yoon
    • Journal of The Korean Association For Science Education
    • /
    • v.33 no.5
    • /
    • pp.952-962
    • /
    • 2013
  • The purpose of this study was to investigate the effects of the Science Writing Heuristic (SWH) approach on the students' summary writing, logical thinking and achievements for the course. Participants in this study were 132 female students from a girls' middle school. The SWH approach was used for two experimental classes and the typical teacher-centered instructional approach was used for two comparative classes. Summary writing test, logical thinking test (GALT) and achievement test for the course were administered before and after the instruction period. Results of this study indicated that the SWH approach was helpful for students in finding big ideas, understanding science concepts, developing logical thinking abilities and doing well in the course. This study also implied that the SWH approach was effective for the low achieving students.