• Title/Summary/Keyword: summarization

Search Result 375, Processing Time 0.026 seconds

AQS: An Analytical Query System for Multi-Location Rice Evaluation Data

  • Nazareno, Franco;Jung, Seung-Hyun;Kang, Yu-Jin;Lee, Kyung-Hee;Cho, Wan-Sup
    • Journal of Korea Society of Industrial Information Systems
    • /
    • v.15 no.2
    • /
    • pp.59-67
    • /
    • 2010
  • Rice varietal information exchange is vital for agricultural experiments and trials. With the growing size of rice data gathered around the world, and numerous research and development achievements, the effective collection and convenient ways of data dissemination is an important aspect to be dealt with. The collection of this data is continuously worked out through various international cooperation and network programs. The problem in acquiring this information anytime anywhere is the new challenge faced by rice breeders, scientist and crop information specialists, in order to perform rapid analysis and obtain significant results in rice research, thus alleviating rice production. To address these constraints, we propose an Online Analytical Query System, a web query application to provide breeders and rice scientist around the world a fast web search engine for rice varieties, giving the users the freedom to choose from which trial it has been used, trait observation parameters as well as geographical or weather conditions, and location specifications. The application uses data warehouse techniques and OLAP for summarization of agricultural trials conducted, and statistical analysis in deriving outstanding varieties used in these trials, consolidated in an Model-View-Controller Web framework.

News in a Nutshell: A Korean Headline-Style Summarization Dataset (요점만 남긴 신문 기사: 한국어 표제 형식 문서 요약 데이터셋)

  • Kwon, Hongseok;Go, Byunghyun;Park, Juhong;Lee, Myungjee;Oh, Jaeyoung;Heo, Dam;Lee, Jonghyeok
    • Annual Conference on Human and Language Technology
    • /
    • 2020.10a
    • /
    • pp.47-53
    • /
    • 2020
  • 문서 요약은 주어진 문서에서 핵심 내용만을 남긴 간결한 요약문을 생성하는 일로 자연어처리의 주요 분야 중 하나이다. 최근 방대한 데이터로부터 심층 신경망 표상을 학습하는 기술의 발전으로 문서 요약 기술이 급진적으로 진화했다. 이러한 데이터 기반 접근 방식에는 모델의 학습을 위한 양질의 데이터가 필요하다. 그러나 한국어와 같이 잘 알려지지 않은 언어에 대해서는 데이터의 획득이 쉽지 않고, 이를 구축하는 것은 많은 시간과 비용을 필요로 한다. 본 논문에서는 한국어 문서 요약을 위한 대용량 데이터셋을 소개한다. 데이터셋은 206,822개의 기사-요약 쌍으로 구성되며, 요약은 표제 형식의 여러 문장으로 되어 있다. 우리는 구축한 학습 데이터의 적합성을 검증하기 위해 수동 평가 및 여러 주요 속성에 대해 분석하고, 기존 여러 문서 요약 시스템에 학습 및 평가하여 향후 문서 요약 벤치마크 데이터셋으로써 기준선을 제시한다. 데이터셋은 https://github.com/hong8e/KHS.git의 스크립트를 통해 내려받을 수 있다.

  • PDF

A Keyphrase Extraction Model for Each Conference or Journal (학술대회 및 저널별 기술 핵심구 추출 모델)

  • Jeong, Hyun Ji;Jang, Gwangseon;Kim, Tae Hyun;Sin, Donggu
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2022.10a
    • /
    • pp.81-83
    • /
    • 2022
  • Understanding research trends is necessary to select research topics and explore related works. Most researchers search representative keywords of interesting domains or technologies to understand research trends. However some conferences in artificial intelligence or data mining fields recently publish hundreds to thousands of papers for each year. It makes difficult for researchers to understand research trend of interesting domains. In our paper, we propose an automatic technology keyphrase extraction method to support researcher to understand research trend for each conference or journal. Keyphrase extraction that extracts important terms or phrases from a text, is a fundamental technology for a natural language processing such as summarization or searching, etc. Previous keyphrase extraction technologies based on pretrained language model extract keyphrases from long texts so performances are degraded in short texts like titles of papers. In this paper, we propose a techonolgy keyphrase extraction model that is robust in short text and considers the importance of the word.

  • PDF

Media-based Analysis of Gasoline Inventory with Korean Text Summarization (한국어 문서 요약 기법을 활용한 휘발유 재고량에 대한 미디어 분석)

  • Sungyeon Yoon;Minseo Park
    • The Journal of the Convergence on Culture Technology
    • /
    • v.9 no.5
    • /
    • pp.509-515
    • /
    • 2023
  • Despite the continued development of alternative energies, fuel consumption is increasing. In particular, the price of gasoline fluctuates greatly according to fluctuations in international oil prices. Gas stations adjust their gasoline inventory to respond to gasoline price fluctuations. In this study, news datasets is used to analyze the gasoline consumption patterns through fluctuations of the gasoline inventory. First, collecting news datasets with web crawling. Second, summarizing news datasets using KoBART, which summarizes the Korean text datasets. Finally, preprocessing and deriving the fluctuations factors through N-Gram Language Model and TF-IDF. Through this study, it is possible to analyze and predict gasoline consumption patterns.

Research on the Development Direction of Language Model-based Generative Artificial Intelligence through Patent Trend Analysis (특허 동향 분석을 통한 언어 모델 기반 생성형 인공지능 발전 방향 연구)

  • Daehee Kim;Jonghyun Lee;Beom-seok Kim;Jinhong Yang
    • The Journal of Korea Institute of Information, Electronics, and Communication Technology
    • /
    • v.16 no.5
    • /
    • pp.279-291
    • /
    • 2023
  • In recent years, language model-based generative AI technologies have made remarkable progress. In particular, it has attracted a lot of attention due to its increasing potential in various fields such as summarization and code writing. As a reflection of this interest, the number of patent applications related to generative AI has been increasing rapidly. In order to understand these trends and develop strategies accordingly, future forecasting is key. Predictions can be used to better understand the future trends in the field of technology and develop more effective strategies. In this paper, we analyzed patents filed to date to identify the direction of development of language model-based generative AI. In particular, we took an in-depth look at research and invention activities in each country, focusing on application trends by year and detailed technology. Through this analysis, we tried to understand the detailed technologies contained in the core patents and predict the future development trends of generative AI.

A Study on the Effect of University Online Learning Platform Usability on Course Satisfaction (대학 비대면 강의 플랫폼 이용성이 강의 만족도에 미치는 영향에 관한 연구)

  • Hyun Soo Chae;Jee Yeon Lee
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.58 no.1
    • /
    • pp.225-254
    • /
    • 2024
  • The study aims to understand undergraduates' and graduate students' perceptions and satisfaction with online learning platforms and to verify the relationship between usability factors and satisfaction with online courses. The literature review facilitated the summarization of major factors to be considered in the online learning platform development process and established the research model. The follow-up survey verified the perceptions of university constituents regarding the fulfillment of the university online learning platforms' user interface principles, platforms' usability, satisfaction with platforms, and satisfaction with online courses. Causal relationships between variables were tested and modeled by analyzing survey results. We also confirmed that the same model can be applied to different types of learners and various types of online learning methods. This study is significant in verifying that the fulfillment of the platforms' user interface design principles can affect satisfaction with online courses using the platforms based on learners' evaluation results. We expect that the research model proposed in this study can contribute to the improvement and development of online learning environments in the future.

A Study on The Records of [The Book of Supernumerary Embryo Preservation] and [The Book of Supernumerary Embryo Donation] Enacted by "The Law on Bioethics and Safety" ("생명윤리 및 안전에 관한 법률"이 정해준 [잔여배아보관실적대장]과 [잔여배아제공실적대장]의 작성에 관한 연구)

  • Yoon, San-Hyun;Ko, Yong;Lim, Jin-Ho
    • Clinical and Experimental Reproductive Medicine
    • /
    • v.34 no.4
    • /
    • pp.253-273
    • /
    • 2007
  • Objective: This study was to find ways to let a manager or superintendent rationally and consistently inspect as well as let a embryologist precisely record [The Book of Supernumerary Embryo Preservation] and [The Book of Supernumerary Embryo Donation]. Methods: Based on the data collected between 1994 and 2004 in Clinic 44 (Maria Fertility Hospital), [The Present State about Production and Use of Embryos], [The Preservation of Supernumerary Embryos and Their Thaw State], [The Present State about Thaw and Use of Frozen Embryos], [The Present State about Donation and Charge of Frozen Embryos], [The Book about Frozen Embryo Discard], and [The Summarization Book about Management and Use of Frozen Embryos] were designed and recorded. Results: The production, use, preservation, discard and donation quantity of human embryos, the use and discard quantity of thawed embryos, and the cumulative embryo preservation quantity could be totalized in [The Present State about Production and Use of Embryos in Clinic 44]. Also, [The Preservation of Supernumerary Embryos and Their Thaw State in Clinic 44] supported "the supernumerary embryo preservation quantity" etc. In addition, [The Present State about Thaw and Use of Frozen Embryos in Clinic 44] or [The Book about Frozen Embryo Discard in Clinic 44] supported "the use and discard quantity of thawed embryos" etc. Moreover, "The embryo donation quantity" could be totalized in [The Present State about Donation and Charge of Frozen Embryos in Clinic 44]. Finally, [The Summarization Book about Management and Use of Frozen Embryos in Clinic 44] could be used for rational and consistent management or inspection. Conclusion: The present results suggest that the documents not only be standard data to record [The Book about Supernumerary Embryo Preservation in Clinic] and [The Book about Supernumerary Embryo Donation in Clinic] but can also be preserved as treatment references.

The Influence of Mother's and Father's Conflict Resolution Styles on Adolescents' Use of Swear Words: The Mediating Role of Aggression (부와 모의 갈등해결양식이 청소년의 욕설사용에 미치는 영향: 공격성의 매개역할)

  • Lee, Bohyun;Lee, Eunhee
    • The Journal of the Convergence on Culture Technology
    • /
    • v.4 no.2
    • /
    • pp.107-114
    • /
    • 2018
  • The study is to find out the influence of mother's and father's conflict resolution styles(aggressive and compromising) on adolescents' use of swear words. This study also investigates whether aggression has a mediated effect in terms of the relationship between mother's and father's conflict resolution styles and their children's use of swear words. To this end, self-report type of questionnaire was conducted to 570 students who attend at 6 different middle schools located in Gyeongnam Province. To the exclusion of incomplete and insincere answers, 477 were selected as the raw data of the research. The summarization of the results is as follows: First, the aggressive type of conflict resolution style with mothers has positive correlation with the students' use of swear words. When the conflict resolution style with mothers gets aggressive, their children's use of swear word increases accordingly. Second, it is confirmed that aggression has a mediated effect when it comes to teenagers' use of swear words triggered by mother's aggressive conflict resolution styles and father's aggressive conflict resolution styles. Therefore, if the conflict between children and parents is not appropriately resolved, the children's aggression accumulates and thereby children's use of swear words increases.

Topic Modeling of News Article about International Construction Market Using Latent Dirichlet Allocation (Latent Dirichlet Allocation 기법을 활용한 해외건설시장 뉴스기사의 토픽 모델링(Topic Modeling))

  • Moon, Seonghyeon;Chung, Sehwan;Chi, Seokho
    • KSCE Journal of Civil and Environmental Engineering Research
    • /
    • v.38 no.4
    • /
    • pp.595-599
    • /
    • 2018
  • Sufficient understanding of oversea construction market status is crucial to get profitability in the international construction project. Plenty of researchers have been considering the news article as a fine data source for figuring out the market condition, since the data includes market information such as political, economic, and social issue. Since the text data exists in unstructured format with huge size, various text-mining techniques were studied to reduce the unnecessary manpower, time, and cost to summarize the data. However, there are some limitations to extract the needed information from the news article because of the existence of various topics in the data. This research is aimed to overcome the problems and contribute to summarization of market status by performing topic modeling with Latent Dirichlet Allocation. With assuming that 10 topics existed in the corpus, the topics included projects for user convenience (topic-2), private supports to solve poverty problems in Africa (topic-4), and so on. By grouping the topics in the news articles, the results could improve extracting useful information and summarizing the market status.

An Epistemological Inquiry on the Development of Statistical Concepts (통계적 개념 발달에 관한 인식론적 고찰)

  • Lee, Young-Ha;Nam, Joo-Hyun
    • The Mathematical Education
    • /
    • v.44 no.3 s.110
    • /
    • pp.457-475
    • /
    • 2005
  • We have inquired on what the statistical classes of the secondary schools had been aiming to, say the epistermlogical objects. And we now appreciate that the main obstacle to the systematic articulation is the lack of anticipation on what the statistical concepts are. This study focuses on the ingredients of the statistical concepts. Those are to be the ground of the systematic articulation of statistic courses, especially of the one for the school kids. Thus we required that those ingredients must satisfy the followings. i) directly related to the contents of statistics ii) psychologically developing iii) mutually exclusive each other as much as possible iv) exhaustive enough to cover all statistical concepts We examined what and how statisticians had been doing and the various previous views on these. After all we suggest the following three concepts are the core of conceptual developments of statistic, say the concept of distributions, the summarizing ability and the concept of samples. By the concepts of distributions we mean the frequency views on each random categories and that is developing from the count through the probability along ages. Summarizing ability is another important resources to embed his probe with the data set. It is not only viewed as a number but also to be anticipated as one reflecting a random phenomena. Inductive generalization is one of the most hazardous thing. Statistical induction is a scientific way of challenging this and this starts from distinguishing the chance with the inevitable consequences. One's inductive logic grows up along with one's deductive arguments, nevertheless they are different. The concept of samples reflects' one's view on the sample data and the way of compounding one's logic with the data within one's hypothesis. With these three in mind we observed Korean Statistic Curriculum from K to 12. Distributional concepts are dealt with throughout but not sequenced well. The way of summarization has been introduced in the 1 st, 5th, 7th and the 10th grade as a numerical value only. One activity on the concept of sample is given at the 6th grade. And it jumps into the statistical reasoning at the selective courses of ' Mathematics I ' or of ' Probability and Statistics ' in the grades of 11-12. We want to suggest further studies on the developing stages of these three conceptual features so as to obtain a firm basis of successive statistical articulation.

  • PDF