• Title/Summary/Keyword: Text consistency

Search Result 51, Processing Time 0.031 seconds

A Study on the Direction of Development of Related Policies with Game-related Issue Analysis: Using Text Mining and Spline Function Analysis of Newspaper Articles (게임 관련 이슈 분석을 통한 관련 정책 발전 방향에 관한 연구: 운형함수와 텍스트마이닝 분석을 활용하여)

  • Jang, You-mi;Yoo, Han-byeol
    • The Journal of the Korea Contents Association
    • /
    • v.22 no.3
    • /
    • pp.513-528
    • /
    • 2022
  • The purpose of this study is to analyze Korean game-related issues and policies to increase the effectiveness of related policies in the future and to increase the consistency of social norms of the policies. this study analyzes related issues by analyzing changes in Korean newspaper articles using spline function and text mining methods, and analyzes the contents of newspaper articles at the time of amplification of issues to present major issues and development directions. As a result of the analysis, game-related issues appeared in various topic, and there are not only support from the government and local governments but also coexisted with game-related regulations (taxation, gambling regulations, game addiction disease, and prevention of fee expansion). Despite regulations, the government presents preemptive responses to problems caused by the application of metabuses and NFTs to games, fostering game-related experts, start-up support, and supporting manpower departure as policy implications.

Analysis of Traffic Improvement Measures in Transportation Impact Assessment Using Text Mining : Focusing on City Development Projects in Gyeonggi Province (텍스트마이닝을 활용한 교통영향평가 교통개선대책 분석 : 경기도 도시개발사업을 대상으로)

  • Eun Hye Yang;Hee Chan Kang;Woo-Young Ahn
    • The Journal of The Korea Institute of Intelligent Transport Systems
    • /
    • v.22 no.2
    • /
    • pp.182-194
    • /
    • 2023
  • Traffic impact assessment plays a crucial role in resolving traffic issues that may arise during the implementation of urban and transportation projects. However, reported results diverge, presumably because the items reviewed differ. In this study, we analyze traffic improvement measures approved for traffic impact assessment, identify key items, and present items that should be included in assessments. Specifically, TF-IDF and N-gram analysis and text mining were performed with focus on urban development projects approved in Gyeonggi Province. The results obtained show that keywords associated with newly established transportation infrastructure, such as roads and intersections, were essential assessment items, followed by the locations of entrances and exits and pedestrian connectivity. We recommend that considerations of the items presented in this study be incorporated into future traffic impact assessment guidelines and standards to improve the consistency and objectivity of the assessment process.

A Study on the Traceability Analysis between Non-standardized Documents (비정형화된 문서간 추적성 분석에 관한 연구)

  • Kim, EunHee;An, Kyung Ik;Song, Duck Yong
    • Korean Journal of Computational Design and Engineering
    • /
    • v.20 no.4
    • /
    • pp.328-336
    • /
    • 2015
  • We proposed a methodology to automatically extract the requirements from the documents and check the consistency and traceability among them. The documents include not only text but also PDF or image files. We also suggest a method to visualize the result with maps, numbers, and graphs. By comparing the results with those of manual reviews from experts, we show that it is necessary to use knowledge-based method in future instead of the wordbased method for improving the reliability. The results can be applied effectively for already existing documents.

Thesaurus Development for HiTEL Service (하이텔 메뉴검색용 시소러스의 개발에 관한 연구)

  • 최석두
    • Journal of the Korean Society for information Management
    • /
    • v.13 no.1
    • /
    • pp.227-241
    • /
    • 1996
  • We present development results for a Hangul thesaurus which was provided to improve performance of the intelligent information retrieval system. The important stages and methods in the process of term acquisition, classification, creation of the consistency-effectiveness relationship using HiTEL menu and text of dictionary are described. To cany out our study we have built a thesaurus management system and also describe its utility functions.

  • PDF

Semi-Automatic Scoring for Short Korean Free-Text Responses Using Semi-Supervised Learning (준지도학습 방법을 이용한 한국어 서답형 문항 반자동 채점)

  • Cheon, Min-Ah;Seo, Hyeong-Won;Kim, Jae-Hoon;Noh, Eun-Hee;Sung, Kyung-Hee;Lim, EunYoung
    • Korean Journal of Cognitive Science
    • /
    • v.26 no.2
    • /
    • pp.147-165
    • /
    • 2015
  • Through short-answer questions, we can reflect the depth of students' understanding and higher-order thinking skills. Scoring for short-answer questions may take long time and may be an issue on consistency of grading. To alleviate such the suffering, automated scoring systems are widely used in Europe and America, but are in the initial stage in research in Korea. In this paper, we propose a semi-automatic scoring system for short Korean free-text responses using semi-supervised learning. First of all, based on the similarity score between students' answers and model answers, the proposed system grades students' answers and the scored answers with high reliability have been included in the model answers through the thorough test. This process repeats until all answers are scored. The proposed system is used experimentally in Korean and social studies in Nationwide Scholastic Achievement Test. We have confirmed that the processing time and the consistency of grades are promisingly improved. Using the system, various assessment methods have got to be developed and comparative studies need to be performed before applying to school fields.

A Study on FTA Rules of WTO (WTO의 FTA룰에 관한 연구)

  • Lee, Gyun
    • Journal of Arbitration Studies
    • /
    • v.17 no.1
    • /
    • pp.183-215
    • /
    • 2007
  • The purpose of this paper is to study of WTO regulations related FTA such as Understanding on the Interpretation of Article XXIV of the General Agreement on Tariffs and Trade(GATT) 1994 and General Agreement on Trade in Service(GATS). In this study, the First introduced FTA rules of WTO in the chapter 2. The WTO agreement includes the "General Agreement on Tariffs an Trade(GATT) 1994". This instrument, known as "GATT 1994", is based on upon the original General Agreement on Tariffs and Trade referred to as "GATT 1947". The Second analyzed the relations between FTA and Article XXIV of GATT 1994 in the chapter 3. The Article XXIV of GATT 1994 is an agreement between the distinctive members for liberalizing trade. The Article XXIV of GATT 1994 is consist of three parts such as customs unions, free-trade area, and interim agreements that WTO is referred to as "Regional Trade Agreement(RTA)". There is a difference between the customs unions and the free-trade area. In the customs unions rules, the members should have the same tarifficatio and the same trade provision against non-members, but in the free-trade are a rules, the member is not necessary to have the same tarifficatio and the same trade provision against non-members. But, the both rules have a liberalization of trade in a common as a revoking tariffs and the government regulations for interfering with trade. In this case, however, the both rules include an inconsistency ele ment under WTO rules such as Most-Favoured-Nation Treatment(MFN) and National Treatment on Internal Taxation and Regulation(NTITR). This study reviewed neither inconsistency nor consistency on the both rules with the RTA of WTO under Article XXIV of GATT 1994. The Third analyzed the relations between FTA and Article V of GATS under WTO in the chapter 4. The GATS is a rule of WTO for the growing importance of trade in services for the growth and development of the world conomy. The GATS is a new rule rather than GATT's rule for concerning goods trade. The Article V of GATS under WTO is a rule that makes based on upon the Article XXIV of GATT. Therefore, If it is to be examined the Article V of GATS, it should be referred to a and an interpretation of the text of the Article XXIV of GATT. However, the Article V of GATS is on the undeveloped stage compare to the Article XXIV of GATT. Because, the statistics of WTO showed that the RTAs under the Article XXIV of GATT have 150 cases completed between nations, but the RTAs under the Article IV of GATS have 10 cases completed between nations. The Forth examined the interpretation of FTA rules under WTO in the chapter 5. Concerning the consistency issue of customs unions and free-trade area under the Article XXIV of GATT, the working parties in customs unions and in free-trade area have been reviewed the consistency is sue which had been not if to GATT. However, the parties finished to get up with one accord the both that are a consistency of argument and an inconsistency of argument with the interpretation of the Article XXIV of GATT. The interpretation of the Article XXIV of GATT has been raised as the issues when EEC by Rome Treaty established in 1957. However, the consistency is sue only agreed 6 working parties out of 69 working parties finished the reviewing of the interpretation up to the end of 1994. Also the consistency issue concerned with the special privilege measure of the customs unions and tree-trade area under the Article XXIV of GATT discussed only 3 cases between working parties up to now and did not accepted as an issue for working parties' report. In conclusion in the chapter 6, this study raised the issues of WTO that are a conference of a new round under WTO and the issues of clarity between FTA rule and WTO regulation.

  • PDF

Feature Extraction to Detect Hoax Articles (낚시성 인터넷 신문기사 검출을 위한 특징 추출)

  • Heo, Seong-Wan;Sohn, Kyung-Ah
    • Journal of KIISE
    • /
    • v.43 no.11
    • /
    • pp.1210-1215
    • /
    • 2016
  • Readership of online newspapers has grown with the proliferation of smart devices. However, fierce competition between Internet newspaper companies has resulted in a large increase in the number of hoax articles. Hoax articles are those where the title does not convey the content of the main story, and this gives readers the wrong information about the contents. We note that the hoax articles have certain characteristics, such as unnecessary celebrity quotations, mismatch in the title and content, or incomplete sentences. Based on these, we extract and validate features to identify hoax articles. We build a large-scale training dataset by analyzing text keywords in replies to articles and thus extracted five effective features. We evaluate the performance of the support vector machine classifier on the extracted features, and a 92% accuracy is observed in our validation set. In addition, we also present a selective bigram model to measure the consistency between the title and content, which can be effectively used to analyze short texts in general.

PC-SAN: Pretraining-Based Contextual Self-Attention Model for Topic Essay Generation

  • Lin, Fuqiang;Ma, Xingkong;Chen, Yaofeng;Zhou, Jiajun;Liu, Bo
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.14 no.8
    • /
    • pp.3168-3186
    • /
    • 2020
  • Automatic topic essay generation (TEG) is a controllable text generation task that aims to generate informative, diverse, and topic-consistent essays based on multiple topics. To make the generated essays of high quality, a reasonable method should consider both diversity and topic-consistency. Another essential issue is the intrinsic link of the topics, which contributes to making the essays closely surround the semantics of provided topics. However, it remains challenging for TEG to fill the semantic gap between source topic words and target output, and a more powerful model is needed to capture the semantics of given topics. To this end, we propose a pretraining-based contextual self-attention (PC-SAN) model that is built upon the seq2seq framework. For the encoder of our model, we employ a dynamic weight sum of layers from BERT to fully utilize the semantics of topics, which is of great help to fill the gap and improve the quality of the generated essays. In the decoding phase, we also transform the target-side contextual history information into the query layers to alleviate the lack of context in typical self-attention networks (SANs). Experimental results on large-scale paragraph-level Chinese corpora verify that our model is capable of generating diverse, topic-consistent text and essentially makes improvements as compare to strong baselines. Furthermore, extensive analysis validates the effectiveness of contextual embeddings from BERT and contextual history information in SANs.

A Study on the Distinctive Features of "Hwangjenaegyeongtaeso(黃帝內經太素)" by Yang Sangseon and his Medical Theory ("황제내경태소(黃帝內經太素)"의 특징(特徵) 및 양상선(楊上善)의 의학이론(醫學理論)에 대한 연구(硏究))

  • Lee, Sang-Hyup;Kim, Joong-Han
    • Journal of Korean Medical classics
    • /
    • v.22 no.2
    • /
    • pp.35-69
    • /
    • 2009
  • Yang Shangseon(楊上善)'s "Hwangjenaegyeongtaeso(黃帝內經太素)" was the first commentary book of "Hwangjenaegyeong(黃帝內經)", its importance often mentioned in level with Wang Bing (王冰)'s "Somun(素問)" "Yeongchu(靈樞)". The distinctive feature of Yang Sangseon(楊上善)'s commentary is that it is easy to comprehend in accordance with an organized classification, and that the explanations are simple and clear. Despite strict application of the Eumyang(陰陽, Yinyang) theory and Five phases[五行] theory throughout the text, should there be sentences which fall out of consistency with the basic theories, he added his own substantial commentary. His medical theory gives attention to the Meridian system[經絡], lays emphasis on developing the soul[神], and has a unique opinion about the Opening closing and pivot[開闔樞] theory along with the Myeongmun(命門). To explain the methods for preserving health[養生], he adopted the Nojang philosophy(老莊思想); to enrich the vitality he adopted the Buddhist philosophy(佛敎思想); and to analyze physiologic and pathogenic factors, he adopted the Confucian philosophy(儒家思想).

  • PDF

DNA 염기 서열의 단편 조립 프로그램 개발

  • Lee, Byung-Uk;Park, Kie-Jung;Park, Wan;Park, Yong-Ha
    • Microbiology and Biotechnology Letters
    • /
    • v.25 no.6
    • /
    • pp.560-565
    • /
    • 1997
  • DNA fragment assembly is a major concem in shot-gun DNA sequencing project. It is to reconstruct a consensus DNA sequence from a collection of random oritented fragments. We developed a computer program that is useful for DNA fragment assembly. Inputs to the program are DNA fragment sequences including IUB-IUPAC bases. The program produces the most probable reconstruction ot the original DNA sequence as a text format or a PostScript format. The program consists of four phases: the first phase quickly eliminates fragment pairs that can not possibly overlap. In the second phase, the quality of overlap between each pair is calculated to a score. In the third phase, overlap pairs are sorted by their scores and consistency of the overlaps is checked. The last phase determines consensus sequences and displays them. The performance of fragment assembly program was tested on a set of DNA fragment sequences which were generated from long DNA sequences of GenBank by a fragmentation program.

  • PDF