• Title/Summary/Keyword: Smart Document

Search Result 119, Processing Time 0.026 seconds

Knowledge Extraction Methodology and Framework from Wikipedia Articles for Construction of Knowledge-Base (지식베이스 구축을 위한 한국어 위키피디아의 학습 기반 지식추출 방법론 및 플랫폼 연구)

  • Kim, JaeHun;Lee, Myungjin
    • Journal of Intelligence and Information Systems
    • /
    • v.25 no.1
    • /
    • pp.43-61
    • /
    • 2019
  • Development of technologies in artificial intelligence has been rapidly increasing with the Fourth Industrial Revolution, and researches related to AI have been actively conducted in a variety of fields such as autonomous vehicles, natural language processing, and robotics. These researches have been focused on solving cognitive problems such as learning and problem solving related to human intelligence from the 1950s. The field of artificial intelligence has achieved more technological advance than ever, due to recent interest in technology and research on various algorithms. The knowledge-based system is a sub-domain of artificial intelligence, and it aims to enable artificial intelligence agents to make decisions by using machine-readable and processible knowledge constructed from complex and informal human knowledge and rules in various fields. A knowledge base is used to optimize information collection, organization, and retrieval, and recently it is used with statistical artificial intelligence such as machine learning. Recently, the purpose of the knowledge base is to express, publish, and share knowledge on the web by describing and connecting web resources such as pages and data. These knowledge bases are used for intelligent processing in various fields of artificial intelligence such as question answering system of the smart speaker. However, building a useful knowledge base is a time-consuming task and still requires a lot of effort of the experts. In recent years, many kinds of research and technologies of knowledge based artificial intelligence use DBpedia that is one of the biggest knowledge base aiming to extract structured content from the various information of Wikipedia. DBpedia contains various information extracted from Wikipedia such as a title, categories, and links, but the most useful knowledge is from infobox of Wikipedia that presents a summary of some unifying aspect created by users. These knowledge are created by the mapping rule between infobox structures and DBpedia ontology schema defined in DBpedia Extraction Framework. In this way, DBpedia can expect high reliability in terms of accuracy of knowledge by using the method of generating knowledge from semi-structured infobox data created by users. However, since only about 50% of all wiki pages contain infobox in Korean Wikipedia, DBpedia has limitations in term of knowledge scalability. This paper proposes a method to extract knowledge from text documents according to the ontology schema using machine learning. In order to demonstrate the appropriateness of this method, we explain a knowledge extraction model according to the DBpedia ontology schema by learning Wikipedia infoboxes. Our knowledge extraction model consists of three steps, document classification as ontology classes, proper sentence classification to extract triples, and value selection and transformation into RDF triple structure. The structure of Wikipedia infobox are defined as infobox templates that provide standardized information across related articles, and DBpedia ontology schema can be mapped these infobox templates. Based on these mapping relations, we classify the input document according to infobox categories which means ontology classes. After determining the classification of the input document, we classify the appropriate sentence according to attributes belonging to the classification. Finally, we extract knowledge from sentences that are classified as appropriate, and we convert knowledge into a form of triples. In order to train models, we generated training data set from Wikipedia dump using a method to add BIO tags to sentences, so we trained about 200 classes and about 2,500 relations for extracting knowledge. Furthermore, we evaluated comparative experiments of CRF and Bi-LSTM-CRF for the knowledge extraction process. Through this proposed process, it is possible to utilize structured knowledge by extracting knowledge according to the ontology schema from text documents. In addition, this methodology can significantly reduce the effort of the experts to construct instances according to the ontology schema.

A Method for Evaluating News Value based on Supply and Demand of Information Using Text Analysis (텍스트 분석을 활용한 정보의 수요 공급 기반 뉴스 가치 평가 방안)

  • Lee, Donghoon;Choi, Hochang;Kim, Namgyu
    • Journal of Intelligence and Information Systems
    • /
    • v.22 no.4
    • /
    • pp.45-67
    • /
    • 2016
  • Given the recent development of smart devices, users are producing, sharing, and acquiring a variety of information via the Internet and social network services (SNSs). Because users tend to use multiple media simultaneously according to their goals and preferences, domestic SNS users use around 2.09 media concurrently on average. Since the information provided by such media is usually textually represented, recent studies have been actively conducting textual analysis in order to understand users more deeply. Earlier studies using textual analysis focused on analyzing a document's contents without substantive consideration of the diverse characteristics of the source medium. However, current studies argue that analytical and interpretive approaches should be applied differently according to the characteristics of a document's source. Documents can be classified into the following types: informative documents for delivering information, expressive documents for expressing emotions and aesthetics, operational documents for inducing the recipient's behavior, and audiovisual media documents for supplementing the above three functions through images and music. Further, documents can be classified according to their contents, which comprise facts, concepts, procedures, principles, rules, stories, opinions, and descriptions. Documents have unique characteristics according to the source media by which they are distributed. In terms of newspapers, only highly trained people tend to write articles for public dissemination. In contrast, with SNSs, various types of users can freely write any message and such messages are distributed in an unpredictable way. Again, in the case of newspapers, each article exists independently and does not tend to have any relation to other articles. However, messages (original tweets) on Twitter, for example, are highly organized and regularly duplicated and repeated through replies and retweets. There have been many studies focusing on the different characteristics between newspapers and SNSs. However, it is difficult to find a study that focuses on the difference between the two media from the perspective of supply and demand. We can regard the articles of newspapers as a kind of information supply, whereas messages on various SNSs represent a demand for information. By investigating traditional newspapers and SNSs from the perspective of supply and demand of information, we can explore and explain the information dilemma more clearly. For example, there may be superfluous issues that are heavily reported in newspaper articles despite the fact that users seldom have much interest in these issues. Such overproduced information is not only a waste of media resources but also makes it difficult to find valuable, in-demand information. Further, some issues that are covered by only a few newspapers may be of high interest to SNS users. To alleviate the deleterious effects of information asymmetries, it is necessary to analyze the supply and demand of each information source and, accordingly, provide information flexibly. Such an approach would allow the value of information to be explored and approximated on the basis of the supply-demand balance. Conceptually, this is very similar to the price of goods or services being determined by the supply-demand relationship. Adopting this concept, media companies could focus on the production of highly in-demand issues that are in short supply. In this study, we selected Internet news sites and Twitter as representative media for investigating information supply and demand, respectively. We present the notion of News Value Index (NVI), which evaluates the value of news information in terms of the magnitude of Twitter messages associated with it. In addition, we visualize the change of information value over time using the NVI. We conducted an analysis using 387,014 news articles and 31,674,795 Twitter messages. The analysis results revealed interesting patterns: most issues show lower NVI than average of the whole issue, whereas a few issues show steadily higher NVI than the average.

The Security Risk and Countermeasures of Blockchain based Virtual Currency Trading (블록체인 기반 가상화폐 거래의 보안 위험 및 대응방안)

  • Chung, Young-Seek;Cha, Jae-Sang
    • The Journal of Korea Institute of Information, Electronics, and Communication Technology
    • /
    • v.11 no.1
    • /
    • pp.100-106
    • /
    • 2018
  • Since the concept of virtual currency called Bitcoin was announced in 2008, the blockchain technology, which is the basis of Bitcoin, is attracting attention as an important platform technology in the era of the 4th industrial revolution that can change our society in the future. Although Existing electronic financial transactions store and manage all transaction history at a reliable central organization such as government and bank, blockchain-based electronic financial transactions are composed of a distributed structure in which all participants participating in the transaction store and manage the transaction history, it is possible to secure transaction transparency while reducing system construction and operation costs. Besides the virtual currency that started with bit coins, the technology of these blockchains has been extended in various fields such as smart contracts and document management. The key technology area of this blockchain is security based on proven cryptographic technology to make it difficult to forge and hack, but there are security risks such as security vulnerabilities in the virtual currency trading service, We will discuss security risks in using virtual currency and discuss countermeasures. Especially security accidents of virtual currency exchanges are occurring frequently recently, the damage of users who trade the virtual currency is also increasing, we propose security threats and security countermeasures against virtual currency exchanges.

Digital Forensic Investigation of HBase (HBase에 대한 디지털 포렌식 조사 기법 연구)

  • Park, Aran;Jeong, Doowon;Lee, Sang Jin
    • KIPS Transactions on Computer and Communication Systems
    • /
    • v.6 no.2
    • /
    • pp.95-104
    • /
    • 2017
  • As the technology in smart device is growing and Social Network Services(SNS) are becoming more common, the data which is difficult to be processed by existing RDBMS are increasing. As a result of this, NoSQL databases are getting popular as an alternative for processing massive and unstructured data generated in real time. The demand for the technique of digital investigation of NoSQL databases is increasing as the businesses introducing NoSQL database in their system are increasing, although the technique of digital investigation of databases has been researched centered on RDMBS. New techniques of digital forensic investigation are needed as NoSQL Database has no schema to normalize and the storage method differs depending on the type of database and operation environment. Research on document-based database of NoSQL has been done but it is not applicable as itself to other types of NoSQL Database. Therefore, the way of operation and data model, grasp of operation environment, collection and analysis of artifacts and recovery technique of deleted data in HBase which is a NoSQL column-based database are presented in this paper. Also the proposed technique of digital forensic investigation to HBase is verified by an experimental scenario.

A framework of management for preventing illegal distribution of pdf bookscan file (PDF 형식 북스캔 파일 불법 유통 방지를 위한 관리 프레임워크)

  • Lee, Kuk-Heon;Chung, Hyun-Ji;Ryu, Dae-Gull;Lee, Sang-Jin
    • Journal of the Korea Institute of Information Security & Cryptology
    • /
    • v.23 no.5
    • /
    • pp.897-907
    • /
    • 2013
  • Since various smart devices are being developed, a growing number of people are reading eBooks instead of paper books. However, people started making eBooks on their own by scanning paper books because there are not enough eBooks provided from market. The term "Bookscan" was made with this reason. The number of bookscan company is increasing because the equipment is too expensive. However, the commercial activity of bookscan company is against copyright law. Also bookscan files are in danger of being illegally distributed on web, because bookscan companies are not protecting copyright. Publication market follows the same procedure with sound market which was collapsed due to copyright problem. Therefore, the technical methods should be prepared for law system against bookscan. The previous ICOP(Illegal Copyrights Obstruction Program) system has been applied to sound and movie files, but not applied to publication. This paper suggests the framework for bookscan file management based on practical mechanism.

A Meta-analysis of Related Factors Depression of Korea University Student (한국 대학생의 우울 관련 요인에 대한 메타분석)

  • Jeon, Byoung-Jin;Song, Bo-Kyong;Ko, Koung-Min;Kim, Ji-Yoon;Park, Sang-Eun;Yu, Yi-Seul;Lee, Du-Ri;Choi, Young-Ju
    • The Journal of Korean society of community based occupational therapy
    • /
    • v.5 no.2
    • /
    • pp.43-55
    • /
    • 2015
  • Objective : This study was a meta-analysis of previous studies to examine the integration of related factors depression University students of Korea, and to determine the relative importance among the relevant factors based on it. Methods : 2000-2014 papers posted on the National Science and Technology Information Center (NDSL), Nurimedia (DBpia), Academic Research Information Service (RISS), Korea Research Information(KISS), provide the text of the Library of Congress were collected using the service. The Key words a 'University Student', 'Depression', 'Depression Factors' was used. Used the Down & Black level, evidence-based checklist was developed by the research (1998) (checklist) had analyzed the selected document metadata to assess the quality. Results : 47-studies selected research groups are divided into five factors(self-esteem, suicidal ideation, positive thinking, stresses, Internet and smartphone addiction). Using meta-analysis, we analyzed the effect sizes, statistical heterogeneity and publication amenities. As a result, the self-esteem of the five factors were not found heterogeneity. Effect size is a self-esteem and suicidal ideation "large effect size", positive thinking and stress "medium effect size", internet and smart phone addiction"small effect size". Conclusion : Self-esteem and suicidal ideation are among the factors associated with depression in University students of Korea was found that the most relevant. It identified the factors associated with depression in college students, and could utilized as basis for the prevention of depression.

Renaissance of Geographic Education in the United States since 1980: Its Dynamic Process and Implications to Geographic Education in Korea (1980년대 이후 美國 地理敎育 復興運動의 展開過程과 그 示唆點: 地理學, 地理敎育, 그리고 敎育政策의 關係)

  • Seo, Tae-Yeol
    • Journal of the Korean Geographical Society
    • /
    • v.28 no.2
    • /
    • pp.163-178
    • /
    • 1993
  • The purpose of this paper is to provide a better understanding of the unprecedented reform movement of geographic education in the United States since 1980 and extract some implications from this movement for geographic education in Korea. For the purpose, the history to this movement was reviewed through following three stages. In the first stage(1980~1984: form :HSGP" to :"Guideline"), the voluntary improvement movement appeared at California and the orgni-zational movement began in 1982 such as the Committee on Geography and International Knowledge. The national educational refrom imperatives, presented at "A Nation at Risk", and "Back to Basics" movement provided good opportunities to resurrect geography as a basic subject. For next real resurrection movement, the very important document "Guidelines for Geographic Education" was published at 1984. In the second stage(1985~1989: from "Guide-lines" to "Public"), the "Guideline" gave power-full motives and foci for reconstructiong the contents of geography, especially by the five fundamental themes(Location, Place, Relation-ships within Places, Movement, and Region). Also GENIP as the symbol of unity of all four major geography organization(AAG, NCGE, NGS, AGS) contributed to expanding and stren-gthening geography education. Also Geography Educagtion Program of NGS was a smart and well organized program to improve geographic education through it's a five strategies: Grass-roots organization(Alliances), Teacher education, Pu-blic awareness, Educational materials develo-pment, Targeted outreach to education decision-makers. In the late 1980s, the last focus of movement was the Public awareness and Edua-ction decision-making. In the third stage(1990-present: from "Public" to "Core Subject"), the initiative pendulum swung from geography organization to nation curricu-lum. In this National Curriculum, Geography was approved as a "Core Subject" and The 1994 National Geography Assessment Framework was constructed to assess the outcome of student's education in geography in grades, 4,8, and 12. Some Implications extracted from the process and contents of renaissance movement of geogr-aphic education in the Uinted States since 1980 are as follows. First, It shows the importance of the unity and target assignment among the geography organization. Second, interactive relationship between the academic geography and school geography develops each other. Third, teacher education, including pre-service education, including pre-service education and in-service education, is a key element to improve the quality of geography. And teacher organization is a good clearing house to exchange information for good geography. Forth, the positive and active response to changes in socketies such as globalism and inter-nationalizing, national education policy, and the trend of pedagogy is needed to rejuvenate geo-graphic education. Above all, we need to establish a well organized and powerfull program, sophisticated activities strategies, and long-term implementa-tion plan if we want more and better school geography.

  • PDF

A Study on the Land-Use Related Assessment Factors in Korean Environmental Impact Assessment (환경영향평가 토지환경 분야의 토지이용 평가항목 고찰 연구)

  • Park, Sang-Jin;Lee, Dong Kun;Jeong, Seulgi
    • Journal of Environmental Impact Assessment
    • /
    • v.30 no.5
    • /
    • pp.297-304
    • /
    • 2021
  • The environmental impact assessment(EIA) project in Korea has undergone changes and revisions in various evaluation items for about 30 years after the introduction of the Environmental Conservation Act (1997). However, despite the importance of land use evaluation items under the current EIA Act, there are insufficient studies to consider. Therefore, this study focused on the land-use evaluation items based on the EIA guidelines, reviewed 90 of the evaluation documents and consultation documents, and tried to suggest implications and supplementary points forthe domestic EIA land-use evaluation items. As a result, the paradigm was changing from land efficiency centered on development in the past to land efficiency centered on the natural environment and resource conservation. However, in spite of the manual for fitting the paradigm change, opinions on the conservation of the natural environment are still being drawn in the consultation document, so it needs improvement. Two improvements in the impact assessment process suggested in this study are the establishment of standardized spatial data and a quantitative impact and reduction method evaluation tool based on it. In particular, there is a need for a plan evaluation tool for land use arrangement and distribution that can solve the needs of minimizing damage to the natural environment and securing green space and a green network.

A Study on the Choice of Export Payment Types by Applying the Characteristics of the New Trade & Logistics Environment (신(新)무역물류환경의 특성을 적용한 수출대금 결제유형 선택연구)

  • Chang-bong Kim;Dong-jun Lee
    • Korea Trade Review
    • /
    • v.48 no.4
    • /
    • pp.303-320
    • /
    • 2023
  • Recently, import and export companies have been using T/T remittance and Surrender B/L more frequently than L/C when selecting the process and method of trade payment settlement. The new trade and logistics environment is thriving in the era of the Fourth Industrial Revolution (4IR). Document-based trade transactions are undergoing a digitalization as bills of lading or smart contracts are being developed. The purpose of this study is to verify whether exporters choose export payment types based on negotiating factors. In addition, we would like to discuss the application of the characteristics of the new trade and logistics environment. Data for analysis was collected through surveys. The collection method consisted of direct visits to the company, e-mail, fax, and online surveys. The survey distribution period is from February 1, 2023, to April 30, 2023. The questionnaire was distributed in 2,000 copies, and 447 copies were collected. The final 336 copies were used for analysis, excluding 111 copies that were deemed inappropriate for the purpose of this study. The results of the study are shown below. First, among the negotiating factors, the product differentiation of exporters did not significantly affect the selection of export payment types. Second, among the negotiating factors, the greater the purchasing advantage recognized by exporters, the higher the possibility of using the post-transfer method. In addition to analyzing the results, this study suggests that exporters should consider adopting new payment methods, such as blockchain technology-based bills of lading and trade finance platforms, to adapt to the characteristics of the evolving trade and logistics environment. Therefore, exporters should continue to show interest in initiatives aimed at digitizing trade documents as a response to the challenges posed by bills of lading. In future studies, it is necessary to address the lack of social awareness in Korea by conducting advanced research abroad.