• Title/Summary/Keyword: Task variety

Search Result 286, Processing Time 0.024 seconds

Multi-Vector Document Embedding Using Semantic Decomposition of Complex Documents (복합 문서의 의미적 분해를 통한 다중 벡터 문서 임베딩 방법론)

  • Park, Jongin;Kim, Namgyu
    • Journal of Intelligence and Information Systems
    • /
    • v.25 no.3
    • /
    • pp.19-41
    • /
    • 2019
  • According to the rapidly increasing demand for text data analysis, research and investment in text mining are being actively conducted not only in academia but also in various industries. Text mining is generally conducted in two steps. In the first step, the text of the collected document is tokenized and structured to convert the original document into a computer-readable form. In the second step, tasks such as document classification, clustering, and topic modeling are conducted according to the purpose of analysis. Until recently, text mining-related studies have been focused on the application of the second steps, such as document classification, clustering, and topic modeling. However, with the discovery that the text structuring process substantially influences the quality of the analysis results, various embedding methods have actively been studied to improve the quality of analysis results by preserving the meaning of words and documents in the process of representing text data as vectors. Unlike structured data, which can be directly applied to a variety of operations and traditional analysis techniques, Unstructured text should be preceded by a structuring task that transforms the original document into a form that the computer can understand before analysis. It is called "Embedding" that arbitrary objects are mapped to a specific dimension space while maintaining algebraic properties for structuring the text data. Recently, attempts have been made to embed not only words but also sentences, paragraphs, and entire documents in various aspects. Particularly, with the demand for analysis of document embedding increases rapidly, many algorithms have been developed to support it. Among them, doc2Vec which extends word2Vec and embeds each document into one vector is most widely used. However, the traditional document embedding method represented by doc2Vec generates a vector for each document using the whole corpus included in the document. This causes a limit that the document vector is affected by not only core words but also miscellaneous words. Additionally, the traditional document embedding schemes usually map each document into a single corresponding vector. Therefore, it is difficult to represent a complex document with multiple subjects into a single vector accurately using the traditional approach. In this paper, we propose a new multi-vector document embedding method to overcome these limitations of the traditional document embedding methods. This study targets documents that explicitly separate body content and keywords. In the case of a document without keywords, this method can be applied after extract keywords through various analysis methods. However, since this is not the core subject of the proposed method, we introduce the process of applying the proposed method to documents that predefine keywords in the text. The proposed method consists of (1) Parsing, (2) Word Embedding, (3) Keyword Vector Extraction, (4) Keyword Clustering, and (5) Multiple-Vector Generation. The specific process is as follows. all text in a document is tokenized and each token is represented as a vector having N-dimensional real value through word embedding. After that, to overcome the limitations of the traditional document embedding method that is affected by not only the core word but also the miscellaneous words, vectors corresponding to the keywords of each document are extracted and make up sets of keyword vector for each document. Next, clustering is conducted on a set of keywords for each document to identify multiple subjects included in the document. Finally, a Multi-vector is generated from vectors of keywords constituting each cluster. The experiments for 3.147 academic papers revealed that the single vector-based traditional approach cannot properly map complex documents because of interference among subjects in each vector. With the proposed multi-vector based method, we ascertained that complex documents can be vectorized more accurately by eliminating the interference among subjects.

Knowledge Extraction Methodology and Framework from Wikipedia Articles for Construction of Knowledge-Base (지식베이스 구축을 위한 한국어 위키피디아의 학습 기반 지식추출 방법론 및 플랫폼 연구)

  • Kim, JaeHun;Lee, Myungjin
    • Journal of Intelligence and Information Systems
    • /
    • v.25 no.1
    • /
    • pp.43-61
    • /
    • 2019
  • Development of technologies in artificial intelligence has been rapidly increasing with the Fourth Industrial Revolution, and researches related to AI have been actively conducted in a variety of fields such as autonomous vehicles, natural language processing, and robotics. These researches have been focused on solving cognitive problems such as learning and problem solving related to human intelligence from the 1950s. The field of artificial intelligence has achieved more technological advance than ever, due to recent interest in technology and research on various algorithms. The knowledge-based system is a sub-domain of artificial intelligence, and it aims to enable artificial intelligence agents to make decisions by using machine-readable and processible knowledge constructed from complex and informal human knowledge and rules in various fields. A knowledge base is used to optimize information collection, organization, and retrieval, and recently it is used with statistical artificial intelligence such as machine learning. Recently, the purpose of the knowledge base is to express, publish, and share knowledge on the web by describing and connecting web resources such as pages and data. These knowledge bases are used for intelligent processing in various fields of artificial intelligence such as question answering system of the smart speaker. However, building a useful knowledge base is a time-consuming task and still requires a lot of effort of the experts. In recent years, many kinds of research and technologies of knowledge based artificial intelligence use DBpedia that is one of the biggest knowledge base aiming to extract structured content from the various information of Wikipedia. DBpedia contains various information extracted from Wikipedia such as a title, categories, and links, but the most useful knowledge is from infobox of Wikipedia that presents a summary of some unifying aspect created by users. These knowledge are created by the mapping rule between infobox structures and DBpedia ontology schema defined in DBpedia Extraction Framework. In this way, DBpedia can expect high reliability in terms of accuracy of knowledge by using the method of generating knowledge from semi-structured infobox data created by users. However, since only about 50% of all wiki pages contain infobox in Korean Wikipedia, DBpedia has limitations in term of knowledge scalability. This paper proposes a method to extract knowledge from text documents according to the ontology schema using machine learning. In order to demonstrate the appropriateness of this method, we explain a knowledge extraction model according to the DBpedia ontology schema by learning Wikipedia infoboxes. Our knowledge extraction model consists of three steps, document classification as ontology classes, proper sentence classification to extract triples, and value selection and transformation into RDF triple structure. The structure of Wikipedia infobox are defined as infobox templates that provide standardized information across related articles, and DBpedia ontology schema can be mapped these infobox templates. Based on these mapping relations, we classify the input document according to infobox categories which means ontology classes. After determining the classification of the input document, we classify the appropriate sentence according to attributes belonging to the classification. Finally, we extract knowledge from sentences that are classified as appropriate, and we convert knowledge into a form of triples. In order to train models, we generated training data set from Wikipedia dump using a method to add BIO tags to sentences, so we trained about 200 classes and about 2,500 relations for extracting knowledge. Furthermore, we evaluated comparative experiments of CRF and Bi-LSTM-CRF for the knowledge extraction process. Through this proposed process, it is possible to utilize structured knowledge by extracting knowledge according to the ontology schema from text documents. In addition, this methodology can significantly reduce the effort of the experts to construct instances according to the ontology schema.

A Study on the Quality of Life of Elderly People with Dementia and the Environmental Factor of Facilities (치매노인의 삶의 질과 시설 환경 요인에 관한 연구)

  • Park, Sejeong;Kim, Hangon
    • 한국노년학
    • /
    • v.29 no.4
    • /
    • pp.1361-1381
    • /
    • 2009
  • There have lately been a variety of social issues in our society due to rapid social changes. Specifically, how to approach elderly people who suffer from dementia is never an easy task, and few in-depth studies have ever focused on their quality of life due to that. The purpose of this study was to examine the quality of life of elderly people with dementia and the relationship between their quality of life and the environments of facilities for them in an attempt to lay the foundation for the development of compatible programs tailored to the environments of the facilities and for relevant policy setting. It's ultimately meant to improve the quality of life of the elderly with dementia and the environments of facilities for them. The subjects in this study were elderly people with dementia who were housed in senior residential and medical welfare facilities in Daegu and Gyeongsangbukdo. The collected data were analyzed with a SPSS 12.0 program, and frequency analysis, cross-tabs and multiple logistic regression analysis were utilized. As a result, facility environments were identified as one of the variables that had a significant impact on the quality of life of the elderly people with dementia. There are some suggestions about how to boost their quality of life: First, good environments should be prepared in consideration of the characteristics of elderly people with dementia in order for themto be satisfied with their own quality of life, and the way of looking at their potentials should be changed. Second, it's found that main caregivers affected the quality of life of the elderly people with dementia, and the kind of programs that focus on the improvement of the relationship between elderly people with dementia and their main caregivers is required. Third, there should be a change in the environments of the facilities. The facilities should be well equipped to successfully respond to the symptoms of elderly people with dementia. To redress their poor accessibility to the facilities, infrastructure involving nursing homes and professional personnels should be built by utilizing the Internet, and the facilities and local community should make concerted efforts to provide quality care to elderly people in want of it.

Retrospect and prospect of political geography and general-synoptic part of human geography in Korea (한국 정치지리학과 인문지리학 일반 50년의 회고)

  • ;Im, Duck-Soon
    • Journal of the Korean Geographical Society
    • /
    • v.31 no.2
    • /
    • pp.295-308
    • /
    • 1996
  • 1. Retrospect of Political Geographic Studies since Liberation, 1945 : 1) Period from 1945 to mid 1960s : There was not political geography as a science in Korea at the time of liberation from Japan 1945. At that time were not pure political geographers in Korea. In 1947, Moon-Hwa Pyo, economics professor, published a book titled Outline of Korean Geopolitics. This book was a first one in the field of political geography and available at that time in the logical descriptions. Bok-Hyon Choi was a first political geographer who in 1959 wrote a book titled Political geography for the collegians of Seoul National University. Professor Choi introduced American-style political geography through the book above mentioned. In 1963, Kie-Joo Hyong published an article titled "Korean Unification: Possibility from the Geopolitical Viewpoint" which was a first article published by Korean young scholar who studied geography in this country. 2) Period from late 1960s to late 1980s : Both Yoon Cha and Duck-Soon Im published frequently several articles of political geography or geopolitics respectively in 1968-1969. And they issued geopolitical disputes on Korean geopolitical structure and an application of rimland theory to Korean peninsula in 1969 through a magazine named Joung-Kyong Younku (the political and economic researches). The disputes played an important role of showing political geography (or geopolitics) to political sciences especially international political Science. Active researches still continued in 1970s. In that atmosphere the first Korean book of political geography written by a post-liberation scholar (Duck-Soon Im) titled Principles of Political Geography was published in 1973. This book was influenced much by American political geography after Second World War. In 1980s, the researches continued more actively. Especially administrative districts, capital cities, and sub-capital cities were frequently studied during this period. 3) Period from late 1980s to Present: Recent Studies : 1985 was a year of much production of articles of political geography. The first Ph.D thesis of political geography published in the same year in our country. And since 1985 produced many M.A. articles. Several categories of esearches of political geography was made in the period from late 1980s to present. Capital cities, Korean unification, administrative districts, urban politics, elections, sub-capital cities, and defense walls were important research categories. Reviewing the researches from 1945 to present. I found eight categories of political geography in Korea: capital cities, administrative districts, geopolitical structure of Korean peninsula, division and unification of Korea, sub-capital cities, defense walls, elections, and urban politics. Each category includes several scholars respectiveiy. 2. Study Tasks and Prospects in Korean Political Geography: In relation to Korean circumstances there are three study-tasks. The first task of Korean people is unification of two Koreas. Political geographers of Korea must al survey titled Survey Methods of Human Geography for collegians. This book was first one on survey part in Korea. The book however, is insufficient in comprehensiveness in aspects too. I think that the important tasks of general-synoptic human geography in Korea are \circled1 publication of comprehensive books of human geography in the aspects and methodologies for collegians and \circled2 acceptance of academic world of human geography in Korea of variety in methodologies of human geography for future progress. progress.

  • PDF

The Effects of Evaluation Attributes of Cultural Tourism Festivals on Satisfaction and Behavioral Intention (문화관광축제 방문객의 평가속성 만족과 행동의도에 관한 연구 - 2006 광주김치대축제를 중심으로 -)

  • Kim, Jung-Hoon
    • Journal of Global Scholars of Marketing Science
    • /
    • v.17 no.2
    • /
    • pp.55-73
    • /
    • 2007
  • Festivals are an indispensable feature of cultural tourism(Formica & Uysal, 1998). Cultural tourism festivals are increasingly being used as instruments promoting tourism and boosting the regional economy. So much research related to festivals is undertaken from a variety of perspectives. Plans to revisit a particular festival have been viewed as an important research topic both in academia and the tourism industry. Therefore festivals have frequently been leveled as cultural events. Cultural tourism festivals have become a crucial component in constituting the attractiveness of tourism destinations(Prentice, 2001). As a result, a considerable number of tourist studies have been carried out in diverse cultural tourism festivals(Backman et al., 1995; Crompton & Mckay, 1997; Park, 1998; Clawson & Knetch, 1996). Much of previous literature empirically shows the close linkage between tourist satisfaction and behavioral intention in festivals. The main objective of this study is to investigate the effects of evaluation attributes of cultural tourism festivals on satisfaction and behavioral intention. accomplish the research objective, to find out evaluation items of cultural tourism festivals through the literature study an empirical study. Using a varimax rotation with Kaiser normalization, the research obtained four factors in the 18 evaluation attributes of cultural tourism festivals. Some empirical studies have examined the relationship between behavioral intention and actual behavior. To understand between tourist satisfaction and behavioral intention, this study suggests five hypotheses and hypothesized model. In this study, the analysis is based on primary data collected from visitors who participated in '2006 Gwangju Kimchi Festival'. In total, 700 self-administered questionnaires were distributed and 561 usable questionnaires were obtained. Respondents were presented with the 18 satisfactions item on a scale from 1(strongly disagree) to 7(strongly agree). Dimensionality and stability of the scale were evaluated by a factor analysis with varimax rotation. Four factors emerged with eigenvalues greater than 1, which explained 66.40% of the total variance and Cronbach' alpha raging from 0.876 to 0.774. And four factors named: advertisement and guides, programs, food and souvenirs, and convenient facilities. To test and estimate the hypothesized model, a two-step approach with an initial measurement model and a subsequent structural model for Structural Equation Modeling was used. The AMOS 4.0 analysis package was used to conduct the analysis. In estimating the model, the maximum likelihood procedure was used.In this study Chi-square test is used, which is the most common model goodness-of-fit test. In addition, considering the literature about the Structural Equation Modeling, this study used, besides Chi-square test, more model fit indexes to determine the tangibility of the suggested model: goodness-of-fit index(GFI) and root mean square error of approximation(RMSEA) as absolute fit indexes; normed-fit index(NFI) and non-normed-fit index(NNFI) as incremental fit indexes. The results of T-test and ANOVAs revealed significant differences(0.05 level), therefore H1(Tourist Satisfaction level should be different from Demographic traits) are supported. According to the multiple Regressions analysis and AMOS, H2(Tourist Satisfaction positively influences on revisit intention), H3(Tourist Satisfaction positively influences on word of mouth), H4(Evaluation Attributes of cultural tourism festivals influences on Tourist Satisfaction), and H5(Tourist Satisfaction positively influences on Behavioral Intention) are also supported. As the conclusion of this study are as following: First, there were differences in satisfaction levels in accordance with the demographic information of visitors. Not all visitors had the same degree of satisfaction with their cultural tourism festival experience. Therefore it is necessary to understand the satisfaction of tourists if the experiences that are provided are to meet their expectations. So, in making festival plans, the organizer should consider the demographic variables in explaining and segmenting visitors to cultural tourism festival. Second, satisfaction with attributes of evaluation cultural tourism festivals had a significant direct impact on visitors' intention to revisit such festivals and the word of mouth publicity they shared. The results indicated that visitor satisfaction is a significant antecedent of their intention to revisit such festivals. Festival organizers should strive to forge long-term relationships with the visitors. In addition, it is also necessary to understand how the intention to revisit a festival changes over time and identify the critical satisfaction factors. Third, it is confirmed that behavioral intention was enhanced by satisfaction. The strong link between satisfaction and behavioral intentions of visitors areensured by high quality advertisement and guides, programs, food and souvenirs, and convenient facilities. Thus, examining revisit intention from a time viewpoint may be of a great significance for both practical and theoretical reasons. Additionally, festival organizers should give special attention to visitor satisfaction, as satisfied visitors are more likely to return sooner. The findings of this research have several practical implications for the festivals managers. The promotion of cultural festivals should be based on the understanding of tourist satisfaction for the long- term success of tourism. And this study can help managers carry out this task in a more informed and strategic manner by examining the effects of demographic traits on the level of tourist satisfaction and the behavioral intention. In other words, differentiated marketing strategies should be stressed and executed by relevant parties. The limitations of this study are as follows; the results of this study cannot be generalized to other cultural tourism festivals because we have not explored the many different kinds of festivals. A future study should be a comparative analysis of other festivals of different visitor segments. Also, further efforts should be directed toward developing more comprehensive temporal models that can explain behavioral intentions of tourists.

  • PDF

Research for Space Activities of Korea Air Force - Political and Legal Perspective (우리나라 공군의 우주력 건설을 위한 정책적.법적고찰)

  • Shin, Sung-Hwan
    • The Korean Journal of Air & Space Law and Policy
    • /
    • v.18
    • /
    • pp.135-183
    • /
    • 2003
  • Aerospace force is a determining factor in a modem war. The combat field is expanding to space. Thus, the legitimacy of establishing aerospace force is no longer an debating issue, but "how should we establish aerospace force" has become an issue to the military. The standard limiting on the military use of space should be non-aggressive use as asserted by the U.S., rather than non-military use as asserted by the former Soviet Union. The former Soviet Union's argument is not even strongly supported by the current Russia government, and realistically is hard to be applied. Thus, the multi-purpose satellite used for military surveillance or a commercial satellite employed for military communication are allowed under the U.S. principle of peaceful use of space. In this regard, Air Force may be free to develop a military surveillance satellite and a communication satellite with civilian research institute. Although MTCR, entered into with the U.S., restricts the development of space-launching vehicle for the export purpose, the development of space-launching vehicle by the Korea Air Force or Korea Aerospace Research Institute is beyond the scope of application of MTCR, and Air Force may just operate a satellite in the orbit for the military purpose. The primary task for multi-purpose satellite is a remote sensing; SAR sensor with high resolution is mainly employed for military use. Therefore, a system that enables Air Force, the Korea Aerospace Research Institute, and Agency for Defense Development to conduct joint-research and development should be instituted. U.S. Air Force has dismantled its own space-launching vehicle step by step, and, instead, has increased using private space launching vehicle. In addition, Military communication has been operated separately from civil communication services or broadcasting services due to the special circumstances unique to the military setting. However, joint-operation of communication facility by the military and civil users is preferred because this reduces financial burden resulting from separate operation of military satellite. During the Gulf War, U.S. armed forces employed commercial satellites for its military communication. Korea's participation in space technology research is a little bit behind in time, considering its economic scale. In terms of budget, Korea is to spend 5 trillion won for 15 years for the space activities. However, Japan has 2 trillion won annul budget for the same activities. Because the development of space industry during initial fostering period does not apply to profit-making business, government supports are inevitable. All space development programs of other foreign countries are entirely supported by each government, and, only recently, private industry started participating in limited area such as a communication satellite and broadcasting satellite, Particularly, Korea's space industry is in an infant stage, which largely demands government supports. Government support should be in the form of investment or financial contribution, rather than in the form of loan or borrowing. Compared to other advanced countries in space industry, Korea needs more budget and professional research staff. Naturally, for the efficient and systemic space development and for the prevention of overlapping and distraction of power, it is necessary to enact space-related statutes, which would provide dear vision for the Korea space development. Furthermore, the fact that a variety of departments are running their own space development program requires a centralized and single space-industry development system. Prior to discussing how to coordinate or integrate space programs between Agency for Defense Development and the Korea Aerospace Research Institute, it is a prerequisite to establish, namely, "Space Operations Center"in the Air Force, which would determine policy and strategy in operating space forces. For the establishment of "Space Operations Center," policy determinations by the Ministry of National Defense and the Joint Chief of Staff are required. Especially, space surveillance system through using a military surveillance satellite and communication satellite, which would lay foundation for independent defense, shall be established with reference to Japan's space force plan. In order to resolve issues related to MTCR, Air Force would use space-launching vehicle of the Korea Aerospace Research Institute. Moreover, defense budge should be appropriated for using multi-purpose satellite and communication satellite. The Ministry of National Defense needs to appropriate 2.5 trillion won budget for space operations, which amounts to Japan's surveillance satellite operating budges.

  • PDF