• Title/Summary/Keyword: 구체화

Search Result 1,693, Processing Time 0.024 seconds

Multi-Vector Document Embedding Using Semantic Decomposition of Complex Documents (복합 문서의 의미적 분해를 통한 다중 벡터 문서 임베딩 방법론)

  • Park, Jongin;Kim, Namgyu
    • Journal of Intelligence and Information Systems
    • /
    • v.25 no.3
    • /
    • pp.19-41
    • /
    • 2019
  • According to the rapidly increasing demand for text data analysis, research and investment in text mining are being actively conducted not only in academia but also in various industries. Text mining is generally conducted in two steps. In the first step, the text of the collected document is tokenized and structured to convert the original document into a computer-readable form. In the second step, tasks such as document classification, clustering, and topic modeling are conducted according to the purpose of analysis. Until recently, text mining-related studies have been focused on the application of the second steps, such as document classification, clustering, and topic modeling. However, with the discovery that the text structuring process substantially influences the quality of the analysis results, various embedding methods have actively been studied to improve the quality of analysis results by preserving the meaning of words and documents in the process of representing text data as vectors. Unlike structured data, which can be directly applied to a variety of operations and traditional analysis techniques, Unstructured text should be preceded by a structuring task that transforms the original document into a form that the computer can understand before analysis. It is called "Embedding" that arbitrary objects are mapped to a specific dimension space while maintaining algebraic properties for structuring the text data. Recently, attempts have been made to embed not only words but also sentences, paragraphs, and entire documents in various aspects. Particularly, with the demand for analysis of document embedding increases rapidly, many algorithms have been developed to support it. Among them, doc2Vec which extends word2Vec and embeds each document into one vector is most widely used. However, the traditional document embedding method represented by doc2Vec generates a vector for each document using the whole corpus included in the document. This causes a limit that the document vector is affected by not only core words but also miscellaneous words. Additionally, the traditional document embedding schemes usually map each document into a single corresponding vector. Therefore, it is difficult to represent a complex document with multiple subjects into a single vector accurately using the traditional approach. In this paper, we propose a new multi-vector document embedding method to overcome these limitations of the traditional document embedding methods. This study targets documents that explicitly separate body content and keywords. In the case of a document without keywords, this method can be applied after extract keywords through various analysis methods. However, since this is not the core subject of the proposed method, we introduce the process of applying the proposed method to documents that predefine keywords in the text. The proposed method consists of (1) Parsing, (2) Word Embedding, (3) Keyword Vector Extraction, (4) Keyword Clustering, and (5) Multiple-Vector Generation. The specific process is as follows. all text in a document is tokenized and each token is represented as a vector having N-dimensional real value through word embedding. After that, to overcome the limitations of the traditional document embedding method that is affected by not only the core word but also the miscellaneous words, vectors corresponding to the keywords of each document are extracted and make up sets of keyword vector for each document. Next, clustering is conducted on a set of keywords for each document to identify multiple subjects included in the document. Finally, a Multi-vector is generated from vectors of keywords constituting each cluster. The experiments for 3.147 academic papers revealed that the single vector-based traditional approach cannot properly map complex documents because of interference among subjects in each vector. With the proposed multi-vector based method, we ascertained that complex documents can be vectorized more accurately by eliminating the interference among subjects.

The Politics and Governance of 'Maeul' Community Archives in South Korea (마을공동체 아카이브의 거버넌스 모델 연구)

  • Lee, Kyong Rae
    • The Korean Journal of Archival Studies
    • /
    • no.45
    • /
    • pp.51-82
    • /
    • 2015
  • Maeul-making, which is to restore inherent characteristics of maeul as a living community has been proceeded by local communities themselves since the 1990s when political democracy and local government in Korean society has been progressed in full-scale. Although New Maeul Movement has been done in the 1970s before and after, it is different from maeul-making because it was focused mainly on improving physical environments of rural communities and initiated by government. The development of maeul community archives in Korea has been related closely to such a maeul-making since the 1990s. Maeul-based community archives, maeul community archives had been begun to build as part of maeul-making and grass-root movement by the 2000s. Initiated by self-motivated communities, maeul community archives were carried out through cooperations between civic activists and residents in maeul communities and voluntary professional archivists from outside. Although records about the maeul community has been collected by mainstream cultural institutions such as public archives, museum, local historical association, and local cultural center, it was at this time to collect records of the maeul community by self-motivated local residents. This tendency of 'independent' maeul community archives, however, is currently entering upon a new phase with the city of Seoul's project (2012) to support making a maeul community, that is, the governance phase based on private-government partnership. At this point of time, it is important for maeul community archives to be built on privately-led governance model that guarantees their autonomy and at the same time bring government's knowhow and supports into them, as opposed to the way captured or driven unilaterally by government. This article explores the growth of maeul community archives and collections in Korean society through a range of self-motivated bodies; the interaction with government; and as a result of those interactions, the creation of maeul community archives based on governance. To introduce and explicate the motivations behind maeul archiving endeavors, this article will first sketch something of the historical, social, and political context in which 'maeul' communities have arisen, collapsed, and restored. It will then examine in more detail some specific examples of maeul community archives as grass-root movement of maeul community. The third section will attempt to identify the governance model of maeul community archives under the auspices of the city of Seoul and its limitations. Finally through these activities, it will suggest the ways in which maeul community archives commit themselves to their duty of grass-root movement of community and at the same time, secure sustainability, that is, concrete ways of privately initiated governance model.

A Status Analysis for the Standards on Permission of Altering Cultural Heritage's Current State Focusing on the Results of Handling Application Cases on Permission of State-Designated Cultural Heritage (Historic Site) for the Last Five Years (2015~2019) (문화재 현상변경 인·허가 검토기준 마련을 위한 실태분석 연구 - 최근 5년(2015~2019)간 국가지정문화재(사적)의 허가신청 안건 처리결과를 중심으로 -)

  • CHO, Hongseok;SUH, Hyunjung;CHOI, Jisu
    • Korean Journal of Heritage: History & Science
    • /
    • v.54 no.3
    • /
    • pp.24-51
    • /
    • 2021
  • Since June 2006, there have been active efforts to systematize the permission system including the amendment of [Cultural Heritage Protection Act]. Cultural Heritage Administration prepared standards on reviewing each type of cultural heritages(CH) in 2015, promoted a project on the modification of permission standards and showed remarkable performances in quantitative aspects. But as there has been little change for the cases applied for permission, additional studies on policy are required to improve the management efficiency and reduce the citizens'inconvenience. In response, this study aims to identify the actual management status on the current state alteration permission system, and establish practically utilizable reference materials at permission review. While historic sites(HS) constitute a relatively small proportion in state-designated CHs, they are subject to the designation of permission standards. Also, with their location in the downtown area, the application rate is high (51.4%) and the results are commonly utilizable to other types of CH. We constructed a DB based on the minutes of Cultural Heritage Committee(CHC) on HS and categorized similar features in permission handling results. The result of the analysis is as follows. Out of a total of 5,243 cases for permission applied for HS, 1,734 cases of cultural heritage areas(CHA) and 3,509 cases of historic and cultural environment preservation areas(HCEPA) have been applied. CHA has a great proportion of the applications for events and festivals, which are highly related to CHs or representing the local area. There is a high permission rate on applications for the purpose of public service by local governments. Meanwhile, HCEPA has a high proportion of applying for the installation and extension of buildings and facilities at the private level. Thus, negative decisions were made for tall buildings, massed facilities, or suspected scattering of similar acts. Our actual condition analysis has identified a total of 78 types of harmful acts which may influence the preservation of CHs. 31 types in CHA and 37 types in HCEPA are categorized. Especially, 10 common types of permission have been confirmed in both sectors. As a result, it is expected to secure consistency in the permission administration, enhance the management efficiency and improve the public's satisfaction over the regulatory administration by providing practically utilizable reference materials for altering the current state of CH and for decision making on the part of CHC.

An Analysis of the Managerial Level's Gender Gap and "Glass Ceiling" of the Corporation (기업 관리직의 젠더 격차와 "유리천장" 분석)

  • Cho, Heawon;Hahm, Inhee
    • 한국사회정책
    • /
    • v.23 no.2
    • /
    • pp.49-81
    • /
    • 2016
  • This study agrees with the idea that a situation centered perspective provides a useful contribution in understanding women's attitude on organizations. Women's occupational experiences are less related to their "femaleness" than to the structural constraints inherent in the occupational positions women fill. So characteristics of the organizational situation including gender composition and hierarchical status may "shape and define" women's experience on the job. The present study examined the managerial level's gender gap and "glass ceiling" of the corporation. According to Kanter, if the ratio of women to men in organizations begins to shift, as affirmative action and new hiring and promotion policies promised, forms of relationships and corporate culture should also change. However, the mere presence of women on workplace may not, in itself, result in women-friendly work condition. This study analyzes "Korean Women Manger Panel survey(2010 3rd. wave)" to examine how much gender gap of the managerial level persists and when the glass ceiling effect emerges. Using t-test and ANOVA, various aspects of the gender gap within managerial level were verified. The most significant finding is the glass ceiling effect starts from very low level of management. Policy implications from the statistical analysis of the Panel survey are: 1) We need to increase the absolute number of the women managers for securing middle level women leadership pipe line. 2) We need to confront the fact that the glass ceiling starts from the very low managerial level, and to explore more realistic way to break up the vicious circle for the tokenism. and 3) We need to looking beyond numbers in approaching women's matter at work. At the cultural and institutional level, work-family programs and policies, women's ratings of their competence, and family-friendly organization's climate should be considered.

On the Problem of Virtue in Confucian and Neoconfucian Philosophy (유학 및 신유학 철학에서의 덕의 문제)

  • Gabriel, Werner
    • (The)Study of the Eastern Classic
    • /
    • no.50
    • /
    • pp.89-120
    • /
    • 2013
  • The concept of virtue seems to be one of the rare cases where the European and the Chinese traditions coincide. The meaning of the Latin word virtus and of Greek $aret{\acute{e}}$ seems to be similar to the Chinese $d{\acute{e}}$德. Most striking in virtue is that it is a capacity for self-realisation through action which is unique to man. On the other hand, there is something physical about it. It is the strength to do something. This strength overcomes the resistance of what is naturally given, it transforms the world, turns the natural world into a human one. In the Chinese tradition, $d{\acute{e}}$ 德, i.e. virtue, is therefore always connected with $da{\grave{o}}$ 道, the totality of natural forces. In the Chinese tradition, as opposed to the European one, virtue is itself considered to be a natural force that is present in man. This force sustains man's connectedness, unity and harmony with the surrounding world. Things exist through the unity of principle理 and ether氣. But the knowledge of this unity is due to principle. Moral and legal norms are shifted totally to the sphere of principle. Therefore their have found the final dissolution from a heroic models. Above all the classical Confucians, but also the other schools, would reply to this that there is nothing more precise than a concrete successful action. Its result fits the world perfectly. The difference is due to the differing interest of ethical thought. In the case of the Confucians the path is more direct. The actor establishes a precise pattern for other actions. Education therefore lies in detailed knowledge about forms of behaviour, not so much in conceptual differentiation. It is quite possible that generalisation may be a methodical prerequisite for success in this endeavour. That problem, too, is discussed. But the success of conceptualisation lies in the successful performance of individual actions, not in shaping actions in accordance with normative concepts.

Development of the forecasting model for import volume by item of major countries based on economic, industrial structural and cultural factors: Focusing on the cultural factors of Korea (경제적, 산업구조적, 문화적 요인을 기반으로 한 주요 국가의 한국 품목별 수입액 예측 모형 개발: 한국의, 한국에 대한 문화적 요인을 중심으로)

  • Jun, Seung-pyo;Seo, Bong-Goon;Park, Do-Hyung
    • Journal of Intelligence and Information Systems
    • /
    • v.27 no.4
    • /
    • pp.23-48
    • /
    • 2021
  • The Korean economy has achieved continuous economic growth for the past several decades thanks to the government's export strategy policy. This increase in exports is playing a leading role in driving Korea's economic growth by improving economic efficiency, creating jobs, and promoting technology development. Traditionally, the main factors affecting Korea's exports can be found from two perspectives: economic factors and industrial structural factors. First, economic factors are related to exchange rates and global economic fluctuations. The impact of the exchange rate on Korea's exports depends on the exchange rate level and exchange rate volatility. Global economic fluctuations affect global import demand, which is an absolute factor influencing Korea's exports. Second, industrial structural factors are unique characteristics that occur depending on industries or products, such as slow international division of labor, increased domestic substitution of certain imported goods by China, and changes in overseas production patterns of major export industries. Looking at the most recent studies related to global exchanges, several literatures show the importance of cultural aspects as well as economic and industrial structural factors. Therefore, this study attempted to develop a forecasting model by considering cultural factors along with economic and industrial structural factors in calculating the import volume of each country from Korea. In particular, this study approaches the influence of cultural factors on imports of Korean products from the perspective of PUSH-PULL framework. The PUSH dimension is a perspective that Korea develops and actively promotes its own brand and can be defined as the degree of interest in each country for Korean brands represented by K-POP, K-FOOD, and K-CULTURE. In addition, the PULL dimension is a perspective centered on the cultural and psychological characteristics of the people of each country. This can be defined as how much they are inclined to accept Korean Flow as each country's cultural code represented by the country's governance system, masculinity, risk avoidance, and short-term/long-term orientation. The unique feature of this study is that the proposed final prediction model can be selected based on Design Principles. The design principles we presented are as follows. 1) A model was developed to reflect interest in Korea and cultural characteristics through newly added data sources. 2) It was designed in a practical and convenient way so that the forecast value can be immediately recalled by inputting changes in economic factors, item code and country code. 3) In order to derive theoretically meaningful results, an algorithm was selected that can interpret the relationship between the input and the target variable. This study can suggest meaningful implications from the technical, economic and policy aspects, and is expected to make a meaningful contribution to the export support strategies of small and medium-sized enterprises by using the import forecasting model.

The Operation Plan of the Community-Linked Extracurricular Education program for Lifelong Education for the Persons with Disabilities Based on the Memorandum of Understanding (MOU) of Extracurricular Education between Chosun University and Daegu University (조선대학교-대구대학교 비교과 교육 업무협약(MOU) 기반 지역 연계 장애인평생교육 비교과프로그램 운영 방략)

  • Kim, Young-Jun;Kim, Wha-Soo;Rhee, Kun-Yong
    • The Journal of the Convergence on Culture Technology
    • /
    • v.8 no.2
    • /
    • pp.273-280
    • /
    • 2022
  • Based on the MOU between Chosun University and Daegu University, this study was conducted with the aim of exploring the operation strategy of a extracurricular education program on the theme of lifelong education for the disabled in community connection. In front-line university sites, extracurricular education programs are often recognized as forms and procedures to assist in subject learning at the major or liberal arts level, but they have a very important status and identity considering that they are classified as "learning competency reinforcement support", "career psychological counseling support", "employment and start-up support", "subject-linked extracurricular education". Accordingly, the extracurricular education programs has the nature and advantage of covering not only the level of the one-time trend program itself, but also various community -linked problem-solving learning, including students' major learning and employment linkage. As part of the above, this study aims to present a strategy for the operation of a extracurricular education programs with the main theme and content of "lifelong education for the disabled" by viewing Chosun University and Daegu University. The contents of the study were largely presented as "organizational operation strategy between two universities," "operation strategy of curriculum between two universities," and "comprehensive system for extracurricular education programs operation of lifelong education for the disabled between the two universities". First, the first research content, "Organized Operation Strategy between Two Universities," was schematized in detail the process of collaborating and communicating with Chosun University's center of extracurricular activities, Daegu University Lifelong Education Center, and other committees and departments. The second research content, "The Curriculum Operation Strategy between Two Universities", is a detailed schematic diagram of the learning contents, methods, and procedures to be organized in the extracurricular education program. The third study, "Comprehensive System of extracurricular education program Operation for Lifelong Education for the Disabled between Two Universities," presents the results of synthesizing the basis elements essential for operating the extracurricular education program at the level of a roadmap. As a result of the study, it was possible to see the project tasks that could be promoted in-depth through the operation of a extracurricular education program on lifelong education for the disabled through the MOU between the two universities.

Professional Baseball Viewing Culture Survey According to Corona 19 using Social Network Big Data (소셜네트워크 빅데이터를 활용한 코로나 19에 따른 프로야구 관람문화조사)

  • Kim, Gi-Tak
    • Journal of Korea Entertainment Industry Association
    • /
    • v.14 no.6
    • /
    • pp.139-150
    • /
    • 2020
  • The data processing of this study focuses on the textom and social media words about three areas: 'Corona 19 and professional baseball', 'Corona 19 and professional baseball', and 'Corona 19 and professional sports' The data was collected and refined in a web environment and then processed in batch, and the Ucinet6 program was used to visualize it. Specifically, the web environment was collected using Naver, Daum, and Google's channels, and was summarized into 30 words through expert meetings among the extracted words and used in the final study. 30 extracted words were visualized through a matrix, and a CONCOR analysis was performed to identify clusters of similarity and commonality of words. As a result of analysis, the clusters related to Corona 19 and Pro Baseball were composed of one central cluster and five peripheral clusters, and it was found that the contents related to the opening of professional baseball according to the corona 19 wave were mainly searched. The cluster related to Corona 19 and unrelated to professional baseball consisted of one central cluster and five peripheral clusters, and it was found that the keyword of the position of professional baseball related to the professional baseball game according to Corona 19 was mainly searched. Corona 19 and the cluster related to professional sports consisted of one central cluster and five peripheral clusters, and it was found that the keywords related to the start of professional sports according to the aftermath of Corona 19 were mainly searched.

The Effect of Domain Specificity on the Performance of Domain-Specific Pre-Trained Language Models (도메인 특수성이 도메인 특화 사전학습 언어모델의 성능에 미치는 영향)

  • Han, Minah;Kim, Younha;Kim, Namgyu
    • Journal of Intelligence and Information Systems
    • /
    • v.28 no.4
    • /
    • pp.251-273
    • /
    • 2022
  • Recently, research on applying text analysis to deep learning has steadily continued. In particular, researches have been actively conducted to understand the meaning of words and perform tasks such as summarization and sentiment classification through a pre-trained language model that learns large datasets. However, existing pre-trained language models show limitations in that they do not understand specific domains well. Therefore, in recent years, the flow of research has shifted toward creating a language model specialized for a particular domain. Domain-specific pre-trained language models allow the model to understand the knowledge of a particular domain better and reveal performance improvements on various tasks in the field. However, domain-specific further pre-training is expensive to acquire corpus data of the target domain. Furthermore, many cases have reported that performance improvement after further pre-training is insignificant in some domains. As such, it is difficult to decide to develop a domain-specific pre-trained language model, while it is not clear whether the performance will be improved dramatically. In this paper, we present a way to proactively check the expected performance improvement by further pre-training in a domain before actually performing further pre-training. Specifically, after selecting three domains, we measured the increase in classification accuracy through further pre-training in each domain. We also developed and presented new indicators to estimate the specificity of the domain based on the normalized frequency of the keywords used in each domain. Finally, we conducted classification using a pre-trained language model and a domain-specific pre-trained language model of three domains. As a result, we confirmed that the higher the domain specificity index, the higher the performance improvement through further pre-training.

Anti-inflammatory Activity of Sorghum bicolor (L.) Moench var. Hwanggeumchal Grains in Lipopolysaccharide-stimulated RAW264.7 Murine Macrophage Cell Line (지질다당류-자극된 마우스 대식세포주 RAW264.7에서 황금찰수수 종자의 항염증 활성)

  • Jun, Do Youn;Woo, Hyun Joo;Ko, Jee Youn;Kim, Young Ho
    • Journal of Life Science
    • /
    • v.32 no.12
    • /
    • pp.929-937
    • /
    • 2022
  • To investigate the anti-inflammatory activity of the grains of sorghum, three Sorghum bicolor (L.) Moench variants (Hwanggeumchal, Huinchal, and Chal) being cultivated in Korea, the 80% ethanol (EtOH) extracts of individual sorghum grains were compared for their inhibitory activity against nitric oxide (NO) production in lipopolysaccharide (LPS)-stimulated RAW264.7 murine macrophage cell line. Among them, the EtOH extract of sorghum Hwanggeumchal grains could exert the highest inhibitory effect on the LPS-induced NO production. However, under these conditions, the viability of RAW264.7 cells was not affected. When the EtOH extract of sorghum Hwanggeumchal grains was sequentially fractionated with n-hexane, methylene chloride (MC), ethyl acetate (EtOAc), and n-butanol, the anti-NO production activity was predominantly detected in both MC and EtOAc fractions. In particular, treatment with the MC fraction reduced dose-dependently the expression levels of iNOS, COX-2 and pro-inflammatory cytokines (IL-1β, IL-6, and TNF-α) in LPS-stimulated RAW264.7 cells. Simultaneously, the MC fraction could prevent LPS-induced activating phosphorylation of p38 mitogen-activated protein kinase (MAPK), c-Jun N-terminal kinase (JNK) and extracellular signal-regulated kinase (ERK). HPLC analysis of the MC fraction showed gentisic acid and naringenin as the major phenolic components. Both gentisic acid and naringenin commonly exhibited a potent inhibitory activity against LPS-induced NO production in RAW264.7 cells. Together, these results provide the evidence of the inhibitory activity of Hwanggeumchal grains on LPS-induce inflammatory responses in RAW264.7 murine macrophage cells and also suggest that sorghum grains possess beneficial health effects which can be applicable in development of the grain-based functional foods.