• Title/Summary/Keyword: 용어(用語)

Search Result 3,808, Processing Time 0.031 seconds

The Etymological Study in Synonyms of Chinese Character 'Xing(行)' (『광아소증(廣雅疏證)』의 '행(行)'의자(義字) 훈고(訓詁)에 보이는 성동성근자(聲同聲近字)에 대한 고찰)

  • 서한용
    • Journal of Sinology and China Studies
    • /
    • v.78
    • /
    • pp.47-65
    • /
    • 2019
  • Guang-ya(廣雅) compiled by Zhang Ji(張揖), in about A.D. 227, is a dictionary of synonyms. Wang Nian-sun(王念孫) discriminated among synonyms by Guang-ya-shu-zheng(廣雅疏證) in about A.D. 1795. In his book, Wang Nian-sun(王念孫) annotated synonyms of Chinese characters in Guang-ya(廣雅). At the same time, he also tried to find out the homophonic and synonymic relationship in Chinese characters. These are the foundation of the graphonomy of Chinese characters. So we can say, Wang Nian-sun(王念孫) made the greatest contribution to the theoretical construction to the etymology of Chinese characters. The outstanding dictionaries of Chinese characters were Shuo-wen(說文) by Xu-shen(許愼), Shuo-wen-jie-zi-zhu(說文解字注) by Duan Yu-cai(段玉裁) and Shuo-wen-tong-xun-ding-sheng(說文通訓定聲) by Zhu Jun-sheng(朱駿聲). These books described Chinese characters and also analyzed the homophonic and synonymic relationship in Chinese characters. The representative dictionaries in the etymology of Chinese characters was Wen-shi(文始) by Zhang Tai-yan(章太炎). The homophonic and synonymic relationship in Chinese characters was described and analyzed in this book. This report consists of five chapters. The first chapter gives the purpose of the etymological study in synonyms of Chinese character 'Xing(行)'. The second chapter gives analysis of the Yi-ti-zi(異體字) in synonyms of Chinese character 'Xing(行)' in Guang-ya-shu-zheng(廣雅疏證). The third chapter gives analysis of the Jia-jie-zi(假借字) in synonyms of Chinese character 'Xing(行)' in Guang-ya-shu-zheng(廣雅疏證). The fourth chapter gives analysis of the Tong-yuan-zi(同源字) in synonyms of Chinese character 'Xing(行)' in Guang-ya-shu-zheng(廣雅疏證). This report also gives analysis of the etymological study in synonyms of Chinese character 'Xing(行)' in Shuo-wen(說文), Shuo-wen-jie-zi-zhu(說文解字注), Shuo-wen-tong-xun-ding-sheng(說文通訓定聲). The concluding chapter provides the summary of the preceding chapters and the description of conclusion.

A Survey of Korean Consumers' Awareness on Animal Welfare of Laying Hens (산란계 동물복지에 대한 국내 소비자의 인지도 조사)

  • Hong, Eui-Chul;Kang, Hwan-Ku;Park, Ki-Tae;Jeon, Jin-Joo;Kim, Hyun-Soo;Kim, Chan-Ho;Kim, Sang-Ho
    • Korean Journal of Poultry Science
    • /
    • v.45 no.3
    • /
    • pp.219-228
    • /
    • 2018
  • This study was conducted twice to investigate egg purchase behavior and perception on animal welfare of Korean consumers. This study included women, who were the main decision makers and caretakers in the household, and men with one-person household. This survey was conducted with by the Computer Assisted Web Interview and Gang Survey methods. On the key considerations factor, the highest response rate was considered to be 'price', and the response rate of considering 'packing date' increased in the second survey. At a reasonable price based on 10 eggs, the response rate was the highest at 53.8% and 42.9% in both the first and second surveys and the appropriate price averages were 2,482 won and 2,132 won, respectively. The highest rate of purchase of egg consumers from 'Large Mart' followed by 'Medium sized supermarket' and 'Chain supermarket'. As for the awareness about animal welfare, the recognition ratio (73.5%) was higher in the result of the second survey than the first. The cognitive period of animal welfare was 59.0% before the insecticide egg crisis and 41.0% thereafter. Regarding whether or not they have ever seen an animal welfare certification mark and an animal welfare animal farm certification mark, 59.6% of respondents said that they saw it for the first time and 37.6% answered that they knew the animal welfare certification mark. On the animal welfare system, the 'free-range' response rate was the highest at 85.8%. The 'free-range' fit response decreased by 34.2%p, while the 'barn' and 'European type' fit response increased by 13.2%p and 24.1%p, respectively. The number of 'I have never seen' and 'I have ever eaten' responses to the recognition and eating experience of animal welfare certified eggs decreased while the number of those who answered 'Have ever seen' and 'Have eaten' increased. The answer of purchasing animal welfare certified eggs at department stores, organic farming cooperatives, and internet shopping malls was higher than that of buying conventional eggs. Of the total respondents, 92.0% were willing to purchase an animal welfare egg before the price was offered, but after offering the prices of animal welfare eggs, the intention to purchase was 62.7%, which was about 30%p lower than before. The reason for purchasing an animal welfare certified egg was the highest score of 71.0% for 'I think it is likely to be high in food safety', and 38.1% for 'I think the price is high' for lack of intention to purchase. In the sensory evaluation of animal welfare eggs, egg color and skin texture of conventional eggs were significantly higher than those of certified welfare eggs (P<0.05), and boiled eggs showed that egg whites of animal welfare certified eggs were more (P<0.05). As a result, the results of this study will contribute to the activation of the animal welfare certification system for laying hens by providing basic data on consumer awareness to animal welfare certified farmers.

An Intelligence Support System Research on KTX Rolling Stock Failure Using Case-based Reasoning and Text Mining (사례기반추론과 텍스트마이닝 기법을 활용한 KTX 차량고장 지능형 조치지원시스템 연구)

  • Lee, Hyung Il;Kim, Jong Woo
    • Journal of Intelligence and Information Systems
    • /
    • v.26 no.1
    • /
    • pp.47-73
    • /
    • 2020
  • KTX rolling stocks are a system consisting of several machines, electrical devices, and components. The maintenance of the rolling stocks requires considerable expertise and experience of maintenance workers. In the event of a rolling stock failure, the knowledge and experience of the maintainer will result in a difference in the quality of the time and work to solve the problem. So, the resulting availability of the vehicle will vary. Although problem solving is generally based on fault manuals, experienced and skilled professionals can quickly diagnose and take actions by applying personal know-how. Since this knowledge exists in a tacit form, it is difficult to pass it on completely to a successor, and there have been studies that have developed a case-based rolling stock expert system to turn it into a data-driven one. Nonetheless, research on the most commonly used KTX rolling stock on the main-line or the development of a system that extracts text meanings and searches for similar cases is still lacking. Therefore, this study proposes an intelligence supporting system that provides an action guide for emerging failures by using the know-how of these rolling stocks maintenance experts as an example of problem solving. For this purpose, the case base was constructed by collecting the rolling stocks failure data generated from 2015 to 2017, and the integrated dictionary was constructed separately through the case base to include the essential terminology and failure codes in consideration of the specialty of the railway rolling stock sector. Based on a deployed case base, a new failure was retrieved from past cases and the top three most similar failure cases were extracted to propose the actual actions of these cases as a diagnostic guide. In this study, various dimensionality reduction measures were applied to calculate similarity by taking into account the meaningful relationship of failure details in order to compensate for the limitations of the method of searching cases by keyword matching in rolling stock failure expert system studies using case-based reasoning in the precedent case-based expert system studies, and their usefulness was verified through experiments. Among the various dimensionality reduction techniques, similar cases were retrieved by applying three algorithms: Non-negative Matrix Factorization(NMF), Latent Semantic Analysis(LSA), and Doc2Vec to extract the characteristics of the failure and measure the cosine distance between the vectors. The precision, recall, and F-measure methods were used to assess the performance of the proposed actions. To compare the performance of dimensionality reduction techniques, the analysis of variance confirmed that the performance differences of the five algorithms were statistically significant, with a comparison between the algorithm that randomly extracts failure cases with identical failure codes and the algorithm that applies cosine similarity directly based on words. In addition, optimal techniques were derived for practical application by verifying differences in performance depending on the number of dimensions for dimensionality reduction. The analysis showed that the performance of the cosine similarity was higher than that of the dimension using Non-negative Matrix Factorization(NMF) and Latent Semantic Analysis(LSA) and the performance of algorithm using Doc2Vec was the highest. Furthermore, in terms of dimensionality reduction techniques, the larger the number of dimensions at the appropriate level, the better the performance was found. Through this study, we confirmed the usefulness of effective methods of extracting characteristics of data and converting unstructured data when applying case-based reasoning based on which most of the attributes are texted in the special field of KTX rolling stock. Text mining is a trend where studies are being conducted for use in many areas, but studies using such text data are still lacking in an environment where there are a number of specialized terms and limited access to data, such as the one we want to use in this study. In this regard, it is significant that the study first presented an intelligent diagnostic system that suggested action by searching for a case by applying text mining techniques to extract the characteristics of the failure to complement keyword-based case searches. It is expected that this will provide implications as basic study for developing diagnostic systems that can be used immediately on the site.

The Effect of the Use of Concept Mapping on Science Achievement and the Scientific Attitude in Ocean Units of Earth Science (해양단원 개념도 활용 수업이 과학성취도 및 태도에 미치는 효과)

  • Han, Jung-Hwa;Kim, Kwang-Hui;Park, Soo-Kyong
    • Journal of the Korean earth science society
    • /
    • v.23 no.6
    • /
    • pp.461-473
    • /
    • 2002
  • Concept mapping is a device for representing the conceptual structure of a subject discipline in a two dimensional form which is analogous to a road map. In the teaching and learning of earth science, each concept depends on its relationships to many others for meaning. Using concept mapping in teaching helps teachers and students to be more aware of the key concepts and relationships among them. The purpose of this study is to investigate the effect of the use of concept mapping on science achievement and the scientific attitude in ocean units of earth science. The results of this study are as follows; first, the science achievement of a group of concept mapping teaching is significantly higher than that of the group of traditional teaching. Also, when the achievement levels are compared among different cognitive ability groups, the effect is more significant in mid or lower level student groups than in high level groups. The use of concept mapping is more effective when the concepts have a distinct concept hierarchy. Second, the scores of the test of ‘attitude toward scientific inquiry’ and ‘application of scientific attitude’ of the group of concept mapping teaching are significantly higher than those of the group of traditional teaching, whereas the scores of the test of ‘interest in science learning’ of concept mapping teaching is not different from those of group of traditional teaching. Third, the survey on the use of concept mapping shows a positive response across the tested groups. The use of concept mapping is more beneficial in fostering the comprehension of the topic. A concept map of student's own construction facilitates the assessment of learning, thus promising the usefulness of concept mapping as a means of evaluation. In regard to retention aspect, concept mapping is considered to be more effective in confirming and remembering the topic, while less effective in the aspects of activity and interest. In conclusion, the use of concept maps makes learning an active meaningful process and improves student's academic achievement and scientific attitude. If the concept mapping is more effectively as an active teaching strategy, more meaningful learning will be attained.

Evaluation of External Quality of Brand Soybeans (콩 시판 브랜드 제품의 외관 품질 평가)

  • Jong, Seung-Keun;Woo, Shun-Hee;Kim, Hong-Sig
    • KOREAN JOURNAL OF CROP SCIENCE
    • /
    • v.52 no.3
    • /
    • pp.239-248
    • /
    • 2007
  • Although high nutritional values and continuous identification of important functional substances of soybean [Glycine max (L.) Merrill.] promote consumption of soybean products worldwide, informations on quality of brand soybean is not enough for consumers. Total of 100 brand soybeans [32 for soypaste and source, 45 black testa (lage), and 17 black testa (small) or medicinal soybean and beansprout soybean] were collected at supermarkets and several external quality factors were analyzed. Brand soybeans were marked with the environmental friendly and intimating words along with soybean (white or yellow), black soybean (black-, frost-, late frost-, green or inner-green-), medicinal soybean and beansprout soybean. Among 100 brand soybeans 30% was 1 kg package and 59% was 500 g package, difference between printed and actual weights of 70% brand soybeans was ${\pm}1%$ and weights of 2/3 of brand soybeans were higher than printed weight. Range of 100 seed weights of soypaste and source, black testa (large) and black testa (small) and beansprout soybeans were $23.7{\sim}47.8g$, $21.9{\sim}44.5g$ and $9.5{\sim}15.0g$, respectively. Although ranges of 100 seed weights of soypaste and source and black testa (large) soybeans were similar, 63% of soypaste and source were less than 29 g, while 78% of black testa (large) soybeans were higher than 30 g. Although average and highest percentages of seeds separated with 6.7 mm sieve were similar with 87.4% and 99.9% for soypaste and source soybean and 86.5% and 99.5% for black testa (large) soybean, respectively, the lowest percentages were 70.7% for soypaste and source soybean and 14.4% for black testa (large) soybean. When 100 seed weight was greater than 35 g, 90% of seeds were remained on 6.7 mm sieve. On the other hand 100 g weight and percentage of seeds remained on 6.7 mm sieve showed significantly positive correlations [r=0.7488** for soypaste and source soybean and r=0.7874** for black testa (large) soybean when 100 seed weight was $20{\sim}30g$. Based on hilum color and/or appearance, 76% of brand soybeans collected (more than 90% in yellow testa soybeans) were found to be mixed more than 10% with other cultivars or landraces. Foreign materials such as sand, piece of clothe, wood piece, dead insects, other soybeans were found in 20% of brand soybeans. Average test weight of brand soybeans was 762g $L^{-1}$ with a range of $645{\sim}820g\;L^{-1}$. Soybeans from local markets were as good as brand soybeans in 100 seed weight, uniformity of seeds, weight of foreign materials and test weight.

A Study on Modernization of International Conventions Relating to Aviation Security and Implementation of National Legislation (항공보안 관련 국제협약의 현대화와 국내입법의 이행 연구)

  • Lee, Kang-Bin
    • The Korean Journal of Air & Space Law and Policy
    • /
    • v.30 no.2
    • /
    • pp.201-248
    • /
    • 2015
  • In Korea the number of unlawful interference act on board aircrafts has been increased continuously according to the growth of aviation demand, and there were 55 incidents in 2000, followed by 354 incidents in 2014, and an average of 211 incidents a year over the past five years. In 1963, a number of states adopted the Convention on Offences and Certain Other Acts Committed on Board Aircraft (the Tokyo Convention 1963) as the first worldwide international legal instrument on aviation security. The Tokyo Convention took effect in 1969 and, shortly afterward, in 1970 the Convention for the Suppression of Unlawful Seizure of Aircraft(the Hague Convention 1970) was adopted, and the Convention for the Suppression of Unlawful Acts Against the Safety of Civil Aviation(the Montreal Convention 1971) was adopted in 1971. After 9/11 incidents in 2001, to amend and supplement the Montreal Convention 1971, the Convention on the Suppression of Unlawful Acts Relating to International Civil Aviation(the Beijing Convention 2010) was adopted in 2010, and to supplement the Hague Convention 1970, the Protocol Supplementary to the Convention for the Suppression of Unlawful Seizure of Aircraft(the Beijing Protocol 2010) was adopted in 2010. Since then, in response to increased cases of unruly behavior on board aircrafts which escalated in both severity and frequency,, the Montreal Protocol which is seen as an amendment to the Convention on Offences and Certain Other Acts Committed on Board Aircraft(the Tokyo Convention 1963) was adopted in 2014. Korea ratified the Tokyo Convention 1963, the Hague Convention 1970, the Montreal Convention 1971, the Montreal Supplementary Protocol 1988, and the Convention on the Marking of Plastic Explosive 1991 which have proven to be effective. Under the Tokyo Convention ratified in 1970, Korea further enacted the Aircraft Navigation Safety Act in 1974, as well as the Aviation Safety and Security Act that replaced the Aircraft Navigation Safety Act in August 2002. Meanwhile, the title of the Aviation Safety and Security Act was changed to the Aviation Security Act in April 2014. The Aviation Security Act is essentially an implementing legislation of the Tokyo Convention and Hague Convention. Also the language of the Aviation Security Act is generally broader than the unruly and disruptive behavior in Sections 1-3 of the model legislation in ICAO Circular 288. The Aviation Security Act has reflected the considerable parts of the implementation of national legislation under the Beijing Convention and Beijing Protocol 2010, and the Montreal Protocol 2014 that are the modernized international conventions relating to aviation security. However, in future, when these international conventions would come into effect and Korea would ratify them, the national legislation that should be amended or provided newly in the Aviation Security Act are as followings : The jurisdiction, the definition of 'in flight', the immunity from the actions against the aircraft commander, etc., the compulsory delivery of the offender by the aircraft commander, etc., the strengthening of penalty on the person breaking the law, the enlargement of application to the accomplice, and the observance of international convention. Among them, particularly the Korean legislation is silent on the scope of the jurisdiction. Therefore, in order for jurisdiction to be extended to the extra-territorial cases of unruly and disruptive offences, it is desirable that either the Aviation Security Act or the general Crime Codes should be revised. In conclusion, in order to meet the intelligent and diverse aviation threats, the Korean government should review closely the contents of international conventions relating to aviation security and the current ratification status of international conventions by each state, and make effort to improve the legislation relating to aviation security and the aviation security system for the ratification of international conventions and the implementation of national legislation under international conventions.

The Role of the Soft Law for Space Debris Mitigation in International Law (국제법상 우주폐기물감축 연성법의 역할에 관한 연구)

  • Kim, Han-Taek
    • The Korean Journal of Air & Space Law and Policy
    • /
    • v.30 no.2
    • /
    • pp.469-497
    • /
    • 2015
  • In 2009 Iridium 33, a satellite owned by the American Iridium Communications Inc. and Kosmos-2251, a satellite owned by the Russian Space Forces, collided at a speed of 42,120 km/h and an altitude of 789 kilometers above the Taymyr Peninsula in Siberia. NASA estimated that the satellite collision had created approximately 1,000 pieces of debris larger than 10 centimeters, in addition to many smaller ones. By July 2011, the U.S. Space Surveillance Network(SSN) had catalogued over 2,000 large debris fragments. On January 11, 2007 China conducted a test on its anti-satellite missile. A Chinese weather satellite, the FY-1C polar orbit satellite, was destroyed by the missile that was launched using a multistage solid-fuel. The test was unprecedented for having created a record amount of debris. At least 2,317 pieces of trackable size (i.e. of golf ball size or larger) and an estimated 150,000 particles were generated as a result. As far as the Space Treaties such as 1967 Outer Space Treaty, 1968 Rescue Agreement, 1972 Liability Convention, 1975 Registration Convention and 1979 Moon Agreement are concerned, few provisions addressing the space environment and debris in space can be found. In the early years of space exploration dating back to the late 1950s, the focus of international law was on the establishment of a basic set of rules on the activities undertaken by various states in outer space.. Consequently environmental issues, including those of space debris, did not receive the priority they deserve when international space law was originally drafted. As shown in the case of the 1978 "Cosmos 954 Incident" between Canada and USSR, the two parties settled it by the memorandum between two nations not by the Space Treaties to which they are parties. In 1994 the 66th conference of International Law Association(ILA) adopted "International Instrument on the Protection of the Environment from Damage Caused by Space Debris". The Inter-Agency Space Debris Coordination Committee(IADC) issued some guidelines for the space debris which were the basis of "the UN Space Debris Mitigation Guidelines" which had been approved by the Committee on the Peaceful Uses of Outer Space(COPUOS) in its 527th meeting. On December 21 2007 this guideline was approved by UNGA Resolution 62/217. The EU has proposed an "International Code of Conduct for Outer Space Activities" as a transparency and confidence-building measure. It was only in 2010 that the Scientific and Technical Subcommittee began considering as an agenda item the long-term sustainability of outer space. A Working Group on the Long-term Sustainability of Outer Space Activities was established, the objectives of which include identifying areas of concern for the long-term sustainability of outer space activities, proposing measures that could enhance sustainability, and producing voluntary guidelines to reduce risks to long-term sustainability. By this effort "Guidelines on the Long-term Sustainability of Outer Space Activities" are being under consideration. In the case of "Declaration of Legal Principles Governing the Activities of States in the Exp1oration and Use of Outer Space" adopted by UNGA Resolution 1962(XVIII), December 13 1963, the 9 principles proclaimed in that Declaration, although all of them incorporated in the Space Treaties, could be regarded as customary international law binding all states considering the time and opinio juris by the responses of the world. Although the soft law such as resolutions, guidelines are not binding law, there are some provisions which have a fundamentally norm-creating character and customary international law. In November 12 1974 UN General Assembly recalled through a Resolution 3232(XXIX) "Review of the role of International Court of Justice" that the development of international law may be reflected, inter alia, by the declarations and resolutions of the General Assembly which may to that extend be taken into consideration by the judgements of the International Court of Justice. We are expecting COPUOS which gave birth 5 Space Treaties that it could give us binding space debris mitigation measures to be implemented based on space debris mitigation soft law in the near future.

A Study on Differences of Contents and Tones of Arguments among Newspapers Using Text Mining Analysis (텍스트 마이닝을 활용한 신문사에 따른 내용 및 논조 차이점 분석)

  • Kam, Miah;Song, Min
    • Journal of Intelligence and Information Systems
    • /
    • v.18 no.3
    • /
    • pp.53-77
    • /
    • 2012
  • This study analyses the difference of contents and tones of arguments among three Korean major newspapers, the Kyunghyang Shinmoon, the HanKyoreh, and the Dong-A Ilbo. It is commonly accepted that newspapers in Korea explicitly deliver their own tone of arguments when they talk about some sensitive issues and topics. It could be controversial if readers of newspapers read the news without being aware of the type of tones of arguments because the contents and the tones of arguments can affect readers easily. Thus it is very desirable to have a new tool that can inform the readers of what tone of argument a newspaper has. This study presents the results of clustering and classification techniques as part of text mining analysis. We focus on six main subjects such as Culture, Politics, International, Editorial-opinion, Eco-business and National issues in newspapers, and attempt to identify differences and similarities among the newspapers. The basic unit of text mining analysis is a paragraph of news articles. This study uses a keyword-network analysis tool and visualizes relationships among keywords to make it easier to see the differences. Newspaper articles were gathered from KINDS, the Korean integrated news database system. KINDS preserves news articles of the Kyunghyang Shinmun, the HanKyoreh and the Dong-A Ilbo and these are open to the public. This study used these three Korean major newspapers from KINDS. About 3,030 articles from 2008 to 2012 were used. International, national issues and politics sections were gathered with some specific issues. The International section was collected with the keyword of 'Nuclear weapon of North Korea.' The National issues section was collected with the keyword of '4-major-river.' The Politics section was collected with the keyword of 'Tonghap-Jinbo Dang.' All of the articles from April 2012 to May 2012 of Eco-business, Culture and Editorial-opinion sections were also collected. All of the collected data were handled and edited into paragraphs. We got rid of stop-words using the Lucene Korean Module. We calculated keyword co-occurrence counts from the paired co-occurrence list of keywords in a paragraph. We made a co-occurrence matrix from the list. Once the co-occurrence matrix was built, we used the Cosine coefficient matrix as input for PFNet(Pathfinder Network). In order to analyze these three newspapers and find out the significant keywords in each paper, we analyzed the list of 10 highest frequency keywords and keyword-networks of 20 highest ranking frequency keywords to closely examine the relationships and show the detailed network map among keywords. We used NodeXL software to visualize the PFNet. After drawing all the networks, we compared the results with the classification results. Classification was firstly handled to identify how the tone of argument of a newspaper is different from others. Then, to analyze tones of arguments, all the paragraphs were divided into two types of tones, Positive tone and Negative tone. To identify and classify all of the tones of paragraphs and articles we had collected, supervised learning technique was used. The Na$\ddot{i}$ve Bayesian classifier algorithm provided in the MALLET package was used to classify all the paragraphs in articles. After classification, Precision, Recall and F-value were used to evaluate the results of classification. Based on the results of this study, three subjects such as Culture, Eco-business and Politics showed some differences in contents and tones of arguments among these three newspapers. In addition, for the National issues, tones of arguments on 4-major-rivers project were different from each other. It seems three newspapers have their own specific tone of argument in those sections. And keyword-networks showed different shapes with each other in the same period in the same section. It means that frequently appeared keywords in articles are different and their contents are comprised with different keywords. And the Positive-Negative classification showed the possibility of classifying newspapers' tones of arguments compared to others. These results indicate that the approach in this study is promising to be extended as a new tool to identify the different tones of arguments of newspapers.

Efficient Topic Modeling by Mapping Global and Local Topics (전역 토픽의 지역 매핑을 통한 효율적 토픽 모델링 방안)

  • Choi, Hochang;Kim, Namgyu
    • Journal of Intelligence and Information Systems
    • /
    • v.23 no.3
    • /
    • pp.69-94
    • /
    • 2017
  • Recently, increase of demand for big data analysis has been driving the vigorous development of related technologies and tools. In addition, development of IT and increased penetration rate of smart devices are producing a large amount of data. According to this phenomenon, data analysis technology is rapidly becoming popular. Also, attempts to acquire insights through data analysis have been continuously increasing. It means that the big data analysis will be more important in various industries for the foreseeable future. Big data analysis is generally performed by a small number of experts and delivered to each demander of analysis. However, increase of interest about big data analysis arouses activation of computer programming education and development of many programs for data analysis. Accordingly, the entry barriers of big data analysis are gradually lowering and data analysis technology being spread out. As the result, big data analysis is expected to be performed by demanders of analysis themselves. Along with this, interest about various unstructured data is continually increasing. Especially, a lot of attention is focused on using text data. Emergence of new platforms and techniques using the web bring about mass production of text data and active attempt to analyze text data. Furthermore, result of text analysis has been utilized in various fields. Text mining is a concept that embraces various theories and techniques for text analysis. Many text mining techniques are utilized in this field for various research purposes, topic modeling is one of the most widely used and studied. Topic modeling is a technique that extracts the major issues from a lot of documents, identifies the documents that correspond to each issue and provides identified documents as a cluster. It is evaluated as a very useful technique in that reflect the semantic elements of the document. Traditional topic modeling is based on the distribution of key terms across the entire document. Thus, it is essential to analyze the entire document at once to identify topic of each document. This condition causes a long time in analysis process when topic modeling is applied to a lot of documents. In addition, it has a scalability problem that is an exponential increase in the processing time with the increase of analysis objects. This problem is particularly noticeable when the documents are distributed across multiple systems or regions. To overcome these problems, divide and conquer approach can be applied to topic modeling. It means dividing a large number of documents into sub-units and deriving topics through repetition of topic modeling to each unit. This method can be used for topic modeling on a large number of documents with limited system resources, and can improve processing speed of topic modeling. It also can significantly reduce analysis time and cost through ability to analyze documents in each location or place without combining analysis object documents. However, despite many advantages, this method has two major problems. First, the relationship between local topics derived from each unit and global topics derived from entire document is unclear. It means that in each document, local topics can be identified, but global topics cannot be identified. Second, a method for measuring the accuracy of the proposed methodology should be established. That is to say, assuming that global topic is ideal answer, the difference in a local topic on a global topic needs to be measured. By those difficulties, the study in this method is not performed sufficiently, compare with other studies dealing with topic modeling. In this paper, we propose a topic modeling approach to solve the above two problems. First of all, we divide the entire document cluster(Global set) into sub-clusters(Local set), and generate the reduced entire document cluster(RGS, Reduced global set) that consist of delegated documents extracted from each local set. We try to solve the first problem by mapping RGS topics and local topics. Along with this, we verify the accuracy of the proposed methodology by detecting documents, whether to be discerned as the same topic at result of global and local set. Using 24,000 news articles, we conduct experiments to evaluate practical applicability of the proposed methodology. In addition, through additional experiment, we confirmed that the proposed methodology can provide similar results to the entire topic modeling. We also proposed a reasonable method for comparing the result of both methods.

An Ontology Model for Public Service Export Platform (공공 서비스 수출 플랫폼을 위한 온톨로지 모형)

  • Lee, Gang-Won;Park, Sei-Kwon;Ryu, Seung-Wan;Shin, Dong-Cheon
    • Journal of Intelligence and Information Systems
    • /
    • v.20 no.1
    • /
    • pp.149-161
    • /
    • 2014
  • The export of domestic public services to overseas markets contains many potential obstacles, stemming from different export procedures, the target services, and socio-economic environments. In order to alleviate these problems, the business incubation platform as an open business ecosystem can be a powerful instrument to support the decisions taken by participants and stakeholders. In this paper, we propose an ontology model and its implementation processes for the business incubation platform with an open and pervasive architecture to support public service exports. For the conceptual model of platform ontology, export case studies are used for requirements analysis. The conceptual model shows the basic structure, with vocabulary and its meaning, the relationship between ontologies, and key attributes. For the implementation and test of the ontology model, the logical structure is edited using Prot$\acute{e}$g$\acute{e}$ editor. The core engine of the business incubation platform is the simulator module, where the various contexts of export businesses should be captured, defined, and shared with other modules through ontologies. It is well-known that an ontology, with which concepts and their relationships are represented using a shared vocabulary, is an efficient and effective tool for organizing meta-information to develop structural frameworks in a particular domain. The proposed model consists of five ontologies derived from a requirements survey of major stakeholders and their operational scenarios: service, requirements, environment, enterprise, and county. The service ontology contains several components that can find and categorize public services through a case analysis of the public service export. Key attributes of the service ontology are composed of categories including objective, requirements, activity, and service. The objective category, which has sub-attributes including operational body (organization) and user, acts as a reference to search and classify public services. The requirements category relates to the functional needs at a particular phase of system (service) design or operation. Sub-attributes of requirements are user, application, platform, architecture, and social overhead. The activity category represents business processes during the operation and maintenance phase. The activity category also has sub-attributes including facility, software, and project unit. The service category, with sub-attributes such as target, time, and place, acts as a reference to sort and classify the public services. The requirements ontology is derived from the basic and common components of public services and target countries. The key attributes of the requirements ontology are business, technology, and constraints. Business requirements represent the needs of processes and activities for public service export; technology represents the technological requirements for the operation of public services; and constraints represent the business law, regulations, or cultural characteristics of the target country. The environment ontology is derived from case studies of target countries for public service operation. Key attributes of the environment ontology are user, requirements, and activity. A user includes stakeholders in public services, from citizens to operators and managers; the requirements attribute represents the managerial and physical needs during operation; the activity attribute represents business processes in detail. The enterprise ontology is introduced from a previous study, and its attributes are activity, organization, strategy, marketing, and time. The country ontology is derived from the demographic and geopolitical analysis of the target country, and its key attributes are economy, social infrastructure, law, regulation, customs, population, location, and development strategies. The priority list for target services for a certain country and/or the priority list for target countries for a certain public services are generated by a matching algorithm. These lists are used as input seeds to simulate the consortium partners, and government's policies and programs. In the simulation, the environmental differences between Korea and the target country can be customized through a gap analysis and work-flow optimization process. When the process gap between Korea and the target country is too large for a single corporation to cover, a consortium is considered an alternative choice, and various alternatives are derived from the capability index of enterprises. For financial packages, a mix of various foreign aid funds can be simulated during this stage. It is expected that the proposed ontology model and the business incubation platform can be used by various participants in the public service export market. It could be especially beneficial to small and medium businesses that have relatively fewer resources and experience with public service export. We also expect that the open and pervasive service architecture in a digital business ecosystem will help stakeholders find new opportunities through information sharing and collaboration on business processes.