• Title/Summary/Keyword: Third

Search Result 33,402, Processing Time 0.07 seconds

A Comparison of the Designation Characteristics of Korean Scenic Sites Policies and National Park System in the United States (국내 명승 정책과 미국 국립공원 시스템의 지정 특성 비교)

  • Lee, Won-Ho;Kim, Dong-Hyun;Janet, R. Balsom
    • Journal of the Korean Institute of Traditional Landscape Architecture
    • /
    • v.38 no.3
    • /
    • pp.25-34
    • /
    • 2020
  • This study examined the definition and major values, the designated procedures and types, and the designation trend in Korean scenic sites and national parks in the United States. Based on this, the analysis of the characteristics of the designation of the two natural heritages. The results are as follows; First, Scenic Sites has characteristics of complex heritage that includes academic, historical, and humanities values on the basis of landscape. As a natural heritage based on public nature, the U.S. National Park aims to contribute to the people's natural heritage and satisfy both ecological and historical values through the protection of the landscape. Second, the designation of a scenic sites are decided through deliberation by the Cultural Heritage Committee after the request of the owner, manager, or local government or by the authority of the head of the Cultural Heritage Administration. The designated survey is divided into basic resource surveys and resource surveys by type. Since the initial designation of the Sogeumgang Mountain in Cheonghakdong, Myeongju in 1970, the number of designated scenic sites was low until the 2000s, but the number of designated scenic sites has increased rapidly since 2006 due to the policy to promote the scenic site, and the proportion of natural and historical and cultural scenic sites has been balanced. The designation of the U.S. national park is decided by the Congress or the president, and the National Park Service makes a series of decisions on whether to conduct a special resource study of provisional resources through a preliminary inspection survey, whether to satisfy the criteria for designation of national parks based on the results of special resource research, and to prioritize them. The U.S. National Parks have been expanded not only by Congress but also by the president's empowerment to designate them as national monuments. With the integrated operation of the National Park Service, the number of designated cases increased as the national park included the heritage sites under the control of various ministries. In addition, a number of historical areas were designated by the enactment of the Historical Site Act, and recreational areas were designated to provide leisure space and classified and managed in a total of 18 units. Third, the comparison of the designation characteristics of the two heritage properties confirmed that the designation of natural heritage with complex value, the classification of types according to complementary designation system and resource characteristics, the establishment of the competent ministry and the balancing of the heritage according to the designation policy. The two heritages had the characteristics of complex natural heritages that met ecological, historical and academic values at the same time based on landscape and public nature. In addition, both countries have identified a system for deliberating the designation of heritage through a basic resource survey and an in-depth designation survey, and classified each type according to the characteristics of the resource. In addition, the policies for promoting scenic sites in Korea and the integrated operation of the National Park Service in the U.S. influenced the designated aspects of the two heritage sites, balancing natural heritage with historical and cultural heritage. Fourth, the resource types and conservation management methods of Scenic site and National Park were largely related. The natural areas of the U.S. National Park include types of natural monuments in Korea as major resources, and have characteristics similar to natural scenic sites. In addition, historical resources were similar to the criteria for designation of historical and cultural scenic sites in terms of landscape, and the aspects of war and celebrity-related relics were related to the types of historic sites. In terms of conservation management, the natural area of the U.S. national park has a way of keeping the original ecosystem intact, but the Korean natural heritage protection system is likely to be useful for focusing on the resource of viscosity. Meanwhile, historical resources include historical sites and historical and cultural scenic sites in the traditional era, but historical relics in the U.S. National Parks have set a time limit to modern times for war history and celebrity-related relics, and the active provision of entertainment programs based on existing resources was derived as a difference.

A Study of the Reactive Movement Synchronization for Analysis of Group Flow (그룹 몰입도 판단을 위한 움직임 동기화 연구)

  • Ryu, Joon Mo;Park, Seung-Bo;Kim, Jae Kyeong
    • Journal of Intelligence and Information Systems
    • /
    • v.19 no.1
    • /
    • pp.79-94
    • /
    • 2013
  • Recently, the high value added business is steadily growing in the culture and art area. To generated high value from a performance, the satisfaction of audience is necessary. The flow in a critical factor for satisfaction, and it should be induced from audience and measures. To evaluate interest and emotion of audience on contents, producers or investors need a kind of index for the measurement of the flow. But it is neither easy to define the flow quantitatively, nor to collect audience's reaction immediately. The previous studies of the group flow were evaluated by the sum of the average value of each person's reaction. The flow or "good feeling" from each audience was extracted from his face, especially, the change of his (or her) expression and body movement. But it was not easy to handle the large amount of real-time data from each sensor signals. And also it was difficult to set experimental devices, in terms of economic and environmental problems. Because, all participants should have their own personal sensor to check their physical signal. Also each camera should be located in front of their head to catch their looks. Therefore we need more simple system to analyze group flow. This study provides the method for measurement of audiences flow with group synchronization at same time and place. To measure the synchronization, we made real-time processing system using the Differential Image and Group Emotion Analysis (GEA) system. Differential Image was obtained from camera and by the previous frame was subtracted from present frame. So the movement variation on audience's reaction was obtained. And then we developed a program, GEX(Group Emotion Analysis), for flow judgment model. After the measurement of the audience's reaction, the synchronization is divided as Dynamic State Synchronization and Static State Synchronization. The Dynamic State Synchronization accompanies audience's active reaction, while the Static State Synchronization means to movement of audience. The Dynamic State Synchronization can be caused by the audience's surprise action such as scary, creepy or reversal scene. And the Static State Synchronization was triggered by impressed or sad scene. Therefore we showed them several short movies containing various scenes mentioned previously. And these kind of scenes made them sad, clap, and creepy, etc. To check the movement of audience, we defined the critical point, ${\alpha}$and ${\beta}$. Dynamic State Synchronization was meaningful when the movement value was over critical point ${\beta}$, while Static State Synchronization was effective under critical point ${\alpha}$. ${\beta}$ is made by audience' clapping movement of 10 teams in stead of using average number of movement. After checking the reactive movement of audience, the percentage(%) ratio was calculated from the division of "people having reaction" by "total people". Total 37 teams were made in "2012 Seoul DMC Culture Open" and they involved the experiments. First, they followed induction to clap by staff. Second, basic scene for neutralize emotion of audience. Third, flow scene was displayed to audience. Forth, the reversal scene was introduced. And then 24 teams of them were provided with amuse and creepy scenes. And the other 10 teams were exposed with the sad scene. There were clapping and laughing action of audience on the amuse scene with shaking their head or hid with closing eyes. And also the sad or touching scene made them silent. If the results were over about 80%, the group could be judged as the synchronization and the flow were achieved. As a result, the audience showed similar reactions about similar stimulation at same time and place. Once we get an additional normalization and experiment, we can obtain find the flow factor through the synchronization on a much bigger group and this should be useful for planning contents.

A Study on Perception and Attitudes of Health Workers Towards the Organization and Activities of Urban Health Centers (도시보건소 직원의 보건소 업무에 대한 인식 및 견해)

  • Lee, Jae-Mu;Kang, Pock-Soo;Lee, Kyeong-Soo;Kim, Cheon-Tae
    • Journal of Yeungnam Medical Science
    • /
    • v.12 no.2
    • /
    • pp.347-365
    • /
    • 1995
  • A survey was conducted to study perception and attitudes of health workers towards health center's activities and organization of health services, from August 15 to September 30, 1994. The study population was 310 health workers engaged in seven urban health centers in Taegu City area. A questionnaire method was used to collect data and response rate was 81.3 percent or 252 respondents. The following are summaries of findings: Profiles of study population: Health workers were predominantly female(62.3%); had college education(60.3%); and held medical and nursing positions(39.6%), technicians(30.6%) and public health/administrative positions(29.8%). Perceptions on health center's resources: Slightly more than a half(51.1%) of respondents expressed that physical facilities of the centers are inadequate; equipments needed are short(39.0%); human resource is inadequate(44.8%); and health budget allocated is insufficient(38.5%) to support the performance of health center's activities. Decentralization and health services: The majority revealed that the decentralization of government system would affect the future activities of health centers(51.9%) which may have to change. However, only one quarter of respondents(25.4%) seemed to view the decentralization positively as they expect that it would help perform health activities more effectively. The majority of the respondents(78.6%) insisted that the function and organization of the urban health centers should be changed. Target workload and job satisfaction: A large proportion (43.3%) of respondents felt that present target setting systems for various health activities are unrealistic in terms of community needs and health center's situation while only 11.1 percent responded it positively; the majority(57.5%) revealed that they need further training in professional fields to perform their job more effectively; more than one third(35.7%) expressed that they enjoy their professional autonomy in their job performance; and a considerable proportion (39.3%) said they are satisfied with their present work. Regarding the personnel management, more worker(47.3%) perceived it negatively than positive(11.5%) as most of workers seemed to think the personnel management practiced at the health centers is not fair or justly done. Health services rendered: Among health services rendered, health workers perceived the following services are most successfully delivered; they are, in order of importance, Tb control, curative services, and maternal and child health care. Such areas as health education, oral health, environmental sanitation, and integrated health services are needed to be strengthening. Regarding the community attitudes towards health workers, 41.3 percent of respondents think they are trusted by the community they serve. New areas of concern identified which must be included in future activities of health centers are, in order of priority, health care of elderly population, home health care, rehabilitation services, and such chronic diseases control programs as diabetes, hypertension, school health and mental health care. In conclusion, the study revealed that health workers seemed to have more negative perceptions and attitudes than positive ones towards organization and management of health services and activities performed by the urban health centers where they are engaged. More specifically, the majority of health workers studied revealed to have the following areas of health center's organization and management inadequate or insufficient to support effective performance of their health activities: Namely, physical facilities and equipments required are inadequate; human and financial resources are insufficient; personnel management is unsatisfactory; setting of service target system is unrealistic in terms of the community needs. However, respondents displayed a number of positive perceptions, particularly to those areas as further training needs and implementation of decentralization of government system which will bring more autonomy of local government as they perceived these change would bring the necessary changes to future activities of the health center. They also displayed positive perceptions in their job autonomy and have job satisfactions.

  • PDF

A Study on the Usage of Miào(廟) and Gōng(宮) in Zhou Dynasty through the Mentions to Them in the Scripture Sentences of 『Chūn-qiū(春秋)』 - In the Process of Investigating the Existence of Zhou Dynasty's System to Regulate the Number of Zōng-miào(宗廟) 【1/2】 (『춘추』 경문에서의 묘(廟)·궁(宮) 언급을 통한 주대(周代)의 그 쓰임 사례 일고찰 - 주대의 묘수제(廟數制) 실재 여부에 대한 궁구 과정에서 【1/2】-)

  • Seo, Jeong-Hwa
    • The Journal of Korean Philosophical History
    • /
    • no.57
    • /
    • pp.57-90
    • /
    • 2018
  • In this discussion, as a way to verify the existence of the system to regulate Zhou dynasty's $z{\bar{o}}ng-mi{\grave{a}}o$(宗廟) numbers, the discussion was focused on '$mi{\grave{a}}o$ (廟)' and '$g{\bar{o}}ng$(宮)' in the records of "$Ch{\bar{u}}n-qi{\bar{u}}$(春秋)". As for the parts where the contents of scripture sentences were not specific, the context of the case was investigated through the writings in "$Zu{\breve{o}}-zhu{\grave{a}}n$(左傳)" and other materials. In the cases of the usage of the letter, '$mi{\grave{a}}o$(廟 : a ruler's house, a nation's royal court)', in the scripture sentences in "$Ch{\bar{u}}n-qi{\bar{u}}$(春秋)", the followings need to be noticed. In $t{\grave{a}}i-mi{\grave{a}}o$(太廟) of State $L{\check{u}}$(魯), nationwide events and a ruler's political ancestral rite, $d{\grave{i}}$(?) ritual, were performed, and fancy tools for ancestral rites used in those rituals were equipped. As for the $z{\bar{o}}ng-mi{\grave{a}}o$(宗廟) of a ruler of those times, a ritual of royal court, $ch{\acute{a}}o$(朝) rite, was performed. The usage case of the letter, '$g{\bar{o}}ng$(宮 : house)', is as the following. In $g{\bar{o}}ng$(宮) where a ruler's personal family lived was a family ancestral rite for them carried out. The record about the ornate decorating for the $hu{\acute{a}}n-g{\bar{o}}ng$ house(桓宮), which can be said to have been the political base of $s{\bar{a}}n-hu{\acute{a}}n-sh{\grave{i}}$(三桓氏), three politically noble families of State $L{\check{u}}$(魯), is outstanding. The $x{\bar{i}}-g{\bar{o}}ng$ house(西宮) during $X{\bar{i}}-g{\bar{o}}ng$(魯 僖公)'s reign and a $x{\bar{i}}n-g{\bar{o}}ng$ house(新宮 : a newly built house) destroyed by fire at the third year of $Ch{\acute{e}}ng-g{\bar{o}}ng$(魯 成公), are assumed to have been a ruler's another house, such as the $ch{\check{u}}-g{\bar{o}}ng$ house(楚宮) in which $Xi{\bar{a}}ng-g{\bar{o}}ng$(魯 襄公) used to enjoy staying, which is different from the viewpoint that it might be a $m{\acute{i}}-g{\bar{o}}ng$ shrine(?宮 : a house constructed as a shrine for the deceased father or the deceased grand father) that had been formed since Han dynasty. It has been discussed that, regarding the records that the '$w{\check{u}}-g{\bar{o}}ng$ house(武宮) was built' and that the '$y{\acute{a}}ng-g{\bar{o}}ng$ house(煬宮) was built', certain buildings were established with the symbols of '$w{\check{u}}$(武 : martial arts and force of arms)' and '$y{\acute{a}}ng$(煬 : to burn and get rid of everything)', and the events that a lord stood as its lord continued. Therefore, its main goal was not the performance of a dutiful ancestral rite by a ruler of those times for deceased rulers, for instance, $W{\check{u}}-g{\bar{o}}ng$(魯 武公) or $Y{\acute{a}}ng-g{\bar{o}}ng$(魯 煬公), but display of certain political symbolism through the ritual. This symbolism is most obvious with the $hu{\acute{a}}n-g{\bar{o}}ng$ house(桓宮) and the $x{\bar{i}}-g{\bar{o}}ng$ house(僖宮). As a consequence, all $mi{\grave{a}}os$(廟) and $g{\bar{o}}ngs$(宮) in scripture sentences had the functions of a shrine in some part, but it has been verified that they were not the buildings set up as a shrine to follow '$z{\bar{o}}ng-mi{\grave{a}}o$(宗廟)'s number regulation system' of '$ti{\bar{a}}nz{\check{i}}-7-mi{\grave{a}}o$(天子七廟 : an emperor owns seven $mi{\grave{a}}os$(廟))' or '$zh{\bar{u}}h{\acute{o}}u-5-mi{\grave{a}}o$(諸侯五廟 : a lord owns five $mi{\grave{a}}os$(廟))'.

Application and Expansion of the Harm Principle to the Restrictions of Liberty in the COVID-19 Public Health Crisis: Focusing on the Revised Bill of the March 2020 「Infectious Disease Control and Prevention Act」 (코로나19 공중보건 위기 상황에서의 자유권 제한에 대한 '해악의 원리'의 적용과 확장 - 2020년 3월 개정 「감염병의 예방 및 관리에 관한 법률」을 중심으로 -)

  • You, Kihoon;Kim, Dokyun;Kim, Ock-Joo
    • The Korean Society of Law and Medicine
    • /
    • v.21 no.2
    • /
    • pp.105-162
    • /
    • 2020
  • In the pandemic of infectious disease, restrictions of individual liberty have been justified in the name of public health and public interest. In March 2020, the National Assembly of the Republic of Korea passed the revised bill of the 「Infectious Disease Control and Prevention Act.」 The revised bill newly established the legal basis for forced testing and disclosure of the information of confirmed cases, and also raised the penalties for violation of self-isolation and treatment refusal. This paper examines whether and how these individual liberty limiting clauses be justified, and if so on what ethical and philosophical grounds. The authors propose the theories of the philosophy of law related to the justifiability of liberty-limiting measures by the state and conceptualized the dual-aspect of applying the liberty-limiting principle to the infected patient. In COVID-19 pandemic crisis, the infected person became the 'Patient as Victim and Vector (PVV)' that posits itself on the overlapping area of 'harm to self' and 'harm to others.' In order to apply the liberty-limiting principle proposed by Joel Feinberg to a pandemic with uncertainties, it is necessary to extend the harm principle from 'harm' to 'risk'. Under the crisis with many uncertainties like COVID-19 pandemic, this shift from 'harm' to 'risk' justifies the state's preemptive limitation on individual liberty based on the precautionary principle. This, at the same time, raises concerns of overcriminalization, i.e., too much limitation of individual liberty without sufficient grounds. In this article, we aim to propose principles regarding how to balance between the precautionary principle for preemptive restrictions of liberty and the concerns of overcriminalization. Public health crisis such as the COVID-19 pandemic requires a population approach where the 'population' rather than an 'individual' works as a unit of analysis. We propose the second expansion of the harm principle to be applied to 'population' in order to deal with the public interest and public health. The new concept 'risk to population,' derived from the two arguments stated above, should be introduced to explain the public health crisis like COVID-19 pandemic. We theorize 'the extended harm principle' to include the 'risk to population' as a third liberty-limiting principle following 'harm to others' and 'harm to self.' Lastly, we examine whether the restriction of liberty of the revised 「Infectious Disease Control and Prevention Act」 can be justified under the extended harm principle. First, we conclude that forced isolation of the infected patient could be justified in a pandemic situation by satisfying the 'risk to the population.' Secondly, the forced examination of COVID-19 does not violate the extended harm principle either, based on the high infectivity of asymptomatic infected people to others. Thirdly, however, the provision of forced treatment can not be justified, not only under the traditional harm principle but also under the extended harm principle. Therefore it is necessary to include additional clauses in the provision in order to justify the punishment of treatment refusal even in a pandemic.

Evaluating efficiency of application the skin flash for left breast IMRT. (왼쪽 유방암 세기변조방사선 치료시 Skin Flash 적용에 대한 유용성 평가)

  • Lim, Kyoung Dal;Seo, Seok Jin;Lee, Je Hee
    • The Journal of Korean Society for Radiation Therapy
    • /
    • v.30 no.1_2
    • /
    • pp.49-63
    • /
    • 2018
  • Purpose : The purpose of this study is investigating the changes of treatment plan and comparing skin dose with or without the skin flash. To investigate optimal applications of the skin flash, the changes of skin dose of each plans by various thicknesses of skin flash were measured and analyzed also. Methods and Material : Anthropomorphic phantom was scanned by CT for this study. The 2 fields hybrid IMRT and the 6 fields static IMRT were generated from the Eclipse (ver. 13.7.16, Varian, USA) RTP system. Additional plans were generated from each IMRT plans by changing skin flash thickness to 0.5 cm, 1.0 cm, 1.5 cm, 2.0 cm and 2.5 cm. MU and maximum doses were measured also. The treatment equipment was 6MV of VitalBeam (Varian Medical System, USA). Measuring device was a metal oxide semiconductor field-effect transistor(MOSFET). Measuring points of skin doses are upper (1), middle (2) and lower (3) positions from center of the left breast of the phantom. Other points of skin doses, artificially moved to medial and lateral sides by 0.5 cm, were also measured. Results : The reference value of 2F-hIMRT was 206.7 cGy at 1, 186.7 cGy at 2, and 222 cGy at 3, and reference values of 6F-sIMRT were measured at 192 cGy at 1, 213 cGy at 2, and 215 cGy at 3. In comparison with these reference values, the first measurement point in 2F-hIMRT was 261.3 cGy with a skin flash 2.0 cm and 2.5 cm, and the highest dose difference was 26.1 %diff. and 5.6 %diff, respectively. The third measurement point was 245.3 cGy and 10.5 %diff at the skin flash 2.5 cm. In the 6F-sIMRT, the highest dose difference was observed at 216.3 cGy and 12.7 %diff. when applying the skin flash 2.0 cm for the first measurement point and the dose difference was the largest at the application point of 2.0 cm, not the skin flash 2.5 cm for each measurement point. In cases of medial 0.5 cm shift points of 2F-hIMRT and 6F-sIMRT without skin flash, the measured value was -75.2 %diff. and -70.1 %diff. at 2F, At -14.8, -12.5, and -21.0 %diff. at the 1st, 2nd and 3rd measurement points, respectively. Generally, both treatment plans showed an increase in total MU, maximum dose and %diff as skin flash thickness increased, except for some results. The difference of skin dose using 0.5 cm thickness of skin flash was lowest lesser than 20 % in every conditions. Conclusion : Minimizing the thickness of skin flash by 0.5 cm is considered most ideal because it makes it possible to keep down MUs and lowering maximum doses. In addition, It was found that MUs, maximum doses and differences of skin doses did not increase infinitely as skin flash thickness increase by. If the error margin caused by PTV or other factors is lesser than 1.0 cm, It is considered that there will be many advantages in with the skin flash technique comparing without it.

  • PDF

A Study on Knowledge Entity Extraction Method for Individual Stocks Based on Neural Tensor Network (뉴럴 텐서 네트워크 기반 주식 개별종목 지식개체명 추출 방법에 관한 연구)

  • Yang, Yunseok;Lee, Hyun Jun;Oh, Kyong Joo
    • Journal of Intelligence and Information Systems
    • /
    • v.25 no.2
    • /
    • pp.25-38
    • /
    • 2019
  • Selecting high-quality information that meets the interests and needs of users among the overflowing contents is becoming more important as the generation continues. In the flood of information, efforts to reflect the intention of the user in the search result better are being tried, rather than recognizing the information request as a simple string. Also, large IT companies such as Google and Microsoft focus on developing knowledge-based technologies including search engines which provide users with satisfaction and convenience. Especially, the finance is one of the fields expected to have the usefulness and potential of text data analysis because it's constantly generating new information, and the earlier the information is, the more valuable it is. Automatic knowledge extraction can be effective in areas where information flow is vast, such as financial sector, and new information continues to emerge. However, there are several practical difficulties faced by automatic knowledge extraction. First, there are difficulties in making corpus from different fields with same algorithm, and it is difficult to extract good quality triple. Second, it becomes more difficult to produce labeled text data by people if the extent and scope of knowledge increases and patterns are constantly updated. Third, performance evaluation is difficult due to the characteristics of unsupervised learning. Finally, problem definition for automatic knowledge extraction is not easy because of ambiguous conceptual characteristics of knowledge. So, in order to overcome limits described above and improve the semantic performance of stock-related information searching, this study attempts to extract the knowledge entity by using neural tensor network and evaluate the performance of them. Different from other references, the purpose of this study is to extract knowledge entity which is related to individual stock items. Various but relatively simple data processing methods are applied in the presented model to solve the problems of previous researches and to enhance the effectiveness of the model. From these processes, this study has the following three significances. First, A practical and simple automatic knowledge extraction method that can be applied. Second, the possibility of performance evaluation is presented through simple problem definition. Finally, the expressiveness of the knowledge increased by generating input data on a sentence basis without complex morphological analysis. The results of the empirical analysis and objective performance evaluation method are also presented. The empirical study to confirm the usefulness of the presented model, experts' reports about individual 30 stocks which are top 30 items based on frequency of publication from May 30, 2017 to May 21, 2018 are used. the total number of reports are 5,600, and 3,074 reports, which accounts about 55% of the total, is designated as a training set, and other 45% of reports are designated as a testing set. Before constructing the model, all reports of a training set are classified by stocks, and their entities are extracted using named entity recognition tool which is the KKMA. for each stocks, top 100 entities based on appearance frequency are selected, and become vectorized using one-hot encoding. After that, by using neural tensor network, the same number of score functions as stocks are trained. Thus, if a new entity from a testing set appears, we can try to calculate the score by putting it into every single score function, and the stock of the function with the highest score is predicted as the related item with the entity. To evaluate presented models, we confirm prediction power and determining whether the score functions are well constructed by calculating hit ratio for all reports of testing set. As a result of the empirical study, the presented model shows 69.3% hit accuracy for testing set which consists of 2,526 reports. this hit ratio is meaningfully high despite of some constraints for conducting research. Looking at the prediction performance of the model for each stocks, only 3 stocks, which are LG ELECTRONICS, KiaMtr, and Mando, show extremely low performance than average. this result maybe due to the interference effect with other similar items and generation of new knowledge. In this paper, we propose a methodology to find out key entities or their combinations which are necessary to search related information in accordance with the user's investment intention. Graph data is generated by using only the named entity recognition tool and applied to the neural tensor network without learning corpus or word vectors for the field. From the empirical test, we confirm the effectiveness of the presented model as described above. However, there also exist some limits and things to complement. Representatively, the phenomenon that the model performance is especially bad for only some stocks shows the need for further researches. Finally, through the empirical study, we confirmed that the learning method presented in this study can be used for the purpose of matching the new text information semantically with the related stocks.

Development of Information Extraction System from Multi Source Unstructured Documents for Knowledge Base Expansion (지식베이스 확장을 위한 멀티소스 비정형 문서에서의 정보 추출 시스템의 개발)

  • Choi, Hyunseung;Kim, Mintae;Kim, Wooju;Shin, Dongwook;Lee, Yong Hun
    • Journal of Intelligence and Information Systems
    • /
    • v.24 no.4
    • /
    • pp.111-136
    • /
    • 2018
  • In this paper, we propose a methodology to extract answer information about queries from various types of unstructured documents collected from multi-sources existing on web in order to expand knowledge base. The proposed methodology is divided into the following steps. 1) Collect relevant documents from Wikipedia, Naver encyclopedia, and Naver news sources for "subject-predicate" separated queries and classify the proper documents. 2) Determine whether the sentence is suitable for extracting information and derive the confidence. 3) Based on the predicate feature, extract the information in the proper sentence and derive the overall confidence of the information extraction result. In order to evaluate the performance of the information extraction system, we selected 400 queries from the artificial intelligence speaker of SK-Telecom. Compared with the baseline model, it is confirmed that it shows higher performance index than the existing model. The contribution of this study is that we develop a sequence tagging model based on bi-directional LSTM-CRF using the predicate feature of the query, with this we developed a robust model that can maintain high recall performance even in various types of unstructured documents collected from multiple sources. The problem of information extraction for knowledge base extension should take into account heterogeneous characteristics of source-specific document types. The proposed methodology proved to extract information effectively from various types of unstructured documents compared to the baseline model. There is a limitation in previous research that the performance is poor when extracting information about the document type that is different from the training data. In addition, this study can prevent unnecessary information extraction attempts from the documents that do not include the answer information through the process for predicting the suitability of information extraction of documents and sentences before the information extraction step. It is meaningful that we provided a method that precision performance can be maintained even in actual web environment. The information extraction problem for the knowledge base expansion has the characteristic that it can not guarantee whether the document includes the correct answer because it is aimed at the unstructured document existing in the real web. When the question answering is performed on a real web, previous machine reading comprehension studies has a limitation that it shows a low level of precision because it frequently attempts to extract an answer even in a document in which there is no correct answer. The policy that predicts the suitability of document and sentence information extraction is meaningful in that it contributes to maintaining the performance of information extraction even in real web environment. The limitations of this study and future research directions are as follows. First, it is a problem related to data preprocessing. In this study, the unit of knowledge extraction is classified through the morphological analysis based on the open source Konlpy python package, and the information extraction result can be improperly performed because morphological analysis is not performed properly. To enhance the performance of information extraction results, it is necessary to develop an advanced morpheme analyzer. Second, it is a problem of entity ambiguity. The information extraction system of this study can not distinguish the same name that has different intention. If several people with the same name appear in the news, the system may not extract information about the intended query. In future research, it is necessary to take measures to identify the person with the same name. Third, it is a problem of evaluation query data. In this study, we selected 400 of user queries collected from SK Telecom 's interactive artificial intelligent speaker to evaluate the performance of the information extraction system. n this study, we developed evaluation data set using 800 documents (400 questions * 7 articles per question (1 Wikipedia, 3 Naver encyclopedia, 3 Naver news) by judging whether a correct answer is included or not. To ensure the external validity of the study, it is desirable to use more queries to determine the performance of the system. This is a costly activity that must be done manually. Future research needs to evaluate the system for more queries. It is also necessary to develop a Korean benchmark data set of information extraction system for queries from multi-source web documents to build an environment that can evaluate the results more objectively.

KNU Korean Sentiment Lexicon: Bi-LSTM-based Method for Building a Korean Sentiment Lexicon (Bi-LSTM 기반의 한국어 감성사전 구축 방안)

  • Park, Sang-Min;Na, Chul-Won;Choi, Min-Seong;Lee, Da-Hee;On, Byung-Won
    • Journal of Intelligence and Information Systems
    • /
    • v.24 no.4
    • /
    • pp.219-240
    • /
    • 2018
  • Sentiment analysis, which is one of the text mining techniques, is a method for extracting subjective content embedded in text documents. Recently, the sentiment analysis methods have been widely used in many fields. As good examples, data-driven surveys are based on analyzing the subjectivity of text data posted by users and market researches are conducted by analyzing users' review posts to quantify users' reputation on a target product. The basic method of sentiment analysis is to use sentiment dictionary (or lexicon), a list of sentiment vocabularies with positive, neutral, or negative semantics. In general, the meaning of many sentiment words is likely to be different across domains. For example, a sentiment word, 'sad' indicates negative meaning in many fields but a movie. In order to perform accurate sentiment analysis, we need to build the sentiment dictionary for a given domain. However, such a method of building the sentiment lexicon is time-consuming and various sentiment vocabularies are not included without the use of general-purpose sentiment lexicon. In order to address this problem, several studies have been carried out to construct the sentiment lexicon suitable for a specific domain based on 'OPEN HANGUL' and 'SentiWordNet', which are general-purpose sentiment lexicons. However, OPEN HANGUL is no longer being serviced and SentiWordNet does not work well because of language difference in the process of converting Korean word into English word. There are restrictions on the use of such general-purpose sentiment lexicons as seed data for building the sentiment lexicon for a specific domain. In this article, we construct 'KNU Korean Sentiment Lexicon (KNU-KSL)', a new general-purpose Korean sentiment dictionary that is more advanced than existing general-purpose lexicons. The proposed dictionary, which is a list of domain-independent sentiment words such as 'thank you', 'worthy', and 'impressed', is built to quickly construct the sentiment dictionary for a target domain. Especially, it constructs sentiment vocabularies by analyzing the glosses contained in Standard Korean Language Dictionary (SKLD) by the following procedures: First, we propose a sentiment classification model based on Bidirectional Long Short-Term Memory (Bi-LSTM). Second, the proposed deep learning model automatically classifies each of glosses to either positive or negative meaning. Third, positive words and phrases are extracted from the glosses classified as positive meaning, while negative words and phrases are extracted from the glosses classified as negative meaning. Our experimental results show that the average accuracy of the proposed sentiment classification model is up to 89.45%. In addition, the sentiment dictionary is more extended using various external sources including SentiWordNet, SenticNet, Emotional Verbs, and Sentiment Lexicon 0603. Furthermore, we add sentiment information about frequently used coined words and emoticons that are used mainly on the Web. The KNU-KSL contains a total of 14,843 sentiment vocabularies, each of which is one of 1-grams, 2-grams, phrases, and sentence patterns. Unlike existing sentiment dictionaries, it is composed of words that are not affected by particular domains. The recent trend on sentiment analysis is to use deep learning technique without sentiment dictionaries. The importance of developing sentiment dictionaries is declined gradually. However, one of recent studies shows that the words in the sentiment dictionary can be used as features of deep learning models, resulting in the sentiment analysis performed with higher accuracy (Teng, Z., 2016). This result indicates that the sentiment dictionary is used not only for sentiment analysis but also as features of deep learning models for improving accuracy. The proposed dictionary can be used as a basic data for constructing the sentiment lexicon of a particular domain and as features of deep learning models. It is also useful to automatically and quickly build large training sets for deep learning models.

A Study on the Tree Surgery Problem and Protection Measures in Monumental Old Trees (천연기념물 노거수 외과수술 문제점 및 보존 관리방안에 관한 연구)

  • Jung, Jong Soo
    • Korean Journal of Heritage: History & Science
    • /
    • v.42 no.1
    • /
    • pp.122-142
    • /
    • 2009
  • This study explored all domestic and international theories for maintenance and health enhancement of an old and big tree, and carried out the anatomical survey of the operation part of the tree toward he current status of domestic surgery and the perception survey of an expert group, and drew out following conclusion through the process of suggesting its reform plan. First, as a result of analyzing the correlation of the 67 subject trees with their ages, growth status. surroundings, it revealed that they were closely related to positional characteristic, damage size, whereas were little related to materials by fillers. Second, the size of the affected part was the most frequent at the bough sheared part under $0.09m^2$, and the hollow size by position(part) was the biggest at 'root + stem' starting from the behind of the main root and stem As a result of analyzing the correlation, the same result was elicited at the group with low correlation. Third, the problem was serious in charging the fillers (especially urethane) in the big hollow or exposed root produced at the behind of the root and stem part, or surface-processing it. The benefit by charging the hollow part was analyzed as not so much. Fourth, the surface-processing of fillers currently used (artificial bark) is mainly 'epoxy+woven fabric+cork', but it is not flexible, so it has brought forth problems of frequent cracks and cracked surface at the joint part with the treetextured part. Fifth, the correlation with the external status of the operated part was very high with the closeness, surface condition, formation of adhesive tissue and internal survey result. Sixth, the most influential thing on flushing by the wrong management of an old and big tree was banking, and a wrong pruning was the source of the ground part damage. In pruning a small bough can easily recover itself from its damage as its formation of adhesive tissue when it is cut by a standard method. Seventh, the parameters affecting the times of related business handling of an old and big tree are 'the need of the conscious reform of the manager and related business'. Eighth, a reform plan in an institutional aspect can include the arrangement of the law and organization of the old and big tree management and preservation at an institutional aspect. This study for preparing a reform plan through the status survey of the designated old and big tree, has a limit inducing a reform plan based on the status survey through individual research, and a weak point suggesting grounds by any statistical data. This can be complemented by subsequent studies.