• Title/Summary/Keyword: 직접평가

Search Result 4,210, Processing Time 0.039 seconds

The Present State of Domestic Acceptance of Various International Conventions for the Prevention of Marine Pollution (해양오염방지를 위한 각종 국제협약의 국내 수용 현황)

  • Kim, Kwang-Soo
    • Journal of the Korean Society of Marine Environment & Safety
    • /
    • v.12 no.4 s.27
    • /
    • pp.293-300
    • /
    • 2006
  • Domestic laws such as Korea Marine Pollution Prevention Law (KMPPL) which has been mae and amended according to the conclusions and amendments of various international conventions for the prevention a marine pollution such as MARPOL 73/78 were reviewed and compared with the major contents of the relevant international conventions. Alternative measures for legislating new laws or amending existing laws such as KMPPL for the acceptance of major contents of existing international conventions were proposed. Annex VI of MARPOL 73/78 into which the regulations for the prevention of air pollution from ship have been adopted has been recently accepted in KMPPL which should be applied to ships which are the moving sources of air pollution at sea rather tlnn in Korea Air Environment Conservation Law which should be applied to automobiles and industrial installations in land. The major contents of LC 72/95 have been accepted in KMPPL However, a few of substances requiring special care in Annex II of 72LC, a few of items in characteristics and composition for the matter in relation to criteria governing the issue of permits for the dumping of matter at sea in Annex III of 72LC, and a few of items in wastes or other matter that may be considered for dumping in Annex I of 96 Protocol have not been accepted in KMPPL yet. The major contents of OPRC 90 have been accepted in KMPPL. However, oil pollution emergency plans for sea ports and oil handling facilities, and national contingency plan for preparedness and response have not been accepted in KMPPL yet. The waste oil related articles if Basel Convention, which shall regulate and prohibit transboundary movement of hazardous waste, should be accepted in KMPPL in order to prevent the transfer if scrap-purpose tanker ships containing oil/water mixtures and chemicals remained on beard from advanced countries to developing and/or underdeveloped countries. International Convention for the Control if Harmful Anti-Fouling Systems on the Ships should be accepted in KMPPL rather tlnn in Korea Noxious Chemicals Management Law. International Convention for Ship's Ballast Water/Sediment Management should be accepted in KMPPL or by a new law in order to prevent domestic marine ecosystem and costal environment from the invasion of harmful exotic species through the discharge of ship's ballast water.

  • PDF

The Histologic Type of Lung Cancer in Idiopathic Pulmonary Fibrosis : the Difference According to the Presence of Fibrosis at Cancer Location (특발성 폐섬유화증에서 발생한 폐암의 조직형의 특성 : 폐암 위치의 섬유화 유무에 따른 조직형의 차이)

  • Kwon, Sung-Youn;Kim, Deog-Kyeom;Lee, Suk-Young;Yoo, Chul-Gyu;Lee, Choon-Taek;Kim, Young-Whan;Im, Jung-Gi;Shim, Young-Soo;Han, Sung-Koo
    • Tuberculosis and Respiratory Diseases
    • /
    • v.49 no.4
    • /
    • pp.441-452
    • /
    • 2000
  • Background : It is well known that the prevalence of lung cancer is higher in idiopathic pulmonary fibrosis (IPF) patients than in the general population. This high prevalence is explained by the concept of 'scar carcinoma'. There have been several reports on the prevalence of histologic typo of lung cancer in IPF with conflicting results. Despite of the high smoker rate in almost all previous reports, none considered the smoking history of patients. Therefore we performed a separate studies on fibrosis associated lung cancer and smoking associated lung cancer. The purpose of this study is to investigate the proportion of lung cancer in IPF that is fibrosis associated and to determine the most common histologic type in fibrosis associated lung cancer in IPF. Method : A retrospective review of medical records and radiologic studies was performed for cases of lung cancer with IPF. We investigated smoking history, sequence of diagnosis of lung cancer and IPF, histologic type of lung cancer and the cancer location, especially whether the location is associated with fibrosis. To evaluate the proportion of fibrous associated lung cancer, the lung cancer in IPF were categorized according to the presence of fibrosis at cancer location. Results : Fifty seven patients were subjects for this analysis. Six (11%) cases were diagnosed as lung cancer during follow-up for IPF, and both diseases were diagnosed simultaneously in the others. Ninety four percent of patients were smokers and the average smoking amount was 47.1$\pm$21.9 pack-year. Among the patients with IPF and lung cancer, 42(80.8%) cases were considered as "fibrosis associated". The remainder was "not fibrosis associated" and probably was due to smoking etc. Although the most frequent histologic type was squamous cell carcinoma as a whole, adenocarcinoma was the prominent histologic type in "fibrosis associated lung cancer." Conclusion : Considering the proportion of "fibrosis not associated lung cancer" in the patients with IPF and lung cancer, significant proportion of lung cancer in IPF may not be fibrosis induced. This may influence the distribution of histologic type of lung cancer in IPF.

  • PDF

Extension Method of Association Rules Using Social Network Analysis (사회연결망 분석을 활용한 연관규칙 확장기법)

  • Lee, Dongwon
    • Journal of Intelligence and Information Systems
    • /
    • v.23 no.4
    • /
    • pp.111-126
    • /
    • 2017
  • Recommender systems based on association rule mining significantly contribute to seller's sales by reducing consumers' time to search for products that they want. Recommendations based on the frequency of transactions such as orders can effectively screen out the products that are statistically marketable among multiple products. A product with a high possibility of sales, however, can be omitted from the recommendation if it records insufficient number of transactions at the beginning of the sale. Products missing from the associated recommendations may lose the chance of exposure to consumers, which leads to a decline in the number of transactions. In turn, diminished transactions may create a vicious circle of lost opportunity to be recommended. Thus, initial sales are likely to remain stagnant for a certain period of time. Products that are susceptible to fashion or seasonality, such as clothing, may be greatly affected. This study was aimed at expanding association rules to include into the list of recommendations those products whose initial trading frequency of transactions is low despite the possibility of high sales. The particular purpose is to predict the strength of the direct connection of two unconnected items through the properties of the paths located between them. An association between two items revealed in transactions can be interpreted as the interaction between them, which can be expressed as a link in a social network whose nodes are items. The first step calculates the centralities of the nodes in the middle of the paths that indirectly connect the two nodes without direct connection. The next step identifies the number of the paths and the shortest among them. These extracts are used as independent variables in the regression analysis to predict future connection strength between the nodes. The strength of the connection between the two nodes of the model, which is defined by the number of nodes between the two nodes, is measured after a certain period of time. The regression analysis results confirm that the number of paths between the two products, the distance of the shortest path, and the number of neighboring items connected to the products are significantly related to their potential strength. This study used actual order transaction data collected for three months from February to April in 2016 from an online commerce company. To reduce the complexity of analytics as the scale of the network grows, the analysis was performed only on miscellaneous goods. Two consecutively purchased items were chosen from each customer's transactions to obtain a pair of antecedent and consequent, which secures a link needed for constituting a social network. The direction of the link was determined in the order in which the goods were purchased. Except for the last ten days of the data collection period, the social network of associated items was built for the extraction of independent variables. The model predicts the number of links to be connected in the next ten days from the explanatory variables. Of the 5,711 previously unconnected links, 611 were newly connected for the last ten days. Through experiments, the proposed model demonstrated excellent predictions. Of the 571 links that the proposed model predicts, 269 were confirmed to have been connected. This is 4.4 times more than the average of 61, which can be found without any prediction model. This study is expected to be useful regarding industries whose new products launch quickly with short life cycles, since their exposure time is critical. Also, it can be used to detect diseases that are rarely found in the early stages of medical treatment because of the low incidence of outbreaks. Since the complexity of the social networking analysis is sensitive to the number of nodes and links that make up the network, this study was conducted in a particular category of miscellaneous goods. Future research should consider that this condition may limit the opportunity to detect unexpected associations between products belonging to different categories of classification.

The Usefulness of Product Display of Online Store by the Product Type of Usage Situation - Focusing on the moderate effect of the product portability - (사용상황별 제품유형에 따른 온라인 점포 제품디스플레이의 유용성 - 제품 휴대성의 조절효과를 중심으로 -)

  • Lee, Dong-Il;Choi, Seung-Hoon
    • Journal of Distribution Research
    • /
    • v.16 no.2
    • /
    • pp.1-24
    • /
    • 2011
  • 1. Introduction: Contrast to the offline purchasing environment, online store cannot offer the sense of touch or direct visual information of its product to the consumers. So the builder of the online shopping mall should provide more concrete and detailed product information(Kim 2008), and Alba (1997) also predicted that the quality of the offered information is determined by the post-purchase consumer satisfaction. In practice, many fashion and apparel online shopping malls offer the picture information with the product on the real person model to enhance the usefulness of product information. On the other virtual product experience has been suggested to the ways of overcoming the online consumers' limited perceptual capability (Jiang & Benbasat 2005). However, the adoption and the facilitation of the virtual reality tools requires high investment and technical specialty compared to the text/picture product information offerings (Shaffer 2006). This could make the entry barrier to the online shopping to the small retailers and sometimes it could be demanding high level of consumers' perceptual efforts. So the expensive technological solution could affects negatively to the consumer decision making processes. Nevertheless, most of the previous research on the online product information provision suggests the VR be the more effective tools. 2. Research Model and Hypothesis: Presented in

    , research model suggests VR effect could be moderated by the product types by the usage situations. Product types could be defined as the portable product and installed product, and the information offering type as still picture of the product, picture of the product with the real-person model and VR. 3. Methods and Results: 3.1. Experimental design and measured variables We designed the 2(product types) X 3(product information types) experimental setting and measured dependent variables such as information usefulness, attitude toward the shopping mall, overall product quality, purchase intention and the revisiting intention. In the case of information usefulness and attitude toward the shopping mall were measured by multi-item scale. As a result of reliability test, Cronbach's Alpha value of each variable shows more than 0.6. Thus, we ensured that the internal consistency of items. 3.2. Manipulation check The main concern of this study is to verify the moderate effect by the product type of usage situation. indicates that our experimental manipulation of the moderate effect of the product type was successful. 3.3. Results As
    indicates, there was a significant main effect on the only one dependent variable(attitude toward the shopping mall) by the information types. As predicted, VR has highest mean value compared to other information types. Thus, H1 was partially supported. However, main effect by the product types was not found. To evaluate H2 and H3, a two-way ANOVA was conducted. As
    indicates, there exist the interaction effects on the three dependent variables(information usefulness, overall product quality and purchase intention) by the information types and the product types. As predicted, picture of the product with the real-person model has highest mean among the information types in the case of portable product. On the other hand, VR has highest mean among the information types in the case of installed product. Thus, H2 and H3 was supported. 4. Implications: The present study found the moderate effect by the product type of usage situation. Based on the findings the following managerial implications are asserted. First, it was found that information types are affect only the attitude toward the shopping mall. The meaning of this finding is that VR effects are not enough to understand the product itself. Therefore, we must consider when and how to use this VR tools. Second, it was found that there exist the interaction effects on the information usefulness, overall product quality and purchase intention. This finding suggests that consideration of usage situation helps consumer's understanding of product and promotes their purchase intention. In conclusion, not only product attributes but also product usage situations must be fully considered by the online retailers when they want to meet the needs of consumers.

  • PDF
  • Seasonal Variation of Water Quality in a Shallow Eutrophic Reservoir (얕은 부영양 저수지의 육수학적 특성-계절에 따른 수질변화)

    • Kim, Ho-Sub;Hwang, Soon-Jin
      • Korean Journal of Ecology and Environment
      • /
      • v.37 no.2 s.107
      • /
      • pp.180-192
      • /
      • 2004
    • This study was carried out to assess the seasonal variation of water quality and the effect of pollutant loading from watershed in a shallow eutrophic reservoir (Shingu reservoir) from November 2002 to February 2004, Stable thermocline which was greater than $1^{\circ}C$ per meter of the water depth formed in May, and low DO concentration (< 2 mg $O_2\;L^{-1}$) was observed in the hypolimnion from May to September, 2003. The ratio of euphotic depth to mixing depth ($Z_{eu}/Z_{m}$) ranged 0.2 ${\sim}$ 1.1, and the depth of the mixed layer exceeded that of the photic layer during study period, except for May when $Z_{eu}$ and $Z_{m}$ were 4 and 4.3 m, respectively. Most of total nitrogen, ranged 1.1 ${\sim}$ 4.5 ${\mu}g\;N\;L^{-1}$, accounted for inorganic nitrogen (Avg, 58.7%), and sharp increase of $NH_3$-N Hand $NO_3$-N was evident during the spring season. TP concentration in the water column ranged 43.9 ${\sim}$ 126.5 ${\mu}g\;P\;L^{-1}$, and the most of TP in the water column accounted for POP (Avg. 80%). During the study period, DIP concentration in the water column was &;lt 10 ${\mu}g\;P\;L^{-1}$ except for July and August when DIP concentration in the hypolimnion was 22.3 and 56.7 ${\mu}g\;P\;L^{-1}$, respectively. Increase of Chl. a concentration observed in July (99 ${\mu}g\;L^{-1}$) and November 2003 (109 ${\mu}g\;L^{-1}$) when P loading through two inflows was high, and showed close relationship with TP concentration (r = 0.55, P< 0.008, n = 22). Mean Chl. a concentration ranged from 13.5 to 84.5 mg $L^{-1}$ in the water column, and the lowest and highest concentration was observed in February 2004 (13.5 ${\pm}$ 1.0 ${\mu}g\;L^{-1}$) and November 2003 (84.5 ${\pm}$29.0 ${\mu}g\;L^{-1}$), respectively. TP concentration in inflow water increased with discharge (r = 0.69, P< 0.001), 40.5% of annual total P loading introduced in 25 July when there was heavy rainfall. Annual total P loading from watershed was 159.0 kg P $yr^{-1}$, and that of DIP loading was 126.3 kg P $yr^{-1}$ (77.7% of TP loading. The loading of TN (5.0ton yr-1) was 30 times higher than that of TP loading (159.0 kg P yr-1), and the 78% of TN was in the form of non-organic nitrogen, 3.9 ton $yr^{-1}$ in mass. P loading in Shingu reservoir was 1.6 g ${\cdot}$ $m^{-2}$ ${\cdot}$ $yr^{-1}$, which passed the excessive critical loading of Vollenweider-OECD critical loading model. The results of this study indicated that P loading from watershed was the major factor to cause eutrophication and temporal variation of water quality in Shingu reservoir Decrease by 71% in TP loading (159 kg $yr^{-1}$) is necessary for the improvement of mesotrophic level. The management of sediment where tine anaerobic condition was evident in summer, thus, the possibility of P release that can be utilized by existing algae, may also be considered.

    The Definition of Outer Space and the Air/Outer Space Boundary Question (우주의 법적 지위와 경계획정 문제)

    • Lee, Young-Jin
      • The Korean Journal of Air & Space Law and Policy
      • /
      • v.30 no.2
      • /
      • pp.427-468
      • /
      • 2015
    • To date, we have considered the theoretical views, the standpoint of states and the discourse within the international community such as the UN Committee on the Peaceful Uses of Outer Space(COPUOS) regarding the Air/Outer Space Boundary Question which is one of the first issues of UN COPUOS established in line with marking the starting point of Outer Space Area. As above mentioned, discussions in the United Nations and among scholars of within each state regarding the delimitation issue often saw a division between those in favor of a functional approach (the functionalists) and those seeking the delineation of a boundary (the spatialists). The spatialists emphasize that the boundary between air and outer space should be delimited because the status of outer space is a type of public domain from which sovereign jurisdiction is excluded, as stated in Article 2 of Outer Space Treaty. On the contrary art. I of Chicago Convention is evidence of the acknowledgement of sovereignty over airspace existing as an international customary law, has the binding force of which exists independently of the Convention. The functionalists, backed initially by the major space powers, which viewed any boundary demarcation as possibly restricting their access to space, whether for peaceful or non-military purposes, considered it insufficient or inadequate to delimit a boundary of outer space without obvious scientific and technological evidences. Last more than 50 years there were large development in the exploration and use of outer space. But a large number states including those taking the view of a functionalist have taken on a negative attitude. As the element of location is a decisive factor for the choice of the legal regime to be applied, a purely functional approach to the regulation of activities in the space above the Earth does not offer a solution. It seems therefore to welcome the arrival of clear evidence of a growing recognition of and national practices concerning a spatial approach to the problem is gaining support both by a large number of States as well as by publicists. The search for a solution to the problem of demarcating the two different legal regimes governing the space above Earth has undoubtedly been facilitated and a number of countries including Russia have already advocated the acceptance of the lowest perigee boundary of outer space at a height of 100km. As a matter of fact the lowest perigee where space objects are still able to continue in their orbiting around the earth has already been imposed as a natural criterion for the delimitation of outer space. This delimitation of outer space has also been evidenced by the constant practice of a large number of States and their tacit consent to space activities accomplished so far at this distance and beyond it. Of course there are still numerous opposing views on the delineation of a outer space boundary by space powers like U.S.A., England, France and so on. Therefore, first of all to solve the legal issues faced by the international community in outer space activities like delimitation problem, there needs a positive and peaceful will of international cooperation. From this viewpoint, President John F. Kennedy once described the rationale behind the outer space activities in his famous "Moon speech" given at Rice University in 1962. He called upon Americans and all mankind to strive for peaceful cooperation and coexistence in our future outer space activities. And Kennedy explained, "There is no strife, ${\ldots}$ nor any international conflict in outer space as yet. But its hazards are hostile to us all: Its conquest deserves the best of all mankind, and its opportunity for peaceful cooperation may never come again." This speech seems to even present us in the contemporary era with ample suggestions for further peaceful cooperation in outer space activities including the delimitation of outer space.

    A Study on the Differences of Information Diffusion Based on the Type of Media and Information (매체와 정보유형에 따른 정보확산 차이에 대한 연구)

    • Lee, Sang-Gun;Kim, Jin-Hwa;Baek, Heon;Lee, Eui-Bang
      • Journal of Intelligence and Information Systems
      • /
      • v.19 no.4
      • /
      • pp.133-146
      • /
      • 2013
    • While the use of internet is routine nowadays, users receive and share information through a variety of media. Through the use of internet, information delivery media is diversifying from traditional media of one-way communication, such as newspaper, TV, and radio, into media of two-way communication. In contrast of traditional media, blogs enable individuals to directly upload and share news, which can be considered to have a differential speed of information diffusion than news media that convey information unilaterally. Therefore this Study focused on the difference between online news and social media blogs. Moreover, there are variations in the speed of information diffusion because that information closely related to one person boosts communications between individuals. We believe that users' standard of evaluation would change based on the types of information. As well, the speed of information diffusion would change based on the level of proximity. Therefore, the purpose of this study is to examine the differences in information diffusion based on the types of media. And then information is segmentalized and an examination is done to see how information diffusion differentiates based on the types of information. This study used the Bass diffusion model, which has been frequently used because this model has higher explanatory power than other models by explaining diffusion of market through innovation effect and imitation effect. Also this model has been applied a lot in other information diffusion related studies. The Bass diffusion model includes an innovation effect and an imitation effect. Innovation effect measures the early-stage impact, while the imitation effect measures the impact of word of mouth at the later stage. According to Mahajan et al. (2000), Innovation effect is emphasized by usefulness and ease-of-use, as well Imitation effect is emphasized by subjective norm and word-of-mouth. Also, according to Lee et al. (2011), Innovation effect is emphasized by mass communication. According to Moore and Benbasat (1996), Innovation effect is emphasized by relative advantage. Because Imitation effect is adopted by within-group influences and Innovation effects is adopted by product's or service's innovation. Therefore, ours study compared online news and social media blogs to examine the differences between media. We also choose different types of information including entertainment related information "Psy Gentelman", Current affair news "Earthquake in Sichuan, China", and product related information "Galaxy S4" in order to examine the variations on information diffusion. We considered that users' information proximity alters based on the types of information. Hence, we chose the three types of information mentioned above, which have different level of proximity from users' standpoint, in order to examine the flow of information diffusion. The first conclusion of this study is that different media has similar effect on information diffusion, even the types of media of information provider are different. Information diffusion has only been distinguished by a disparity between proximity of information. Second, information diffusions differ based on types of information. From the standpoint of users, product and entertainment related information has high imitation effect because of word of mouth. On the other hand, imitation effect dominates innovation effect on Current affair news. From the results of this study, the flow changes of information diffusion is examined and be applied to practical use. This study has some limitations, and those limitations would be able to provide opportunities and suggestions for future research. Presenting the difference of Information diffusion according to media and proximity has difficulties for generalization of theory due to small sample size. Therefore, if further studies adopt to a request for an increase of sample size and media diversity, difference of the information diffusion according to media type and information proximity could be understood more detailed.

    A Study of Guidelines for Genetic Counseling in Preimplantation Genetic Diagnosis (PGD) (착상전 유전진단을 위한 유전상담 현황과 지침개발을 위한 기초 연구)

    • Kim, Min-Jee;Lee, Hyoung-Song;Kang, Inn-Soo;Jeong, Seon-Yong;Kim, Hyon-J.
      • Journal of Genetic Medicine
      • /
      • v.7 no.2
      • /
      • pp.125-132
      • /
      • 2010
    • Purpose: Preimplantation genetic diagnosis (PGD), also known as embryo screening, is a pre-pregnancy technique used to identify genetic defects in embryos created through in vitro fertilization. PGD is considered a means of prenatal diagnosis of genetic abnormalities. PGD is used when one or both genetic parents has a known genetic abnormality; testing is performed on an embryo to determine if it also carries the genetic abnormality. The main advantage of PGD is the avoidance of selective pregnancy termination as it imparts a high likelihood that the baby will be free of the disease under consideration. The application of PGD to genetic practices, reproductive medicine, and genetic counseling is becoming the key component of fertility practice because of the need to develop a custom PGD design for each couple. Materials and Methods: In this study, a survey on the contents of genetic counseling in PGD was carried out via direct contact or e-mail with the patients and specialists who had experienced PGD during the three months from February to April 2010. Results: A total of 91 persons including 60 patients, 49 of whom had a chromosomal disorder and 11 of whom had a single gene disorder, and 31 PGD specialists responded to the survey. Analysis of the survey results revealed that all respondents were well aware of the importance of genetic counseling in all steps of PGD including planning, operation, and follow-up. The patient group responded that the possibility of unexpected results (51.7%), genetic risk assessment and recurrence risk (46.7%), the reproduction options (46.7%), the procedure and limitation of PGD (43.3%) and the information of PGD technology (35.0%) should be included as a genetic counseling information. In detail, 51.7% of patients wanted to be counseled for the possibility of unexpected results and the recurrence risk, while 46.7% wanted to know their reproduction options (46.7%). Approximately 96.7% of specialists replied that a non-M.D. genetic counselor is necessary for effective and systematic genetic counseling in PGD because it is difficult for physicians to offer satisfying information to patients due to lack of counseling time and specific knowledge of the disorders. Conclusions: The information from the survey provides important insight into the overall present situation of genetic counseling for PGD in Korea. The survey results demonstrated that there is a general awareness that genetic counseling is essential for PGD, suggesting that appropriate genetic counseling may play a important role in the success of PGD. The establishment of genetic counseling guidelines for PGD may contribute to better planning and management strategies for PGD.

    A Case Study on the Effective Liquid Manure Treatment System in Pig Farms (양돈농가의 돈분뇨 액비화 처리 우수사례 실태조사)

    • Kim, Soo-Ryang;Jeon, Sang-Joon;Hong, In-Gi;Kim, Dong-Kyun;Lee, Myung-Gyu
      • Journal of Animal Environmental Science
      • /
      • v.18 no.2
      • /
      • pp.99-110
      • /
      • 2012
    • The purpose of the study is to collect basis data for to establish standard administrative processes of liquid fertilizer treatment. From this survey we could make out the key point of each step through a case of effective liquid manure treatment system in pig house. It is divided into six step; 1. piggery slurry management step, 2. Solid-liquid separation step, 3. liquid fertilizer treatment (aeration) step, 4. liquid fertilizer treatment (microorganism, recirculation and internal return) step, 5. liquid fertilizer treatment (completion) step, 6. land application step. From now on, standardization process of liquid manure treatment technologies need to be develop based on the six steps process.

    Development of Information Extraction System from Multi Source Unstructured Documents for Knowledge Base Expansion (지식베이스 확장을 위한 멀티소스 비정형 문서에서의 정보 추출 시스템의 개발)

    • Choi, Hyunseung;Kim, Mintae;Kim, Wooju;Shin, Dongwook;Lee, Yong Hun
      • Journal of Intelligence and Information Systems
      • /
      • v.24 no.4
      • /
      • pp.111-136
      • /
      • 2018
    • In this paper, we propose a methodology to extract answer information about queries from various types of unstructured documents collected from multi-sources existing on web in order to expand knowledge base. The proposed methodology is divided into the following steps. 1) Collect relevant documents from Wikipedia, Naver encyclopedia, and Naver news sources for "subject-predicate" separated queries and classify the proper documents. 2) Determine whether the sentence is suitable for extracting information and derive the confidence. 3) Based on the predicate feature, extract the information in the proper sentence and derive the overall confidence of the information extraction result. In order to evaluate the performance of the information extraction system, we selected 400 queries from the artificial intelligence speaker of SK-Telecom. Compared with the baseline model, it is confirmed that it shows higher performance index than the existing model. The contribution of this study is that we develop a sequence tagging model based on bi-directional LSTM-CRF using the predicate feature of the query, with this we developed a robust model that can maintain high recall performance even in various types of unstructured documents collected from multiple sources. The problem of information extraction for knowledge base extension should take into account heterogeneous characteristics of source-specific document types. The proposed methodology proved to extract information effectively from various types of unstructured documents compared to the baseline model. There is a limitation in previous research that the performance is poor when extracting information about the document type that is different from the training data. In addition, this study can prevent unnecessary information extraction attempts from the documents that do not include the answer information through the process for predicting the suitability of information extraction of documents and sentences before the information extraction step. It is meaningful that we provided a method that precision performance can be maintained even in actual web environment. The information extraction problem for the knowledge base expansion has the characteristic that it can not guarantee whether the document includes the correct answer because it is aimed at the unstructured document existing in the real web. When the question answering is performed on a real web, previous machine reading comprehension studies has a limitation that it shows a low level of precision because it frequently attempts to extract an answer even in a document in which there is no correct answer. The policy that predicts the suitability of document and sentence information extraction is meaningful in that it contributes to maintaining the performance of information extraction even in real web environment. The limitations of this study and future research directions are as follows. First, it is a problem related to data preprocessing. In this study, the unit of knowledge extraction is classified through the morphological analysis based on the open source Konlpy python package, and the information extraction result can be improperly performed because morphological analysis is not performed properly. To enhance the performance of information extraction results, it is necessary to develop an advanced morpheme analyzer. Second, it is a problem of entity ambiguity. The information extraction system of this study can not distinguish the same name that has different intention. If several people with the same name appear in the news, the system may not extract information about the intended query. In future research, it is necessary to take measures to identify the person with the same name. Third, it is a problem of evaluation query data. In this study, we selected 400 of user queries collected from SK Telecom 's interactive artificial intelligent speaker to evaluate the performance of the information extraction system. n this study, we developed evaluation data set using 800 documents (400 questions * 7 articles per question (1 Wikipedia, 3 Naver encyclopedia, 3 Naver news) by judging whether a correct answer is included or not. To ensure the external validity of the study, it is desirable to use more queries to determine the performance of the system. This is a costly activity that must be done manually. Future research needs to evaluate the system for more queries. It is also necessary to develop a Korean benchmark data set of information extraction system for queries from multi-source web documents to build an environment that can evaluate the results more objectively.


    (34141) Korea Institute of Science and Technology Information, 245, Daehak-ro, Yuseong-gu, Daejeon
    Copyright (C) KISTI. All Rights Reserved.