Construction of Event Networks from Large News Data Using Text Mining Techniques (텍스트 마이닝 기법을 적용한 뉴스 데이터에서의 사건 네트워크 구축)
-
- Journal of Intelligence and Information Systems
- /
- v.24 no.1
- /
- pp.183-203
- /
- 2018
News articles are the most suitable medium for examining the events occurring at home and abroad. Especially, as the development of information and communication technology has brought various kinds of online news media, the news about the events occurring in society has increased greatly. So automatically summarizing key events from massive amounts of news data will help users to look at many of the events at a glance. In addition, if we build and provide an event network based on the relevance of events, it will be able to greatly help the reader in understanding the current events. In this study, we propose a method for extracting event networks from large news text data. To this end, we first collected Korean political and social articles from March 2016 to March 2017, and integrated the synonyms by leaving only meaningful words through preprocessing using NPMI and Word2Vec. Latent Dirichlet allocation (LDA) topic modeling was used to calculate the subject distribution by date and to find the peak of the subject distribution and to detect the event. A total of 32 topics were extracted from the topic modeling, and the point of occurrence of the event was deduced by looking at the point at which each subject distribution surged. As a result, a total of 85 events were detected, but the final 16 events were filtered and presented using the Gaussian smoothing technique. We also calculated the relevance score between events detected to construct the event network. Using the cosine coefficient between the co-occurred events, we calculated the relevance between the events and connected the events to construct the event network. Finally, we set up the event network by setting each event to each vertex and the relevance score between events to the vertices connecting the vertices. The event network constructed in our methods helped us to sort out major events in the political and social fields in Korea that occurred in the last one year in chronological order and at the same time identify which events are related to certain events. Our approach differs from existing event detection methods in that LDA topic modeling makes it possible to easily analyze large amounts of data and to identify the relevance of events that were difficult to detect in existing event detection. We applied various text mining techniques and Word2vec technique in the text preprocessing to improve the accuracy of the extraction of proper nouns and synthetic nouns, which have been difficult in analyzing existing Korean texts, can be found. In this study, the detection and network configuration techniques of the event have the following advantages in practical application. First, LDA topic modeling, which is unsupervised learning, can easily analyze subject and topic words and distribution from huge amount of data. Also, by using the date information of the collected news articles, it is possible to express the distribution by topic in a time series. Second, we can find out the connection of events in the form of present and summarized form by calculating relevance score and constructing event network by using simultaneous occurrence of topics that are difficult to grasp in existing event detection. It can be seen from the fact that the inter-event relevance-based event network proposed in this study was actually constructed in order of occurrence time. It is also possible to identify what happened as a starting point for a series of events through the event network. The limitation of this study is that the characteristics of LDA topic modeling have different results according to the initial parameters and the number of subjects, and the subject and event name of the analysis result should be given by the subjective judgment of the researcher. Also, since each topic is assumed to be exclusive and independent, it does not take into account the relevance between themes. Subsequent studies need to calculate the relevance between events that are not covered in this study or those that belong to the same subject.
This study aimed at contributing to the improvement of cropping systems after finding out the effects of excrements and components of crop root influence on other crops as well as themselves. The following forage crops suitable for our country were selected for the present study. Aqueous extracts of fresh roots, aqueous extracts of rotting roots and aqueous solutions of excrements of red clover, orchard grass and brome grass were studied for the effects influencing the germination and growth of seedlings of red clover, ladino clover, lespedeza, soybean, orchard grass, Italian ryegrass, brome grass, barley, wheat, sorghum, corn and Hog-millet. In view of the possibility that the organic acid might be closely related to the excrements and components of crop root connected with soil sickness, the acid components of three species of roots were analysed by paper chromatography and gas chromatography method. The following results were obtained: 1. Effects of Aqueous Extracts of Fresh Roots : Aqueous extracts of red clover: The extracts inhibited the growth of seedlings of the ladino clover and lespedeza and also inhibited the development of most crops except that of sorghum among the Graminaceae. Aqueous extracts of orchard grass: The extracts promoted the seedlings growth of red clover and soybean, while it inhibited the germination and growth of orchard grass. There were no noticeable effects influencing other crops while it inhibited the growth of barley and Hog-millet. Aqueous extracts of brome grass: There was no effect on Italian ryegrass but there was an inhibiting effect on the other crops. 2. Effects of Aqueous Extracts of Rotting Roots : Aqueous extracts of red clover: The extracts promoted the seedling growth of red clover. But it reflected the inhibiting effects on other crops except sorghum. Aqueous extracts of orchard grass: The extracts promoted the growth of red clover, ladino clover, soybean and sorghun, while it inhibited the germination and rooting of barley and Hog-millet. Aqueous extracts of brome grass: The extracts gave the promotive effects to the growth of red clover, soybean and sorghum, but caused inhibiting effects on orchard grass, brome grass, barley and Hog-millet. 3. Effects of Aqueous Solutions of Excrements : The aqueous solution of excrements of red clover reflected the inhibition effects to the growth of Graminaceae, while the aqueous solutions of excrements of orchard grass and Italian ryegrass caused the promotive effects on the growth of red clover. 4. Results of Organic Acid Analysis : The oxalic acid, citric acid, tartaric acid, malonic acid, malic acid and succinic acid were included in the roots of red clover as unvolatile organic acid, and in the orchard grass and brome grass there were included the oxalic acid, citric acid, tartaric acid and malic acid. And formic acid was confirmed in the red clover, orchard grass and brome grass as volatile organic acid. In consideration of the results mentioned in above the effects of excrements and components of roots found in this studies may be summarized as follows. 1) The red clover generally gave a disadvantageous effect on the Graminaceae. Such trend was considered chiefly caused by the presence of many organic acids, namely oxalic, citric, tartaric, malonic, malic, succinic and formic acid. 2) The orchard grass generally gave an advantageous effect on the Leguminosae. This may be due to a few kinds of organic acid contained in the root, namely oxalic, citric, tartaric, malic and formic acid. Furthermore a certain of promotive materials for growth was noted. 3) As long as the root of brome grass are not rotten, it gave a disadvantageous effect on the Leguminosae and Graminaceae. This may be due to the fact that several unidentified volatile organic acid were also included besides the confirmed organic acid, namely oxalic, citric, tartaric, malic and formic acid. 5. Effects of Components in Roots to the Soil Sickness : 1) It was considered that the cause of alleged red clover's soil sickness did not result from the toxic components of the roots. 2) It was recognized that the toxic components of roots might be the cause of soil sickness in case the orchard grass and brome grass were put into the long-term single cropping. 6. Effects of Rooted Components to the Companion Crops in the Cropping System : a) In case of aqueous extracts of fresh roots and aqueous excrements (Inter cropping and mixed cropping) : 1) Advantageous combinations : Orchard grass->Red clover, Soybean, Italian ryegrass->Red clover, 2) Disadvantageous combinations : Red clover->Ladino clover, Lespedeza, Orchard grass, Italian ryegrass, Fescue Ky-31, Brome grass, Barley, Wheat, Corn and Hog.millet, Orchard grass->Lespedeza, Orchard grass, Barley and Hog-millet, Brome grass->Red clover, Ladino clover, Lespedeza, Soybean, Orchard grass, Brome grass, Barley, Wheat, Sorghum, Corn and Hog-millet, 3) Harmless combinations : Red clover->Red clover, Soybean and Sorghum, Orchard grass->Ladino clover, Italian ryegrass, Brome grass, Wheat, Sorghum and Corn, Brome grass->Italian ryegrass, b) In case of aquecus extracts of rotting roots(After cropping) : 1) Advantageous combinations : Red clover->Red clover and Sorghum, Orchard grass->Red clover, Ladino clover, Soybean, Sorghum, and Corn, Brome grass->Red clover, Soybean and Sorghum, 2) Disadvantageous combinations : Red clover->Lespedeza, Orchard grass, Italian ryegrass, Brome grass, Barley, Wheat, and Hog-millet Orchard grass->Barley and Hog-millet, Brome grass->Orchard grass, Brome grass, Barley and Hog-millet, 3) Harmless combinations : Red clover->Ladino clover, Soybean and Corn, Orchard grass->Lespedeza, Orchard grass, Italian ryegrass, Brome grass and Wheat Brome gass->Ladino clover, Lespedeza, Italian ryegrass and Wheat.
The incidence of globally infectious and pathogenic diseases such as H1N1 (swine flu) and Avian Influenza (AI) has recently increased. An infectious disease is a pathogen-caused disease, which can be passed from the infected person to the susceptible host. Pathogens of infectious diseases, which are bacillus, spirochaeta, rickettsia, virus, fungus, and parasite, etc., cause various symptoms such as respiratory disease, gastrointestinal disease, liver disease, and acute febrile illness. They can be spread through various means such as food, water, insect, breathing and contact with other persons. Recently, most countries around the world use a mathematical model to predict and prepare for the spread of infectious diseases. In a modern society, however, infectious diseases are spread in a fast and complicated manner because of rapid development of transportation (both ground and underground). Therefore, we do not have enough time to predict the fast spreading and complicated infectious diseases. Therefore, new system, which can prevent the spread of infectious diseases by predicting its pathway, needs to be developed. In this study, to solve this kind of problem, an integrated monitoring system, which can track and predict the pathway of infectious diseases for its realtime monitoring and control, is developed. This system is implemented based on the conventional mathematical model called by 'Susceptible-Infectious-Recovered (SIR) Model.' The proposed model has characteristics that both inter- and intra-city modes of transportation to express interpersonal contact (i.e., migration flow) are considered. They include the means of transportation such as bus, train, car and airplane. Also, modified real data according to the geographical characteristics of Korea are employed to reflect realistic circumstances of possible disease spreading in Korea. We can predict where and when vaccination needs to be performed by parameters control in this model. The simulation includes several assumptions and scenarios. Using the data of Statistics Korea, five major cities, which are assumed to have the most population migration have been chosen; Seoul, Incheon (Incheon International Airport), Gangneung, Pyeongchang and Wonju. It was assumed that the cities were connected in one network, and infectious disease was spread through denoted transportation methods only. In terms of traffic volume, daily traffic volume was obtained from Korean Statistical Information Service (KOSIS). In addition, the population of each city was acquired from Statistics Korea. Moreover, data on H1N1 (swine flu) were provided by Korea Centers for Disease Control and Prevention, and air transport statistics were obtained from Aeronautical Information Portal System. As mentioned above, daily traffic volume, population statistics, H1N1 (swine flu) and air transport statistics data have been adjusted in consideration of the current conditions in Korea and several realistic assumptions and scenarios. Three scenarios (occurrence of H1N1 in Incheon International Airport, not-vaccinated in all cities and vaccinated in Seoul and Pyeongchang respectively) were simulated, and the number of days taken for the number of the infected to reach its peak and proportion of Infectious (I) were compared. According to the simulation, the number of days was the fastest in Seoul with 37 days and the slowest in Pyeongchang with 43 days when vaccination was not considered. In terms of the proportion of I, Seoul was the highest while Pyeongchang was the lowest. When they were vaccinated in Seoul, the number of days taken for the number of the infected to reach at its peak was the fastest in Seoul with 37 days and the slowest in Pyeongchang with 43 days. In terms of the proportion of I, Gangneung was the highest while Pyeongchang was the lowest. When they were vaccinated in Pyeongchang, the number of days was the fastest in Seoul with 37 days and the slowest in Pyeongchang with 43 days. In terms of the proportion of I, Gangneung was the highest while Pyeongchang was the lowest. Based on the results above, it has been confirmed that H1N1, upon the first occurrence, is proportionally spread by the traffic volume in each city. Because the infection pathway is different by the traffic volume in each city, therefore, it is possible to come up with a preventive measurement against infectious disease by tracking and predicting its pathway through the analysis of traffic volume.
The wall shear stress in the vicinity of end-to end anastomoses under steady flow conditions was measured using a flush-mounted hot-film anemometer(FMHFA) probe. The experimental measurements were in good agreement with numerical results except in flow with low Reynolds numbers. The wall shear stress increased proximal to the anastomosis in flow from the Penrose tubing (simulating an artery) to the PTFE: graft. In flow from the PTFE graft to the Penrose tubing, low wall shear stress was observed distal to the anastomosis. Abnormal distributions of wall shear stress in the vicinity of the anastomosis, resulting from the compliance mismatch between the graft and the host artery, might be an important factor of ANFH formation and the graft failure. The present study suggests a correlation between regions of the low wall shear stress and the development of anastomotic neointimal fibrous hyperplasia(ANPH) in end-to-end anastomoses. 30523 T00401030523 ^x Air pressure decay(APD) rate and ultrafiltration rate(UFR) tests were performed on new and saline rinsed dialyzers as well as those roused in patients several times. C-DAK 4000 (Cordis Dow) and CF IS-11 (Baxter Travenol) reused dialyzers obtained from the dialysis clinic were used in the present study. The new dialyzers exhibited a relatively flat APD, whereas saline rinsed and reused dialyzers showed considerable amount of decay. C-DAH dialyzers had a larger APD(11.70
The wall shear stress in the vicinity of end-to end anastomoses under steady flow conditions was measured using a flush-mounted hot-film anemometer(FMHFA) probe. The experimental measurements were in good agreement with numerical results except in flow with low Reynolds numbers. The wall shear stress increased proximal to the anastomosis in flow from the Penrose tubing (simulating an artery) to the PTFE: graft. In flow from the PTFE graft to the Penrose tubing, low wall shear stress was observed distal to the anastomosis. Abnormal distributions of wall shear stress in the vicinity of the anastomosis, resulting from the compliance mismatch between the graft and the host artery, might be an important factor of ANFH formation and the graft failure. The present study suggests a correlation between regions of the low wall shear stress and the development of anastomotic neointimal fibrous hyperplasia(ANPH) in end-to-end anastomoses. 30523 T00401030523 ^x Air pressure decay(APD) rate and ultrafiltration rate(UFR) tests were performed on new and saline rinsed dialyzers as well as those roused in patients several times. C-DAK 4000 (Cordis Dow) and CF IS-11 (Baxter Travenol) reused dialyzers obtained from the dialysis clinic were used in the present study. The new dialyzers exhibited a relatively flat APD, whereas saline rinsed and reused dialyzers showed considerable amount of decay. C-DAH dialyzers had a larger APD(11.70