• Title/Summary/Keyword: Synthetic Data

Search Result 1,437, Processing Time 0.034 seconds

Construction of Event Networks from Large News Data Using Text Mining Techniques (텍스트 마이닝 기법을 적용한 뉴스 데이터에서의 사건 네트워크 구축)

  • Lee, Minchul;Kim, Hea-Jin
    • Journal of Intelligence and Information Systems
    • /
    • v.24 no.1
    • /
    • pp.183-203
    • /
    • 2018
  • News articles are the most suitable medium for examining the events occurring at home and abroad. Especially, as the development of information and communication technology has brought various kinds of online news media, the news about the events occurring in society has increased greatly. So automatically summarizing key events from massive amounts of news data will help users to look at many of the events at a glance. In addition, if we build and provide an event network based on the relevance of events, it will be able to greatly help the reader in understanding the current events. In this study, we propose a method for extracting event networks from large news text data. To this end, we first collected Korean political and social articles from March 2016 to March 2017, and integrated the synonyms by leaving only meaningful words through preprocessing using NPMI and Word2Vec. Latent Dirichlet allocation (LDA) topic modeling was used to calculate the subject distribution by date and to find the peak of the subject distribution and to detect the event. A total of 32 topics were extracted from the topic modeling, and the point of occurrence of the event was deduced by looking at the point at which each subject distribution surged. As a result, a total of 85 events were detected, but the final 16 events were filtered and presented using the Gaussian smoothing technique. We also calculated the relevance score between events detected to construct the event network. Using the cosine coefficient between the co-occurred events, we calculated the relevance between the events and connected the events to construct the event network. Finally, we set up the event network by setting each event to each vertex and the relevance score between events to the vertices connecting the vertices. The event network constructed in our methods helped us to sort out major events in the political and social fields in Korea that occurred in the last one year in chronological order and at the same time identify which events are related to certain events. Our approach differs from existing event detection methods in that LDA topic modeling makes it possible to easily analyze large amounts of data and to identify the relevance of events that were difficult to detect in existing event detection. We applied various text mining techniques and Word2vec technique in the text preprocessing to improve the accuracy of the extraction of proper nouns and synthetic nouns, which have been difficult in analyzing existing Korean texts, can be found. In this study, the detection and network configuration techniques of the event have the following advantages in practical application. First, LDA topic modeling, which is unsupervised learning, can easily analyze subject and topic words and distribution from huge amount of data. Also, by using the date information of the collected news articles, it is possible to express the distribution by topic in a time series. Second, we can find out the connection of events in the form of present and summarized form by calculating relevance score and constructing event network by using simultaneous occurrence of topics that are difficult to grasp in existing event detection. It can be seen from the fact that the inter-event relevance-based event network proposed in this study was actually constructed in order of occurrence time. It is also possible to identify what happened as a starting point for a series of events through the event network. The limitation of this study is that the characteristics of LDA topic modeling have different results according to the initial parameters and the number of subjects, and the subject and event name of the analysis result should be given by the subjective judgment of the researcher. Also, since each topic is assumed to be exclusive and independent, it does not take into account the relevance between themes. Subsequent studies need to calculate the relevance between events that are not covered in this study or those that belong to the same subject.

Regional Analysis of Forest Eire Occurrence Factors in Kangwon Province (강원도 지역 산불발생인자의 지역별 유형화)

  • 이시영;한상열;안상현;오정수;조명희;김명수
    • Korean Journal of Agricultural and Forest Meteorology
    • /
    • v.3 no.3
    • /
    • pp.135-142
    • /
    • 2001
  • This study attempts to categorizes the factors of forest fire occurrences based on regional meteorologic data and general forest no characteristics of 18 cities and guns in Kangwon province. lo accomplish this goal, some statistical analyses such as analysis of variance, correspondence analysis and multidimensional scaling were adopted. To reveal the forest fires pattern of study region, a categorization process was conducted by employing the quantification approach which modified and quantified the metric-data of fire occurrence dates. Also, The fire occurrence similarity was compared by using multidimensional scaling for each study region. The major results are summarized as follows: It was found that the meteorological factors emerged as different to each region are average and maximum temperature, minimum dew point temperature and average and maximum wind speed. In the result of correspondence analysis representing relationships between fire causes and study regions, Kangrung is caused by arsonist, Chulwon, Hwachen and Yanggu caused by military factor, Sokcho and Chunchen caused by the debris burning, and Samchuk caused by general man-caused fires, respectively. Finally, the forest fire occurrence pattern of this study regions were divided into five areas such as, group I including Samchuk, Kangryung, Chunchen, Wonju, Hongchen and Hhoingsung, group II including Donghae, Taebaek, Yangyang and Pyongchang, group III including Jungsun, Chulwon and Whachen, group Ⅵ including Gosung, Injae and Yanggu, and group V including Shokcho and Youngwol.

  • PDF

Surface Change Detection in the March 5Youth Mine Using Sentinel-1 Interferometric SAR Coherence Imagery (Sentinel-1 InSAR 긴밀도 영상을 이용한 3월5일청년광산의 지표 변화 탐지)

  • Moon, Jihyun;Kim, Geunyoung;Lee, Hoonyol
    • Korean Journal of Remote Sensing
    • /
    • v.37 no.3
    • /
    • pp.531-542
    • /
    • 2021
  • Open-pit mines require constant monitoring as they can cause surface changes and environmental disturbances. In open-pit mines, there is little vegetation at the mining site and can be monitored using InSAR (Interferometric Synthetic Aperture Radar) coherence imageries. In this study, activities occurring in mine were analyzed by applying the recently developed InSAR coherence-based NDAI (Normalized Difference Activity Index). The March 5 Youth Mine is a North Korean mine whose development has been expanded since 2008. NDAI analysis was performed with InSAR coherence imageries obtained using Sentinel-1 SAR images taken at 12-day intervals in the March 5 Youth Mine. First, the area where the elevation decreased by about 75.24 m and increased by about 9.85 m over the 14 years from 2000 was defined as the mining site and the tailings piles. Then, the NDAI images were used for time series analysis at various time intervals. Over the entire period (2017-2019), average mining activity was relatively active at the center of the mining area. In order to find out more detailed changes in the surface activity of the mine, the time interval was reduced and the activity was observed over a 1-year period. In 2017, we analyzed changes in mining operations before and after artificial earthquakes based on seismic data and NDAI images. After the large-scale blasting that occurred on 30 April 2017, activity was detected west of the mining area. It is estimated that the size of the mining area was enlarged by two blasts on 30 September 2017. The time-averaged NDAI images used to perform detailed time-series analysis were generated over a period of 1 year and 4 months, and then composited into RGB images. Annual analysis of activity confirmed an active region in the northeast of the mining area in 2018 and found the characteristic activity of the expansion of tailings piles in 2019. Time series analysis using NDAI was able to detect random surface changes in open-pit mines that are difficult to identify with optical images. Especially in areas where in situ data is not available, remote sensing can effectively perform mining activity analysis.

Tectonic Structures and Hydrocarbon Potential in the Central Bransfield Basin, Antarctica (남극 브랜스필드 해협 중앙분지의 지체구조 및 석유부존 가능성)

  • Huh Sik;Kim Yeadong;Cheong Dae-Kyo;Jin Young Keun;Nam Sang Heon
    • The Korean Journal of Petroleum Geology
    • /
    • v.5 no.1_2 s.6
    • /
    • pp.9-15
    • /
    • 1997
  • The study area is located in the Central Bransfield Basin, Antarctica. To analyze the morphology of seafloor, structure of basement, and seismic stratigraphy of the sedimentary layers, we have acquired, processed, and interpreted the multi-channel seismic data. The northwest-southeastern back-arc extension dramatically changes seafloor morphology, volcanic and fault distribution, and basin structure along the spreading ridges. The northern continental shelf shows a narrow, steep topography. In contrast, the continental shelf or slope in the south, which is connected to the Antarctic Peninsula, has a gentle gradient. Volcanic activities resulted in the formation of large volcanos and basement highs near the spreading center, and small-scale volcanic diapirs on the shelf. A very long, continuous normal fault characterizes the northern shelf, whereas several basinward synthetic faults probably detach into the master fault in the south. Four transfer faults, the northwest-southeastern deep-parallel structures, controlled the complex distributions of the volcanos, normal faults, depocenters, and possibly hydrocarbon provinces in the study area. They have also deformed the basement structure and depositional pattern. Even though the Bransfield Basin was believed to be formed in the Late Cenozoic (about 4 Ma), the hydrocarbon potential may be very high due to thick sediment accumulation, high organic contents, high heat flow resulted from the active tectonics, and adequate traps.

  • PDF

The Effect of 15% Carbamide Peroxide on the Surface Roughness and Staining of Esthetic Restoratives (15% Carbamide Peroxide가 심미수복재의 표면조도와 착색에 미치는 영향)

  • Kim, Soo-Hwa;Choi, Hye-Sook;Roh, Jj-Yeon;Kim, Kwang-Mahn
    • Journal of dental hygiene science
    • /
    • v.13 no.2
    • /
    • pp.165-173
    • /
    • 2013
  • The purpose of this study was to evaluate the surface change after 15% carbamide peroxide home bleaching to various restorative materials (composite resin [CR], resin modified glass ionomer [RMGI] and glass ionomer [GI]) and to observe the effect of surface condition of the materials on re-staining. Three esthetic restorative materials (Filtek Z250, 3M, USA; Fuji II LC, GC, Japan; Fuji II, GC, Japan) were used in this study. Twenty specimens per material group were made and divided into two groups (bleached and control). The specimens were immersed in coffee after applying bleaching agent. The color change and surface roughness were measured before and after bleaching and after immersion in coffee. The data were analyzed with SPSS 18.0. The results were as follows: 1. The color of all experiment groups was significantly changed after bleaching (p<0.05). RMGI was the greatest value of ${\Delta}E^*$ and ${\Delta}L^*$. GI and CR groups were in ordering (p<0.05). The ${\Delta}a^*$ value was decreased GI, RMGI and CR. RMGI was only significantly decreased in ${\Delta}b^*$ value (p<0.05). 2. The surface roughness before and after bleaching was significantly different on CR, RMGI and GI (p<0.05). 3. After staining with coffee, the value of ${\Delta}E^*$ was increased in GI, RMGI and CR, furthermore GI and RMGI showed significant difference in the bleaching groups (p<0.05). The ${\Delta}L^*$ value of GI and RMGI was significantly decreased. 4. The change of surface roughness after staining was not significantly different in all groups (p>0.05). The maintenance of color stability in esthetic restorations is one of the most important properties. Tooth whitening is for the aesthetic. Therefore, dental professionals should notice to patients about re-staining after tooth whitening. They should give an instruction that how to prevent and which kinds of agents could be stained.

L-band SAR-derived Sea Surface Wind Retrieval off the East Coast of Korea and Error Characteristics (L밴드 인공위성 SAR를 이용한 동해 연안 해상풍 산출 및 오차 특성)

  • Kim, Tae-Sung;Park, Kyung-Ae;Choi, Won-Moon;Hong, Sungwook;Choi, Byoung-Cheol;Shin, Inchul;Kim, Kyung-Ryul
    • Korean Journal of Remote Sensing
    • /
    • v.28 no.5
    • /
    • pp.477-487
    • /
    • 2012
  • Sea surface winds in the sea off the east coast of Korea were derived from L-band ALOS (Advanced Land Observing Satellite) PALSAR (Phased Array type L-band Synthetic Aperture Radar) data and their characteristics of errors were analyzed. We could retrieve high-resolution wind vectors off the east coast of Korea including the coastal region, which has been substantially unavailable from satellite scatterometers. Retrieved SAR-wind speeds showed a good agreement with in-situ buoy measurement by showing relatively small an root-mean-square (RMS) error of 0.67 m/s. Comparisons of the wind vectors from SAR and scatterometer presented RMS errors of 2.16 m/s and $19.24^{\circ}$, 3.62 m/s and $28.02^{\circ}$ for L-band GMF (Geophysical Model Function) algorithm 2009 and 2007, respectively, which tended to be somewhat higher than the expected limit of satellite scatterometer winds errors. L-band SAR-derived wind field exhibited the characteristic dependence on wind direction and incidence angle. The previous version (L-band GMF 2007) revealed large errors at small incidence angles of less than $21^{\circ}$. By contrast, the L-band GMF 2009, which improved the effect of incidence angle on the model function by considering a quadratic function instead of a linear relationship, greatly enhanced the quality of wind speed from 6.80 m/s to 1.14 m/s at small incident angles. This study addressed that the causes of wind retrieval errors should be intensively studied for diverse applications of L-band SAR-derived winds, especially in terms of the effects of wind direction and incidence angle, and other potential error sources.

Empirical Analysis of Consumer Behavior on the Internet Shopping Mall Choice from the Schema Perspective: Comparison Between Bricks & Clicks and Pure-Player Shopping Mall (스키마 관점에서 살펴본 인터넷 쇼핑몰 선택에 대한 소비자행동의 이해: Bricks & Clicks와 Pure-Player 인터넷 쇼핑몰 비교를 중심으로)

  • Chung, Nam-Ho;Lee, Kun-Chang
    • Asia pacific journal of information systems
    • /
    • v.17 no.4
    • /
    • pp.165-186
    • /
    • 2007
  • With the advent of a wide variety of Internet shopping malls, consumers can choose a best appealing shopping mall from among the Bricks-and-Clicks and Pure-Player malls. Pure-Players launched their operation grandiosely with the early stage of Internet use in 1995. However, after the burst of Dot-com company bubbles in 1997, Pure-Players introduce various types of business models to meet potential needs of consumers. While Pure-Players suffer skeptical views from market analysts as well as consumers, traditional offline companies learned important lessons from Dot-com companies collapse phenomena, and expanded their business channels into online in the name of Bricks-and-Clicks. Nowadays, Bricks-and-Clicks successfully establish in the market as one of reliable business partners among consumers. Therefore, it is no surprise that recent competitions between Bricks-and Clicks and Pure-Players become fiercer than ever to attract potential customers to their websites. In this situation, consumers can choose a shopping mall to their best satisfaction. Consumers can enjoy both offline and online options for shopping because Bricks-and Clicks provide both offline and online channels to consumers, which is compared with Pure-Players offering only online channel. Offline channel is unique in providing consumers with chances to touch and feel target products and services. Meanwhile, online channel is considered very viable and convenient shopping options for consumers. In this respect, it is easily assumed that consumers will show different online shopping behavior when they have to choose either Bricks-and-Clicks mall or Pure-Player mall for the sake of shopping. Remaining research issue in this case is how much consumers' schema would influence online shopping behavior between Bricks-and-Clicks and Pure-Players. Basically, schema is a framework for synthetic information recognition that individual consumers have and is very characteristic in that it focuses not on fragmentary facts but on the combination of various causes affecting results. Consumers' schema is closely represented by trust, structural assurance, and perceived relative advantage towards a specific type of shopping mall. In literature, there exist a lot of studies comparing Bricks-and-Clicks and Pure-Players. However, there is no study to pursue the analysis of consumer behaviors comparing Bricks-and Clicks and Pure-Players from the schema perspective. Therefore, this study aims to investigate this research gap. Empirical analysis is adopted by garnering valid questionnaires from 514 Internet shopping mall users. 237 were mainly using Bricks-and-Clicks for shopping, while 277 were found to visit Pure-Players for shopping. PLS was applied to analyze the survey data to verify the proposed research hypotheses. Findings from the empirical test results are as follows. First, consumers perceive more trust and relative advantage in Pure-Players, comparing with Bricks-and-Clicks. This result is against widely-accepted perception that Bricks-and-Clicks would be perceived by consumers as more trustworthy and relatively advantageous because they have offline reputation and stores. Therefore, it becomes more obvious that Internet is becoming daily necessaries, and consumers increasingly feel very comfortable in using the Internet for their own personal purposes. Second, consumers have firm faith in transaction safety, regardless Bricks-and-Clicks and Pure-Players. This seems due to the fact that most of shopping malls showing dubious transaction safety have no place in the market. In a nutshell, empirical results tell us that Pure-Players will grow very much in the future, to the extent that consumers perceive no difference in comparison with Bricks-and-Clicks. Besides, consumers' schema accumulated through trust and perceived relative advantage plays crucial role in determining consumer behavior.

Inhibitory Effect of Hot-Water Extract of Paeonia japonica on Oxidative Stress and Identification of Its Active Components (백작약 열수추출물의 산화적 스트레스 억제효과 및 유효성분 동정)

  • Jeong, Ill-Yun;Lee, Joo-Sang;Oh, Heon;Jung, U-Hee;Park, Hae-Ran;Jo, Sung-Kee
    • Journal of the Korean Society of Food Science and Nutrition
    • /
    • v.32 no.5
    • /
    • pp.739-744
    • /
    • 2003
  • This study was carried out to investigate the antioxidative activity and to identify the active components of hot-water extract of Paeoniajaponica (PJ), which was a main ingredient of a herb mixture preparation recently established as a potent candidate of radioprotector in our laboratory. The water extract was fractionated with CHCl$_3$, EtOAc and n-BuOH. The extract and its fractions showed very low activity in hydroxyl radical scavenging test. In lipid peroxidation test, the extract, EtOAc and water fractions showed moderate inhibition with the ratio above 50%. In DPPH radical scavenging test, the extract, EtOAc and water fraction showed high activity with the ratio above 80%, especially. EtOAc fraction scavenged the radicals as much as synthetic antioxidant (BHA), even at low concentration. It is suggested that mai or partition for antioxidative activity of Paeonia japonica was EtOAc fraction. Subsequently, two active compounds (PJE021-1 and JE024-1) from EtOAc fraction were isolated by using MCI gel and silica gel column chromatography The two compounds inhibited remarkedly the $H_2O$$_2$-induced DNA damage in human peripheral blood lymphocytes, measured by single-cell gel electrophoresis (SCGE). PJE021-1 protected the cells to almost negative control level, dose-dependently. PJE024-1 exhibited a potent inhibition with the ratio of 71% at even low concentration (0.5 $\mu\textrm{g}$/$m\ell$). Finally, their chemical structures were identified as gallic acid (PJE021-1) and (+)-catechin (PJE024-1), respectively, on the basis of the speculation of spectral and physical data.

Acute toxicity of some pesticides on five Korean native Cladocerans (한국산 물벼룩에 대한 수종 농약의 급성독성)

  • Kim, Byung-Seok;Park, Yoen-Ki;Park, Kyung-Hun;Jeong, Mi-Hye;You, Are-Sun;Yang, Yu-Jung;Shin, Jin-Sup;Kim, Jin-Hwa;Yoon, Seong-Myeong;Ahn, Young-Joon
    • The Korean Journal of Pesticide Science
    • /
    • v.11 no.4
    • /
    • pp.261-267
    • /
    • 2007
  • The acute toxicity of several pesticides on 4 Korean water flea was investigated to develop a new standard species used for ecological risk assessment of pesticide. Four Korean freshwater cladocerans, Daphnia obtusa, Daphnia sp., Moina macrocopa and Simocepholus vetulus were exposed to five different pesticides during 48 hours to compare their sensitivity with a standard test species, Daphnia magna, endorsed formally by the major international organizations. The synthetic pyrethroid, fenpropathrin was the most toxic pesticide to cladocerans. Diazinon, carbofuran, iprodione and myclobutanil were in the order of their toxicity to cladocerans tested. There was no consistent difference in sensitivity to five pesticides for four Korean cladocerans tested. In conclusion, the ecological risk assessment using single species toxicity referred to base set data should not be enough to protect to every species in the field environment.

A Model for the $3_{10}$/$\alpha$ Helix Transitions of $\alpha$-Aminoisobutyric Acid-Alanine Oligopeptide ($\alpha$-아미노이소부틸산-알라닌 올리고 펩티드의 $3_{10}$/$\alpha$ 나선 전이에 관한 모형)

  • Kim, Yeong Gu;Park, Hyeong Seok
    • Journal of the Korean Chemical Society
    • /
    • v.38 no.10
    • /
    • pp.710-718
    • /
    • 1994
  • We suggest a statistical thermodynamic theory for the conformational transition of a synthetic alanine (Ala), ${\alpha}$-aminoisobutyric acid (Aib) alternative oligopeptide, Buo-(Ala-Aib)$_n$-oMe, where the terminal groups Buo and oMe stand for t-butoxy and methoxy, respectively. Pure Aib homo-oligomers have always been found to adopt $3_{10}$ helical conformations, while polyalanine has always $\alpha$ helical conformation. In an organic solvent (e.g. $CD_3$CN) it shows that the length for the $3_{10}$/${\alpha}$ helix transitions of Buo-(Ala-Aib)$_n$-oMe, is 8 at room temperature. In an aqueous solution oligopeptide has always coil conformation at room temperature. In an organic solution, helical structures of the oligopeptide are more stable than coil structure, so we studied the $$3_{10}/\alpha$ helix transitions, considering coiled-conformations, coiled and $3_{10}$ helical conformations, and coiled and $\alpha$ helical conformations by using the zipper model. We determined the values of parameters ($\sigma_A$, $\sigma_T$, $\xi_A$, $\xi_T$) from the relating published data; $\sigma_A$ = 0.00011, $\sigma_T$ = 0.0060, $\xi_A$ = 10.1, $\xi_T$ = 3.90. The distributions of $\alpha$ helical length can be N-2, N-3, N-4, ${\cdots}$, 3, 2, 1 (N = 2n) while those of $3_{10}$ helical length, N-1, N-2, N-3, N-4, ${\cdots}$, 3, 2, 1.

  • PDF