• Title/Summary/Keyword: index clustering

Search Result 323, Processing Time 0.026 seconds

Citizen Sentiment Analysis of the Social Disaster by Using Opinion Mining (오피니언 마이닝 기법을 이용한 사회적 재난의 시민 감성도 분석)

  • Seo, Min Song;Yoo, Hwan Hee
    • Journal of Korean Society for Geospatial Information Science
    • /
    • v.25 no.1
    • /
    • pp.37-46
    • /
    • 2017
  • Recently, disaster caused by social factors is frequently occurring in Korea. Prediction about what crisis could happen is difficult, raising the citizen's concern. In this study, we developed a program to acquire tweet data by applying Python language based Tweepy plug-in, regarding social disasters such as 'Nonspecific motive crimes' and 'Oxy' products. These data were used to evaluate psychological trauma and anxiety of citizens through the text clustering analysis and the opinion mining analysis of the R Studio program after natural language processing. In the analysis of the 'Oxy' case, the accident of Sewol ferry, the continual sale of Oxy products of the Oxy had the highest similarity and 'Nonspecific motive crimes', the coping measures of the government against unexpected incidents such as the 'incident' of the screen door, the accident of Sewol ferry and 'Nonspecific motive crime' due to misogyny in Busan, had the highest similarity. In addition, the average index of the Citizens sentiment score in Nonspecific motive crimes was more negative than that in the Oxy case by 11.61%p. Therefore, it is expected that the findings will be utilized to predict the mental health of citizens to prevent future accidents.

Application of Spatial Autocorrelation for the Spatial Distribution Pattern Analysis of Marine Environment - Case of Gwangyang Bay - (해양환경 공간분포 패턴 분석을 위한 공간자기상관 적용 연구 - 광양만을 사례 지역으로 -)

  • Choi, Hyun-Woo;Kim, Kye-Hyun;Lee, Chul-Yong
    • Journal of the Korean Association of Geographic Information Studies
    • /
    • v.10 no.4
    • /
    • pp.60-74
    • /
    • 2007
  • For quantitative analysis of spatio-temporal distribution pattern on marine environment, spatial autocorrelation statistics on the both global and local aspects was applied to the observed data obtained from Gwangyang Bay in South Sea of Korea. Global indexes such as Moran's I and General G were used for understanding environmental distribution pattern in the whole study area. LISAs (local indicators of spatial association) such as Moran's I ($I_i$) and $G_i{^*}$ were considered to find similarity between a target feature and its neighborhood features and to detect hot spot and/or cold spot. Additionally, the significance test on clustered patterns by Z-scores was carried out. Statistical results showed variations of spatial patterns quantitatively in the whole year. Then all of general water quality, nutrients, chlorophyll-a and phytoplankton had strong clustered pattern in summer. When global indexes showed strong clustered pattern, the front region with a negative $I_i$ which means a strong spatial variation was observed. Also, when global indexes showed random pattern, hot spot and/or cold spot were/was found in the small local region with a local index $G_i{^*}$. Therefore, global indexes were useful for observing the strength and time series variations of clustered patterns in the whole study area, and local indexes were useful for tracing the location of hot spot and/or cold spot. Quantification of both spatial distribution pattern and clustering characteristics may play an important role to understand marine environment in depth and to find the reasons for spatial pattern.

  • PDF

EPR : Enhanced Parallel R-tree Indexing Method for Geographic Information System (EPR : 지리 정보 시스템을 위한 향상된 병렬 R-tree 색인 기법)

  • Lee, Chun-Geun;Kim, Jeong-Won;Kim, Yeong-Ju;Jeong, Gi-Dong
    • The Transactions of the Korea Information Processing Society
    • /
    • v.6 no.9
    • /
    • pp.2294-2304
    • /
    • 1999
  • Our research purpose in this paper is to improve the performance of query processing in GIS(Geographic Information System) by enhancing the I/O performance exploiting parallel I/O and efficient disk access. By packing adjacent spatial data, which are very likely to be referenced concurrently, into one block or continuous disk blocks, the number of disk accesses and the disk access overhead for query processing can be decreased, and this eventually leads to the I/O time decrease. So, in this paper, we proposes EPR(Enhanced Parallel R-tree) indexing method which integrates the parallel I/O method of the previous Parallel R-tree method and a packing-based clustering method. The major characteristics of EPR method are as follows. First, EPR method arranges spatial data in the increasing order of proximity by using Hilbert space filling curve, and builds a packed R-tree by bottom-up manner. Second, with packing-based clustering in which arranged spatial data are clustered into continuous disk blocks, EPR method generates spatial data clusters. Third, EPR method distributes EPR index nodes and spatial data clusters on multiple disks through round-robin striping. Experimental results show that EPR method achieves up to 30% or more gains over PR method in query processing speed. In particular, the larger the size of disk blocks is and the smaller the size of spatial data objects is, the better the performance of query processing by EPR method is.

  • PDF

Analysis of Roadkill Hotspot According to the Spatial Clustering Methods (공간 군집지역 탐색방법에 따른 로드킬 다발구간 분석)

  • Song, Euigeun;Seo, Hyunjin;Kim, Kyungmin;Woo, Donggul;Park, Taejin;Choi, Taeyoung
    • Journal of Environmental Impact Assessment
    • /
    • v.28 no.6
    • /
    • pp.580-591
    • /
    • 2019
  • This study analyzed roadkill hotspots in Yeongju, Mungyeong-si Andong-si and Cheongsong-gun to compare the method of searching the area of the spatial cluster for selecting the roadkill hotspots. The local spatial autocorrelation index Getis-Ord Gi* statistics were calculated by different units of analysis, drawing hotspot areas of 9% from 300 m and 14% from 1 km on the basis of the total road area. The rating of Z-score in the 1km hotspot area showed the highest Z-score in the 28th National Road section on the border between Yecheon-gun and Yeongj-si. The kernel density method performed general kernel density estimation and network kernel density estimation analysis, both of which made it easier to visualize roadkill hotspots than district unit analysis, but there were limitations that it was difficult to determine statistically significant priority. As a result, local hotspot areas were found to be different according to the cluster analysis method, and areas that are in common need of reduction measures were found to be the hotspot of 28th National Road through Yeongju-si and Yecheon-gun. It is deemed that the results of this study can be used as basic data when identifying roadkill hotspots and establishing measures to reduce roadkill.

A News Video Mining based on Multi-modal Approach and Text Mining (멀티모달 방법론과 텍스트 마이닝 기반의 뉴스 비디오 마이닝)

  • Lee, Han-Sung;Im, Young-Hee;Yu, Jae-Hak;Oh, Seung-Geun;Park, Dai-Hee
    • Journal of KIISE:Databases
    • /
    • v.37 no.3
    • /
    • pp.127-136
    • /
    • 2010
  • With rapid growth of information and computer communication technologies, the numbers of digital documents including multimedia data have been recently exploded. In particular, news video database and news video mining have became the subject of extensive research, to develop effective and efficient tools for manipulation and analysis of news videos, because of their information richness. However, many research focus on browsing, retrieval and summarization of news videos. Up to date, it is a relatively early state to discover and to analyse the plentiful latent semantic knowledge from news videos. In this paper, we propose the news video mining system based on multi-modal approach and text mining, which uses the visual-textual information of news video clips and their scripts. The proposed system systematically constructs a taxonomy of news video stories in automatic manner with hierarchical clustering algorithm which is one of text mining methods. Then, it multilaterally analyzes the topics of news video stories by means of time-cluster trend graph, weighted cluster growth index, and network analysis. To clarify the validity of our approach, we analyzed the news videos on "The Second Summit of South and North Korea in 2007".

Spatial Influence on Acupoints Network Derived from the Chapter on Acupuncture & Moxibustion in "Beijiqianjinyaofang" ("비급천금요방(備急千金要方)" 침구편(鍼灸篇)으로 구성한 경혈(經穴) 네트워크에 공간적 위치 변수가 미치는 영향)

  • Kim, Min-Uk;Yang, Seung-Bum;Ahn, Seong-Hoon;Sohn, In-Chul;Kim, Jae-Hyo
    • Korean Journal of Acupuncture
    • /
    • v.29 no.3
    • /
    • pp.431-440
    • /
    • 2012
  • Objectives : Recently, network science is very popular topic in various scientific fields and many studies have reported that it gives meaningful results on studying characteristics of a complex system. In this study, based on network theory, we made acupoints network using data of combined acupoints which appeared at "Beijiqianjinyaofang". We focused to find out the distinctive roles of remote and local combinations on the network. Furthermore, we aimed to identify the possibility of numerical and quantitative application to acupuncture researches. Methods : Based on examples of combined acupoints in "Beijiqianjinyaofang", the network consisted of 291 nodes and 2,431 links. The spatial distances between combined acupoints were calculated by the human dummy model. We removed the links step by step for the three cases - remote, local, and random cases, and observed the characteristic changes by calculating path lengths, similarity indices, and clustering coefficients. Also cluster analysis was carried out. Results : The network had a small number of remote links, and a large number of local links. These two links had the distinct characteristics. Whereas the local links formed a cluster of nearby nodes, remote links played a role to increase the correlation between the clusters. Conclusions : These results suggest that acupoints network increases the connectivity between the distal part and the trunk of human body, and enables various combinations of the acupoints. This finding conclusively showed that mechanism of combined acupoints could be interpreted meaningfully by applying network theory in acupuncture researches.

Encapsulation and optical properties of Er3+ ions for planar optical amplifiers via sol-gel process (졸-겔법을 이용한 광증폭기의 Er 이온 캡슐화 및 광학적 특성)

  • Kim, Joo-Hyeun;Seok, Sang-Il;Ahn, Bok-Yeop
    • Proceedings of the Materials Research Society of Korea Conference
    • /
    • 2003.11a
    • /
    • pp.135-135
    • /
    • 2003
  • The fast evolution in the fold of optical communication systems demands powerful optical information treatment. These functions can be performed by integrated optical systems. A key component of such systems is erbium doped waveguide amplifier(EDWA). The intra 4f radiative transition of Er at 1.5 $\mu\textrm{m}$ is particularly interesting because this wavelength is standard in optical telecommunications. The fabrication of waveguide amplifier for integrated optics using sol-gel process has received an increasing attention. Potential advantage of lower cost by less capital equipment and easy processing makes this process an attractive alternatives to conventional technologies like flame hydrolysis deposition, ion exchange and chemical vapor deposition, etc. In addition, sol-gel process has been found to be extremely suitable for the control of composition and refractive index related directly with optical properties. The main drawback of such an amplifier with respect to the EDWA is the need for a much higher Er3+ concentration to compensate for the smaller interaction length. However, the high doping of Er might be resulted in the non-radiative relaxation by clustering of Er ions End co-operative upconversion. In order to solve this problem, we investigate the possibility of avoiding short Er-Er distances by encapsulation of Er3+ ions in hosts such as organic-inorganic hybrid materials. For inorganic-organic hybrid sols, methacryloxypropyltrimethoxysilane (MPTS), zirconyl chloride octahydrate and erbium(III) chloride hexahydrate were used as starting materials, followed by conventional sol-gel process. It was observed by TEM that nano sols having core/shell toplology were formed, depending on the mole ratio of Zr/Er. The surface roughness for the coatings on Si substrate was investigated by AFM as a function of Zr/Er ratio. The local environment and vibrational Properties of Er3+ ions were studied using Near-IR, FT-IR, and UV/Vis spectroscopy. Nano hybrid coatings derived from polymer and Er doped encapsulation Eave the good luminescence at 1.55$\mu\textrm{m}$.

  • PDF

Clustered Segment Index for Efficient Approximate Searching on the Secondary Structure of Protein Sequences (클러스터 세그먼트 인덱스를 이용한 단백질 이차 구조의 효율적인 유사 검색)

  • Seo Min-Koo;Park Sang-Hyun;Won Jung-Im
    • Journal of KIISE:Databases
    • /
    • v.33 no.3
    • /
    • pp.251-260
    • /
    • 2006
  • Homology searching on the primary structure (i.e., amino acid arrangement) of protein sequences is an essential part in predicting the functions and evolutionary histories of proteins. However, proteins distant in an evolutionary history do not conserve amino acid residue arrangements, while preserving their structures. Therefore, homology searching on proteins' secondary structure is quite important in finding out distant homology. In this manuscript, we propose an indexing scheme for efficient approximate searching on the secondary structure of protein sequences which can be easily implemented in RDBMS. Exploiting the concept of clustering and lookahead, the proposed indexing scheme processes three types of secondary structure queries (i.e., exact match, range match, and wildcard match) very quickly. To evaluate the performance of the proposed method, we conducted extensive experiments using a set of actual protein sequences. CSI was proved to be faster than the existing indexing methods up to 6.3 times in exact match, 3.3 times in range match, and 1.5 times in wildcard match, respectively.

Combining Ego-centric Network Analysis and Dynamic Citation Network Analysis to Topic Modeling for Characterizing Research Trends (자아 중심 네트워크 분석과 동적 인용 네트워크를 활용한 토픽모델링 기반 연구동향 분석에 관한 연구)

  • Yu, So-Young
    • Journal of the Korean Society for information Management
    • /
    • v.32 no.1
    • /
    • pp.153-169
    • /
    • 2015
  • The combined approach of using ego-centric network analysis and dynamic citation network analysis for refining the result of LDA-based topic modeling was suggested and examined in this study. Tow datasets were constructed by collecting Web of Science bibliographic records of White LED and topic modeling was performed by setting a different number of topics on each dataset. The multi-assigned top keywords of each topic were re-assigned to one specific topic by applying an ego-centric network analysis algorithm. It was found that the topical cohesion of the result of topic modeling with the number of topic corresponding to the lowest value of perplexity to the dataset extracted by SPLC network analysis was the strongest with the best values of internal clustering evaluation indices. Furthermore, it demonstrates the possibility of developing the suggested approach as a method of multi-faceted research trend detection.

Development Tendency of Altmetrics Research: Using Social Network Analysis and Co-word Analysis (소셜네트워크 분석과 Co-word 분석을 사용한 Altmetric 연구 개발동향)

  • Lee, Hyun-Chang;Li, Jiapei;Shin, Seong-Yoon
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.21 no.11
    • /
    • pp.2089-2094
    • /
    • 2017
  • Altmetrics is the measurement index and quantitative data to complement the traditional indicators based on the citation. Altmetrics research has acquired greater importance in the past few years, partly due to the complement to the traditional bibliometrics. This paper aims to reveal the research status and trends in altmetrics research. A total of 187 articles from 2005 to 2017 are obtained and analyzed, illustrating a steady rise (S-mode) in altmetrics research since 2005. Using social network analysis and co-word analysis, the author cooperation network and keyword co-occurrence network are developed. The core scientists and eight international research groups are discovered, reflecting that researchers in this field have a low degree of cooperation. Four topics of altmetrics research are discovered by hierarchical clustering. The results can be useful for the advanced research of altmetrics.