• Title/Summary/Keyword: Data Clustering

Search Result 2,721, Processing Time 0.041 seconds

A Spatial Statistical Approach to Migration Studies: Exploring the Spatial Heterogeneity in Place-Specific Distance Parameters (인구이동 연구에 대한 공간통계학적 접근: 장소특수적 거리 패러미터의 추출과 공간적 패턴 분석)

  • Lee, Sang-Il
    • Journal of the Korean association of regional geographers
    • /
    • v.7 no.3
    • /
    • pp.107-120
    • /
    • 2001
  • This study is concerned with providing a reliable procedure of calibrating a set of places specific distance parameters and with applying it to U.S. inter-State migration flows between 1985 and 1900. It attempts to conform to recent advances in quantitative geography that are characterized by an integration of ESDA(exploratory spatial data analysis) and local statistics. ESDA aims to detect the spatial clustering and heterogeneity by visualizing and exploring spatial patterns. A local statistic is defined as a statistically processed value given to each location as opposed to a global statistic that only captures an average trend across a whole study region. Whereas a global distance parameter estimates an averaged level of the friction of distance, place-specific distance parameters calibrate spatially varying effects of distance. It is presented that a poisson regression with an adequately specified design matrix yields a set of either origin-or destination-specific distance parameters. A case study demonstrates that the proposed model is a reliable device of measuring a spatial dimension of migration, and that place-specific distance parameters are spatially heterogeneous as well as spatially clustered.

  • PDF

Analysis of Utilization Characteristics, Health Behaviors and Health Management Level of Participants in Private Health Examination in a General Hospital (일개 종합병원의 민간 건강검진 수검자의 검진이용 특성, 건강행태 및 건강관리 수준 분석)

  • Kim, Yoo-Mi;Park, Jong-Ho;Kim, Won-Joong
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.14 no.1
    • /
    • pp.301-311
    • /
    • 2013
  • This study aims to analyze characteristics, health behaviors and health management level related to private health examination recipients in one general hospital. To achieve this, we analyzed 150,501 cases of private health examination data for 11 years from 2001 to 2011 for 20,696 participants in 2011 in a Dae-Jeon general hospital health examination center. The cluster analysis for classify private health examination group is used z-score standardization of K-means clustering method. The logistic regression analysis, decision tree and neural network analysis are used to periodic/non-periodic private health examination classification model. 1,000 people were selected as a customer management business group that has high probability to be non-periodic private health examination patients in new private health examination. According to results of this study, private health examination group was categorized by new, periodic and non-periodic group. New participants in private health examination were more 30~39 years old person than other age groups and more patients suspected of having renal disease. Periodic participants in private health examination were more male participants and more patients suspected of having hyperlipidemia. Non-periodic participants in private health examination were more smoking and sitting person and more patients suspected of having anemia and diabetes mellitus. As a result of decision tree, variables related to non-periodic participants in private health examination were sex, age, residence, exercise, anemia, hyperlipidemia, diabetes mellitus, obesity and liver disease. In particular, 71.4% of non-periodic participants were female, non-anemic, non-exercise, and suspicious obesity person. To operation of customized customer management business for private health examination will contribute to efficiency in health examination center.

Growing Environment Characteristics and Vegetation Structure of Lonicera harae, Medicinal Plant (약용식물 길마가지나무 자생지의 생육환경특성과 식생구조)

  • Son, Yonghwan;Park, Sunghyuk;Jeong, Daehui;Cho, Hyejung;Son, Hojun;Jeon, Kwonseok
    • Korean Journal of Plant Resources
    • /
    • v.34 no.4
    • /
    • pp.297-310
    • /
    • 2021
  • Lonicera harae is a species of shrub in the Caprifoliaceae family, mostly distributed in East asia. So far, the related research on the genus of Lonicera is insufficient compared to the Lonicera japonica belonging to the same genus, which requires attention to domestic native plants. Therefore, this study aims to provide baseline data for cultivation and utilization through the growth environment and vegetation structure of the natural habitat. Lonicera harae, which plant found throughout the Korean Peninsula. The natural habitats of Lonicera harae is the forest, valley and lowland areas of the southern region. study examined 24 quadrats in 11 regions, including Gwangju, Wanju and Namhae. As a result, environmental condition of Lonicera harae was 8 to 483 m above sea level, normally distributed over 173 m, Slope was 5 to 25 degree with 8.5 degree on average. The list of plants were classified as a total of 229 taxa comprising 80 families, 166 genus, 198 species, 3 subspecies, 24 varieties, 4 forma. As a result of the clustering analysis, the three clusters were divided into three groups; Robinia pseudoacacia, Zelkova serrata, Larix kaempferi. Species diversity was 1.399 and Dominance and Evenness were found to be 0.978 and 0.022 respectively.

Exploring the contextual factors of episodic memory: dissociating distinct social, behavioral, and intentional episodic encoding from spatio-temporal contexts based on medial temporal lobe-cortical networks (일화기억을 구성하는 맥락 요소에 대한 탐구: 시공간적 맥락과 구분되는 사회적, 행동적, 의도적 맥락의 내측두엽-대뇌피질 네트워크 특징을 중심으로)

  • Park, Jonghyun;Nah, Yoonjin;Yu, Sumin;Lee, Seung-Koo;Han, Sanghoon
    • Korean Journal of Cognitive Science
    • /
    • v.33 no.2
    • /
    • pp.109-133
    • /
    • 2022
  • Episodic memory consists of a core event and the associated contexts. Although the role of the hippocampus and its neighboring regions in contextual representations during encoding has become increasingly evident, it remains unclear how these regions handle various context-specific information other than spatio-temporal contexts. Using high-resolution functional MRI, we explored the patterns of the medial temporal lobe (MTL) and cortical regions' involvement during the encoding of various types of contextual information (i.e., journalism principle 5W1H): "Who did it?," "Why did it happen?," "What happened?," "When did it happen?," "Where did it happen?," and "How did it happen?" Participants answered six different contextual questions while looking at simple experimental events consisting of two faces with one object on the screen. The MTL was divided to sub-regions by hierarchical clustering from resting-state data. General linear model analyses revealed a stronger activation of MTL sub-regions, the prefrontal lobe (PFC), and the inferior parietal lobule (IPL) during social (Who), behavioral (How), and intentional (Why) contextual processing when compared with spatio-temporal (Where/When) contextual processing. To further investigate the functional networks involved in contextual encoding dissociation, a multivariate pattern analysis was conducted with features selected as the task-based connectivity links between the hippocampal subfields and PFC/IPL. Each social, behavioral, and intentional contextual processing was individually and successfully classified from spatio-temporal contextual processing, respectively. Thus, specific contexts in episodic memory, namely social, behavior, and intention, involve distinct functional connectivity patterns that are distinct from those for spatio-temporal contextual memory.

High-Risk Area for Human Infection with Avian Influenza Based on Novel Risk Assessment Matrix (위험 매트릭스(Risk Matrix)를 활용한 조류인플루엔자 인체감염증 위험지역 평가)

  • Sung-dae Park;Dae-sung Yoo
    • Korean Journal of Poultry Science
    • /
    • v.50 no.1
    • /
    • pp.41-50
    • /
    • 2023
  • Over the last decade, avian influenza (AI) has been considered an emerging disease that would become the next pandemic, particularly in countries like South Korea, with continuous animal outbreaks. In this situation, risk assessment is highly needed to prevent and prepare for human infection with AI. Thus, we developed the risk assessment matrix for a high-risk area of human infection with AI in South Korea based on the notion that risk is the multiplication of hazards with vulnerability. This matrix consisted of highly pathogenic avian influenza (HPAI) in poultry farms and the number of poultry-associated production facilities assumed as hazards of avian influenza and vulnerability, respectively. The average number of HPAI in poultry farms at the 229-municipal level as the hazard axis of the matrix was predicted using a negative binomial regression with nationwide outbreaks data from 2003 to 2018. The two components of the matrix were classified into five groups using the K-means clustering algorithm and multiplied, consequently producing the area-specific risk level of human infection. As a result, Naju-si, Jeongeup-si, and Namwon-si were categorized as high-risk areas for human infection with AI. These findings would contribute to designing the policies for human infection to minimize socio-economic damages.

A study on the weight control behavior according to cluster types of the motivation to use social media among university students in the Jeonbuk area (전북지역 대학생의 소셜미디어 이용동기 유형에 따른 체중조절 행태 연구)

  • Jiyoon Lee;Sung Suk Chung;Jeong Ok Rho
    • Journal of Nutrition and Health
    • /
    • v.56 no.2
    • /
    • pp.203-216
    • /
    • 2023
  • Purpose: This study examines the weight control behavior depending on university students' motives of using social media. Methods: The participants were 447 university students in the Jeonbuk area. Collected data were analyzed using factor analysis, cluster analysis, analysis of variance, and χ2 tests with SPSS v. 26.0. Considering the motives of using social media, we investigated the usage of social media, dietary behavior related to social media, and weight control behavior. Results: Using the K-clustering method, the motives to use social media were categorized into three clusters: cluster 1 was the interest-centered group, cluster 2 was the multipurpose information-seeking group, and cluster 3 was the relationship-centered group. Among the various social media sites, YouTube (86.8%), Instagram (76.1%), and Facebook (61.1%) were the most visited by the subjects. The dietary behavior related to social media in cluster 2 was significantly higher than clusters 1 and 3 (p < 0.001). Clusters 1 and 2 showed a significantly higher dissatisfaction with one's weight (p < 0.05) and consequent interest in weight control than cluster 3 (p < 0.001). Cluster 2 used weight control-related information from social media significantly more than other clusters (p < 0.05). Weight control experiences in cluster 1 and 2 were significantly higher than in cluster 3 (p < 0.001). Conclusion: Differences in dietary behavior related to social media and weight control behavior were observed between cluster types of motivation to use social media. Based on the usage motives of university students and their behaviors, we propose that educational programs should be conducted for weight control using social media.

Eco-environmental assessment in the Sembilan Archipelago, Indonesia: its relation to the abundance of humphead wrasse and coral reef fish composition

  • Amran Ronny Syam;Mujiyanto;Arip Rahman;Imam Taukhid;Masayu Rahmia Anwar Putri;Andri Warsa;Lismining Pujiyani Astuti;Sri Endah Purnamaningtyas;Didik Wahju Hendro Tjahjo;Yosmaniar;Umi Chodrijah;Dini Purbani;Adriani Sri Nastiti;Ngurah Nyoman Wiadnyana;Krismono;Sri Turni Hartati;Mahiswara;Safar Dody;Murdinah;Husnah;Ulung Jantama Wisha
    • Fisheries and Aquatic Sciences
    • /
    • v.26 no.12
    • /
    • pp.738-751
    • /
    • 2023
  • The Sembilan Archipelago is famous for its great biodiversity, in which the humphead wrasse (Cheilinus undulatus) (locally named Napoleon fish) is the primary commodity (economically important), and currently, the environmental degradation occurs due to anthropogenic activities. This study aimed to examine the eco-environmental parameters and assess their influence on the abundance of humphead wrasse and other coral reef fish compositions in the Sembilan Archipelago. Direct field monitoring was performed using a visual census throughout an approximately one km transect. Coral cover data collection and assessment were also carried out. A coastal water quality index (CWQI) was used to assess the water quality status. Furthermore, statistical-based analyses [hierarchical clustering, Pearson's correlation, principal component analysis (PCA), and canonical correspondence analysis (CCA)] were performed to examine the correlation between eco-environmental parameters. The Napoleon fish was only found at stations 1 and 2, with a density of about 3.8 Ind/ha, aligning with the dominant composition of the family Serranidae (covering more than 15% of the total community) and coinciding with the higher coral mortality and lower reef fish abundance. The coral reef conditions were generally ideal for supporting marine life, with a living coral percentage of about > 50% in all stations. Based on CWQI, the study area is categorized as good and excellent water quality. Of the 60 parameter values examined, the phytoplankton abundance, Napoleon fish, and temperature are highly correlated, with a correlation coefficient value greater than 0.7, and statistically significant (F < 0.05). Although the adaptation of reef fish to water quality parameters varies greatly, the most influential parameters in shaping their composition in the study area are living corals, nitrites, ammonia, larval abundance, and temperature.

Estimation of GARCH Models and Performance Analysis of Volatility Trading System using Support Vector Regression (Support Vector Regression을 이용한 GARCH 모형의 추정과 투자전략의 성과분석)

  • Kim, Sun Woong;Choi, Heung Sik
    • Journal of Intelligence and Information Systems
    • /
    • v.23 no.2
    • /
    • pp.107-122
    • /
    • 2017
  • Volatility in the stock market returns is a measure of investment risk. It plays a central role in portfolio optimization, asset pricing and risk management as well as most theoretical financial models. Engle(1982) presented a pioneering paper on the stock market volatility that explains the time-variant characteristics embedded in the stock market return volatility. His model, Autoregressive Conditional Heteroscedasticity (ARCH), was generalized by Bollerslev(1986) as GARCH models. Empirical studies have shown that GARCH models describes well the fat-tailed return distributions and volatility clustering phenomenon appearing in stock prices. The parameters of the GARCH models are generally estimated by the maximum likelihood estimation (MLE) based on the standard normal density. But, since 1987 Black Monday, the stock market prices have become very complex and shown a lot of noisy terms. Recent studies start to apply artificial intelligent approach in estimating the GARCH parameters as a substitute for the MLE. The paper presents SVR-based GARCH process and compares with MLE-based GARCH process to estimate the parameters of GARCH models which are known to well forecast stock market volatility. Kernel functions used in SVR estimation process are linear, polynomial and radial. We analyzed the suggested models with KOSPI 200 Index. This index is constituted by 200 blue chip stocks listed in the Korea Exchange. We sampled KOSPI 200 daily closing values from 2010 to 2015. Sample observations are 1487 days. We used 1187 days to train the suggested GARCH models and the remaining 300 days were used as testing data. First, symmetric and asymmetric GARCH models are estimated by MLE. We forecasted KOSPI 200 Index return volatility and the statistical metric MSE shows better results for the asymmetric GARCH models such as E-GARCH or GJR-GARCH. This is consistent with the documented non-normal return distribution characteristics with fat-tail and leptokurtosis. Compared with MLE estimation process, SVR-based GARCH models outperform the MLE methodology in KOSPI 200 Index return volatility forecasting. Polynomial kernel function shows exceptionally lower forecasting accuracy. We suggested Intelligent Volatility Trading System (IVTS) that utilizes the forecasted volatility results. IVTS entry rules are as follows. If forecasted tomorrow volatility will increase then buy volatility today. If forecasted tomorrow volatility will decrease then sell volatility today. If forecasted volatility direction does not change we hold the existing buy or sell positions. IVTS is assumed to buy and sell historical volatility values. This is somewhat unreal because we cannot trade historical volatility values themselves. But our simulation results are meaningful since the Korea Exchange introduced volatility futures contract that traders can trade since November 2014. The trading systems with SVR-based GARCH models show higher returns than MLE-based GARCH in the testing period. And trading profitable percentages of MLE-based GARCH IVTS models range from 47.5% to 50.0%, trading profitable percentages of SVR-based GARCH IVTS models range from 51.8% to 59.7%. MLE-based symmetric S-GARCH shows +150.2% return and SVR-based symmetric S-GARCH shows +526.4% return. MLE-based asymmetric E-GARCH shows -72% return and SVR-based asymmetric E-GARCH shows +245.6% return. MLE-based asymmetric GJR-GARCH shows -98.7% return and SVR-based asymmetric GJR-GARCH shows +126.3% return. Linear kernel function shows higher trading returns than radial kernel function. Best performance of SVR-based IVTS is +526.4% and that of MLE-based IVTS is +150.2%. SVR-based GARCH IVTS shows higher trading frequency. This study has some limitations. Our models are solely based on SVR. Other artificial intelligence models are needed to search for better performance. We do not consider costs incurred in the trading process including brokerage commissions and slippage costs. IVTS trading performance is unreal since we use historical volatility values as trading objects. The exact forecasting of stock market volatility is essential in the real trading as well as asset pricing models. Further studies on other machine learning-based GARCH models can give better information for the stock market investors.

Effects of Customers' Relationship Networks on Organizational Performance: Focusing on Facebook Fan Page (고객 간 관계 네트워크가 조직성과에 미치는 영향: 페이스북 기업 팬페이지를 중심으로)

  • Jeon, Su-Hyeon;Kwahk, Kee-Young
    • Journal of Intelligence and Information Systems
    • /
    • v.22 no.2
    • /
    • pp.57-79
    • /
    • 2016
  • It is a rising trend that the number of users using one of the social media channels, the Social Network Service, so called the SNS, is getting increased. As per to this social trend, more companies have interest in this networking platform and start to invest their funds in it. It has received much attention as a tool spreading and expanding the message that a company wants to deliver to its customers and has been recognized as an important channel in terms of the relationship marketing with them. The environment of media that is radically changing these days makes possible for companies to approach their customers in various ways. Particularly, the social network service, which has been developed rapidly, provides the environment that customers can freely talk about products. For companies, it also works as a channel that gives customized information to customers. To succeed in the online environment, companies need to not only build the relationship between companies and customers but focus on the relationship between customers as well. In response to the online environment with the continuous development of technology, companies have tirelessly made the novel marketing strategy. Especially, as the one-to-one marketing to customers become available, it is more important for companies to maintain the relationship marketing with their customers. Among many SNS, Facebook, which many companies use as a communication channel, provides a fan page service for each company that supports its business. Facebook fan page is the platform that the event, information and announcement can be shared with customers using texts, videos, and pictures. Companies open their own fan pages in order to inform their companies and businesses. Such page functions as the websites of companies and has a characteristic of their brand communities such as blogs as well. As Facebook has become the major communication medium with customers, companies recognize its importance as the effective marketing channel, but they still need to investigate their business performances by using Facebook. Although there are infinite potentials in Facebook fan page that even has a function as a community between users, which other platforms do not, it is incomplete to regard companies' Facebook fan pages as communities and analyze them. In this study, it explores the relationship among customers through the network of the Facebook fan page users. The previous studies on a company's Facebook fan page were focused on finding out the effective operational direction by analyzing the use state of the company. However, in this study, it draws out the structural variable of the network, which customer committment can be measured by applying the social network analysis methodology and investigates the influence of the structural characteristics of network on the business performance of companies in an empirical way. Through each company's Facebook fan page, the network of users who engaged in the communication with each company is exploited and it is the one-mode undirected binary network that respectively regards users and the relationship of them in terms of their marketing activities as the node and link. In this network, it draws out the structural variable of network that can explain the customer commitment, who pressed "like," made comments and shared the Facebook marketing message, of each company by calculating density, global clustering coefficient, mean geodesic distance, diameter. By exploiting companies' historical performance such as net income and Tobin's Q indicator as the result variables, this study investigates influence on companies' business performances. For this purpose, it collects the network data on the subjects of 54 companies among KOSPI-listed companies, which have posted more than 100 articles on their Facebook fan pages during the data collection period. Then it draws out the network indicator of each company. The indicator related to companies' performances is calculated, based on the posted value on DART website of the Financial Supervisory Service. From the academic perspective, this study suggests a new approach through the social network analysis methodology to researchers who attempt to study the business-purpose utilization of the social media channel. From the practical perspective, this study proposes the more substantive marketing performance measurements to companies performing marketing activities through the social media and it is expected that it will bring a foundation of establishing smart business strategies by using the network indicators.

Characterization of Traits Related to Grain Shape in Korean Rice Varieties (국내 육성 벼 품종 입형 관련 특성 분석)

  • Lee, Chang-Min;Lee, Keon-Mi;Baek, Man-Kee;Kim, Woo-Jae;Suh, Jung-Pil;Jeong, Oh-Young;Cho, Young-Chan;Park, Hyun-Su;Kim, Suk-Man
    • KOREAN JOURNAL OF CROP SCIENCE
    • /
    • v.65 no.3
    • /
    • pp.199-213
    • /
    • 2020
  • Grain size and shape are the two important components contributing to rice yield and quality. To analyze traits related to grain-shape, a total of 272 varieties derived from japonica, japonica black and Tongil-type rice accession in Korea were evaluated in this study. The traits, grain length (GL), grain width (GW), grain thickness (GT), length to width ratio (RLW), and 1000-grain weight (TGW) were measured and replicated 10 times. Genes (GW2, GS3, qGL3, qSW5, GS5, TGW6, GW7, and GW8) related to grain-shape were validated in the accessions using specific DNA marker sets. K-mean clustering of the accession based on phenotypic data revealed three groups: group 1 was classified by GW and GT and included most of japonica type, group 2 was classified by RLW and GL reached a medium size and possessed a half spindle-shaped type, and group 3 was classified by TGW, reached a long size and possessed a semi-round shape. In validation tests using the marker sets, both gw2 and tgw6 were validated in less than 1% of the tested accessions and two allelic types, qgl3 and gw8, were only verified in Tongil-type accessions. For GW8 and GW2, any different amplicons were not amplified in any japonica or Tongil-type accessions, respectively. In order to suggest the representative grain-shape gene combinations for each ecotype, the allelic combinations were evaluated by PCR analysis. Cj1 and 2 in japonica (Cj1-7), Cj_b1 and 2 in japonica-black (Cj_b1-3), and CT3 in Tongil-type (CT1-13) turned out to be the dominant combination in each ecotype, respectively. In addition, the results revealed that introgression of four genes (gw2, gs3, qSW5, and GS5) would expand the diversity of grain shape in Korean japonica varieties. The gene combinations information could be utilized practically to understand or enhance grain shape in japonica rice breeding program.