• Title/Summary/Keyword: Multidimensional Scaling(MDS)

Search Result 125, Processing Time 0.019 seconds

User-Perspective Issue Clustering Using Multi-Layered Two-Mode Network Analysis (다계층 이원 네트워크를 활용한 사용자 관점의 이슈 클러스터링)

  • Kim, Jieun;Kim, Namgyu;Cho, Yoonho
    • Journal of Intelligence and Information Systems
    • /
    • v.20 no.2
    • /
    • pp.93-107
    • /
    • 2014
  • In this paper, we report what we have observed with regard to user-perspective issue clustering based on multi-layered two-mode network analysis. This work is significant in the context of data collection by companies about customer needs. Most companies have failed to uncover such needs for products or services properly in terms of demographic data such as age, income levels, and purchase history. Because of excessive reliance on limited internal data, most recommendation systems do not provide decision makers with appropriate business information for current business circumstances. However, part of the problem is the increasing regulation of personal data gathering and privacy. This makes demographic or transaction data collection more difficult, and is a significant hurdle for traditional recommendation approaches because these systems demand a great deal of personal data or transaction logs. Our motivation for presenting this paper to academia is our strong belief, and evidence, that most customers' requirements for products can be effectively and efficiently analyzed from unstructured textual data such as Internet news text. In order to derive users' requirements from textual data obtained online, the proposed approach in this paper attempts to construct double two-mode networks, such as a user-news network and news-issue network, and to integrate these into one quasi-network as the input for issue clustering. One of the contributions of this research is the development of a methodology utilizing enormous amounts of unstructured textual data for user-oriented issue clustering by leveraging existing text mining and social network analysis. In order to build multi-layered two-mode networks of news logs, we need some tools such as text mining and topic analysis. We used not only SAS Enterprise Miner 12.1, which provides a text miner module and cluster module for textual data analysis, but also NetMiner 4 for network visualization and analysis. Our approach for user-perspective issue clustering is composed of six main phases: crawling, topic analysis, access pattern analysis, network merging, network conversion, and clustering. In the first phase, we collect visit logs for news sites by crawler. After gathering unstructured news article data, the topic analysis phase extracts issues from each news article in order to build an article-news network. For simplicity, 100 topics are extracted from 13,652 articles. In the third phase, a user-article network is constructed with access patterns derived from web transaction logs. The double two-mode networks are then merged into a quasi-network of user-issue. Finally, in the user-oriented issue-clustering phase, we classify issues through structural equivalence, and compare these with the clustering results from statistical tools and network analysis. An experiment with a large dataset was performed to build a multi-layer two-mode network. After that, we compared the results of issue clustering from SAS with that of network analysis. The experimental dataset was from a web site ranking site, and the biggest portal site in Korea. The sample dataset contains 150 million transaction logs and 13,652 news articles of 5,000 panels over one year. User-article and article-issue networks are constructed and merged into a user-issue quasi-network using Netminer. Our issue-clustering results applied the Partitioning Around Medoids (PAM) algorithm and Multidimensional Scaling (MDS), and are consistent with the results from SAS clustering. In spite of extensive efforts to provide user information with recommendation systems, most projects are successful only when companies have sufficient data about users and transactions. Our proposed methodology, user-perspective issue clustering, can provide practical support to decision-making in companies because it enhances user-related data from unstructured textual data. To overcome the problem of insufficient data from traditional approaches, our methodology infers customers' real interests by utilizing web transaction logs. In addition, we suggest topic analysis and issue clustering as a practical means of issue identification.

Characteristics of Benthic Environment and Polychaete Communities of Gamak Bay, Korea (가막만의 저서환경과 다모류군집 특성)

  • Yoon, Sang-Pil;Jung, Rae-Hong;Kim, Youn-Jung;Kim, Seong-Soo;Lee, Jae-Seong;Park, Jong-Soo;Lee, Won-Chan;Choi, Woo-Jung
    • The Sea:JOURNAL OF THE KOREAN SOCIETY OF OCEANOGRAPHY
    • /
    • v.12 no.4
    • /
    • pp.287-304
    • /
    • 2007
  • This study was carried out to investigate spatio-temporal variations of benthic environment and macrobenthic polychaete communities in Gamak Bay where excessive organic matters from untreated sewage effluents and fish and shellfish farming activities have been accumulated in certain regions. Such environmental variables as sediment composition, organic content, acid volatile sulfide and dissolved oxygen content etc. were measured and polychaete specimens were taken in September 1999 and February 2000. In September 1999, organic contents were up to three times higher in the northwestern part of the bay and the area closed to Gukdong harbor than in the rest of the bay. In particular, benthic environment of the northwestern part of the bay was extremely deteriorated by the occurrence of hypoxia and highly concentrated sulfide. Of 28 stations investigated in summer, 5 stations located in the northwestern part were azoic. In the rest of the stations, a total of 119 polychaete species were sampled with a mean density of 900 $ind./m^2$. While species richness was higher in the entrance and central part of the bay where water exchanges with open sea were relatively active, density was higher in the moderately enriched stations neighbouring Gukdong harbor and fish farms where such potential indicators of organic pollution as Aphelochaeta monilaris, Lumbrineris longifolia were largely dominant. In February 2000, a total of 81 polychaete species appeared with a mean density of 2,802 $ind./m^2$ from 12 stations sampled in winter. Azoic areas were recolonized by Capitella capitata, Pseudopolydora paucibranchiata and Dipolydora socialis which showed maximum density in the innermost station. On the other hand, the rest of the stations were dominated by Euchone alicaudata and Praxillella affinis. The configuration of the stations sampled in summer and winter on the multidimensional scaling plots reflected the position of each stations within the bay in which there were great differences in organic content, sulfide concentration and oxygen content rather than in sediment composition. Therefore, heterogeneities in polychaete community structure in Gamak Bay were primarily influenced by the degree of the disturbance depending on the position within the bay.

Impact of Fish Farming on Macrobenthic Polychaete Communities (해상 가두리 양식이 저서 다모류군집에 미치는 영향)

  • Jung, Rae-Hong;Yoon, Sang-Pil;Kwon, Jung-No;Lee, Jae-Seong;Lee, Won-Chan;Koo, Jun-Ho;Kim, Youn-Jung;Oh, Hyun-Taik;Hong, Sok-Jin;Park, Sung-Eun
    • The Sea:JOURNAL OF THE KOREAN SOCIETY OF OCEANOGRAPHY
    • /
    • v.12 no.3
    • /
    • pp.159-169
    • /
    • 2007
  • Excessive input of organic matters from fish cage farms to the coastal waters has been considered as one of the major factors disturbing their benthic ecosystem. Sediment samples were taken from around the two fish cage zones (A and B) in Tongyeong coast in June and August 2003, to evaluate the ecological impacts of fish cage farming activity on the macrobenthic polychaete communities. Polychaete accounted for $81{\sim}87%$ of the total macrofauna individuals from each of the sampling stations. The number of species, abundance, diversity and dominant species of polychaete were rapidly changed with the distance from the fish cages. Within 10 m from the fish cages, Capitella capitata, which is a bio-indicator for the highly enriched sediments, was a dominant species and the lowest diversity was recorded. In particular, the maximum density (${\sim}18,410\;ind.m^2$) of C. capitata was found at Farm A where fish cages were more densely established within a semi-enclosed bay system. The sampling zone between 10 m and 15 m showed a rapid decrease of C. capitata with a rapid increase of the numbers of species, implying that this zone may be an ecotone point from a highly to a slightly enriched area. In the sampling zone between 15 m and 60 m, a transitional zone, which represents slightly enriched condition before normal one, was observed with additional increase and maintenance of the number of species and density of polychaete. In addition, the potential bio-indicators of organic enrichment, such as Lumbrineris longifolia and Aphelochaeta monilaris were the predominant species in the sampling zone. Multidimensional scaling (MDS) ordination plots and k-dominance curves confirmed the above results on the gradual changes in the macrobenthic polychaete communities. Our findings suggest that the magnitude of impact of fish cage farming activity on polychaete communities is probably governed by a distance from fish cage, density of fish cage and geomorphological characteristics around fish cage farm.

Perceptional Change of a New Product, DMB Phone

  • Kim, Ju-Young;Ko, Deok-Im
    • Journal of Global Scholars of Marketing Science
    • /
    • v.18 no.3
    • /
    • pp.59-88
    • /
    • 2008
  • Digital Convergence means integration between industry, technology, and contents, and in marketing, it usually comes with creation of new types of product and service under the base of digital technology as digitalization progress in electro-communication industries including telecommunication, home appliance, and computer industries. One can see digital convergence not only in instruments such as PC, AV appliances, cellular phone, but also in contents, network, service that are required in production, modification, distribution, re-production of information. Convergence in contents started around 1990. Convergence in network and service begins as broadcasting and telecommunication integrates and DMB(digital multimedia broadcasting), born in May, 2005 is the symbolic icon in this trend. There are some positive and negative expectations about DMB. The reason why two opposite expectations exist is that DMB does not come out from customer's need but from technology development. Therefore, customers might have hard time to interpret the real meaning of DMB. Time is quite critical to a high tech product, like DMB because another product with same function from different technology can replace the existing product within short period of time. If DMB does not positioning well to customer's mind quickly, another products like Wibro, IPTV, or HSPDA could replace it before it even spreads out. Therefore, positioning strategy is critical for success of DMB product. To make correct positioning strategy, one needs to understand how consumer interprets DMB and how consumer's interpretation can be changed via communication strategy. In this study, we try to investigate how consumer perceives a new product, like DMB and how AD strategy change consumer's perception. More specifically, the paper segment consumers into sub-groups based on their DMB perceptions and compare their characteristics in order to understand how they perceive DMB. And, expose them different printed ADs that have messages guiding consumer think DMB in specific ways, either cellular phone or personal TV. Research Question 1: Segment consumers according to perceptions about DMB and compare characteristics of segmentations. Research Question 2: Compare perceptions about DMB after AD that induces categorization of DMB in direction for each segment. If one understand and predict a direction in which consumer perceive a new product, firm can select target customers easily. We segment consumers according to their perception and analyze characteristics in order to find some variables that can influence perceptions, like prior experience, usage, or habit. And then, marketing people can use this variables to identify target customers and predict their perceptions. If one knows how customer's perception is changed via AD message, communication strategy could be constructed properly. Specially, information from segmented customers helps to develop efficient AD strategy for segment who has prior perception. Research framework consists of two measurements and one treatment, O1 X O2. First observation is for collecting information about consumer's perception and their characteristics. Based on first observation, the paper segment consumers into two groups, one group perceives DMB similar to Cellular phone and the other group perceives DMB similar to TV. And compare characteristics of two segments in order to find reason why they perceive DMB differently. Next, we expose two kinds of AD to subjects. One AD describes DMB as Cellular phone and the other Ad describes DMB as personal TV. When two ADs are exposed to subjects, consumers don't know their prior perception of DMB, in other words, which subject belongs 'similar-to-Cellular phone' segment or 'similar-to-TV' segment? However, we analyze the AD's effect differently for each segment. In research design, final observation is for investigating AD effect. Perception before AD is compared with perception after AD. Comparisons are made for each segment and for each AD. For the segment who perceives DMB similar to TV, AD that describes DMB as cellular phone could change the prior perception. And AD that describes DMB as personal TV, could enforce the prior perception. For data collection, subjects are selected from undergraduate students because they have basic knowledge about most digital equipments and have open attitude about a new product and media. Total number of subjects is 240. In order to measure perception about DMB, we use indirect measurement, comparison with other similar digital products. To select similar digital products, we pre-survey students and then finally select PDA, Car-TV, Cellular Phone, MP3 player, TV, and PSP. Quasi experiment is done at several classes under instructor's allowance. After brief introduction, prior knowledge, awareness, and usage about DMB as well as other digital instruments is asked and their similarities and perceived characteristics are measured. And then, two kinds of manipulated color-printed AD are distributed and similarities and perceived characteristics for DMB are re-measured. Finally purchase intension, AD attitude, manipulation check, and demographic variables are asked. Subjects are given small gift for participation. Stimuli are color-printed advertising. Their actual size is A4 and made after several pre-test from AD professionals and students. As results, consumers are segmented into two subgroups based on their perceptions of DMB. Similarity measure between DMB and cellular phone and similarity measure between DMB and TV are used to classify consumers. If subject whose first measure is less than the second measure, she is classified into segment A and segment A is characterized as they perceive DMB like TV. Otherwise, they are classified as segment B, who perceives DMB like cellular phone. Discriminant analysis on these groups with their characteristics of usage and attitude shows that Segment A knows much about DMB and uses a lot of digital instrument. Segment B, who thinks DMB as cellular phone doesn't know well about DMB and not familiar with other digital instruments. So, consumers with higher knowledge perceive DMB similar to TV because launching DMB advertising lead consumer think DMB as TV. Consumers with less interest on digital products don't know well about DMB AD and then think DMB as cellular phone. In order to investigate perceptions of DMB as well as other digital instruments, we apply Proxscal analysis, Multidimensional Scaling technique at SPSS statistical package. At first step, subjects are presented 21 pairs of 7 digital instruments and evaluate similarity judgments on 7 point scale. And for each segment, their similarity judgments are averaged and similarity matrix is made. Secondly, Proxscal analysis of segment A and B are done. At third stage, get similarity judgment between DMB and other digital instruments after AD exposure. Lastly, similarity judgments of group A-1, A-2, B-1, and B-2 are named as 'after DMB' and put them into matrix made at the first stage. Then apply Proxscal analysis on these matrixes and check the positional difference of DMB and after DMB. The results show that map of segment A, who perceives DMB similar as TV, shows that DMB position closer to TV than to Cellular phone as expected. Map of segment B, who perceive DMB similar as cellular phone shows that DMB position closer to Cellular phone than to TV as expected. Stress value and R-square is acceptable. And, change results after stimuli, manipulated Advertising show that AD makes DMB perception bent toward Cellular phone when Cellular phone-like AD is exposed, and that DMB positioning move towards Car-TV which is more personalized one when TV-like AD is exposed. It is true for both segment, A and B, consistently. Furthermore, the paper apply correspondence analysis to the same data and find almost the same results. The paper answers two main research questions. The first one is that perception about a new product is made mainly from prior experience. And the second one is that AD is effective in changing and enforcing perception. In addition to above, we extend perception change to purchase intention. Purchase intention is high when AD enforces original perception. AD that shows DMB like TV makes worst intention. This paper has limitations and issues to be pursed in near future. Methodologically, current methodology can't provide statistical test on the perceptual change, since classical MDS models, like Proxscal and correspondence analysis are not probability models. So, a new probability MDS model for testing hypothesis about configuration needs to be developed. Next, advertising message needs to be developed more rigorously from theoretical and managerial perspective. Also experimental procedure could be improved for more realistic data collection. For example, web-based experiment and real product stimuli and multimedia presentation could be employed. Or, one can display products together in simulated shop. In addition, demand and social desirability threats of internal validity could influence on the results. In order to handle the threats, results of the model-intended advertising and other "pseudo" advertising could be compared. Furthermore, one can try various level of innovativeness in order to check whether it make any different results (cf. Moon 2006). In addition, if one can create hypothetical product that is really innovative and new for research, it helps to make a vacant impression status and then to study how to form impression in more rigorous way.

  • PDF

Temporal and Spatial Distribution of Benthic Polychaetous Communities in Seomjin River Estuary (섬진강 하구역 저서다모류군집의 시·공간 분포)

  • Kang, Sung Hyo;Lee, Jung Ho;Park, Sung Wan;Shin, Hyun Chool
    • The Sea:JOURNAL OF THE KOREAN SOCIETY OF OCEANOGRAPHY
    • /
    • v.19 no.4
    • /
    • pp.243-255
    • /
    • 2014
  • This study was investigated to estimate the relations between benthic environments and benthic polychaetous community from April 2012 to February 2013. Twenty four stations were selected sequentially with Seomjin River Estuary from the northern part of Gwangyang Bay. The study area could be divided into three characteristic zones based on salinity, water temperature, dissolved oxygen and pH such as Saline Water Zone (SWZ), Brackish Water Zone (BWZ), and Fresh Water Zone (FWZ). Salinity was above 30.0 psu in SWZ, drastically decreased toward inland in BWZ, and nearly zero psu in FWZ. SWZ showed its specific environmental characters like that water temperature fluctuated with little seasonal change and DO showed the lowest values among three zones, and pH maintained as consistent value without seasonal fluctuation. In FWZ, on the other hand, water temperature showed high seasonal fluctuation, DO showed the highest values among three zones, and pH fluctuated greatly. In sedimentary environment, mud, sand and sand/gravel were found as dominant sedimentary deposits in SWZ, BWZ and FWZ, respectively. Organic matter content and AVS in surface sediment were high in SWZ, while Chl-a content high in FWZ. This study area showed a marked environmental difference between FWZ and SWZ as follows: FWZ has coarse sediment and low salinity, low organic matter content, low AVS in FWZ but SWZ has fine sediment and high salinity, high organic matter content and AVS. Species number and mean density of benthic polychaete community was highest in Saline Water Zone (SWZ), drastically decreased in Brackish Water Zone (BWZ), and lowest in Fresh Water Zone (FWZ). Dominant polychates above 5.0% of individual numbers were 6 taxa. Lumbrineris longifolia, Prionospio cirrifera, Tharyx sp. occurred as main dominant species of all study periods, and Hediste sp., Praxillella affinis, Tylorrhynchus sp. dominantly occurred at some seasons. Inhabiting areas of dominant species were separated characteristically. Representative species in SWZ were Lumbrineris longifolia, Tharyx sp., Mediomastus sp.. Wide-appearing species between SWZ and BWZ were Prionospio cirrifera, Heteromastus filiformis, Aricidea sp.. Characteristic species in FWZ were Tylorrhynchus sp. and Hediste sp.. As the results of cluster analysis and nMDS based on the species composition of polychaetous community, unique station groups were established in SWZ and FWZ. Stations in BWZ were sub-divided into several groups with season. Pearson's correlation analysis and PCA between benthic environments and ecological characteristics of polychaetous community showed that salinity, sediment composition, organic content and dissolved oxygen played a role to determine the temporal and spatial distribution of the ecological characteristics as species number, mean density, abundance of main species, and ecological indices.