• Title/Summary/Keyword: multiple correspondence analysis

Search Result 60, Processing Time 0.031 seconds

Relationship between Plant Species Covers and Soil Chemical Properties in Poorly Controlled Waste Landfill Sites

  • Kim, Kee-Dae;Lee, Eun-Ju
    • Journal of Ecology and Environment
    • /
    • v.30 no.1
    • /
    • pp.39-47
    • /
    • 2007
  • The relationships between the cover of herbaceous species and 15 soil chemical properties (organic carbon contents, total N, available P, exchangeable K, Na, Ca and Mg, HCl-extractable Cd, Cr, Cu, Fe, Mn, Ni, Pb and Zn) in nine poorly controlled waste landfill sites in Korea were examined by correlation analysis and multiple regression equations. Species showed different patterns of correlation between their cover values and soil chemical properties. The cover of Ambrosia artemisiifolia var. elatior, Aster subulatus var. sandwicensis and Erechtites hieracifolia were negatively correlated with the contents of Fe, Mn and Ni within landfill soils. Total cover of all species in quadrats was positively correlated with the contents of Cd and negatively correlated with the contents of Mn and Fe from stepwise regression analysis with 15 soil properties. Canonical correspondence analysis demonstrated that the distribution of native and exotic plants on poorly controlled landfills was significantly influenced by the contents of Na and Ca in soils, respectively.

A Study on the Location of Game-themed Music by PC Game Genre : Focusing on types of music, structural elements of music, and images (PC게임 장르에 따른 게임 테마음악의 위치화 연구 : 음악종류, 음악의 구조적 요소, 이미지를 중심으로)

  • Park, Kwan-Ik;Hwang, Kyung-Ho;Lee, Hyung-Seok
    • Journal of Korea Game Society
    • /
    • v.20 no.2
    • /
    • pp.75-90
    • /
    • 2020
  • The purpose of this study is to identify the functional role of music, which is one of the main elements that make up PC games. To this end, theme music by genre of PC game was divided through analysis and then the relationship between variables was analyzed by visually positioning the distance between 'music type', 'structural element of music' and 'image' using multiple correspondence matching analysis. As a result, it was confirmed that there was a difference in location between the types of music by game genre, the structural elements of music, and the characteristics of the image.

A Study of Important Attribute Which Gives an Effect to Golf Club Membership Purchase (골프 회원권 구입에 영향을 미치는 중요속성에 관한 연구)

  • Ko, Ho-Seok;Kang, In-Won
    • Journal of Global Scholars of Marketing Science
    • /
    • v.15 no.1
    • /
    • pp.105-120
    • /
    • 2005
  • The research regarding the important attribute which gives an effect to golf club membership purchase. 1990's as the golf boom as percentage shows the rapid increase of the golf course visitor and it grows, from 2001 are passing over ten million people golf course visitors. When purchasing golf club membership, it is important from the research which it sees the attributes which it thinks probably are what, it grasped and the scope of research did Seoul and the metropolitan area golf user in the object and it executed an empirical.analysis. In order to raise the reality and a reliability of research also it executed the collection and a Question investigation and a empirical analysis of various statistical data. The result of research a frequency analysis, descriptive analysis, factor analysis, multiple regression analysis and a correspondence analysis used the SPSS/PC+ 10.0 which and is a statistics package program. The results show that four dimensions of image were derived from 14 important attributes, using a factor analysis. A correspondence analysis indicates that statistically significant relationships existed between some of Golf club membership factors with respect to the demographic variables of income and Golf frequency. A multiple regression analyses also indicate that most of Golf club membership factors had great impacts on visitors' recommend.

  • PDF

The Relationship among Brand Equity, Corporate personality, Attachment and Consumer behavior (관광목적지의 브랜드자산, 자아일치성, 애착 및 행동의도간의 관계)

  • Seo, Kyung-Do;Lee, Jung-Eun
    • Journal of Digital Convergence
    • /
    • v.11 no.10
    • /
    • pp.313-320
    • /
    • 2013
  • In this study of tourism destination brand equity and self-consistent interstellar significant attachment relationships and brand equity and behavioral intention was to determine the effect on the relationship. First, the tourism destination brand equity of the self-consistent sex tourists will have a significant impact is to test the hypothesis of a multiple regression analysis was conducted brand equity, loyalty, self image and gender matched only indicates the relationship was significant. Second, self Correspondence tourist destination tourists will have a significant effect on attachment. In order to verify the hypothesis that the multiple regression analysis was conducted for self-Correspondence attachment was significantly related shows. Third, the attachment of tourist destinations for travelers of action also will have a significant impact on. Hypotheses multiple regression analysis was conducted to attachment behavior also shows significant relationship was about.

A Bibliometric Approach for Department-Level Disciplinary Analysis and Science Mapping of Research Output Using Multiple Classification Schemes

  • Gautam, Pitambar
    • Journal of Contemporary Eastern Asia
    • /
    • v.18 no.1
    • /
    • pp.7-29
    • /
    • 2019
  • This study describes an approach for comparative bibliometric analysis of scientific publications related to (i) individual or several departments comprising a university, and (ii) broader integrated subject areas using multiple disciplinary schemes. It uses a custom dataset of scientific publications (ca. 15,000 articles and reviews, published during 2009-2013, and recorded in the Web of Science Core Collections) with author affiliations to the research departments, dedicated to science, technology, engineering, mathematics, and medicine (STEMM), of a comprehensive university. The dataset was subjected, at first, to the department level and discipline level analyses using the newly available KAKEN-L3 classification (based on MEXT/JSPS Grants-in-Aid system), hierarchical clustering, correspondence analysis to decipher the major departmental and disciplinary clusters, and visualization of the department-discipline relationships using two-dimensional stacked bar diagrams. The next step involved the creation of subsets covering integrated subject areas and a comparative analysis of departmental contributions to a specific area (medical, health and life science) using several disciplinary schemes: Essential Science Indicators (ESI) 22 research fields, SCOPUS 27 subject areas, OECD Frascati 38 subordinate research fields, and KAKEN-L3 66 subject categories. To illustrate the effective use of the science mapping techniques, the same subset for medical, health and life science area was subjected to network analyses for co-occurrences of keywords, bibliographic coupling of the publication sources, and co-citation of sources in the reference lists. The science mapping approach demonstrates the ways to extract information on the prolific research themes, the most frequently used journals for publishing research findings, and the knowledge base underlying the research activities covered by the publications concerned.

Sensibility Preference of Eco-Friendly Fabric Products and Trust Reliability (친환경 섬유의류 제품의 감성 선호도와 신뢰도 조사 연구)

  • Na, Young-Joo;Kim, Hyo-Won
    • Fashion & Textile Research Journal
    • /
    • v.14 no.3
    • /
    • pp.430-437
    • /
    • 2012
  • This study analyzed the sensibility of eco-friendly fabrics for college students and investigated their attitude on environmental problems, trust reliability onto eco-apparel products, and their purchase state. We tested 6 eco-friendly fabrics (recycled polyester, organic cotton, green tea, charcoal, bamboo, and nettle) through a survey using the Likert scale of 12 polar sensibility words. Most fabrics showed feelings that were smooth, natural, female, and country these were followed by fashion, cheap, functional, sustainable, warm, and vintage. In addition, nettle fabric showed 'rough' feeling, and recycled polyester fabric showed an 'artificial' feeling. Correspondence analysis showed the distance and direction between fabric types and sensibility words with a 2D diagram where the X axis was named with 'Soft <-> Hard' and Y axis was with 'Environmental <-> Manmade' to represent the relationship between fabric types and the sensibility words. According to the results of the multiple regression analysis, the cognition level of the consumer for environmental problems was found to be the most influential variable on the loyalty purchase of eco-friendly products; however, the trust reliability level of consumer onto eco-friendly apparel products was found to be the most influential variable on the conditional purchase of eco-friendly apparel products.

A Methodology for Automatic Multi-Categorization of Single-Categorized Documents (단일 카테고리 문서의 다중 카테고리 자동확장 방법론)

  • Hong, Jin-Sung;Kim, Namgyu;Lee, Sangwon
    • Journal of Intelligence and Information Systems
    • /
    • v.20 no.3
    • /
    • pp.77-92
    • /
    • 2014
  • Recently, numerous documents including unstructured data and text have been created due to the rapid increase in the usage of social media and the Internet. Each document is usually provided with a specific category for the convenience of the users. In the past, the categorization was performed manually. However, in the case of manual categorization, not only can the accuracy of the categorization be not guaranteed but the categorization also requires a large amount of time and huge costs. Many studies have been conducted towards the automatic creation of categories to solve the limitations of manual categorization. Unfortunately, most of these methods cannot be applied to categorizing complex documents with multiple topics because the methods work by assuming that one document can be categorized into one category only. In order to overcome this limitation, some studies have attempted to categorize each document into multiple categories. However, they are also limited in that their learning process involves training using a multi-categorized document set. These methods therefore cannot be applied to multi-categorization of most documents unless multi-categorized training sets are provided. To overcome the limitation of the requirement of a multi-categorized training set by traditional multi-categorization algorithms, we propose a new methodology that can extend a category of a single-categorized document to multiple categorizes by analyzing relationships among categories, topics, and documents. First, we attempt to find the relationship between documents and topics by using the result of topic analysis for single-categorized documents. Second, we construct a correspondence table between topics and categories by investigating the relationship between them. Finally, we calculate the matching scores for each document to multiple categories. The results imply that a document can be classified into a certain category if and only if the matching score is higher than the predefined threshold. For example, we can classify a certain document into three categories that have larger matching scores than the predefined threshold. The main contribution of our study is that our methodology can improve the applicability of traditional multi-category classifiers by generating multi-categorized documents from single-categorized documents. Additionally, we propose a module for verifying the accuracy of the proposed methodology. For performance evaluation, we performed intensive experiments with news articles. News articles are clearly categorized based on the theme, whereas the use of vulgar language and slang is smaller than other usual text document. We collected news articles from July 2012 to June 2013. The articles exhibit large variations in terms of the number of types of categories. This is because readers have different levels of interest in each category. Additionally, the result is also attributed to the differences in the frequency of the events in each category. In order to minimize the distortion of the result from the number of articles in different categories, we extracted 3,000 articles equally from each of the eight categories. Therefore, the total number of articles used in our experiments was 24,000. The eight categories were "IT Science," "Economy," "Society," "Life and Culture," "World," "Sports," "Entertainment," and "Politics." By using the news articles that we collected, we calculated the document/category correspondence scores by utilizing topic/category and document/topics correspondence scores. The document/category correspondence score can be said to indicate the degree of correspondence of each document to a certain category. As a result, we could present two additional categories for each of the 23,089 documents. Precision, recall, and F-score were revealed to be 0.605, 0.629, and 0.617 respectively when only the top 1 predicted category was evaluated, whereas they were revealed to be 0.838, 0.290, and 0.431 when the top 1 - 3 predicted categories were considered. It was very interesting to find a large variation between the scores of the eight categories on precision, recall, and F-score.

Standardizing and Visualizing Descriptive Summaries of Election Survey Data (선거 여론조사 자료의 표준적 요약과 시각화)

  • Huh, Myung-Hoe;Lee, Yong-Goo
    • The Korean Journal of Applied Statistics
    • /
    • v.21 no.5
    • /
    • pp.845-854
    • /
    • 2008
  • Survey reports of election opinions consist of numerous cross-tabulations between socio-demographic variables and political opinions including preferred candidates. Since socio-demographic variables are related each other, duplicate interpretations arise. The aim of this study is twofold: The first is to separate the effects of socio variables such as education, occupation and income from the effects of demographic variables such as region, sex and age. The second is the visualization of multiple cross-tabulations in low-dimensional space by extended doubling technique of correspondence analysis. Survey researchers may get some help from this study to present their survey results more lucidly and visually.

Preliminary Inspection Prediction Model to select the on-Site Inspected Foreign Food Facility using Multiple Correspondence Analysis (차원축소를 활용한 해외제조업체 대상 사전점검 예측 모형에 관한 연구)

  • Hae Jin Park;Jae Suk Choi;Sang Goo Cho
    • Journal of Intelligence and Information Systems
    • /
    • v.29 no.1
    • /
    • pp.121-142
    • /
    • 2023
  • As the number and weight of imported food are steadily increasing, safety management of imported food to prevent food safety accidents is becoming more important. The Ministry of Food and Drug Safety conducts on-site inspections of foreign food facilities before customs clearance as well as import inspection at the customs clearance stage. However, a data-based safety management plan for imported food is needed due to time, cost, and limited resources. In this study, we tried to increase the efficiency of the on-site inspection by preparing a machine learning prediction model that pre-selects the companies that are expected to fail before the on-site inspection. Basic information of 303,272 foreign food facilities and processing businesses collected in the Integrated Food Safety Information Network and 1,689 cases of on-site inspection information data collected from 2019 to April 2022 were collected. After preprocessing the data of foreign food facilities, only the data subject to on-site inspection were extracted using the foreign food facility_code. As a result, it consisted of a total of 1,689 data and 103 variables. For 103 variables, variables that were '0' were removed based on the Theil-U index, and after reducing by applying Multiple Correspondence Analysis, 49 characteristic variables were finally derived. We build eight different models and perform hyperparameter tuning through 5-fold cross validation. Then, the performance of the generated models are evaluated. The research purpose of selecting companies subject to on-site inspection is to maximize the recall, which is the probability of judging nonconforming companies as nonconforming. As a result of applying various algorithms of machine learning, the Random Forest model with the highest Recall_macro, AUROC, Average PR, F1-score, and Balanced Accuracy was evaluated as the best model. Finally, we apply Kernal SHAP (SHapley Additive exPlanations) to present the selection reason for nonconforming facilities of individual instances, and discuss applicability to the on-site inspection facility selection system. Based on the results of this study, it is expected that it will contribute to the efficient operation of limited resources such as manpower and budget by establishing an imported food management system through a data-based scientific risk management model.

Analysis of Location Patterns for Protected Horticulture (시설원예의 입지유형 분석)

  • 황한철;이남호;전우정;남상운;홍성구
    • Magazine of the Korean Society of Agricultural Engineers
    • /
    • v.40 no.2
    • /
    • pp.92-101
    • /
    • 1998
  • Location patterns of protected horticulture were analyzed using a multiple correspondence analysis(MCA). The analysis could be used in evaluating location suitability of protected horticulture. The location factors of the protected horticulture for MCA include land category, size of protected horticulture, land slope, topography, effictive soil depth, irrigation and drainage condition, distance from roads, and so forth. The results showed that there were three different location patterns of protected horticulture. The first pattern was characterized by their nearness to villages. The facilities of this pattern were mainly located near to residential area. The second pattern was of those found in plain area. The facilities of this pattern were large in scale and located in paddy field far from residential area. The facilities of the last pattern were small in scale and located on nonpaddy fields. They were mostly found in hilly or mountainous area.

  • PDF