• Title/Summary/Keyword: Statistical Correspondence

Search Result 96, Processing Time 0.023 seconds

Pre-Evaluation for Prediction Accuracy by Using the Customer's Ratings in Collaborative Filtering (협업필터링에서 고객의 평가치를 이용한 선호도 예측의 사전평가에 관한 연구)

  • Lee, Seok-Jun;Kim, Sun-Ok
    • Asia pacific journal of information systems
    • /
    • v.17 no.4
    • /
    • pp.187-206
    • /
    • 2007
  • The development of computer and information technology has been combined with the information superhighway internet infrastructure, so information widely spreads not only in special fields but also in the daily lives of people. Information ubiquity influences the traditional way of transaction, and leads a new E-commerce which distinguishes from the existing E-commerce. Not only goods as physical but also service as non-physical come into E-commerce. As the scale of E-Commerce is being enlarged as well. It keeps people from finding information they want. Recommender systems are now becoming the main tools for E-Commerce to mitigate the information overload. Recommender systems can be defined as systems for suggesting some Items(goods or service) considering customers' interests or tastes. They are being used by E-commerce web sites to suggest products to their customers who want to find something for them and to provide them with information to help them decide which to purchase. There are several approaches of recommending goods to customer in recommender system but in this study, the main subject is focused on collaborative filtering technique. This study presents a possibility of pre-evaluation for the prediction performance of customer's preference in collaborative filtering before the process of customer's preference prediction. Pre-evaluation for the prediction performance of each customer having low performance is classified by using the statistical features of ratings rated by each customer is conducted before the prediction process. In this study, MovieLens 100K dataset is used to analyze the accuracy of classification. The classification criteria are set by using the training sets divided 80% from the 100K dataset. In the process of classification, the customers are divided into two groups, classified group and non classified group. To compare the prediction performance of classified group and non classified group, the prediction process runs the 20% test set through the Neighborhood Based Collaborative Filtering Algorithm and Correspondence Mean Algorithm. The prediction errors from those prediction algorithm are allocated to each customer and compared with each user's error. Research hypothesis : Two research hypotheses are formulated in this study to test the accuracy of the classification criterion as follows. Hypothesis 1: The estimation accuracy of groups classified according to the standard deviation of each user's ratings has significant difference. To test the Hypothesis 1, the standard deviation is calculated for each user in training set which is divided 80% from MovieLens 100K dataset. Four groups are classified according to the quartile of the each user's standard deviations. It is compared to test the estimation errors of each group which results from test set are significantly different. Hypothesis 2: The estimation accuracy of groups that are classified according to the distribution of each user's ratings have significant differences. To test the Hypothesis 2, the distributions of each user's ratings are compared with the distribution of ratings of all customers in training set which is divided 80% from MovieLens 100K dataset. It assumes that the customers whose ratings' distribution are different from that of all customers would have low performance, so six types of different distributions are set to be compared. The test groups are classified into fit group or non-fit group according to the each type of different distribution assumed. The degrees in accordance with each type of distribution and each customer's distributions are tested by the test of ${\chi}^2$ goodness-of-fit and classified two groups for testing the difference of the mean of errors. Also, the degree of goodness-of-fit with the distribution of each user's ratings and the average distribution of the ratings in the training set are closely related to the prediction errors from those prediction algorithms. Through this study, the customers who have lower performance of prediction than the rest in the system are classified by those two criteria, which are set by statistical features of customers ratings in the training set, before the prediction process.

Diagnostic Accuracy and Evaluation of Myocardial Viability by Cardiac Magnetic Resonance Imaging in Acute Myocardial Infarction: A Comparison with Thallium-201 Myocardial SPECT (급성심근경색증에서의 심장자기공명영상술의 진단 정확도와 심근 생존력 평가: TI-201 심근관류 SPECT와의 비교)

  • Kim Hye-seon;Park Dong Woo;Kim Yongsoo;Kim Young-sun;Choi Yo Won;Jeon Seok Chul;Seo Heung Suk;Hahm Chang Kok;Kim Soon Kil;Ahn You hern;Choi Yoon Young;Park Choong-Ki
    • Investigative Magnetic Resonance Imaging
    • /
    • v.7 no.2
    • /
    • pp.100-107
    • /
    • 2003
  • Purpose : To assess the usefulness of cardiac MR imaging (MRI) in the diagnosis of acute myocardial infarction and in the assessment of myocardial viability in comparision with T1-201 SPECT. Materials and Methods : We retrospectively studied 17 patients who complained of chest pain and dyspnea with cardiac MRI . The patients were evaluated for the presence or absence of high signal intensity on T2-weighted image (T2wI), abnormal wall motion on 2D-FIESTA, perfusion defect on Gd-DTPA enhanced T1WI, and delayed myocardial enhancement on 15-minutes delay Gd-DTPA enhanced T1WI. The results were correlated with the images on T1-201 SPECT, taken at rest and stress, through which reversibility of perfusion defect was assessed. Results : Both cardiac MRI and T1-201 SPECT proved to be useful methods for diagnosing acute myocardial infarction. In order of decreasing correspondence, T2WI, T1-201 SPECT, delayed enhancement study, and wall motion images all showed significant statistical correlation with the clinical diagnosis of myocardial infarction. Perfusion MRI, on the other hand, showed no significant statistical difference was found between T1-201 SPECT and cardiac MRI. The results on T2WI showed high accordance with those on Tl-201 SPECT, while delayed myocardial enhancement and wall motion studies showed no agreement with Tl-201 SPECT. Conclusion : Cardiac MRI is useful method for diagnosis of acute myocardiac infarction. With respect to the assessment of myocardial viability, the results obtained on cardiac MRI showed high agreement with those on Tl-201 SPECT. However, further study is necessary at this point for standardization and establishment of the methods for assessing myocardial viability on cardiac MRI.

  • PDF

Eco-environmental assessment in the Sembilan Archipelago, Indonesia: its relation to the abundance of humphead wrasse and coral reef fish composition

  • Amran Ronny Syam;Mujiyanto;Arip Rahman;Imam Taukhid;Masayu Rahmia Anwar Putri;Andri Warsa;Lismining Pujiyani Astuti;Sri Endah Purnamaningtyas;Didik Wahju Hendro Tjahjo;Yosmaniar;Umi Chodrijah;Dini Purbani;Adriani Sri Nastiti;Ngurah Nyoman Wiadnyana;Krismono;Sri Turni Hartati;Mahiswara;Safar Dody;Murdinah;Husnah;Ulung Jantama Wisha
    • Fisheries and Aquatic Sciences
    • /
    • v.26 no.12
    • /
    • pp.738-751
    • /
    • 2023
  • The Sembilan Archipelago is famous for its great biodiversity, in which the humphead wrasse (Cheilinus undulatus) (locally named Napoleon fish) is the primary commodity (economically important), and currently, the environmental degradation occurs due to anthropogenic activities. This study aimed to examine the eco-environmental parameters and assess their influence on the abundance of humphead wrasse and other coral reef fish compositions in the Sembilan Archipelago. Direct field monitoring was performed using a visual census throughout an approximately one km transect. Coral cover data collection and assessment were also carried out. A coastal water quality index (CWQI) was used to assess the water quality status. Furthermore, statistical-based analyses [hierarchical clustering, Pearson's correlation, principal component analysis (PCA), and canonical correspondence analysis (CCA)] were performed to examine the correlation between eco-environmental parameters. The Napoleon fish was only found at stations 1 and 2, with a density of about 3.8 Ind/ha, aligning with the dominant composition of the family Serranidae (covering more than 15% of the total community) and coinciding with the higher coral mortality and lower reef fish abundance. The coral reef conditions were generally ideal for supporting marine life, with a living coral percentage of about > 50% in all stations. Based on CWQI, the study area is categorized as good and excellent water quality. Of the 60 parameter values examined, the phytoplankton abundance, Napoleon fish, and temperature are highly correlated, with a correlation coefficient value greater than 0.7, and statistically significant (F < 0.05). Although the adaptation of reef fish to water quality parameters varies greatly, the most influential parameters in shaping their composition in the study area are living corals, nitrites, ammonia, larval abundance, and temperature.

The Structure of Plant Community in Jungdaesa-Birobong Area, Odaesan National Park (오대산국립공원 중대사-비로봉 구간 식물군집구조)

  • Han, Bong-ho;Choi, Jin-woo;Noh, Tai-hwan;Kim, Dong-wook
    • Korean Journal of Environment and Ecology
    • /
    • v.29 no.5
    • /
    • pp.764-776
    • /
    • 2015
  • This study aims to identify the structure of the plant community, and the ecological succession sere and the change in the forest ecosystem in Jungdaesa-Birobong area, Odaesan National Park_(i._e., located at high altitudes(over 1,000m)). It seeks to offer the basic data for the planning of vegetation management. In order to verify the status of the forest vegetation between Jungdaesa-Birobong, seventeen plots(size is $20m{\times}20m$) were set up as research sites at high altitudes. Importance value, distribution by diameter at breast height(DBH), the growth volume and age of the sample trees, similarity index and species diversity index of each survey plot were analysed. According to the results of DCA(Detrended Correspondence Analysis), one of the multivariate statistical techniques. It was found that the plant communities were classified into five groups: community I_(Quercus mongolica-Tilia amurensis community), community II_(Q. mongolica-Deciduous broad-leaved community), community III_(Q. mongolica-Pinus koraiensis community), community IV_(Abies holophylla-Q. mongolica community) and community V_(A. holophylla-Deciduous broad-leaved community). Community I which is dominated by Quercus mongolica and Deciduous broad-leaved communities is located at an altitude of over 1,300 meters(ranging from 1,335m to 1,495m), the community IV and V which are dominated by Abies holophylla are located at an altitude of under 1,200 meters(ranging from 1,115m to 1,175m) and the community II and III which include the main species of Quercus mongolica, Pinus koraiensis and Abies holophylla are located at an altitude of between 1,160 meters and 1,300 meters. The results showed that Quercus mongolica tends to have a higher importance value of woody species at a higher altitude while Abies holophylla tends to have higher importance value at a lower altitude. For the importance value woody species and -DBH class distribution, the communites I, II and III are expected to continuously maintain the present status. Whereas, for the influence of communities IV and V, Q. mongolica is predicted to be weakened. The age of sample trees was between 85 and 161; the average age was 123. The index of Shannon's Species diversity (H') showed heterogeneity was found among community I_(i._e., located at high altitude) and communities IV and V_(i._e., located at low altitude). As a results of analysing the index of Shannon's Species diversity (H': unit: $400m^2$), community III showed the highest diversity intex with 1.1109 followed by community II with 1.0475, community I with 1.0125, community IV with 0.9918 and community V with 0.8686. This study verified that the index of Shannon's species was significantly different by plant communities. For instance, when comparing the index of Shannon's species diversity in Quercus mongolica communities of this study and that of past relevant research, the value of index is very similar. However, the diversity index for the community which is dominated by Abies holophylla showed lower value when compared to the results from past relevant research.

Changes in Feed Value of Barley and Pea by Different Seeding Rates and Cutting Dates in Mixed Sowing Cultivation (보리와 완두의 혼파재배에서 혼파비율과 예취시기에 따른 사료가치의 변화)

  • Oh, Tae-Seok;Kim, Chang-Ho;Lee, Hyo-Won
    • KOREAN JOURNAL OF CROP SCIENCE
    • /
    • v.54 no.3
    • /
    • pp.279-286
    • /
    • 2009
  • This study carried out to find out feed value of barley plus pea mixture with different ratio and cutting date to got basic information when introduced the mixture as new cropping system in middle part of Korean peninsular. Dry matter (DM) yield increased as barley seeding rate was higher and showed the highest yield in the plots with barley 85% plus 15% ratio when harvested on May 16. There was no different in crude protein, available protein and digestible protein cutting on April 25 in every mixture, but the content increased with higher pea mixture rate after May 2. The content of acid detergent fiber (ADF) and neutral detergent fiber (NDF) increase coincided with higher barley rate and late cutting dates. But relative feed value (RFV) resulted in opposite trend. Higher pea ratio influenced increased content of total digestible nuterients (TDN), but decreased before May 9 cutting and increased after the next cutting regime. There was no statistical difference in P and Mg between sowing rate, but Ca increased at higher pea ratio and P, Ca, K decreased in all plots as harvests were delayed. The content of estimated net energy (ENE), net energy maintenance (NEM) and net energy gain (NEG) significantly increased with higher pea rate and earlier cutting. But net energy lactation (NEL) was no significant differences between seeding rates and cutting dates. In conclusion, mineral yield such as P, Ca, K and Mg showed the highest yield at barley plus pea ratio of 75 : 25 and energy yield of ENE, NEL, NEM, NEG and TDN was the highest at 85 to 15 mixture plots and DM yield, TDN yield, mineral yield such as P, Ca, K and Mg and energy yield of ENE, NEL, NEM, NEG were the highest on each treatment cutting on May 16.

Forest Vegetation Structure in Maruguem (the Ridge Line) Area of Gitdaebaegibong to Jukryeong, Baekdudaegan (백두대간(깃대배기봉-죽령 구간) 마루금 주변의 산림식생구조)

  • Song, Ju Hyeon;Yun, Chung Weon
    • Journal of Korean Society of Forest Science
    • /
    • v.108 no.2
    • /
    • pp.147-167
    • /
    • 2019
  • This study was conducted to analyze forest vegetation structure in the Marugeum (Ridge) area of Gitdaebaegibong to Jukryeong, Baekdudaegan. Data were collected in 298 quadrates through a Braun-Blanquet vegetation survey from April, 2018 to October, 2018. Forest vegetation was classified into 13 vegetation units. A Quercus mongolica community was divided into Morus bombycis, Filipendula glaberrima, Fraxinus sieboldiana, Prunus maackii unit and Q. mongolica typical unit. The M. bombycis unit was further classified into a Deutzia glabrata group and M. bombycis typical group. The F. glaberrima unit was subdivided into a Veratrum oxysepalum group, Arundinella hirta group, and F. glaberrima typical group. The F. sieboldiana unit was divided into a Pinus densiflora group, Larix kaempferi group, and F. sieboliana typical group. The relationship between vegetation units and environmental factors was studied through coincidence analysis and CCA. The F. glaberrima unit (VU 6~8) was distributed by elevation above 1,200 m and other vegetation units were distributed below 1,200 m. Results of the CCA analysis showed that the F. glaberrima unit distribution is positively correlated with elevation. As a result of species diversity, the F. glaberrima unit was higher than other vegetation units. A similarity index analysis revealed that the F. sieboldiana unit (VU 9~11) was relatively homogeneous, and the M. bombycis unit (VU 1~5) and A. girta group (VU 7) were relatively heterogeneous. A detrended correspondence analysis determined that the distance between the statistical axes of the M. bombycis and F. glaberrima units was the greatest, which is consistent with the analysis of the similarity index. As a result of interspecific correlation of major woody plants, hydrophilic species were positively correlated, and a negative correlation was found between Q. mongolica and intolerant species such as P. densiflora and L. kaempferi.