빅 데이터 분석을 위한 맵리듀스 알고리즘의 최근 연구 동향

  • Published : 2014.01.17

Abstract

Keywords

References

  1. J. B. MacQueen. Some Methods for classification and Analysis of Multivariate Observations. Proceedings of 5th Berkeley Symposium on Mathematical Statistics and Probability, Vol. 1, pp.281-297. 1967.
  2. Rakesh Agrawal, Johannes Gehrke, Dimitrios Gunopulos, Prabhakar Raghavan. Automatic subspace clustering of high dimensional data for data mining applications. ln SIGMOD, 1998.
  3. Martin Ester, Hans-Peter Kriegel, Jorg Sander, Xiaowei Xu. A density-based algorithm for discovering clusters in large spatial databases with noise. In KDD, 1996.
  4. Mihael Ankerst, Markus M. Breunig, Hans-Peter Kriegel, Jorg Sander. OPTICS: Ordering Points To Identify the Clustering Structure. In STGMOD, 1999.
  5. 장지훈, 박형민, 심규석, 맵리듀스 프레임웍을 이용한 서브스페이스클러스터링의 구현, 한국정보과학회학술발표눈문집, 제35권, 제2호, pp.98-103, 2008.
  6. Yaobin He, Haoyu Tan, Wuman Luo, Huajian Mao, Di Ma, Shengzhong Feng, Jianpillg Fan. MR-dbscan: An efficient parallel density-based clustering algorithm using MapReduce. In ICPADS, 2011.
  7. Younghoon Kim, Kyuseok Shim, Min-Soeng Kim, June Sup Lee, DBCURE-MR: An efficient density based clustering algorithm for large data using MapReduce, To appear to Information Systems.
  8. Tianyang Sun, Chengchun Shu, Feng Li, Haiyan Yu, Lili Ma, Yitong Fang. An efficient hierarchical clustering method for large datasets with Map-Reduce. In PDCAT,2009.
  9. Meghana Deodhar, Joydeep Ghosh. A framework for simultaneous co-clustering and learning from complex data. In KDD, 2007.
  10. Meghana Deodhar, Clinton Jones, and Joydeep Ghosh. Parallel Simultaneous Co-clustering and Learning with Map-Reduce. In IEEE International Conference on Granular Computing, 2010.
  11. Jiawei Han, Jian Pei, and Yiwen Yin. Mining frequent patterns without candidate generation. In SlGMOD, 2000.
  12. Jiawei Han, Jian Pei, Yiwen Yin. Mining frequent patterns without candidate generation. In SIGMOD, 2000.
  13. Biswanath Panda, Joshua. S. Herbach, Sugato Basu, and Roberto J. Bayardo. PLANET: Massively Parallel Learning of Tree Ensembles with MapReduce. In VLDB, 2012.
  14. 김영훈, 심규석,맵리듀스를 이용한 아다부스트 알고리즘,Telecommunication Reviews(SK Telecom), " 제21권, 제3호, 2011
  15. Chao Liu, Hung-Chih Yang, Jinliang Fan, Li-Wei He, and Yi-Min Wang. Distributed nonnegative matrix factorization for web-scale dyadic data analysis on mapreduce. In WWW, 2010.
  16. U Kang, Brendan Meeder, and Christos Faloutsos. Spectral analysis for billion-scale graphs: Discoveries and implementation. In PAKDD, 2011.
  17. U Kang, Charalampos E. Tsourakakis, and Christos Faloutsos. PEGASUS: Mining Peta-scale Graphs. Knowledge and Infonnation Systems, Vol. 27, No.2, pp. 303-325,2011. https://doi.org/10.1007/s10115-010-0305-0
  18. Siddharth Suri, and Sergei Vassilvitskii. Counting triangles and the curse of the last reducer. In WWW, 2011.
  19. Ha-Myung Park, and Chin-Wan Chung. An efficient MapReduce algorithm for counting triangles in a very large graph. In ClKM, 2013.
  20. 정우환, 김영훈, 심규석, 그래프의 삼각형 개수 계산을 위한 맵리듀스 알고리즘, 한국정보과학회 학술발표논문집, 제39권, 제2호, pp.16-18, 2012.
  21. James W. Demmel. Applied numerical linear algebra, SIAM, 1997.
  22. Abhinandan Das, Mayur Datar, Ashutosh Garg, and Shyam Rajaram. Google news personalization: scalable online collaborative filtering. In WWW, 2007.
  23. Younghoon Kim, and Kyuseok Shim. TWITOBI: A Recommendation System for Twitter Using Probabilistic Modeling. In ICDM, 2011.
  24. Yi Wang, Hongjie Bai, Matt Stanton, Wen-Yen Chen, and Edward Y. Chang. PLDA: Parallel latent dirichlet allocation for large-scale applications. In AAlM, 2009.
  25. Ke Zhai, Jordan Boyd-Graber, Nima Asadi, and Mohamad Alkhouja. Mr. LDA: A flexible large scale topic modeling package using variational inference in MapReduce. In WWW, 2012.
  26. Huanhuan Cao, Daxin Jiang, Jian Pei, Enhong Chen, and Hang Li . Towards Context-Aware Search by Learning A Very Large Variable Length Hidden Markov Model from Search Logs. In WWW, 2009.
  27. Alper Okcan, Mirek Riedewald. Processing theta-joins using MapReduce. In SIGMOD, 2011.
  28. Foto N. Atrati, Jeffrey D. Ullman. Optimizing joins in a Map-Reduce environment. In VLDB, 2009.
  29. Xiaofei Zhang, Lei Chen, Min Wang. Efficient Multiway Theta-loin Processing Using MapReduce. In VLDB, 2012.
  30. Sai Wu, Feng Li, Sharad Mehrotra, Beng Chin Ooi. Query optimization for massively parallel data processing. In SOCC, 2011.
  31. Rares Vernica, Michael J. Carey, Chen Li. Efficient parallel set-similarity joins using MapReduce. In SIGMOD, 2010.
  32. Ahmed Metwally, Christos Faloutsos. V-SMART-Join: A scalable MapReduce framework for all-pair similarity joins of multisets and vectors. In VLDB, 2012.
  33. Younghoon Kim, Kyuseok Shim. Parallel top-k similarity join algorithms using MapReduce. In ICDE, 2012.
  34. Ranieri Baraglia. Gianmarco De Francisci Morales, Claudio Lucchese. Document similarity self-join with MapReduce. In ICDM, 2010.
  35. Tamer Elsayed, Jimmy J. Lin, Douglas W. Oard. Pairwise document similarity in large collections with MapReduce. In HLT, 2008.
  36. Yoonjae Park, Jun-Ki Min, and Kyuseok Shim. Parallel Computation of Skyline and Reverse Skyline Queries Using MapReduce. Proceedings of the VLDB Endowment, Vol. 6, No. 14, 2013.
  37. Jeffrey Jestes, Ke Yi, and Feifei Li. Building Wavelet Histograms on Large Data in MapReduce. Proceedings of the VLDB Endowment, Vol. 5, No.2, pp.109-1 02, 2011.
  38. Yufei Tao, Wenqing Lin and Xiaokui Xiao. Minimal MapReduce algorithms. In SIGMOD, 2013.