DOI QR코드

DOI QR Code

UX Analysis for Mobile Devices Using MapReduce on Distributed Data Processing Platform

MapReduce 분산 데이터처리 플랫폼에 기반한 모바일 디바이스 UX 분석

  • 김성숙 (안양대학교 컴퓨터공학과) ;
  • 김성규 (안양대학교 컴퓨터공학과)
  • Received : 2013.07.16
  • Accepted : 2013.08.12
  • Published : 2013.09.30

Abstract

As the concept of web characteristics represented by openness and mind sharing grows more and more popular, device log data generated by both users and developers have become increasingly complicated. For such reasons, a log data processing mechanism that automatically produces meaningful data set from large amount of log records have become necessary for mobile device UX(User eXperience) analysis. In this paper, we define the attributes of to-be-analyzed log data that reflect the characteristics of a mobile device and collect real log data from mobile device users. Along with the MapReduce programming paradigm in Hadoop platform, we have performed a mobile device User eXperience analysis in a distributed processing environment using the collected real log data. We have then demonstrated the effectiveness of the proposed analysis mechanism by applying the various combinations of Map and Reduce steps to produce a simple data schema from the large amount of complex log records.

웹의 특징인 개방과 공유의 사고방식이 더욱 일반화 되면서 개발자 뿐 만 아니라 사용자가 직접 발생시키는 데이터도 복합적으로 늘어나고 있는 실정이다. 이러한 상황에서 모바일 디바이스 User eXperience(UX) 분석에서 다른 무엇보다도 디바이스에 기록되는 대용량의 로그 기록에서 필요한 데이터들을 자동으로 요약 정리해 주는 기법이 필요하다. 이에, 본 논문에서는 분석하고자 하는 모바일 디바이스 특성에 맞게 사전에 로그 데이터 속성에 대한 정의를 먼저하고, 직접 이를 반영한 사용자의 로그를 수집하여 저장하였다. 또한, 발생되는 대용량의 로그 기록에 기초한 UX를 분석하고자 다양한 로그 데이터 타입을 설정 및 처리할 수 있는 Hadoop(하둡)에서 제공하는 MapReduce 기법을 활용하여 데이터를 분산 처리하였다. 이를 통해, Map과 Reduce의 다양한 조합으로 대용량의 모바일 디바이스에서 발생되는 로그 데이터 셋에서 복잡한 스키마를 단순화시켜 분산 데이터 처리 환경에 맞게 UX 분석 방안을 제시하였다.

Keywords

References

  1. N. Laptev, K. Zeng, and C. Zaniolo, "Very Fast Estimation for Result and Accuracy of Big Data Analytics: the EARL System," in Proceedings of the 29th IEEE International Conference on Data Engineering, Brisbane, 2013, pp.1296-1299.
  2. A. Balmin, T. Kaldewey, and S. Tata, "Clydesdale: Structured Data Processing on Hadoop," in Proceedings of the ACM SIGMOD International Conference on Management of Data, 2012, pp.705-708.
  3. D. Miner and A. Shook, "MapReduce Design Patterns," O'Reilly Media, ch. 2, pp.13-32, 2012.
  4. Y. S. Kim, "Secrets of Seductive UX/UI Design," Wiki Books, ch. 1, 2013.
  5. S. S. Kim, and S. G. Kim, "On Designing Cloud Computing Systems for Analysing Smart Device's UX," in Proceedings of the 2013 Korea Computer Congress, 2013, pp.303-305.
  6. S. Ghemawat, H. Gobioff, et al., "The Google file system," in Proceedings of the 19th ACM symposium on Operating systems principles, 2003, pp.29-43.
  7. J. Dean and S. Ghemawat, "MapReduce: Simplified Data Processing on Large Clusters," in Proceedings of the 6th Symposium on Operating System Design and Implementation, SanFrancisco, 2004, pp.137-150.
  8. T. White, "Hadoop: the definitive guide," O'Reilly Media, 2009.
  9. J. Dean and S. Ghemawat, "MapReduce: A Flexible Data Processing Tool," Communications of the ACM, Vol.53, No.1, pp.72-77, 2010.
  10. M. Sethi, N. Sachindran, and S. Raghavan, "SASH: Enabling Continuous Incremental Analytic Workflows on Hadoop," in Proceedings of the IEEE 29th International Conference on Data Engineering, Brisbane, 2013, pp.1219-1230.
  11. F. Wang, J. Qiu, J. Yang, B. Dong, X. Li, and Y. Li, "Hadoop high availability through metadata replication," in Proceedings of the 1st International Workshop on Cloud Data Management, New York, 2009, pp.37-44.
  12. M. Y. Eltabakh, Y. Tian, F. Ozcan, R. Gemulla, A. Krettek, J. McPherson, "CoHadoop: flexible data placement and its exploitation in Hadoop," The VLDB Endowment, Vol.4, No.9, pp.575-585, 2011. https://doi.org/10.14778/2002938.2002943
  13. A. Abouzeid, K. B. Pawllikowski, D. Abadi, A. Silbeschatz, and A. Rasin, "HadoopDB: An Architectural Hybrid of MapReduce and DBMS Technologies for Analytical Workloads," The VLDB Endowment, Vol.2, No.1, pp.922-933, 2009. https://doi.org/10.14778/1687627.1687731
  14. Y. He, R. Lee, Y. Huai, Z. Shao, et al., "RCFile: A fast and space-efficient data placement structure in MapReducebased warehouse systems," in Proceedings of the 27th IEEE International Conference on Data Engineering, Hannover, 2011, pp.1199-1208.
  15. Y. Lin, D. Agrawal, C. Chen, B. C. Ooi, S. Wu, "Llama: leveraging columnar storage for scalable join processing in the MapReduce framework," in Proceedings of the 2011 ACM SIGMOD International Conference on Management of data, New York, 2011, pp.961-972.
  16. KakaoTalk Enjoy free one-to-one or group chats anywhere in the world [Internet], http://www.kakao.com/talk/ko
  17. B. Albert, T. Tullis and D. Tedesco, "Beyond The Usability Lab: Conducting Large-Scale User Experience Studies," Morgan Kaufmann Pub., ch. 1, pp.5-16, 2010.