DOI QR코드

DOI QR Code

A Study on Reducing Data Obesity through Optimized Data Modeling in Research Support Database

연구지원 데이터베이스에서 최적화된 데이터모델링을 통한 데이터 비만도 개선에 관한 연구

  • Kim, Hee-Wan (Division of Computer-Mechatronics, Shamyook University)
  • 김희완 (삼육대학교 컴퓨터.메카트로닉스공학부)
  • Received : 2018.01.31
  • Accepted : 2018.02.09
  • Published : 2018.02.28

Abstract

The formal data used in the business is managed in a table form without normalization due to lack of understanding and application of data modeling. If the balance of the database design is destroyed, it affects the speed of response to the data query, and the data obesity becomes high. In this paper, it is investigated how data obesity improved through database design through optimized data modeling. The data query path was clearly visualized by square design through data modeling based on the relationship between object (data) and object, from the radial and task - oriented isolation design where data obesity is excessive. In terms of data obesity, the obesity degree of the current research support database was 57.2%, but it was 16.2% in the new research support database, and the data obesity degree was reducd by 40.5%. In addition, by minimizing redundancy of data, the database has been improved to ensure the accuracy and integrity of the data.

현업에서 사용하고 있는 정형 데이터는 데이터모델링에 대한 이해부족 및 적용의 미흡으로 정규화되지 않은 채로 테이블 형태로 관리되고 있는 현실이다. 데이터베이스 설계의 균형이 파괴되면 데이터 질의에 대한 응답속도에 영향을 미치며, 데이터 비만도가 높아지게 된다. 본 논문에서는 최적화된 데이터모델링을 통한 데이터베이스 설계를 통하여 데이터 비만도가 어떻게 개선되었는지를 연구하였다. 데이터 비만도가 과다하게 나타나는 방사형 및 업무 중심의 고립형 설계에서 객체(데이터)와 객체간의 관계 중심의 데이터모델링을 통한 정방형 설계를 함으로 데이터 질의 경로가 선명하게 가시화되었다. 데이터비만도 면에서도 기존의 연구지원 데이터베이스의 비만도는 57.2%였으나, 새로운 연구지원 데이터베이스에서는 16.2%로 나타나 데이터 비만도가 40.5%가 개선되었으며, 데이터의 중복을 최소화함으로써 데이터의 정확성과 무결성이 보장되는 데이터베이스로 개선되었다.

Keywords

References

  1. C. W. Fisher, B. R. Kingma, "Criticality of data quality as exemplified in two disasters", Information Systems, Vol. 39, pp.109-116, 2010.
  2. [DB] Elements of a good data model, http://blog.daum.net/fmddn/1787002, 2012.12.17.
  3. C. B. Cinzia Cappiello, C. Francalanci, A. Maurino, "Methodologies for data quality assessment and improvement", ACM Computing Surveys Vol. 41, No. 3, p. 52, 2009.
  4. D. Katz, M. Bommaroti, J. Zelner, "The data deluge", The Economist, Mar 1, 2010
  5. Min-Kyu Lee, "Data Performance Case Study through Removing of Data Duplicate Relationships", Soongsil University, 2010.
  6. T. Shanker, M. Richtel, "Data overload can be deadly", The New York Times, Jan 16, 2011.
  7. Richard Y. Wang, Henry B. Kon, and Stuart E. Madnick, "Data quality requirements analysis and modeling", Proceedings of IEEE Ninth International Conference on Data Engineering, pp. 670 - 677. April 1993.
  8. I. Davies, P. Green, M. Rosemann, M. Indulska, and S. Galo, "How do practitioners use conceptual modeling in practice?", Data and Knowledge Engineering, Vol 58, pp. 358-380, 2006. https://doi.org/10.1016/j.datak.2005.07.007
  9. "Practical Projects Application of Data Model Normalization / De-normalization", http://blog.naver.com/jooyong3/40035951092.
  10. Ji-Ho So, Young-Ju Jeon, "Design and Construction of Integrated Database for Contents Development of Pulse Analysis System", The Journal of The Institute of Internet, Broadcasting and Communication(JIIBC), Vol. 17, No. 5, pp. 137-142, Oct 2017. DOI: https://doi.org/10.7236/JIIBC.2005.5.2.56.
  11. In-Hwan Jung, Young-Ung Kim, "ER_Modeler: A Logical Database Design Tool based on Entity-Relationship Model", The Journal of The Institute of Internet, Broadcasting and Communication(JIIBC), Vol. 11, No. 5, pp. 11-17, Oct 2011. DOI: https://doi.org/10.7236/JIIBC.2005.5.2.56.
  12. Hye-Kyung Rhee, Hee-Wan Kim, "A Study on Negligence of Data Modeling Fundamentals at the University Job Information System", Journal of the Korea Society of Computer and Information, Vol. 19, No. 8, pp.139-150, Aug 2014. https://doi.org/10.9708/jksci.2014.19.8.139
  13. Kook-Hee Lee, "A Study on the Database Quality Assessment", Korea Database Agency, Dec, 1995.
  14. James Martin, Information Engineering, Vol. 1, Vol.2, Vol.3, Prentice-Hall, 1989.
  15. Barker, R, CASE *Method: Entity Relationship Modelling, Wokingham: Addison-Wesley, 1990.
  16. Peter Chen, "The Entity Relationship Model-Toward a Unified View of Data", ACM, Vol.1, No.1, 1976.