Browse > Article
http://dx.doi.org/10.3745/KIPSTD.2007.14-D.7.829

Developing dirty data cleansing service between SOA-based services  

Ji, Eun-Mi (이화여자대학교 컴퓨터학과)
Choi, Byoung-Ju (이화여자대학교 컴퓨터학과)
Lee, Jung-Won (아주대학교 정보통신대학 전자공학부)
Abstract
Dirty Data Cleansing technique so far have aimed to integrate large amount of data from various sources and manage data quality resided in DB so that it enables to extract meaningful information. Prompt response to varying environment is required in order to persistently survive in rapidly changing business environment and the age of limitless competition. As system requirement is recently getting complexed, Service Oriented Architecture is proliferated for the purpose of integration and implementation of massive distributed system. Therefore, SOA necessarily needs Data Exchange among services through Data Cleansing Technique. In this paper, we executed quality management of XML data which is transmitted through events between services while they are integrated as a sole system. As a result, we developed Dirty Data Cleansing Service based on SOA as focusing on data cleansing between interactive services rather than cleansing based on detection of data error in DB already integrated.
Keywords
Data Quality; Data Cleansing; SOA:Service Oriented Architecture;
Citations & Related Records
연도 인용수 순위
  • Reference
1 P. Krogdahl, G. Luef, and C. Steindl, 'Service-Oriented Agility: An initial analysis for the Use of Agile methods for SOA development,' In Proceedings of the 2005 IEEE International Conference on Service Computing(SCC '05). Vol.2, pp.93-100, July, 2005   DOI
2 M. Hernandez, R. Miller, and L. Hass, 'Schema Mappings as Query Discovery,' In Proceedings of Intl. Conf. VLDB, 2001
3 이경하, 이규철, '웹 서비스의 표준화 동향과 발전 방향', 한국정보과학회 데이터베이스 연구회지, 제19권 제1호, pp.80-87, March, 2003
4 M. P. Papazoglou and D. Georgakopoulos, 'Service-Oriented Computing,' Communication of the ACM, Vol.46, No.10, pp.25-28, Oct., 2003
5 지은미, 최병주, 이정원, 'SOA에서의 오류 데이터 정제 서비스 개발', 정보처리학회 2007년도 춘계학술발표대회 논문집(상) 우수논문, 제14권 제1호, pp.649-652, 2007
6 Theodore Johnson, and Tamraparni Dasu, 'Data Quality and Data Cleaning,' Tutorials of 10th SIGKDD, Aug., 2004
7 T. Dasu, T. Johnson, S. Muthukrishnan, V. Shkapenyuk, 'Mining Data Structure; Or, How to Build a Data Quality Browser,' In Proceedings of SIGMOD Conf., pp. 240-251, 2002   DOI
8 SLAAM, www.slaam.co.kr
9 ZipIt, www.sujiewon.co.kr
10 The AscentialTM Enterprise Integration Suite, www.ascential.com
11 HummingBird, www.hummingbird.com
12 Ortiz Jr., Sixto; 'Getting on Board the Enterprise Service Bus,' Published by the IEEE computer Society, pp.15-17, 2007   DOI   ScienceOn
13 Won Kim, Byoung-Ju Choi, Eui-Kyeoung Hong, Soo-Kyoung Kim, Doheon Lee, 'A Taxonomy of Dirty Data,' The Data Mining and Knowledge Discovery Journal, Vol.7 No.1, pp.81-99, 2003   DOI   ScienceOn
14 G. Shankaranarayanan and Y. Cai, 'A Web Services Application for the Data Quality Management in the B2B Networked Environment,' In Proceedings of 38th Hawaii International Conference on System Sciences, IEEE, 2005   DOI
15 M. M. Breunig, H.-P. Kriegel, R. Ng, J. Sander, 'LOF: Identifying Density-Based Local Outliers,' In Proceedings of SIGMOD Conf., 2000   DOI   ScienceOn
16 MonArch, www.00db.co.kr
17 J. W. Lee, E. Y. Moon, and B. J. Choi, 'Data cleansing for Service-Oriented Architecture,' Springer-Verlag, Lecture Notes in Computer Science Vol 3590, pp.87-97, 2005   DOI   ScienceOn
18 M. Lee, H Lu, T Ling, and Y. Ko., 'Cleansing Data for Mining and Warehousing,' In Proceedings of 10th DEXA, 1999   DOI
19 M. Hernandez and S. Stolfo, 'Real-world data is dirty: data cleansing and the merge/purge problem,' Data Mining and Knowledge Discovery, Vol.2, No.1, pp.9-37, 1998   DOI   ScienceOn