Browse > Article
http://dx.doi.org/10.5392/JKCA.2017.17.11.389

Semantic Clustering Model for Analytical Classification of Documents in Cloud Environment  

Kim, Young Soo (배재대학교 사이버보안학과)
Lee, Byoung Yup (배재대학교 사이버보안학과)
Publication Information
Abstract
Recently semantic web document is produced and added in repository in a cloud computing environment and requires an intelligent semantic agent for analytical classification of documents and information retrieval. The traditional methods of information retrieval uses keyword for query and delivers a document list returned by the search. Users carry a heavy workload for examination of contents because a former method of the information retrieval don't provide a lot of semantic similarity information. To solve these problems, we suggest a key word frequency and concept matching based semantic clustering model using hadoop and NoSQL to improve classification accuracy of the similarity. Implementation of our suggested technique in a cloud computing environment offers the ability to classify and discover similar document with improved accuracy of the classification. This suggested model is expected to be use in the semantic web retrieval system construction that can make it more flexible in retrieving proper document.
Keywords
Cloud; Semantic; Keyword Frequency; Concept Matching; Clustering;
Citations & Related Records
Times Cited By KSCI : 5  (Citation Analysis)
연도 인용수 순위
1 김영수, 문형진, 조혜선, 김병익, 이진해, 이진우, 이병엽, "계층적침해자원기반의 침해사고 구성 및 유형 분석," 한국콘텐츠학회논문지, 제16권, 제11호, pp.139-153, 2016.   DOI
2 김영수, "보안 인텔리전트 유형 분류를 위한 다중 프로파일링 앙상블 모델," 한국콘텐츠학회논문지, Vol.17, No.3, pp.231-237, 2017.   DOI
3 이태휘, 임동혁, "맵리듀스에서의 구조적 RDF 데이터 변경 탐지 기법," 정보처리학회논문지, Vol.3, No.8, pp.293-298, 2014,   DOI
4 심준, 이홍철, "검색 키워드 확장을 이용한 온톨로지 자동 생성 시스템 개발," 한국산학기술학회논문지, Vol.10, No.6, pp.1220-1228, 2009.   DOI
5 배우정, 이현영, 박인철, 이용석, "개념 그래프의 트리 표현," 한국정보과학회 학술발표논문집, Vol.25(1B), pp.393-395, 1998.
6 안윤선, 김윤희, "군집분석을 이용한 하이브리드 클라우드 컴퓨팅 환경에서의 시맨틱 클라우드 자원 추천 서비스 기법," 정보처리학회논문지, Vol.,4 No.9, pp.283-288, 2015.   DOI
7 P. Mell and T. Grance, "The NIST definition of cloud computing," National Institute of Standards and Tchnology, Vol.53, No.6, p.50, 2009.
8 C. N. Hoefer and G. Karagiannis, Taxonomy of cloud computing services. In: GLOBECOM Workshops (GC Wkshps), 2010 IEEE. pp.1345-1350, IEEE 2010.
9 P. Bhaskar, J. Admela, K. Dimitrios, and G. Yves, "Architectural Requirements for Cloud Computing Systems:An Enterprise Cloud Approach," J. Grid Computing, Vol.9, No.1, pp.3-26, 2011.   DOI
10 Wei-Tek Tsai, Xin Sun, and Janaka Balasooriya, "Service-Oriented Cloud Computing Architecture," 2010 Seventh International Conference on Information Technology, 2010.
11 H. Rijgersberg, M. Wigham, and J. T. Top, "How semantics can improve engineering processes: A case of unitsof measure and quantities," Advanced Engineering Informatics, Vol.25, No.2, pp.276-287, 2011.   DOI
12 P. Shvaiko and J. Euzenat, Ontology Matching: State art and Future Challenges, pp.1-15, IEEE 2013.
13 R. P. Padliy, M. R. Patra, and S. C. Satapthy, RDBMS to NoSQL: Reviewing some next-generation non-relational databse's. Int J. Adv. Eng. Sci Techno1, Vol.11, pp.15-30, 2011.
14 K. Saruladha, G. Aghila, and B. A. Sathiya, "Comparative Analysis of Ontology and Schema Matching Systems," International Journal of Computer Application, Vol.34, No.8, pp.14-21, 2011.
15 A. Ismail and M. Joy, Semantic searches for extracting similarities in a content management system. Proceedings the IEEE International Conference on Semantic Technology and Information Retrieval, Putrajaya, pp.113-118, June 28-29, 2011,
16 N. Leavitt, Will NoSQL databases live up to their promises. computer, Vol.43, pp.12-14, 2010.
17 R. Priyadarshini and Latha Tamilselvan, "Document clustering based on keyword frequency and concept matching technique in Hadoop," International Journal of Scientific & Engineering Research, Vol.5, Issue 5, May, 2014.
18 David Sanchez, Montserrat Batet, David Isern, and Aida Valls, "Ontology-based semantic similarity: A new feature-based approach," Journal of Expert systems with applications, Elseveir, No.39, pp.7718-7728, 2012.
19 J. H. Hwang and K. H. Ryu, "A weighted common structure based clustering technique for XML documents," Elsevier Publication, 2010.
20 B. Drakshayani and E V Prasad, "Text Document Clustering based on Semantics," International Journal of Computer Applications, pp.0975-8887, Vol.45, No.4, May 2012.
21 R. Priyadarshini, Latha Tamilselvan, "Document Based Semantic CMS in Cloud," Information Technology Journal, Vol.13, pp.217-230, February 07, 2014.   DOI