Content based data search using semantic annotation

Kim, Byung-Gon;Oh, Sung-Kyun;

doi:10.9728/dcs.2011.12.4.429

Journal of Digital Contents Society (디지털콘텐츠학회 논문지)

Volume 12 Issue 4
/
Pages.429-436
/
2011
/
1598-2009(pISSN)
/
2287-738X(eISSN)

Digital Contents Society (한국디지털콘텐츠학회)

DOI QR Code

Content based data search using semantic annotation

시맨틱 주석을 이용한 내용 기반 데이터 검색

김병곤 (부천대학 e-비즈니스과) ;
오성균 (서일대학 소프트웨어과)

Received : 2011.10.21
Accepted : 2011.12.04
Published : 2011.12.31

https://doi.org/10.9728/dcs.2011.12.4.429 Citation PDF KSCI

Download PDF

⟨ Previous Next ⟩

Abstract

Various documents, images, videos and other materials on the web has been increasing rapidly. Efficient search of those things has become an important topic. From keyword-based search, internet search has been transformed to semantic search which finds the implications and the relations between data elements. Many annotation processing systems manipulating the metadata for semantic search have been proposed. However, annotation data generated by different methods and forms are difficult to process integrated search between those systems. In this study, in order to resolve this problem, we categorized levels of many annotation documents, and we proposed the method to measure the similarity between the annotation documents. Similarity measure between annotation documents can be used for searching similar or related documents, images, and videos regardless of the forms of the source data.

인터넷검색의 대상이 되는 각종 문서, 이미지, 동영상 등의 자료가 늘어날수록 이에 대한 효율적인 검색의 문제가 중요시되고 있다. 효율적인 검색의 관점은 초기의 키워드 중심의 검색에서 자료가 지니는 의미적인 요소들을 종합적으로 판단하여 이들의 연관성을 찾아 검색하는 의미적 검색의 방향으로 진행되고 있다. 이에 따라, 각종 자료에 대한 의미적 검색을 위하여 메타데이터 처리를 위한 시맨틱 주석을 생성, 운영하는 시스템들이 연구되어 왔다. 그러나, 동일한 종류의 자료에 대한 주석 위주로 진행되었고, 각기 다른 방법과 형태로 생성된 주석 데이터 간에는 호환적인 검색이나 처리가 어렵다. 본 연구에서는 이 문제를 해결하기 위하여 다양한 주석문서를 내용분석에 따라 단계별 형태로 분류하고, 상이한 종류의 자료 간에도 검색이 가능하도록 문서간의 유사도를 측정하는 방법을 제시하였다. 주석문서간의 유사도 측정은 소스문서와 유사도가 높은 주석문서를 검색하여 결과적으로 자료의 종류나 형태에 상관없이 가장 유사한 내용을 지니는 문서나 이미지, 동영상 등을 검색하는데 사용할 수 있다.

Keywords

References

Siegfried Handschuh and Steffen Staab, editors. Annotation for the Semantic Web. IOS Press, 2003.
Lawrence Reeve and Hyoil Han, The Survey of Semantic Annotation Platforms, The 20th Annual ACM Symposium on Applied Computing (ACM SAC) 2005, Santa Fe, New Mexico, 2005
"MnM: Ontology Driven Semi-Automatic and Automatic Support for Semantic Markup", Maria Vargas-Vera, Enrico Motta, John Domingue, Mattia Lanzoni, Arthur Stutt and Fabio Ciravegna, The 13th International Conference on Knowledge Engineering and Management (EKAW 2002), Springer Verlag, 2002
PLUS Group, "Simple HTML Ontology Extenstions" http://www.cs.umd.edu/projects/plus/SHOE, 2002.
Popov, B., Kiryakov, A., Kirilov, A., Manov, D., Ognyanoff, D. and Goranov, M., KIM - Semantic Annotation Platform in 2nd International Semantic Web Conference (ISWC2003), 2003, pp 834-849.
The Dublin Core Metadata Initiative, Dublin Core Metadata Element Set, Ver. 1.1: Reference Description. http://dublincore.org/documents/2010/10/11/dces/
Visual Resources Association Data Standards Committee, VRA Core Categories, Version 3.0. http://www.vraweb.org/about/index.html
Information Technology - Multimedia Content Description Interface (MPEG-7). Standard No. ISO/IEC 15938:2001, International Organization for standardization(ISO), 2001.
SF. Chang, T. Sikora, and A. Puri. Overview of the MPEG-7 standard. IEEE Trans. on Circuits and Systems for Video Technology, 11(6) , June 2001, pp 688-695 https://doi.org/10.1109/76.927421
Stephan Bloehdorn, Kosmas Petridis, Carsten Saathoff, Nikos Simou, Vassilis Tzouvaras, Yannis Avrithis, Siegfried Handschuh, Yiannis Kompatsiaris, Steffen Staab, Michael G. Strintzis, "Semantic Annotation of Images and Videos for Multimedia Analysis", The Semantic Web: Research and Applications: Proceedings of the Second European Semantic Web Conference, ESWC 2005, pp 592-607
Image Annotation on the Semantic Web, W3C Working Draft 22 March 2006 http://www.w3.org/TR/2006/WD-swbp-image-annotation-20060322/