Browse > Article
http://dx.doi.org/10.13089/JKIISC.2010.20.6.43

A Study on Extracting the Document Text for Unallocated Areas of Data Fragments  

Yoo, Byeong-Yeong (Digital Forensics Research Center, Korea University)
Park, Jung-Heum (Digital Forensics Research Center, Korea University)
Bang, Je-Wan (Digital Forensics Research Center, Korea University)
Lee, Sang-Jin (Digital Forensics Research Center, Korea University)
Abstract
It is meaningful to investigate data in unallocated space because we can investigate the deleted data. Consecutively complete file recovery using the File Carving is possible in unallocated area, but noncontiguous or incomplete data recovery is impossible. Typically, the analysis of the data fragments are needed because they should contain large amounts of information. Microsoft Word, Excel, PowerPoint and PDF document file's text are stored using compression or specific document format. If the part of aforementioned document file was stored in unallocated data fragment, text extraction is possible using specific document format. In this paper, we suggest the method of extracting a particular document file text in unallocated data fragment.
Keywords
Digital Forensics; File Carving; Data Fragment; Text Extraction;
Citations & Related Records
Times Cited By KSCI : 1  (Citation Analysis)
연도 인용수 순위
1 박보라, 이상진, "비할당 영역 데이터 파편의 압축 여부 판단과 압축 해제," 정보보호학회 논문지, 18(4), pp. 175-185, 2008년 8월.   과학기술학회마을
2 Frank Rice, Introducing the Office (2007) Open XML File Formats, Microsoft Corporation, URL: http://msdn2.microsoft.com/ko- kr/library/aa338205.aspx, 2006
3 Microsoft Corporation, Office Open XML Part 4 - Markup Language Reference, Microsoft Corporation, 2006
4 Adobe Systems Incorporated, Document management - Portable document format - Part 1: PDF 1.7, Adobe Systems Incorporated, 2008.
5 Mason McDaniel and M. Hossain Heydari, "Content Basec File Type Detection Algorithms," 6th Annual Hawaii International Conference on System Sciences(HICSS), pp. 108-114, 2003.
6 권태석, 변근덕, 이상진, 임종인 "포렌식 관점에서 효율적인 파일 카빙 알고리즘 설계 제안," 한국방송공학회, pp. 205-208, 2008년 2월.
7 Kulesh Shnmugasundaram and Nasir Memon, "Automatic Reassembly of Decument Fragments via context Based Statistical Models," Proceedings of the 19th Annual Computer Security Applications Conference (ACSAC), pp. 152-159, 2003.