DOI QR코드

DOI QR Code

Forgery Detection Scheme Using Enhanced Markov Model and LBP Texture Operator in Low Quality Images

저품질 이미지에서 확장된 마르코프 모델과 LBP 텍스처 연산자를 이용한 위조 검출 기법

  • Received : 2021.09.24
  • Accepted : 2021.11.05
  • Published : 2021.12.31

Abstract

Image forensic is performed to check image limpidness. In this paper, a robust scheme is discussed to detect median filtering in low quality images. Detection of median filtering assists in overall image forensic. Improved spatial statistical features are extracted from the image to classify pristine and median filtered images. Image array data is rescaled to enhance the spatial statistical information. Features are extracted using Markov model on enhanced spatial statistics. Multiple difference arrays are considered in different directions for robust feature set. Further, texture operator features are combined to increase the detection accuracy and SVM binary classifier is applied to train the classification model. Experimental results are promising for images of low quality JPEG compression.

본 논문에서는 저품질 이미지에 적용된 미디언 필터링를 검출하는 기법을 제안하고자 한다. 이러한 미디언 필터링검출은 이미지 포렌식 기법에 사용되고 있는 것으로 제안된 방법에서는 원본 이미지와 미디언 필터링된 이미지를 구분하기 위하여 공간 영역에서 통계적 특징 정보를 추출하고 확장시킨다. 확장된 특징 정보는 마르코프 모델을 사용하고 강인한 특징 집합을 생성하기 위하여 다중 방향 배열을 사용한다. 제안된 방법에서는 검출 정확도를 높이기 위하여 텍스처 연산자를 사용하고 SVM 분류기를 통하여 분류 모델을 훈련시킨다. 실험 결과에서는 JPEG 압축을 사용한 저품질 이미지에서 제안한 방법의 우수함을 보인다.

Keywords

I. Introduction

Digital images are very popular on social media platforms such as WhatsApp, Facebook, Instagram and Twitter. Image is more informative and requires less memory in comparison with other information representation alternatives. In many countries, images can be shared easily in contrast to video files, where speed of the internet is slow. The wide availability of digital images makes it vulnerable. Images can be manipulated and shared by anyone using mobile applications and software. The main motive of fake images is to spread some wrong information to gain political, social, business advantages. Sometimes, fake images are also created for fun. As shown in Fig. 1, the left photo is a fake image. White eyebrows are created to show that cat is angry in the fake photo. The right photo is real in which no changes are done. If the purpose of the fake photo is fun, it is not a bad thing. In general, the purpose is not good that makes the situation grim and needs a proper attention.

JBBHCB_2021_v31n6_1171_f0001.png 이미지

Fig. 1. Fake and real photo of cat

Several image processing operations are applied in the process of fake image. Some common operations are resampling, rotation, blurring, denoising, contrast enhancement, and filtering and so on. The detection of these operations helps in fake image detection. In this paper, the detection of median filtering operation is focused. Median filter operation is a nonlinear operator and it serves two purposes. It is used for denoising and diminishing the artifacts of other operations like resampling, contrast enhancement, etc. Median filter detection is tough in comparison to Gaussian blurring, and mean filtering. Median filter changes the image statistics in nonlinear way. Several attempts have been made to detect median filtering. As JPEG compression is quite popular that make the problem more challenging, JPEG compression suppresses the median filter artifacts. Existing techniques can be improved further to give better accuracy in compressed images. In this paper, one attempt is made to detect median filtering in a low quality JPEG compression and small size image blocks. The proposed technique is based on manual feature extraction process. In manual methods, a computational requirement is very less in contrast to deep learning methods without sacrificing the effectiveness. In the first phase of manual methods, features need to be extracted from the image. The extracted features provide the internal statistical information. In median filtering operation, image statistics gets modified. The statistical gap between non-filtered and median filtered images can be noticed by extracting the relevant features. In the second phase, a relevant classifier is applied to train the model and testing is performed using trained model.

Several techniques [1]-[9] based on manual feature extraction process are discussed in literature. Kirchner and Fridrich [1] introduced the median filtering detection technique in the first time. Markov model is applied on difference arrays in multiple directions and popularly known as SPAM. Chen et al. [2] proposed global and local feature set (GLF) technique. Markov features and normalized cross correlation features are combined to improve the performance. Agarwal et al. [3] applied the Markov model on the first, second and third order difference arrays in multiple directions to get better results. Peng et al. [4] considered the autoregressive model and Markov model to detect median filtering. Model is applied on median residual and other differences. In similar approach, Gao et al. [5] applied autoregressive model in frequency domain and Markov model in spatial domain. In Gao et al.’s method[6], Markov model and texture features are combined to get better detection accuracy. A local configuration pattern is used as a texture operator. Wang and Gao [7] applied the local quadruple pattern for extracting statistical information in discrete cosine transform (DCT) domain. Peng et al. [8] considered autoregressive modelandthelocalbinarypatterntexture operator. Gupta and Singhal [9] extracted the features in DCT domain after calculating median filter residual, where a value of mean, variance, skewness and kurtosis features are fetched in DCT domain. After analyzing previous papers, twooutcomesareconsideredforimproving the existing techniques. First, Markov model provides the better statistical information then autoregressive model. Second, there is a lot of scope for considering more robust texture operator. As a result, modified difference arrays are considered for extracting improved Markov features and more robust texture operators are utilized for performance boost in this paper.

The proposed scheme is discussed in next section. Effectiveness of the proposed sheme is verified in secccction III using experimental analysis. The conclusion of this paper is discussed in section IV

II. The Proposed Scheme

A widespread popularity of digital images has increased its importance in people’s life. In this paper, a robust forensic technique for detection of median filtering is proposed to preserve the integrity of images. Markov features are extracted in enhanced rescaled domain and combined with robust texture operator features. The general block diagram of feature extraction techniques is shown in Fig. 2. Initially, training and test set are prepared from pristine and median filtered images. Features are extracted using the proposed scheme. SVM classifier is used to create the trained model using training set features. The trained model is used for test set images and images are classified into pristine and median filtered images. The results are represented using accuracy, sensitivity and specificity.

JBBHCB_2021_v31n6_1171_f0002.png 이미지

Fig. 2. Block diagram of feature extraction techniques

Next, the enhanced Markov features extraction process is discussed. Markov features are popular due to their low computational cost and robustness in many classification applications. The second order Markov features is proved its worth in comparison with the first, third and higher order Markov features. In this paper, the second order Markov model is considered. Conventionally, the second order Markov features can be extracted using following equation.

\(K_{(a, b, c)=} \operatorname{Pr}\left(E_{(m, n+2)}=a \mid E_{(m, n+1)}=b, E_{(m, n)}=c\right)\)       (1)

E is the thresholded difference array in a particular direction and \(a, b, c \in\{-H, \cdots, H\}\) and H is the threshold value.

\(\operatorname{Pr}\left(E_{(m, n+2)}=a \mid E_{(m, n+1)}=b, E_{(m, n)}=c\right)=0\) if \(\operatorname{Pr}\left(E_{(m, n+1)}=b, E_{(m, n)}=c\right)=0\). The difference array can be defined of a gray level image G(m,n) as follows.

\(D_{(m, n)}=G_{(m, n+1)}-G_{(m, n)}\)       (2)

, where \(G_{(m, n)} \in\{0,1, \ldots, L-2, L-1\}\) and \(D_{(m, n)} \in\{-L+1,-L+2, \ldots, 0, \ldots, L-2, L-1\}\).

Then the thresholded difference array E(m,n) is calculated.

\(E_{(m, n)}= \begin{cases}-H & \text { if } D_{(m, n)}<-H \\ D_{(m, n)} & \text { if }-H \leq D_{(m, n)} \leq H \\ H & \text { if } D_{(m, n)}>H\end{cases}\)       (3)

However, the thresholding process replaces the D(m,n) elements with threshold value without considering its weight. To gap, G(m,n) elements are rescaled first in smaller domain and generated a new array named as GS(m,n), where \(G S_{(m, n)} \in\{-S,-S+1, \ldots, 0, \ldots, S-1, S\}\). The value of S is decided by experimental analysis. The optimal value is obtained 10 for S. Further, difference array is obtained as follows.

\(D S_{(m, n)}=G S_{(m, n+1)}-G S_{(m, n)}\)       (4)

,where \(G S_{(m, n)} \in\{-10, \ldots, 10\}\) and \(D S_{(m, n)} \in\{-20, \ldots, 20\}\). The enhanced thresholded difference array ES(m,n) is calculated.

\(E S_{(m, n)}= \begin{cases}-H & \text { if } D S_{(m, n)}<-H \\ D S_{(m, n)} & \text { if }-H \leq D S_{(m, n)} \leq H \\ H & \text { if } D S_{(m, n)}>H\end{cases}\)       (5)

Further, the second order Markov model is applied to extract features as follows.

\(K S_{(a, b, c)}-\operatorname{Pr}\left(E S_{(m, n+2)}=a \mid E S_{(m+n+1)}=b, E S_{(m+n)}=c\right)\)       (6)

The proposed process can improve significantly without increasing feature vector size. The features are extracted for horizontal, vertical, diagonal and minor diagonal difference arrays in forward and backward directions. Further, texture operator based features are extracted. Texture operators are effective in many applications like as face recognition, object classification, medical imaging and soon. Some of robust texture operators like LBP[10], LBPRIU [11], COALBP [12], RICLBP[13] are also analyzed in median filtering detection. Simple LBP operator [10]provides local statistical information. LBPRIU [11] considers rotation invariant uniform LBP and gives local statistical information. Co-occurrence between LBP’s are calculated in COALBP [12] to get global information. RICLBP [13] gives global structural statistical and local statistical details among similar type of LBP. The LBP is treated similarly with other LBP if it produces a similar pattern after rotation from one of the rotation angel, 0°, 45°, 90°, 135° and 180°. Further, co-occurrence of the rotation invariant LBPs gives global statistical details. The LBP is derived using Fig. 3. The center pixel is compared either by horizontal and vertical neighbors or diagonal neighbors. The center pixel is compared with neighbors and then replaced with 0, otherwise by 1 if neighbor pixel value is less than center pixel value. Four neighbors and anti-clock wise direction is considered for calculating the LBP.

JBBHCB_2021_v31n6_1171_f0003.png 이미지

Fig. 3. LBP formation

There is a noticeable improvement after combining enhanced Markov features and RICLBP texture operator in the proposed scheme. Experimental results are discussed in next section. Median filters of size 3x3 and 5x5 are considered for experimental analysis. The proposed technique is compared with different texture operators and some median filtering detection techniques.

III. Experimental Results

Median filtering is a nonlinear operation and its detection is challenging in comparison with averaging and Gaussian filtering. The proposed scheme is applied on UCID database [14] as many existing techniques are used. The UCID database has miscellaneous categories containing 1,338 images of natural scenes, objects, peoples, etc. Some images of UCIDdatabase are shown in Fig. 4.

JBBHCB_2021_v31n6_1171_f0004.png 이미지

Fig. 4. Example of UCID database images

It is concluded from previous literature and experimental analysis that detection of median filtering is challenging on low quality and small size images. Therefore, experimental results are displayed only for low JPEG quality factors 30, 50 and70.The small size image blocks with 32x32 and 64x64 pixels are considered. Non-filtered images are created by cropping center block of required size after applying JPEG compression. Median filtered images are created by using following steps: apply median filter on the UCID database images in the first step, compress images using required quality factor in the second step and crop the center block of the compressed median filtered image in the last step. Padding artifacts of median filter will not arise by following these steps. In experiments, 3x3and 5x5 window size are considered for median filtering. Training set contains sixty percent images of both the classes and testing set contains remaining forty percent images. Around fifty training and testing pairs are formed for unbiased performance evaluation. The results are shown in terms of detection accuracy, sensitivity, and specificity in percentage.

\(\begin{gathered} \text { Accuracy }=\left(\frac{T P+T N}{T P+T N+F P+F N}\right) * 100 \\ \text { Sensitivity }=\left(\frac{T P}{T P+F N}\right) * 100 \\ \text { Specificity }=\left(\frac{T N}{T N+F P}\right) * 100 \end{gathered}\)       (7)

where true positive, true negative, false positive and false negative are abbreviated as TP, TN, FP and FN, respectively.

The proposed technique is compared with COALBP, RICLBP, GDCTF, LBP, LBPRIU, and SPAM. In Fig. 5, an average detection accuracy is shown to detect median filter of size 3x3 on block size 64x64. The proposed technique achieves 86.36%, 91.04%, and 93.69% detection accuracy for JPEG compression quality factors 30, 50 and 70 respectively. Even for low quality factor Q=30, the accuracy of the proposed method is 86.36%.

JBBHCB_2021_v31n6_1171_f0005.png 이미지

Fig. 5. Detection accuracy for median filter of size 3x3 and image size 64x64

Sensitivity (SE) and specificity (SP) can be seen in Fig. 6. The sensitivity of each method is higher than its corresponding specificity. It means that a number of median filtered images are incorrectly classified as non-filtered images. The difference between sensitivity and specificity values are the highest in GDCTF technique.

JBBHCB_2021_v31n6_1171_f0006.png 이미지

Fig. 6. Sensitivity and specificity for median filter of size 3x3 and image size 64x64

Now, results are shown in Fig. 7 for median filtering detection of filter size 3x3 on 32x32 pixel blocks. The proposed technique gives 79.38%, 83.34%, and 88.34% detection accuracy for quality factors 30, 50 and 70 respectively. For block size 32x32, the performance gain of the proposed method is comparatively better than block size 64x64. It can be concluded that the proposed scheme gives better results in tough scenario.

JBBHCB_2021_v31n6_1171_f0007.png 이미지

Fig. 7. Detection accuracy for median filter ofsize 3x3 and image size 32x32

The sensitivity and specificity resultsare presented in Fig. 8. The similar behavior is followed by sensitivity and specificity for block size 64x64 in Fig. 6. However, sensitivity and specificity of the proposed scheme are the highest in comparison with other techniques.

JBBHCB_2021_v31n6_1171_f0008.png 이미지

Fig. 8. Sensitivity and specificity for median filter of size 3x3 and image size 32x32

In Table 1, detection accuracy is displayed for image size 32x32 (B_32) and 64x64 (B_64) for 5x5 median filter. JPEG compression Q={30,50,70} is considered. RICLBP performance is comparable with proposed method, where the proposed scheme achieves better results. The performance gain of proposed scheme is better in low quality compression. There is additional gain of 2% in detection accuracy for JPEG Q=30.

Table 1. Detection accuracy for median filter of size 5x5

JBBHCB_2021_v31n6_1171_t0001.png 이미지

Sensitivity and specificity is given in Table 2 for detection of median filtering 5x5 on 32x32 size images with JPEG Q={30,50,70}. Sensitivity is higher than specificity in each scenario. However, there is a wide gap in sensitivity and specificity of GDCTF technique.

Table 2. Sensitivity and specificity for 5x5median filter and 32x32 image

JBBHCB_2021_v31n6_1171_t0002.png 이미지

The difference between sensitivity and specificity values is affected by the compression. As the compression increases, difference between sensitivity and specificity values also increases

The sensitivity and specificity are higher for 64x64 images in comparison with 32x32 images as shown in Table 3

Table 3. Sensitivity and specificity for 5x5median filter and 64x64 image

JBBHCB_2021_v31n6_1171_t0003.png 이미지

The sensitivity is higher than specificity similar with the results of Table 2. However, the difference between sensitivity and specificity values is less in 64x64 images in comparison with 32x32 images. It is due to the fact that large size images contain more statistical information than small size images. Similar pattern is also followed by JPEG compression. As the compression increases (Q decreases), the statistical information gets limited. There is little difference between sensitivity and specificity for JPEG Q=70.

IV.Conclusion

As image forensics assured the integrity of the images, multiple operations have been identified in image forensics. In this paper, a new scheme for detection of median filtering has been proposed. Detection of median filtering has been significant due to its nonlinear behavior. In the proposed scheme, image array was rescaled in optimum domain. The enhanced statistical information has been derived using Markov model. Further, the rotational invariant LBP operator has been applied for additional features. The proposed scheme has given the satisfactory results in experimental analysis, where the experiments were performed on small blocks with low quality compression.

References

  1. M. Kirchner and J. Fridrich, "On detection of median filtering in digital images," Media Forensics and Security II, 754110, pp. 1-12, Jan. 2010.
  2. C. Chen, J. Ni, and J. Huang, "Blind detection of median filtering in digital images: a difference domain based approach," IEEE Transactions on Image Processing, vol. 22, no. 12, pp. 4699-4710, Aug. 2013. https://doi.org/10.1109/TIP.2013.2277814
  3. S. Agarwal, S. Chand, and N. Skarbnik, "SPAM revisited for median filtering detection using higher-order difference," Securirty and Communication Networks, vol. 9, no. 17, pp. 4089-4102, Nov. 2016. https://doi.org/10.1002/sec.1590
  4. A. Peng, S. Luo, H. Zeng, and Y. Wu, "Median filtering forensics using multiple models in residual domain," IEEE Access, vol. 7, pp. 28525-28538, Feb. 2019. https://doi.org/10.1109/access.2019.2897761
  5. H. Gao and T. Gao, "Detection of median filtering based on ARMA model and pixel-pair histogram feature of difference image," Multimedia Tools and Applications, pp. 12551-12567, vol. 79, Jan. 2020. https://doi.org/10.1007/s11042-019-08340-3
  6. H. Gao, T. Gao, and R. Cheng, "Robust detection of median filtering based on data-pair histogram feature and local configuration pattern," Journal of Information Security and Applications, vol. 53, pp. 102506, Aug. 2020. https://doi.org/10.1016/j.jisa.2020.102506
  7. D. Wang and T. Gao, "Filtered image forensics based on frequency domain features," International Conference on Communication Technology, pp. 1208-1212, Jan. 2019.
  8. A. Peng, G. Yu, Y. Wu, Q. Zhang, and X. Kang, "A universal image forensics of smoothing filtering," International Journal of Digital Crime and Forensics, vol. 11, no. 1, pp. 18-28, Jan. 2019. https://doi.org/10.4018/ijdcf.2019010102
  9. A. Gupta and D. Singhal, "A simplistic global median filtering forensics based on frequency domain analysis of image residuals," ACM Transactions on Multimedia Computing, Communications, and Applications, vol. 15, no. 3, Sep. 2019.
  10. T. Ahonen, A. Hadid, and M. Pietikainen, "Face recognition with local binary patterns," European Conference on Computer Vision, pp. 469-481, May. 2004.
  11. G. Zhao, T. Ahonen, J. Matas, and M. Pietikainen, "Rotation-invariant image and video description with local binary pattern features," IEEE Transactions on Image Processing, vol. 21, no 4, Apr. 2012.
  12. R. Nosaka, C.H. Suryanto, and K. Fukui, "Rotation invariant co-occurrence among adjacent LBPs," Asian Conference on Computer Vision, pp. 15-25, Jan. 2012.
  13. R. Nosaka and K. Fukui, "HEp-2 cell classification using rotation invariant co-occurrence among local binary patterns," Pattern Recognition, vol. 47, no. 7, pp. 2428-2436, Jul. 2014. https://doi.org/10.1016/j.patcog.2013.09.018
  14. G. Schaefer and M. Stich, "UCID - an uncompressed colour image database," Proceedings of the SPIE, vol. 5307, pp. 472-480, Dec. 2003.