Adaptive Multi-class Segmentation Model of Aggregate Image Based on Improved Sparrow Search Algorithm

Mengfei Wang;Weixing Wang;Sheng Feng;Limin Li;

doi:10.3837/tiis.2023.02.006

KSII Transactions on Internet and Information Systems (TIIS)

Volume 17 Issue 2
/
Pages.391-411
/
2023
/
1976-7277(pISSN)
/
1976-7277(eISSN)

Korean Society for Internet Information (한국인터넷정보학회)

DOI QR Code

Adaptive Multi-class Segmentation Model of Aggregate Image Based on Improved Sparrow Search Algorithm

Mengfei Wang (School of Information, Chang'an University Xi'an) ;
Weixing Wang (School of Information, Chang'an University Xi'an) ;
Sheng Feng (Computer Science and Engineering, Shaoxing University) ;
Limin Li (School of Electrical and Electronic Engineering, Wenzhou University)

Received : 2022.08.18
Accepted : 2023.01.27
Published : 2023.02.28

https://doi.org/10.3837/tiis.2023.02.006 Citation PDF HTML

Download PDF

⟨ Previous Next ⟩

Abstract

Aggregates play the skeleton and supporting role in the construction field, high-precision measurement and high-efficiency analysis of aggregates are frequently employed to evaluate the project quality. Aiming at the unbalanced operation time and segmentation accuracy for multi-class segmentation algorithms of aggregate images, a Chaotic Sparrow Search Algorithm (CSSA) is put forward to optimize it. In this algorithm, the chaotic map is combined with the sinusoidal dynamic weight and the elite mutation strategies; and it is firstly proposed to promote the SSA's optimization accuracy and stability without reducing the SSA's speed. The CSSA is utilized to optimize the popular multi-class segmentation algorithm-Multiple Entropy Thresholding (MET). By taking three METs as objective functions, i.e., Kapur Entropy, Minimum-cross Entropy and Renyi Entropy, the CSSA is implemented to quickly and automatically calculate the extreme value of the function and get the corresponding correct thresholds. The image adaptive multi-class segmentation model is called CSSA-MET. In order to comprehensively evaluate it, a new parameter I based on the segmentation accuracy and processing speed is constructed. The results reveal that the CSSA outperforms the other seven methods of optimization performance, as well as the quality evaluation of aggregate images segmented by the CSSA-MET, and the speed and accuracy are balanced. In particular, the highest I value can be obtained when the CSSA is applied to optimize the Renyi Entropy, which indicates that this combination is more suitable for segmenting the aggregate images.

Keywords

1. Introduction

Aggregate is the main raw material of concrete, and the geometric characteristics of aggregates determine the mechanical properties of concrete [1]. Affected by aggregate source, rock type, crushing method and grinding degree, it is difficult to accurately and quickly measure the rough surface texture, edge shape, particle size and other characteristics of aggregate [2]. Image processing techniques can be utilized to assist feature detection of aggregates. However, it is more difficult to detect aggregates than other particles because the aggregate image is very noisy, and the aggregates overlap or touch each other.

Multi-class segmentation is an important in image segmentation algorithms, which can simultaneously segment multiple different features of an aggregated image [3]. Common methods are: Thresholding, Region merging and split, Clustering, and Semantic segmentation. They employ distinct color blocks to differentiate locations based on image discontinuities, color or grayscale similarity, texture, and other characteristics [4]. The Region-growing [5] is typically effective at segmenting smooth areas of aggregate images, and yet the algorithm suffers from severe under-segmentation when aggregate particles are adhesion. The Watershed segmentation [6] can be applied for the images of densely packed aggregate, but there is over-segmentation at arris of polygonal aggregates. The Clustering algorithm [7] is good at segmenting the aggregate images with clear edges, but the aggregate overlapping problem cannot be resolved. The Semantic segmentation [8] might solve the aggregate touching problem; but, the aggregate surface texture will affect the segmentation results if the dataset cannot cover all the situations. However, creating a database of images with various aggregate rough texture features is challenging. The Thresholding is a computationally simple algorithm, and the features of the image are usually at the valleys or peaks of the histogram. This capability can be exploited to differentiate aggregate surface textures and edges [9]. Multi-thresholding (MT) easily separates contacting aggregates compared to Single-thresholding, while preserving details such as surface roughness, grain edges, etc. [10]. Since the MT is not affected by gray-scale similarity, it is more robust. Multiple Entropy Thresholding (MET) [11-12] is a popular method for automatic threshold determination, besides Otsu [13]. The MET is more efficient than Otsu, it determines the thresholds through entropy, and the entropies frequently employed for multi-class image segmentation are Kapur [14], Minimum-cross [15] and Renyi [16]. Therefore, the MET is utilized as one of the primary linkages in the suggested model in this paper. However, when applying the exhaustive method, the MET must test each threshold combination one by one in order to pick the optimal thresholds suitable for image segmentation. As a result, the more thresholds there are, the lower the operational efficiency.

The swarm intelligence optimization method learns the population’s cooperative behavior in order to discover the target, and adopts a distributed iterative convergence strategy to achieve parameter optimization [17]. Compared with the exhaustive method, it can not only greatly reduce the time for MET to determine the threshold, but also does not degrade operating performance when the amount of thresholds grows [18-19]. Currently popular algorithms of this type are: Whale Optimization Algorithm (WOA) [20], Bacterial Foraging Algorithm (BFO), Gray Wolf Optimization (GWO) [21], Artificial Bee Colony Algorithm (ABCO), Particle Swarm Optimization (PSO) [22], Bat Algorithm (BAT), Mayfly Algorithm (MA) [23], Antlion Algorithm (ALO), Butterfly Optimization Algorithm (BOA) and Sparrow Search Algorithm (SSA) [24] etc. The precision, stability, convergence speed of algorithm optimization is affected by the population position, pathfinding and local optima. For example, the population of the GWO is distributed according to grade, resulting in good optimization precision. Whale spiral search in the WOA, which makes the WOA iterate faster. The SSA stores the sparrow’s position in the matrix to avoid repeated searches. And the population is divided into producers, scroungers, and vigilantes, each of which corresponds to two update methods, and the three kinds of sparrows are optimized at the same time, with high efficiency and robustness. But these algorithms have two disadvantages that lead to the performance degradation, that is, incomplete global search and falling into neighborhood optimum. For a more comprehensive search, chaotic map [25] is suggested, and Chen et al. [26] proposed reverse learning. For the local optimal solution, Levy flight [26] is suggested to jump out of the local area, and Liu et al. [25] made a Cauchy-Gauss mutation strategy. These evolutionary strategies are suitable for different optimization algorithms and objective functions; otherwise it is difficult to balance the optimization accuracy and convergence speed. Combining these performances, SSA [24] is currently a better optimization algorithm, and it has played a very good role in the parameter selection in the fields of path planning [27], production forecasting [28] and network selection [29]. Hence, this paper takes the SSA as one of the other main links of the proposed model, and optimizes the MET based on the improved SSA.

The following are the study’s primary contributions:

1. An adaptive multi-class segmentation model for aggregate images, CSSA-MET, is suggested. CSSA can quickly and accurately help three METs to determine the thresholds and to improve the segmentation efficiency.

2. A Chaotic SSA is suggested, in which the chaotic map is combined with the sinusoidal dynamic weight and the elite mutation strategies, and it is firstly studied to promote the SSA’s optimization accuracy and stability without reducing the SSA’s speed. On the benchmark function tests, the CSSA outperforms the other seven similar algorithms.

3. Numerous aggregate image segmentation experiments demonstrate the feasibility and effectiveness of the CSSA-MET for segmenting aggregate images. To evaluate all methods comprehensively, a new parameter I based on segmentation accuracy and processing speed is constructed. According to the results, the CSSA optimized Renyi Entropy performs best for segmenting aggregated particles images.

The remainder of this work is presented: The Section 2 includes the MET and SSA. The CSSA-MET is explained in full in Section 3. Section 4 contains the CSSA and CSSA-MET tests. Finally, this investigation is summarized in Section 5.

2. Preliminaries

In this section, three METs are presented in Subsection 2.1, and SSA is introduced in Subsection 2.2.

2.1 Multiple Entropy Thresholding

The MET is a multi-class image segmentation algorithm. Its principle is to find a set of values to make the total information entropy of the image reach the extreme value. These values are the segmentation thresholds. Divide the histogram utilizing thresholds, then translate the result to each pixel in the image and assign closest gray value per pixel. The MET determines the thresholds through entropy, and the entropies frequently employed are Kapur [14], Minimum-cross [15] and Renyi [16], which calculate different amounts of information.

Assuming that the image size is 𝑀 × 𝑁, and the image is grayscaled into 0~𝐺 levels, when the gray-scale value is 𝑖 the number of pixels is 𝑛_𝑖, and the probability 𝑖 occurrence is 𝑃_𝑖 = 𝑛_𝑖/(𝑀 × 𝑁). The gray-scale value range of the 𝑘-th area is [𝑔_𝑘−1, 𝑔_𝑘], 0 ≤ 𝑔_𝑘−1 ≤ 𝑔_𝑘 ≤ 𝐿. Then the average gray value of this area is 𝑢_𝑘, and the probability sum of the gray-scale values in this region is 𝜔_𝑘 = Σ_{𝑖=𝑔𝑘−1}^𝑔𝑘 𝑃_𝑖 . The information of this areas is 𝐻₁, 𝐻₂, ⋯, and 𝐻_𝐾+1 respectively, then the information total is 𝐸 = 𝐻₁ + 𝐻₂ + ⋯ +𝐻_𝐾+1. Calculate the thresholds 𝑔_{(1,2,⋯,𝐾)} that make the 𝐸 reach the maximum or minimum value, as shown in Fig. 1. The mathematical expression of MET calculation thresholds is shown in Table 1.

E1KOBZ_2023_v17n2_391_f0001.png 이미지

Fig. 1. Schematic diagram of MET determination thresholds.

Table 1. The thresholds determination methods of three METs

E1KOBZ_2023_v17n2_391_t0001.png 이미지

Table 2 shows the differences in their segmented aggregate images when 𝐾 = 2. It can be observed that the histogram fluctuates smoothly in some gray-scale intervals due to the aggregate’s gray scale, but it is rich in extreme points, which are typical features of the aggregate image, such as edges and surface rough texture. When 𝐾 is small, the accuracy is low, and as 𝐾 increases, the MET detects more features.

Table 2. The results of the MET splitting two kinds of aggregate particles when 𝐾 = 2

E1KOBZ_2023_v17n2_391_t0002.png 이미지

The dark aggregate image’s brightness values are concentrated in dark area, and the histogram has only one clear valley. Kapur and Renyi Entropy detected the more brighter part details, while Minimum-cross Entropy identified the more darker part features. However, bright aggregate image’s brightness values are concentrated in bright parts, and the histogram has no obvious valley. At this time, the focus of the three METs detection results is just the opposite. The major cause of this disparity is variance in the histogram. Since there are many extreme points in the histogram and their distance is close, even with only close thresholds, there can be significant differences in results. As a result, the optimization techniques directly affect the quality of image segmentation.

2.2 Sparrow Search Algorithm

The SSA [24] is a recent optimization algorithm for mimicking sparrow behavior and has better convergence accuracy, speed and robustness. The SSA stores each sparrow’s location 𝑥_𝑖,𝑗in each dimension in a matrix, 𝑖 ∈ [1, 𝑛], 𝑗 ∈ [1, 𝑑], 𝑛 is the number of sparrows and 𝑑 is the dimension.

Sparrows are divided into producers, scroungers and vigilantes. Producers are the core of the team, determine the direction of population movement, and are also the key to global search. The scroungers affect local convergence, and vigilante can react quickly when in danger. Each sparrow corresponds to two position update formulas, and these three kinds of sparrows update their positions respectively according to Table 3.

Table 3. Sparrow location update method

E1KOBZ_2023_v17n2_391_t0003.png 이미지

Fig. 2 left shows the optimization principle of the SSA. It is clear that the performance of the SSA is closely connected to sparrow’s population dispersion, optimization pathway, and localized optimum.

E1KOBZ_2023_v17n2_391_f0002.png 이미지

Fig. 2. SSA and CSSA optimization principle diagrams.

To boost the SSA’s effectiveness, Chen et al. [26] added Tent map, dynamic parameters, and Levy flight (CDLSSA) to the SSA, with increased accuracy but slower speed. Liu et al. [25] advocated a Cubic map and adaptive weight to optimize the SSA (CASSA). The speed of this algorithm was fast, but the accuracy was not greatly improved. At present, there is no single method that provides the optimal configuration of speed and precision.

3. Methods

This section the three evolution strategies of the CSSA are introduced in detail in Subsection 3.1, and the flow of the CSSA-MET is shown in Subsection 3.2.

3.1 CSSA

Chaotic Sparrow Search Algorithm (CSSA) makes targeted improvements to the three deficiencies of the SSA, corresponding to three evolutionary strategies. The optimization principle diagram of the CSSA is illustrated in Fig. 2 right. The initial sparrow positions are more evenly spread out, the optimization path is wider, and sparrows confined to a local area can successfully leap away from it.

3.1.1 Chaotic map

Chaotic system is a definite nonlinear system, which has the characteristics of good uniformity, high randomness, fast speed and low cost. Therefore, chaos maps are often used to change the state of the system. The initial position of the sparrows in the SSA is stochastic, and if the sparrows gather, it hinders the worldwide search. As a result, we suggested to adopt a chaotic map at initialization. To scatter the sparrows so that they are evenly distributed globally. Piecewise map [30] is a chaotic map with high precision and good stability, which can be described as (1).

\(\begin{aligned} x^{\prime}= \begin{cases} x(k) / P &\mbox P>x \geq 0 \\ 1-(x(k)-0.5) /(P-0.5) &\mbox 0.5>x \geq P \\ 1+(x(k)-0.5) /(P-0.5) &\mbox 1-P>x \geq 0.5, & \mbox P \in(0,1) \neq 0.5 \\ (1-x(k)) / P &\mbox 1>x \geq 1-P \end{cases} \end{aligned}\) (1)

The chaotic sequences are shown in Fig. 3. The ordinate is the frequency of occurrence of 𝑥, which represents the uniformity of 𝑥, which is used in the SSA to represent the uniformity of the mapped sparrow position. Piecewise map has significant randomization, and this chaos sequence is most uniform when 𝑃 = 0.4.

E1KOBZ_2023_v17n2_391_f0003.png 이미지

Fig. 3. Sequences of 𝑃 = 0.4, 0.6, 0.9 after Piecewise chaotic map when 𝑥(1) = 0.1.

Since the range [0, 1] of the chaotic map, it is required to translate into the target space’s limits [𝑙b, 𝑢b] during the CSSA initialization phase.

3.1.2 Sinusoidal dynamic weight

Instance 𝑦 = 𝑒^−𝑥 has an impact on the producers. The traveling distance quickly shrunk as 𝑡 rises. It lowers the capacity to do global searches, easier to enter the neighborhood optimum, which decreases the optimizing accuracy.

Therefore, this paper proposes a sinusoidal dynamic weight for adjusting the search range for the first time, which can be expressed as (2).

\(\begin{aligned} w= \begin{cases}\frac{1}{2} (w_{\max }-\sin (\frac{t \cdot \pi}{T_{\max }}-\frac{T_{\max }}{2}) & \mbox t<\frac{T_{\max }}{2} \\ W_{\min }+\sin (\frac{t \cdot \pi}{2 \cdot T_{\max }}) & \mbox t \geq \frac{T_{\max }}{2} \end{cases} \end{aligned}\) (2)

Where, 𝑤_max and 𝑤_min are the initial and later weight respectively. After many experiments, when 𝑤_max = 1 and 𝑤_min = −1, the convergence effect is the best.

In the early iteration, 𝑤 is large, population is scattered. In the later iteration, 𝑤 is small, which helps the sparrow to converge. Introduce (2) into the position update of the producers, then becomes (3).

\(\begin{aligned} x_{i, j}^{t+1}= \begin{cases} x_{i, j}^{t} \cdot \exp (-\frac{i}{\alpha \cdot T_{\max }}) \cdot w & \mbox R_{2} < ST \\ x_{i,j}^{t} + Q \cdot L & \mbox R_2 \geq ST \end{cases} \end{aligned}\) (3)

3.1.3 Elite mutation

The sparrows are trapped in a local optimum, and if it doesn’t get out early, it will cause better solutions to be missed.

Therefore, an elite mutation that executes swiftly is put forward. Only at finish for every iteration, sort individual fitness values, select an elite sparrow with the best fitness value, and update its position according to (4).

𝑥′_best= (𝑢b − 𝑙b) ∙ randn + 𝑙b (4)

Where, 𝑥′_bestis the mutated position of the elite sparrow.

When the SSA becomes stalled inside the nearby region, it can easily leap out and continue to global search to increase the algorithm’s performance. Even if this sparrow had converged to the global optimal solution, only changing the search path of one sparrow will not affect the final convergence result.

3.2 CSSA-MET

This paper proposes an adaptive multi-class segmentation model the CSSA-MET for aggregate images, which is composed of a swarm intelligence optimization algorithm CSSA and a multi-class segmentation algorithm MET. The flowchart of the CSSA-MET is shown in Fig. 4, and the purple font in the figure is the innovation point.

E1KOBZ_2023_v17n2_391_f0004.png 이미지

Fig. 4. Flowchart of adaptive multi-class segmentation model CSSA-MET.

4. Experiments

The benchmark function experiment for the optimization algorithm CSSA evaluation is in Subsection 4.1, and the evaluation experiment for the aggregate image adaptive multi-class segmentation model CSSA-MET is in Subsection 4.2.

Our studies are carried out with a computer outfitted with an Intel (R) Core (TM) i5-10400F @2.90 GHz CPU, 16 GB RAM and a 64-bit Win-10 operating system.

4.1 Performance of Optimization Algorithm CSSA

Benchmark functions are used to assess the performance of optimization algorithms. The capacity to locally converge is tested by the unimodal function, while the ability to globally explore is tested by the multimodal function.

The pertinent expressions are displayed in Table 4, and their versions are shown in Fig. 5 at the dimension (D) is 2.

Table 4. Benchmark functions

E1KOBZ_2023_v17n2_391_t0004.png 이미지

E1KOBZ_2023_v17n2_391_f0005.png 이미지

Fig. 5. 2-D representations of benchmark functions 𝐹₁~𝐹₆.

The CSSA is contrasted to the WOA [20], GWO [21], PSO [22], MA [23], SSA [24], CASSA [25], and CDLSSA [26]. The parameters are set in Table 5.

Table 5. Parameters of the optimization algorithms

E1KOBZ_2023_v17n2_391_t0005.png 이미지

The accuracy, stability, and speed of the optimization method are taken into consideration while choosing the average value, standard deviation and time-consuming as assessment measures. Since the population’s starting positions are stochastic, the average of 60 optimisation tests is utillized as the final evaluation result, these are provided in Table 6.

Table 6. Accuracy, stability and speed evaluation of eight optimization algorithms

E1KOBZ_2023_v17n2_391_t0006.png 이미지

On the uni-modal and multi-modal functions in Table 6, the accuracy (Avg) and stability (SD) of the CSSA are always optimal. Although the speed (T/s) of the CSSA is not optimal, it is also close to the optimal value. The CASSA has weak stability, whereas the CDLSSA has good accuracy and stability, but its speed is twice that of the CSSA, and only the CSSA achieves the optimal combination of speed and precision.

The iteration curves of the eight optimization methods are displayed in Fig. 6 for evaluation of the beneficial effects of these three evolutionary strategies on the CSSA. The first is chaotic mapping, which is reflected in the iteration curves of 𝐹₃~𝐹₆. The population’s starting position is more uniform, which makes the initial value of the CSSA better. The second is the dynamic weight, it is reflected at 𝐹₁~𝐹₂, 𝐹₆, and search range is wider, so that the optimal solution of the CSSA is continuously updated, and it converges to the global optimum in later iterations. Finally, the elite mutation is reflected in some broken lines on 𝐹₁, 𝐹₃, 𝐹₄ and 𝐹₆. When they fall into a local optimum, they can effectively escape and improve the solution’s accuracy.

E1KOBZ_2023_v17n2_391_f0006.png 이미지

Fig. 6. Convergence curves of eight optimization algorithms on 𝐹₁~𝐹₆.

Wilcoxon rank-sum test [31] is utilized to compare the significant distinction between two samples. This sample size in Table 6 is small. In order to avoid unreliable results, samples with 𝐷 = 2,60 were added to the test. Table 7 displays the results. Bold words in the table indicate significant differences (𝑃 − value ≤ 0.05).

Table 7. Wilcoxon test of CSSA vs. other optimization algorithms

E1KOBZ_2023_v17n2_391_t0007.png 이미지

It can be seen that there are significant differences between the CSSA and WOA, GWO, PSO, and MA, except for T of the WOA. This shows that the CSSA is far superior to these four algorithms in accuracy (Avg) and stability (SD), and the CSSA is similar to the WOA in speed (T). Similarly, CSSA is much better than SSA and CASSA in stability (SD), and much better than CDLSSA in speed (T).

In addition, it can also be seen that the SSA, CDLSSA, CASSA and CSSA are similar (no bold). In terms of accuracy and stability, the CDLSSA is the best, followed by the CASSA, and finally the SSA. In terms of speed, the CASSA is the best, followed by the SSA, and the CDLSSA is the worst. Combining these conclusions Table 6 shows that the CSSA has maximum accuracy, stability and also has a faster convergence speed.

4.2 Evaluation of Segmentation Model CSSA-MET

The MET is utilized as the optimization algorithm’s objective function, and they are merged with the CSSA and SSA one by one to form six techniques, including three CSSA-METs and three SSA-METs. Simultaneously, Fuzzy C-means (FCM) is compared.

The 100+ aggregate image tests are obtained in the Key Laboratory of Road Construction Technology and Equipment, and Fig. 7 displays five of them.

E1KOBZ_2023_v17n2_391_f0007.png 이미지

Fig. 7. Five aggregate images and their histograms.

From the point of view of the histogram, they contain abundant glitch-like extreme points, and these extreme points are close in distance, which are the characteristics of aggregates. When the surface texture is rougher, although the threshold is similar, it might lead to a decrease in roughness, so the optimization algorithm’s effectiveness will affect the segmentation precision.

From the perspective of aggregate characteristics, these particles have different characteristics such as shape, color, size, edge and surface rough texture. The features are used to evaluate gravel and pebbles, size aggregates, calculate sanding time, and detect parent rock type. During image processing, the quantity 𝐾 of thresholds is decided upon in accordance with needs. 𝐾 = 2~6 in relevant literatures, Table 7 displays partial results of the CSSA-MET segmented aggregate images when 𝐾 = {2,4,6}.

Because in Subsection 4.1, the CSSA only iterates 200 times on the 30-dimensional function to complete the convergence. After many experiments, only 100 iterations of the CSSA-MET efficiently segment images. The remaining parameters are identical to those used in the prior section, and just 100 repetitions are required to reduce segmentation time while maintaining accuracy.

From a subjective point of view, when 𝐾 = 2, the FCM segmentation effect is the best, second the Renyi. The MET’s precision continues to improve at 𝐾 = 4, while the FCM becomes increasingly unstable. When 𝐾 = 6 , Renyi segmented contact or overlapping aggregate particles performed well, second the Kapur, while the Minimum-cross brightness is higher. The grey-scale value divergences are severe even though the FCM’s edges are visible.

Table 8. Partial results of CSSA-MET segmentation of aggregate images

E1KOBZ_2023_v17n2_391_t0008.png 이미지

Overall, the CSSA-MET outperforms the SSA-MET on the segmentation of aggregate images because the CSSA can obtain more accurate thresholds. The criterion for measuring optimization method’s accuracy is whether optimized objective function can obtain a better fitness value. Table 9 displays the fitness values and corresponding thresholds obtained by the CSSA and SSA at 𝐾 = 6, the bolder of the two is superior. The higher the fitness value of Kapur and Renyi in Table 1, the better, while Minimum-cross is the opposite. The CSSA:SSA ratio for obtaining optimal values is 9:0 (Table 9), demonstrating that the CSSA must have superior capabilities. Compared with the SSA-MET, the CSSA-MET has a more accurate threshold for segmenting images.

Table 9. Thresholds determined by CSSA-MET and their corresponding fitness values

E1KOBZ_2023_v17n2_391_t0009.png 이미지

As the image size in Table 8 is too small to see the difference between the CSSA-MET and SSA-MET segmentation results, the local regions of (c) are intercepted when 𝐾 = 6. They are aggregate surface roughness, texture and edge, respectively, and each local image contains at least two main features. Table 10 shows their segmentation results. It can be seen that the segmented images of the CSSA-MET contain more detailed features, while the results of the SSA-MET lose a lot of specifics, particularly roughness.

Table 10. Partial local results for CSSA-MET segmentation of aggregate images

E1KOBZ_2023_v17n2_391_t0010.png 이미지

Combining Table 10 with the data of (c) in Table 9, it can be seen that even if the thresholds are not significantly different, it can cause a large difference in results. This demonstrates that the CSSA’s effectiveness is critical for segmentation models.

The above is a subjective evaluation, and the image segmentation algorithm also needs a comprehensive objective evaluation. The similarity between the segmentation result and the ground-truth must be compared in order to assess the segmented image’s reliability. Since the eye couldn’t mark the true under each 𝐾, the segmentation results are contrasted to the original images.

The segmentation results are contrasted with the original image since the eye was unable to distinguish the real under each K.

The Peak Signal to Noise Ratio (PSNR), Structure Similarity (SSIM), Feature Similarity (FSIM) and the average time (T) to segment an image are made as assessment criteria. Relevant expressions are shown in Table 11.

Table 11. Image quality evaluation index

E1KOBZ_2023_v17n2_391_t0011.png 이미지

Greater information and smaller noise levels are indicated by a higher PSNR score; SSIM ∈ [0, 1], the more similar the segmented picture is to the original image, the higher the SSIM value; FSIM ∈ [0, 1], the tinier the feature difference between the image before and after segmentation, the larger the FSIM value. Therefore, the higher the values of these three parameters, the better the segmentation results. The average values of sixty tests are utilized as the ultimate results because of the unpredictability of the optimization algorithm population’s beginning position. The statistical data of the PSNR, SSIM and FSIM are shown in Tables 12-14. The bold font in these tables is the value of the superior assessment criterion between the CSSA and SSA

From an optimization perspective, when the CSSA is compared to the SSA, the better PSNR values ratio is 30:0 (Table 12), the better SSIM values ratio is 25:7 (Table 13), and the better FSIM values ratio is 27:4 (Table 14). This means the capabilities of the CSSA to optimize the MET has an advantage in aggregate image segmentation. And as the number of thresholds increases, these benefits are grown. Especially in terms of the PSNR values, the CSSA-MET is better than the SSA-MET, which shows that the CSSA-MET can segment more details, and is suitable for all kinds of aggregate images, with high robustness.

Table 12. PSNR values

E1KOBZ_2023_v17n2_391_t0012.png 이미지

Table 13. SSIM values

E1KOBZ_2023_v17n2_391_t0013.png 이미지

Table 14. FSIM values

E1KOBZ_2023_v17n2_391_t0014.png 이미지

From the threshold perspective, when 𝐾 = 2, the PSNR values of the FCM are the maximum, but under the influence of the gray-scale values of the clustering centers and neighborhood pixels, the FCM performs worse and worse when 𝐾 = 4, 6. On the contrary, the performance of the CSSA-Renyi Entropy has been outstanding, achieving the best PSNR values at 𝐾 = 4, 6, following Kapur, although Minimum-cross’s results are slightly worse, but also better than the FCM. Similarly, on the SSIM and FSIM values, the CSSA-MET has achieved better results, and the FCM is similar to the above conclusion. These are the difference between the three METs in aggregate image segmentation.

Table 15 shows the standard deviation (SD) of the CSSA-MET and SSA-MET in 60 experiments, which can measure the stability of the algorithm, and the SD value is inversely proportional to the stability. Generally, the stability is the highest when 𝐾 = 2. As the K value increases, the SD value increases and the stability decreases. The figure of merit ratio of the CSSA-MET and SSA-MET is 19:8, indicating that the CSSA-MET has better stability.

Table 15. SD values

E1KOBZ_2023_v17n2_391_t0015.png 이미지

The average time-consuming of image segmentation of each algorithm is statistically calculated in Table 16. The ratio of the better T values of the CSSA-MET to the SSA-MET is 5:4. the CSSA’s three techniques did not affect the SSA’s efficiency, and the CSSA was occasionally faster than the SSA. The key cause is the aggregate histogram’s particularity, it has a lot of extreme points, resulting in a lot of local optimum in the optimization process. Elite mutation allows the SSA to easily slip into these locals, but the CSSA can jump out of them in time. As a result, the CSSA-MET is better suited for segmenting aggregate images.

Table 16. T values

E1KOBZ_2023_v17n2_391_t0016.png 이미지

For the same segmentation method, it is not always optimal in the four parameters, which brings trouble to the practical evaluation of the algorithms.

The segmentation models’ line graphs for the four evaluation parameters are displayed in Fig. 8. A method cannot achieve the optimal values on all parameters at the same time, therefore, these four indicators are normalized and fused into a new weight parameter 𝐼, 𝐼 ∈ [0, 1]. The algorithm’s advantage increases with increasing I value. The I value can be calculated by (5), and the I value statistics of all algorithms are summarized in Table 17.

E1KOBZ_2023_v17n2_391_f0008.png 이미지

Fig. 8. The segment methods’ line graph for the four assessment criteria. The abscissa corresponds to the CSSA-Kapur, SSA-Kapur, CSSA-Minimum-cross, SSA-Minimum-cross, CSSA-Renyi, SSA-Renyi, and FCM from left to right.

\(\begin{aligned}I=\sum \sum \frac{\mid \overline{I_{K, P}}-\text { worst }_{K, P} \mid}{N_{K, P} \cdot \Delta I_{K, P}}\end{aligned}\) (5)

Table 17. I values

E1KOBZ_2023_v17n2_391_t0017.png 이미지

Where, P = PSNR, SSIM, FSIM, T are the evaluation parameters, 𝐾 = 2, 4, 6 , ∆𝐼_𝐾,𝑃 = |best_𝐾,𝑃 − worst_𝐾,𝑃|, worst is the worst value, best is the optimal value, and 𝑁_𝐾,𝑃 is the number of (𝐾, 𝑃). In this study, 𝑁_𝐾,𝑃 = 3 × 4 = 12.

In Subsection 4.1, the Wilcoxon rank sum test showed that the SSA, CASSA, CDLSSA and CSSA were similar. Therefore, the comprehensive evaluation results of the CASS-MET and CDLSSA-MET are added in Table 17.

In Table 17, from a horizontal perspective, the I value of the CSSA-MET is always higher. Followed by the CASSA-MET, the CDLSSA-MET is affected by the T value, the comprehensive evaluation parameters are low, and the result of the SSA-MET is not very ideal. Vertically, the I value of Renyi Entropy is always higher. This is followed by Kapur Entropy and finally by Minimum-cross Entropy.

On the whole, the CSSA-Renyi Entropy has the strongest realizability for segmenting aggregate images. The result of the CSSA-Renyi Entropy segmenting aggregate image (e) when 𝐾 = 6 and the segmentation histogram are shown in Fig. 10. It can be seen that the threshold selection is reasonable, and the surface texture and edge of the aggregate are well preserved.

E1KOBZ_2023_v17n2_391_f0009.png 이미지

Fig. 9. CSSA-Renyi Entropy segmentation result and segmentation histogram of aggregate image (e) when 𝐾 = 6.

5. Conclusion

This paper proposes an adaptive multi-class image segmentation model for multi-aggregate images based on the Chaotic SSA, named CSSA-MET, it aims to overcome the shortcoming of MET’s unbalanced running time and accuracy in multi-class segmentation of aggregate images. Firstly, the CSSA combines Chaotic map, sinusoidal dynamic weights and elite mutation to improve the performance of the SSA. Benchmark function experiment and Wilcoxon rank-sum test verify better optimization ability of the CSSA. Then, the CSSA is utilized to quickly determine the correct MET thresholds. In the various segmentation experiments, the CSSA-MET is proved that can segment aggregate images in more aggregate details. And finally, a normalized weight parameter I is proposed to evaluate the performance of segmented images, which integrates PSNR, SSIM, FSIM and T, and all five indicators show that the CSSA-MET is more effective than the SSA-MET and FCM, and among the three METs, the CSSA optimized Renyi Entropy performs is the best, achieving the best balance between speed and accuracy, and effectively retaining rough surface texture and edge features.

In future work, we will explore the image segmentation focus of each method and try to fuse them to achieve parallel classification of multiple aggregate features. In addition, the three evolutionary strategies, the CASSA and CASSA-MET models in this paper can be used in similar fields and have broad application prospects.

References

Y. Zhou, H. Jin, Wang B, "Modeling and mechanical influence of meso-scale concrete considering actual aggregate shapes," Construction and Building Materials, vol. 228, pp. 116785, Dec. 2019.
K. Ma, X. Huang, "The morphological characteristics of brick-concrete recycled coarse aggregate based on the digital image processing technique," Journal of Building Engineering, vol. 44, pp.1032192, Dec.
Y. Wang, M. Shabaninejad, R. Armstrong, et al., "Deep neural networks for improving physical accuracy of 2D and 3D multi-mineral segmentation of rock micro-CT images," Applied Soft Computing, vol. 104, pp. 107185, Jun.
R. Grewal, S. Kasana, "Hyperspectral image segmentation: a comprehensive survey," Multimed Tools Appl., vol. 2022, pp.1-54, Oct. 2022.
Z. Cheng and J. Wang, "Improved region growing method for image segmentation of three-phase materials," Powder Technol., vol. 368, pp. 80-89, May, 2020. https://doi.org/10.1016/j.powtec.2020.04.032
Q. Guo, Y. Wang, and S. Yang, et al., "A method of blasted rock image segmentation based on improved watershed algorithm," Sci. Rep., vol. 12, Article no.7143, May, 2022.
Y. Wang, W. Tu, and H. Li, "Fragmentation calculation method for blast muck piles in open-pit copper mines based on three-dimensional laser point cloud data," Int. J. Appl. Earth Obs., vol. 100, pp. 102338, Aug.
Y. Liu, Z. Zhang, X. Liu, et al, "Efficient image segmentation based on deep learning for mineral image classification," Advanced Powder Technology, vol. 32, no. 10, pp. 3885-3903, Oct.
Y. Ju, H. F. Sun, and M. X. Xing, et al., "Numerical analysis of the failure process of soil-rock mixtures through computed tomography and PFC3D models," Int. J. Coal Sci. Technol., vol. 5, no. 2, pp. 126-141, Jan. 2018. https://doi.org/10.1007/s40789-018-0194-5
X. Liu, J. Yan, and X. Zhang, et al., "Numerical upscaling of multi-mineral digital rocks: Electrical conductivities of tight sandstones," J. Petrol. Sci. Eng., vol. 201, pp. 108530, Jun. 2021.
D. Zhao, L. Liu, F. Yu, et al., "Chaotic random spare ant colony optimization for multi-threshold image segmentation of 2D Kapur entropy," Knowledge-Based Systems, vol. 216, pp. 106510, Mar. 2021.
H. Gill, B. Khehra, A. Singh, et al., "Teaching-learning-based optimization algorithm to minimize cross entropy for Selecting multilevel threshold values," Egyptian Informatics Journal, vol. 20, no. 1, pp. 11-25. Mar. 2019. https://doi.org/10.1016/j.eij.2018.03.006
C. Huang, X. Li, Y. Wen, "An OTSU image segmentation based on fruitfly optimization algorithm," Alexandria Engineering Journal, vol. 60, no. 1, pp. 183-188, Feb. 2021. https://doi.org/10.1016/j.aej.2020.06.054
P. Upadhyay, J. Chhabra, "Kapur's entropy based optimal multilevel image segmentation using crow search algorithm," Applied soft computing, vol. 97, pp. 105522, Dec. 2020.
R. Chakraborty, R. Sushil, M. Garg, "An Improved PSO-Based Multilevel Image Segmentation Technique Using Minimum Cross-Entropy Thresholding," Arab J Sci Eng., vol. 44, pp. 3005- 3020, Jun. 2019. https://doi.org/10.1007/s13369-018-3400-2
L. Peng, D. Zhang, "An adaptive Levy flight firefly algorithm for multilevel image thresholding based on Renyi entropy," The Journal of Supercomputing, vol. 78, no. 5, pp. 6875-6896, Oct. 2022. https://doi.org/10.1007/s11227-021-04150-3
J. Tang, G. Liu and Q. Pan, "A Review on Representative Swarm Intelligence Algorithms for Solving Optimization Problems: Applications and Trends," IEEE/CAA Journal of Automatica Sinica, vol. 8, no. 10, pp. 1627-1643, Oct. 2021. https://doi.org/10.1109/JAS.2021.1004129
E. Houssein, E. Helmy, D. Oliva, et al., "Multi-level thresholding image segmentation based on nature-inspired optimization algorithms: a comprehensive review," Metaheuristics in Machine Learning: Theory and Applications, vol. 2021, pp. 239-265, Jul. 2021.
L. Ren, A. Heidari, Z. Cai, et al., "Gaussian kernel probability-driven slime mould algorithm with new movement mechanism for multi-level image segmentation," Measurement, vol. 192, pp. 110884, Mar. 2022.
S. Mirjalili and A. Lewis, "The whale optimization algorithm," Adv. Eng. Softw., vol. 95, pp. 51-67, May, 2016. https://doi.org/10.1016/j.advengsoft.2016.01.008
S. Mirjalili, S. M. Mirjalili, and A. Lewis, "Grey wolf optimizer," Adv. Eng. Softw., vol. 69, pp. 46-61, Mar. 2014. https://doi.org/10.1016/j.advengsoft.2013.12.007
J. Kennedy and R. Eberhart, "Particle swarm optimization," in Proc. of ICNN'95-International Conference on Neural Networks, Perth, WA, Australia, pp. 1942-1948, 1995.
K. Zervoudakis and S. Tsafarakis, "A mayfly optimization algorithm," Comput. Ind. Eng., vol. 145, pp. 106559, Jul. 2020.
J. K. Xue and B. Shen, "A novel swarm intelligence optimization approach: sparrow search algorithm," Syst. Sci. Control Eng., vol. 8, no. 1, pp. 22-34, Dec. 2020. https://doi.org/10.1080/21642583.2019.1708830
G. Liu, C. Shu, and Z. Liang, et al., "A modified sparrow search algorithm with application in 3d route planning for UAV," Sensors, vol. 21, no. 4, pp. 1224, Feb. 2021.
X. Chen, X. Huang, and D. Zhu, et al., "Research on chaotic flying sparrow search algorithm," Journal of Physics: Conference Series, vol. 1848, no. 1, pp. 012044, Jan. 2021.
Z. Zhang, R. He, K. Yang, "A bioinspired path planning approach for mobile robots based on improved sparrow search algorithm," Adv. Manuf., vol. 10, pp. 114-130, Aug. 2022. https://doi.org/10.1007/s40436-021-00366-x
X. Li, X. Ma, F. Xiao, et al., "Time-series production forecasting method based on the integration of Bidirectional Gated Recurrent Unit (Bi-GRU) network and Sparrow Search Algorithm (SSA)," Journal of Petroleum Science and Engineering, vol. 208, pp. 109309, Jan. 2022.
P. Kathiroli, K. Selvadurai, "Energy efficient cluster head selection using improved Sparrow Search Algorithm in Wireless Sensor Networks," Journal of King Saud University-Computer and Information Sciences, vol. 34, no. 10, pp. 8564-8575, Nov. 2022. https://doi.org/10.1016/j.jksuci.2021.08.031
A. Tutueva, E. Nepomuceno, A. Karimov, et al., "Adaptive chaotic maps and their application to pseudo-random numbers generation," Chaos, Solitons and Fractals, vol. 133, pp. 109615, Apr. 2020.
R. Barros, J. Hidalgo, D. Lima Cabral, "Wilcoxon rank sum test drift detector," Neurocomputing, vol. 275, pp. 1954-1963, Jan. 2018. https://doi.org/10.1016/j.neucom.2017.10.051

KSII Transactions on Internet and Information Systems (TIIS)

Adaptive Multi-class Segmentation Model of Aggregate Image Based on Improved Sparrow Search Algorithm

Abstract

Keywords

1. Introduction

2. Preliminaries

2.1 Multiple Entropy Thresholding

2.2 Sparrow Search Algorithm

3. Methods

3.1 CSSA

3.1.1 Chaotic map

3.1.2 Sinusoidal dynamic weight

3.1.3 Elite mutation

3.2 CSSA-MET

4. Experiments

4.1 Performance of Optimization Algorithm CSSA

4.2 Evaluation of Segmentation Model CSSA-MET

5. Conclusion

References

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)