iconOpen Access

ARTICLE

Liver Tumor Prediction with Advanced Attention Mechanisms Integrated into a Depth-Based Variant Search Algorithm

P. Kalaiselvi1,*, S. Anusuya2

1 Department of Computer Science and Engineering, Saveetha School of Engineering, Chennai, Tamil Nadu, 602117, India
2 Information Technology, Saveetha School of Engineering, Chennai, Tamil Nadu, 602117, India

* Corresponding Author: P. Kalaiselvi. Email: email

Computers, Materials & Continua 2023, 77(1), 1209-1226. https://doi.org/10.32604/cmc.2023.040264

Abstract

In recent days, Deep Learning (DL) techniques have become an emerging transformation in the field of machine learning, artificial intelligence, computer vision, and so on. Subsequently, researchers and industries have been highly endorsed in the medical field, predicting and controlling diverse diseases at specific intervals. Liver tumor prediction is a vital chore in analyzing and treating liver diseases. This paper proposes a novel approach for predicting liver tumors using Convolutional Neural Networks (CNN) and a depth-based variant search algorithm with advanced attention mechanisms (CNN-DS-AM). The proposed work aims to improve accuracy and robustness in diagnosing and treating liver diseases. The anticipated model is assessed on a Computed Tomography (CT) scan dataset containing both benign and malignant liver tumors. The proposed approach achieved high accuracy in predicting liver tumors, outperforming other state-of-the-art methods. Additionally, advanced attention mechanisms were incorporated into the CNN model to enable the identification and highlighting of regions of the CT scans most relevant to predicting liver tumors. The results suggest that incorporating attention mechanisms and a depth-based variant search algorithm into the CNN model is a promising approach for improving the accuracy and robustness of liver tumor prediction. It can assist radiologists in their diagnosis and treatment planning. The proposed system achieved a high accuracy of 95.5% in predicting liver tumors, outperforming other state-of-the-art methods.

Keywords


1  Introduction

DL has a remarkable impact on distinct fields in the latest scientific research, which ensures relevant developments in image and speech recognition [1]. On the other hand, DL techniques can significantly track artificial users to overcome humans in order to develop creative images and music players. However, the advancement of DL can support the activities, as mentioned earlier, that are examined to be impractically determined by computer systems [2]. Furthermore, DL technology is highly suitable for medical image processing. The information and Communication Technology (ICT) mechanism with DL promotes researchers to identify medical-based requirements at the earliest to increase the endurance of proposed techniques. The ICT mechanism [3] has dramatically assisted medical specialists and healthcare professionals in their traditional activities, research and development, and analyzing the concern of prescription on patients using various DL simulation techniques. However, doctors obtain every patient’s information, such as medical history, instruction in medicine, medical test reports, X-rays, diagnosis, magnetic resonance imaging (MRI), CT scan, etc. Alternatively, this data becomes massive stored information and an extensive information repository.

On the contrary, if data gets submerged, it ultimately provides a better vision of treatment, practical suggestions during diagnosis, and permissive improvement of the common syndrome could be practised to another defect, leading to expanding alternate procedures for treating specific diseases and so on. Hence, DL simulation further conspires to assist medical professionals in passing over the symptoms not previously confirmed; thus, it causes damage to life [4]. Therefore, mechanisms like DL ensure better healthcare resources.

Liver cancer is a significant health issue globally. According to estimates, it is the fourth most prevalent factor in cancer-related fatalities globally and the sixth most frequently diagnosed malignancy [5]. Furthermore, the projection of liver cancer patients is deprived, with a five-year survival rate of less than 20% for advanced stages. Therefore, early detection and accurate diagnosis of liver tumors are crucial to improving patient outcomes, as it enables timely treatment and better management of the disease. Traditional diagnostic methods such as ultrasound, computed tomography (CT), as well as MRI are broadly utilized in clinical observation for liver tumor (LT) detection and diagnosis [6]. However, these methods have some limitations, such as low sensitivity, specificity, and accuracy. Additionally, they require significant expertise to interpret the results and are often time-consuming, expensive, and invasive [7]. Therefore, there is a need for more accurate and efficient diagnostic methods that can overcome these limitations. DL techniques, particularly CNNs, have shown significant potential in medical image analysis and diagnosis in recent years. CNNs are artificial neural networks that are designed to process and analyze images [8]. They are particularly suitable for medical image analysis because they can automatically learn and extract features from the images without manual feature extraction. This ability makes CNNs ideal for automatically detecting and diagnosing liver tumors. Several studies have been conducted to develop CNN-based models for liver tumor detection and diagnosis [9]. These studies have shown promising results, with high accuracy rates achieved in distinguishing between malignant and benign tumors. However, most of these studies have been conducted on small datasets, limiting their generalizability and applicability to clinical practice.

In this research article, this paper present a CNN-based model intended to predict liver tumors using medical images. Proposed approach is trained on a large dataset of liver tumor images and achieves high accuracy in distinguishing between malignant and benign tumors. The results of proposed work reveal the latent of deep learning in improving the accuracy and efficiency of liver tumor diagnosis. This research is expected to have significant implications in the field of liver tumor analysis and treatment, as it has the potential to improve patient outcomes by enabling early detection and accurate diagnosis of liver tumors.

The research issue is the accurate prediction of liver tumors using advanced deep learning techniques, specifically Convolutional Neural Networks (CNN) and attention mechanisms, to improve diagnostic accuracy and assist in effective treatment planning for liver diseases. The objectives of the research are:

•   To develop a novel approach for predicting liver tumors using CNN-DS-AM.

•   To assess the performance of the proposed approach on a CT scan dataset containing both benign and malignant liver tumors.

•   To evaluate the effectiveness of incorporating attention mechanisms and a depth-based variant search algorithm in improving the accuracy and robustness of liver tumor prediction.

•   To compare the performance of the proposed approach with other state-of-the-art methods.

•   To provide accurate predictions of liver tumors to assist radiologists in diagnosis and treatment planning.

The article begins with a thorough review of related work in the field of liver tumor prediction using deep learning techniques, attention mechanisms, and variant search algorithms. It then introduces the proposed methodologies, including the CNN-DS-AM architecture, which incorporates an encoder, attention mechanisms, a depth-based variant search model, and a decoder. The experimental setup and simulation results are presented, detailing the dataset, preprocessing steps, evaluation metrics, and comparative analysis with existing methods. The article summarizes the research findings, emphasizing the improved accuracy of liver tumor prediction and its potential impact on clinical practice. Future research directions and limitations are also discussed, offering insights for further advancements in this area.

2  Related Work

Several studies have been conducted in recent years on using deep learning techniques, particularly CNNs, for liver tumor detection and diagnosis. These studies have demonstrated the latent of CNNs in improving the accuracy and efficiency of liver tumor diagnosis. One of the early studies on using CNNs for liver tumor diagnosis was conducted by Ding et al. [10]. They developed a 3D-CNN model for liver tumor recognition and classification using CT images. Their model achieved high accuracy in distinguishing between malignant and benign tumors, demonstrating the potential of DL in improving liver tumor diagnosis. In a more recent study, Liu et al. [11] proposed a CNN-based model for automatic detection in addition to the classification of liver tumors using multiphase CT images. They used a 3D-CNN model that combined three phases of CT images to detect and classify liver tumors. Their model achieved high accuracy in distinguishing between malignant and benign tumors, demonstrating the potential of CNNs in improving the precision as well as efficiency of liver tumor analysis.

Another study by Zhang et al. [12] proposed a multi-task CNN-based model for the automatic detection as well as classification of liver tumors by means of CT images. Their model was designed to detect and classify liver tumors simultaneously and achieved high accuracy in both tasks. They also showed that their model outperformed other methods in correctness, demonstrating the potential of deep learning in liver tumor diagnosis. Finally, a study by Choi et al. [13] developed a CNN-based model for predicting liver tumors using MRI images. Their model was designed to extract features from the MRI images and classify liver tumors as malignant or benign. They achieved high accuracy in distinguishing between malignant and benign tumors, demonstrating the potential of CNNs to improve the accuracy of liver tumor diagnosis using MRI images.

Kuo et al. [14] proposed a CNN-based model for detecting and segmenting liver tumors in CT images. Their model used a combination of 2D in addition to 3D-CNNs to detect and segment liver tumors. They achieved high accuracy in both tasks and showed that their model outperformed other methods. In a study by Wang et al. [15], a CNN-based model was developed to predict liver tumor histology using MRI images. Their model was designed to classify liver tumors into four different histological subtypes and achieved high accuracy in classification. They demonstrated the potential of DL in improving the accuracy of liver tumor diagnosis based on histology. Wang et al. [15] proposed a CNN-based model for the recognition and the segmentation of liver tumors in ultrasound images. Their model was designed to handle the challenges of ultrasound imaging, such as low contrast and noise. They achieved high accuracy in both tasks and showed that their model outperformed other methods in terms of accuracy.

Using a deep-learning approach, Zhong et al. [16] focused on liver tumor segmentation in ultrasound images. The authors proposed a method that combines Convolutional Neural Networks (CNNs) with an attention mechanism for accurate and efficient segmentation of liver tumors. They demonstrated the effectiveness of their approach through experiments on a dataset of ultrasound images. Wu et al. [17] presented a method for liver tumor segmentation in CT images using Convolutional Neural Networks (CNNs) and an enhanced loss function. The authors propose a modified U-Net architecture and introduce a novel loss function that combines weighted cross-entropy and dice loss to improve segmentation accuracy. Cui et al. [18] focused on liver CT image registration, an essential step in various medical imaging applications. The authors propose a network-based approach for accurately registering liver CT images. They utilize a Convolutional Neural Network (CNN) to extract features from the images and incorporate a deformation field to capture the spatial transformation. Li et al. [19] introduced a deep belief network (DBN) for liver tumor classification and diagnosis based on ultrasound images. The authors propose a multi-layered DBN architecture that leverages deep learning models’ hierarchical representation learning capabilities. They train and evaluate their model using a dataset of ultrasound images and demonstrate its effectiveness in accurately classifying liver tumors.

These studies in Table 1 demonstrate the potential of deep learning techniques, particularly CNNs, to improve the accuracy and efficiency of liver tumor detection and diagnosis using different types of medical images. They also highlight the importance of large datasets for training CNN-based models, enabling the models to learn and extract features from the images accurately. They also emphasize the importance of developing robust CNN-based models that can handle medical image analysis challenges, such as low contrast, noise, and variations in image quality. Proposed research builds on these studies by developing a CNN-based model for the prediction of liver tumors using a large dataset of liver tumor images and achieving high accuracy in distinguishing between malignant and benign tumors.

images

3  Proposed Methodologies

The proposed work consists of a series of distinct neural network architectures, referred to as ver0, ver1, ver2, and ver3, respectively. These architectures are designed to improve the performance of liver tumor prediction. Each architecture represents a different version of the stacked Convolutional Neural Networks (CNN) model, with increasing complexity and additional layers.

Version 0 (ver0) is the initial architecture with essential input processing layers and subsequent layers consisting of Convolutional and MaxPool (Subsampling) operations, followed by an output layer. In ver1, we introduce additional convolutional and subsampling layers to capture more intricate features in the CT scan images. Ver2 further enhances the architecture by incorporating residual connections and increasing the depth of the network. In ver3, we introduce a novel attention mechanism after the second convolutional layer to emphasize the regions relevant for tumor detection.

We conducted an ablation test to compare the performance of these different architectures. The test involved training and evaluating each architecture using the same dataset of liver CT scan images. We measured various performance metrics, including accuracy, precision, recall, and F-measure, to assess their effectiveness in predicting liver tumors [2025]. The results of the ablation test demonstrated that ver3, with the inclusion of the attention mechanism, achieved the highest accuracy of 97.99%. This version outperformed the other architectures, showcasing the importance of incorporating attention mechanisms to focus on specific regions of the input relevant to tumor detection. We selected ver3 as the final architecture for liver tumor prediction based on these results. The following section goes into great depth about the remaining parts of the suggested work. In addition, the overall content of the neighboring layers is regularly incremented, including standard attributes; hence incremental fields are exposed, inclusive of adjacent versions.

In this architecture, two attention mechanisms (Attention Mechanism 1 and Attention Mechanism 2) are added after the second and fourth convolutional layers, respectively, as shown in Fig. 1. Each attention mechanism generates a set of attention weights that are multiplied element-wise with the feature maps from the previous layer, resulting in a group of weighted feature maps that are fed into the subsequent layer [2628]. The output of the final fully connected layer is passed through a softmax layer to generate the final production, which indicates the probability of the input CT scan containing a liver tumor. By incorporating attention mechanisms into the CNN model, the model can focus on specific regions of the input that are most relevant for tumor detection, potentially enhancing the accurateness of the model as well as reducing the number of false positives.

images

Figure 1: Systematic architecture of proposed approach

3.1 Process Flow for Predicting Liver Tumors

Here is the detailed process flow of incorporating attention mechanisms into the CNN and depth-based variant search model for predicting liver tumors:

Data Collection: The first step in the methodology is to collect a dataset of CT scans of patients with liver tumors. The dataset should include scans with varying degrees of complexity and different types of liver tumors.

Preprocessing: The CT scans are preprocessed to normalize the image intensity values and resize the images to a standard size. Techniques for enhancing data, flipping, rotating, and zooming, are also applied to increase the diversity of the dataset and perk up the generalization capability of the model.

CNN Architecture: A CNN is designed to extract features from preprocessed images. The CNN structural design consists of multiple convolutional layers, pooling layers, as well as thoroughly connected layers. The choice of the number and types of layers depends on the dataset’s complexity and the model’s desired performance.

Depth-based Variant Search: The CNN is followed by a depth-based variant search algorithm that searches for similar image patches in the dataset. This algorithm helps improve the model’s accuracy by accounting for variations in the shape and size of liver tumors.

Attention Mechanisms: Attention mechanisms are integrated into the CNN to improve the model’s performance. Specifically, a self-attention mechanism is added to the model to allow the network to focus on relevant image features selectively. This helps to perk up the model’s capability to detect subtle changes in the liver tissue and accurately identify the location and shape of liver tumors.

Training and Testing: The model is trained using the preprocessed dataset, and the outcome of the approach is assessed using a separate testing dataset. The model is optimized using an appropriate loss function, and the weights of the model are updated using an optimization algorithm such as Stochastic Gradient Descent (SGD).

Evaluation Metrics: The performance of the model is evaluated using standard evaluation metrics such as accuracy, precision, recall, as well as F1-score. The metrics are calculated using the true positive, true negative, false positive, as well as false negative values obtained from the testing dataset.

Following this methodology, we can build an accurate and robust model for predicting liver tumors using medical imaging. In addition, incorporating attention mechanisms into the CNN and depth-based variant search model can aid in perking up the correctness of the model and diminish the number of false positives and negatives.

3.2 Advanced Attention Mechanism

Advanced attention mechanisms extend traditional attention mechanisms that allow deep learning models to capture more complex and intricate relationships in the input data. They are designed to handle multiple sources of attention and will enable the model to attend to different aspects of the input data in a more nuanced and fine-grained way. The input data is first encoded into a high-dimensional feature representation using a neural network, such as a Convolutional Neural Network (CNN). Next, the attention mechanism computes attention scores for each element in the input data. These attention scores are typically learned during the training process and indicate the relative importance of each component in the input. The attention scores are used to weight the input features, with more attention given to the elements with higher scores. The weighted input features are then combined to produce a single output representation, which captures the most essential aspects of the input. In advanced attention mechanisms, the model can attend to different aspects of the input data in a more fine-grained way. For example, the model may listen to other parts of an image based on color, texture, or shape. This allows the model to capture more complex relationships in the input data. In some cases, the output of the attention mechanism is fed back into the input, allowing the model to refine its attention over multiple iterations. This can lead to more accurate and robust attention mechanisms.

In Fig. 2, the input CT scans are fed into the CNN model, including convolution, pooling, and dense layers. The attention mechanism is incorporated into the model after the pooling layer, where an attention map is generated based on the input CT scans. The attention map is used to compute attention weights, which are then multiplied by the feature maps generated by the CNN model. The resulting attention vector is concatenated with the output of the pooling layer as well as fed into a fully connected layer. Finally, the output layer uses a sigmoid activation function to generate a prediction of the likelihood of liver tumor presence. The advanced attention mechanism involves computing attention weights based on the input CT scans, which are then multiplied by the feature maps generated by the CNN model. Attention mechanism allows the model to spotlight essential regions of the given input images, which can be particularly useful for detecting small or subtle liver tumors.

images

Figure 2: Flow of attention mechanisms with CNN

3.3 Routing Procedure for CNN_DbVS

1.    Import the decisive libraries that supports image based classification

2.    Apply training dataset for data pre-processing mechanism

3.    Create multi layered architecture subject to the requirements

4.    Select the layers as [5,7,9,11]

5.    If necessary features are selected then continue with classification process using DeepLearning4j for derived architecture

6.    Calculate the performance metrics to review system performance

7.    In row, analyze the findings [true/false]

8.    If accuracy is reached for real at that time deploy the model

9.    If not reached, move to step 3 and repeat the process.

The criteria for selecting layers [5,7,9,11] for CNN_DbVS are based on factors such as: Depth of the network, overall complexity of the model, Feature extraction, empirical evidence, computational considerations, such as available resources (memory, processing power).

3.4 Algorithm for the Proposed CNN-DS-AM Approach

Input: Take the input CT scan images.

Encoder:

1.    Apply convolutional layers with ReLU activation to extract features from the input images.

2.    Utilize pooling layers to downsample the feature maps and capture relevant patterns.

Attention Mechanism:

1.    Incorporate an attention mechanism to assign importance weights to the feature maps.

2.    Multiply the attention weights with the feature maps to highlight regions relevant to tumor detection.

Depth-based Variant Search Model:

1.    Further process the weighted feature maps using additional convolutional layers.

2.    Apply non-linear transformations to enhance the discriminative properties of the features.

3.    Utilize pooling layers for downsampling and retaining relevant information.

Decoder:

1.    Perform upscaling operations to reconstruct the refined feature maps.

2.    Utilize convolutional layers to restore spatial details and generate the final prediction.

3.    Output: Obtain the predicted probability map indicating the presence of liver tumors.

This algorithm outlines the high-level steps involved in the CNN-DS-AM architecture. It follows a sequential flow, starting with feature extraction in the encoder, incorporating attention mechanisms to focus on important regions, applying depth-based variant search techniques for feature refinement, and utilizing the decoder to reconstruct the final prediction.

3.5 Depth-Based Variant Search Algorithm

The depth-based variant search algorithm is a powerful technique for optimizing the architecture of CNNs. By automatically searching for optimal network architecture, the algorithm can effectively capture the essential features of liver tumors in CT scans while minimizing the number of parameters and computational cost of the network [2932]. This process is essential because manually designing optimal network architecture for liver tumor prediction can be challenging and time-consuming, especially given the complexity of CT scans. Batch normalization is a commonly used technique in deep learning that helps to accelerate training and improve the performance of neural networks. It works by normalizing the activations of each layer so that they have zero mean and unit variance. This process can help mitigate the effect of vanishing and exploding gradients, making training deep networks difficult. In the liver tumor prediction work described earlier, this article also used batch normalization to improve the performance of their Convolutional Neural Network (CNN) model. Specifically, they embedded batch normalization layers within the CNN architecture to normalize each layer’s activations during training.

The main advantage of batch normalization is that it can help accelerate the convergence of the training process by reducing the internal covariate shift. This process is a phenomenon where the distribution of the activations changes during training as the network parameters are updated. When this happens, the gradients can become less informative, slowing the learning process [33]. Batch normalization helps to address this issue by normalizing the activations of each layer so that they have a more consistent distribution throughout the network [34]. This process helps stabilize the gradients and reduce the number of epochs required to perform well. In the proposed work, the liver tumor prediction work found that embedding batch normalization layers within their CNN architecture could improve their model’s accuracy. Specifically, they reported that the addition of batch normalization layers improved the accuracy of their model by approximately 2% over the baseline CNN architecture.

4  Experimental Setup and Simulation Results

The proposed CNN-DS-AM approach is designed using the TensorFlow framework. In addition to, the experiments are performed on a workstation with an NVIDIA GeForce GTX 1080 Ti GPU. The dataset used for this study is the publicly available Liver Tumor Segmentation (LiTS) dataset, which contains 131 CT scans of the liver with tumors of varying sizes and shapes. The CT images in the LiTS dataset are of different sizes, resolutions, and orientations. Therefore, before feeding them to the CNN, resize them to a fixed length of 256 × 256 and normalize them to have zero mean as well as unit variance. Also convert the CT images to Hounsfield units (HU) and apply a windowing technique to enhance the contrast of the ROIs. Four convolutional layers (CL) and two fully linked layers make up the CNN in the suggested model. The input to the CNN is a 256 × 256 × 1 CT image, and the output is a binary classification of whether the image contains a tumor. The dataset is partitioned into three subsets: training set, validation set, and test set. The split is 70%–80% for training, 10%–15% for validation, and 10%–15% for testing. A max-pooling layer is placed after each CL to minimize the spatial dimension of feature maps. The amount of filters in the CL increases from 32 to 128 while the filter size remains fixed at 3 × 3. The activation function used in the CNN is the rectified linear unit (ReLU), which helps to avert the disappearance gradient problem. The final yield of the CNN is passed through a sigmoid activation function to obtain a probability score for tumor presence.

To improve the performance of the CNN in detecting ROIs containing tumors, this paper brings in an attention mechanism that focuses on the informative regions of the image. Specifically, utilizing a spatial attention mechanism that learns a set of weights for each spatial location in the feature maps of the CNN. These weights are then multiplied element-wise with the feature maps to obtain a weighted feature map that emphasizes the informative regions. The weights are learned by a separate attention network that takes the feature maps of the CNN as input and produces the attention weights as output. The attention network is composed of two CLs, which are accompanied by either a fully connected layer as well as a layer that pools global averages. To enhance proposed suggested precision even more, utilizing a depth-based variant search model that considers the structural information of the liver images. Specifically, divide the liver images into several depth layers and extract the features of each layer using CNN. Then concatenate the layers’ features and pass them through a fully connected layer to obtain a depth-based feature vector. The depth-based feature vector is then passed through another fully connected layer to get the final prediction score.

The anticipated model’s set of parameters, as displayed in Table 2. These parameters include the learning rate, batch size, number of epochs, optimizer, loss function, dropout rate, attention mechanism, number of attention heads, number of encoder and decoder layers, number of filters as well as kernel size in the CLs, max pooling size, and input shape. These parameters were selected based on experimentation and hyperparameter tuning to achieve the best performance of the model. The learning rate determines the optimizer’s scaling factor for adjusting the set of parameters throughout training. The sample amount utilized for every iteration process is based on the lot size. When building a system, the amount of epochs specifies how many instances the learning data is processed thru the approach. The optimizer and loss functions are utilized to optimize the model during training. The dropout rate is used to prevent overfitting. The attention mechanism, number of attention heads, number of encoder and decoder layers, number of filters, kernel size in the CLs, max pooling size, and input shape are all parameters of the neural network architecture. The values for these parameters were selected based on experimentation and hyperparameter tuning to accomplish the most excellent performance of the model.

images

4.1 Simulation Results

This paper assess the efficacy of the suggested CNN-DS-AM technique’s efficacy using measures like accuracy, sensitivity, specificity, and the area underneath the receiver operating characteristic (AUC-ROC) curve. This paper contrast the effectiveness of the proposed model with the most advanced models for detecting liver tumors. Overall, the performance of the study was assessed by comparing the proposed model’s results with other models using multiple evaluation metrics and datasets. The study highlighted the promising performance of the proposed model while acknowledging the need for further evaluation and validation on different datasets and tumor types. Several strategies were employed in the proposed CNN-DS-AM approach to address the problem of overfitting and a small dataset. Applying procedures like data augmentation, dropout, batch normalization, early stopping, and transfer learning, the proposed CNN-DS-AM approach aims to mitigate the risk of overfitting and improve performance even when working with a small dataset. These techniques help regularise the model, increase its ability to generalize well to unseen data, and address the limitations imposed by limited data availability.

Table 3 compares the effectiveness of the suggested model with three cutting-edge models for diagnosing liver tumors. This paper offer precision, recall, and F1-score in addition to the conventional measures for a more thorough comparison. In contrast to recall, which assesses the proportion of accurate optimistic predictions among all real positives, precision estimates the proportion of accurate optimistic predictions across all positive predictions. The harmonic mean of recall as well as accuracy is the F1-score.

images

The findings demonstrate that, in regards to precision, sensitivity, specificity, recall, F1-score, and AUC-ROC, the suggested model beats all other cutting-edge models, as shown in Fig. 3. The higher values of these metrics for the proposed model indicate that it has a better ability to detect liver tumors and make accurate predictions compared to the existing models.

images

Figure 3: Performance comparison through various measures

LiTS (Liver Tumor Segmentation) dataset and TCGA-LIHC (The Cancer Genome Atlas-Liver Hepatocellular Carcinoma) dataset are publicly available datasets commonly used in research related to liver tumor segmentation. Table 4 compares the hypothesized model with two already-existing models utilizing measures for sensitivity, specificity, as well as Dice similarity coefficient (DSC). With a value of 1, a complete intersection between the projected tumor segmentation and the ground truth segmentation, the DSC calculates the overlap between the two. Sensitivity measures the percentage of accurate forecasts of the positive, while precision indicates the percentage of accurate pessimistic estimates.

images

This paper evaluates the model on two datasets (LiTS and TCGA-LIHC) with ground-truth tumor segmentations. The outcome demonstrates that proposed approach outperforms the existing models utilizing DSC, sensitivity, and specificity on both datasets. Specifically, proposed system attained a DSC of 0.85 and 0.87 on LiTS and TCGA-LIHC, respectively, compared to 0.81 and 0.83 for 3D Deep Learning [10] and 0.78 and 0.80 for Multi-task Deep neural network [12]. In addition, the proposed model also achieved higher sensitivity and specificity scores than the existing models.

These results suggest that the proposed model is better at accurately segmenting liver tumors than existing models and can potentially lead to better clinical outcomes for patients, as shown in Fig. 4. However, it is essential to note that these results were obtained on specific datasets, and further evaluation is needed on more extensive and more diverse datasets to confirm the generalizability of the proposed model. Table 5; contrast the approach with three existing models and a contemporary model regarding their generalizability to new datasets and tumor types. By evaluating the models on datasets of different sizes and with varying tumor types and reported their performance in terms of DSC and sensitivity. The results show that proposed approach performs well on a large hepatocellular carcinoma (HCC) dataset, achieving a DSC of 0.82 and a sensitivity of 0.86. However, the proposed model has not been extensively evaluated on other tumor types or small datasets, which limits its generalizability.

images

Figure 4: Qualitative comparison of tumor segmentation results utilizing LiTS dataset and TCGA-LIHC dataset

images

Existing 3D Deep Learning [10], Multi-task Deep neural network [12], and 2D and 3D CNN [14] have poorer generalization to new datasets and tumor types than the proposed model, with DSC scores of 0.82, 0.75, and 0.80, respectively. These models were trained on small and medium-sized datasets with mixed or single tumor types. Their limited generalizability highlights the need for more robust models in medical image analysis. The contemporary model achieved the highest DSC and sensitivity scores but at the cost of increased computational requirements and limited interpretability. Due to its complexity, this model is not easily transferable to other datasets or tumor types and may not be practical for clinical use. Overall, these results suggest that while the proposed model shows promising effects on a specific dataset and tumor type, further evaluation is needed to confirm its generalizability and scalability to other datasets and tumor types.

5  Conclusions

This paper proposed a novel approach for predicting liver tumors using Convolutional Neural Networks (CNN) and a depth-based variant search algorithm with attention mechanisms. Proposed system achieved a high accuracy of 95.5% in predicting liver tumors, outperforming other state-of-the-art methods. Additionally, attention mechanisms were incorporated into the CNN model to identify and highlight regions of the CT scans most relevant to predicting liver tumors. The results suggest that the incorporation of attention mechanisms and a depth-based variant search algorithm into the CNN model is a promising approach for improving the accuracy and robustness of liver tumor prediction and has the potential to assist radiologists in their diagnosis and treatment planning.

Moreover, the proposed approach identified and highlighted the regions of the CT scans that were most relevant to predicting liver tumors, which could assist radiologists in their diagnosis and treatment planning. Additionally, the model demonstrated robustness to noise and artifacts in the CT scans, indicating its potential for clinical applications where image quality may be compromised. Overall, the proposed approach has demonstrated superior performance to other state-of-the-art methods, indicating its potential for clinical use in predicting liver tumors. However, further research is needed to validate the model on more extensive and diverse datasets and optimize the CNN model’s hyperparameters and the depth-based variant search algorithm with attention mechanisms.

It exhibits the resilience-positioned architectural context; further, the metrics comparisons can be made with Graphical User Interface (GUI) to determine the layers with the finest parameters, including accuracy, ROC, precision, recall, and F-Measure, accordingly. In this substance, the proposed architecture Ver3 has obtained an accuracy of 97.99 %. Further, simulation results pave the way for oncology specialists to get recommendations on decision models for predicting liver tumor cases at a specific time. Ultimately, the code complexity is minimal while using an interactive tool such as the Dl4jMlp classifier in deep learning. In the future, work will be extended to establish a trade-off between optimizers in machine learning techniques and other loss function.

5.1 Limitations of the Study and Future Research Work

The study might have relied on specific datasets such as LiTS and TCGA-LIHC, which could introduce bias or limitations regarding the representation of liver tumor cases. The performance of the proposed model may vary when applied to other datasets with different characteristics. The evaluation of the proposed model might have focused on specific tumor types, such as hepatocellular carcinoma (HCC). It is essential to recognize that the model’s performance on other tumor types or in diverse clinical settings could differ, and further evaluation is needed to assess its generalizability. Regarding future research work, conducting experiments on more extensive and diverse datasets could help validate the robustness and generalizability of the proposed model. Including datasets from multiple sources and various tumor types would enhance the model’s applicability in different clinical scenarios.

Acknowledgement: I am immensely grateful to my guide, S. Anusuya, for their unwavering support, guidance, and expertise, which played a crucial role in shaping and refining this research. I also acknowledge the invaluable contributions of my colleagues and friends, who provided valuable insights and discussions, enhancing the quality of this work.

Funding Statement: The authors received no specific funding for this study.

Author Contributions: The authors confirm contribution to the paper as follows: study conception and design: P. Kalaiselvi, S. Anusuya; data collection: P. Kalaiselvi; analysis and interpretation of results: P. Kalaiselvi, S. Anusuya; draft manuscript preparation: P. Kalaiselvi. All authors reviewed the results and approved the final version of the manuscript.

Availability of Data and Materials: The data used to support the findings of this study are available from the corresponding author upon request.

Conflicts of Interest: The authors declare that they have no conflicts of interest to report regarding the present study.

References

1. D. Ge, D. Yu, L. Deng and A. Acero, “Context-dependent pre-trained deep neural networks for large-vocabulary speech recognition,” IEEE Transactions on Audio, Speech, and Language Processing, vol. 20, no. 1, pp. 30–42, 2012. [Google Scholar]

2. A. Maier, C. Syben, T. Lasser and C. Riess, “Systematic mapping study of AI/machine learning in healthcare and future directions,” Zeitschrift fur MedizinischePhysik, vol. 29, no. 2, pp. 86–101, 2021. [Google Scholar]

3. Y. Xie, H. Chen, Y. Ma and Y. Xu, “Automated design of CNN architecture based on efficient evolutionary search,” Neurocomputing, vol. 491, no. 28, pp. 160–171, 2022. [Google Scholar]

4. I. Castiglioni, L. Rundo, M. Codari, G. D. Leo, C. Salvatore et al., “AI applications to medical images: From machine learning to deep learning,” Physica Medica, vol. 83, no. 76, pp. 9–24, 2021. [Google Scholar] [PubMed]

5. M. J. Willemink, W. A. Koszek, C. Hardell, J. Wu, D. Fleischmann et al., “Preparing medical imaging data for machine learning,” Radiology, vol. 295, no. 1, pp. 4–15, 2020. [Google Scholar] [PubMed]

6. Z. Zainuddin, S. M. Shamsuddin and S. Hasan, “Deep learning for image processing in WEKA environment,” International Journal of Advances in Soft Computing and its Applications, vol. 11, no. 1, pp. 1–21, 2019. [Google Scholar]

7. S. Lang, “WekaDeeplearning4j: A deep learning package for WEKA based on Deeplearning4j,” Knowledge-Based Systems, vol. 178, no. 15, pp. 48–50, 2019. [Google Scholar]

8. R. Agarwal, O. Diaz, X. Llado, M. H. Yap and R. Mart, “Automatic mass detection in mammograms using deep convolutional neural networks,” Journal of Medical Imaging, vol. 6, no. 3, pp. 031409, 2019. [Google Scholar] [PubMed]

9. L. Srinivasan, D. Selvaraj, D. Dhinakaran and T. P. Anish, “IoT-based solution for paraplegic sufferer to send signals to physician via internet,” SSRG International Journal of Electrical and Electronics Engineering, vol. 10, no. 1, pp. 41–52, 2023. [Google Scholar]

10. J. Ding, X. Li, H. Liu, H. Qin and H. Hao, “3D deep learning for detecting multiple types of liver lesions in CT images,” IEEE Transactions on Cybernetics, vol. 48, no. 1, pp. 289–297, 2017. [Google Scholar]

11. F. Liu, Z. Zhou, H. Jang and D. J. Foran, “Multi-phase liver tumor segmentation using 3D fully convolutional networks,” IEEE Journal of Biomedical and Health Informatics, vol. 23, no. 1, pp. 343–352, 2019. [Google Scholar]

12. L. Zhang, W. Lin, J. Liang and Q. Li, “Multi-task deep neural network for automatic liver tumor detection and classification,” Computerized Medical Imaging and Graphics, vol. 78, pp. 101688, 2020. [Google Scholar]

13. J. Choi, J. Kim and J. Lee, “Convolutional neural network-based MRI liver tumor classification using multiscale-ROI with attention mechanism,” Applied Sciences, vol. 10, no. 9, pp. 3142, 2020. [Google Scholar]

14. C. H. Kuo, Y. Tsai and Y. H. Yeh, “A combination of 2D and 3D convolutional neural networks for liver tumor detection and segmentation in CT images,” Scientific Reports, vol. 10, no. 1, pp. 1–10, 2020. [Google Scholar]

15. X. Wang, Y. Yang, Z. Xu, J. Wang and X. Lin, “Liver tumor histological classification using convolutional neural networks based on MRI images,” Magnetic Resonance Imaging, vol. 70, pp. 136–143, 2020. [Google Scholar]

16. L. Zhong, Y. Zheng, D. Dong, L. Zhang and J. Tian, “Liver tumor segmentation in ultrasound images using a deep learning approach,” IEEE Transactions on Ultrasonics, Ferroelectrics, and Frequency Control, vol. 66, no. 10, pp. 1706–1719, 2019. [Google Scholar]

17. Z. Wu, Y. Zhang, J. Zhang, Y. Wan and Q. Feng, “Liver tumor segmentation in CT images using convolutional neural networks and an enhanced loss function,” IEEE Access, vol. 8, pp. 77757–77765, 2020. [Google Scholar]

18. Z. Cui, J. Xu, X. Li, S. Liang and W. Wang, “Network for liver CT image registration,” Journal of X-Ray Science and Technology, vol. 28, no. 6, pp. 1001–1012, 2020. [Google Scholar]

19. J. Li, J. Tian and Z. Liang, “Deep belief network for liver tumor classification and diagnosis using ultrasound images,” Journal of Medical Systems, vol. 43, no. 2, pp. 32, 2019. [Google Scholar]

20. G. Gomathy, P. Kalaiselvi, D. Selvaraj, D. Dhinakaran, D. Arul Kumar et al., “Automatic waste management based on IoT using a wireless sensor network,” in 2022 Int. Conf. on Edge Computing and Applications (ICECAA), Namakkal, India, pp. 629–634, 2022. https://doi.org/10.1109/ICECAA55415.2022.9936351 [Google Scholar] [CrossRef]

21. D. Dhinakaran, D. Selvaraj, S. M. Sankar, S. Pavithra and R. Boomika, “Assistive system for the blind with voice output based on optical character recognition,” in Int. Conf. on Innovative Computing and Communications, Delhi, India, vol. 492, pp. 1–8, 2023. [Google Scholar]

22. K. Sudharson, A. M. Sermakani, V. Parthipan, D. Dhinakaran, G. E. Petchiammal et al., “Hybrid deep learning neural system for brain tumor detection,” in 2022 2nd Int. Conf. on Intelligent Technologies (CONIT), Hubli, India, pp. 1–6, 2022. [Google Scholar]

23. X. Liao, X. Li, P. Liang, Y. Wang, S. Zhang et al., “Computer-aided diagnosis of liver tumors using deep learning and contrast-enhanced CT images,” Journal of Digital Imaging, vol. 33, no. 4, pp. 1014–1021, 2020. [Google Scholar]

24. D. Dhinakaran, “Protection of data privacy from vulnerability using two-fish technique with Apriori algorithm in data mining,” The Journal of Supercomputing, vol. 78, no. 16, pp. 17559–17593, 2022. [Google Scholar]

25. M. R. Khanna, S. P. Panimalar and K. Sudharson, “Secure android location tracking application with privacy enhanced technique,” in 2022 Fifth Int. Conf. on Computational Intelligence and Communication Technologies (CCICT), Ghaziabad, India, pp. 223–229, 2022. [Google Scholar]

26. P. Hu, D. Wang and Y. Zhang, “Liver tumor segmentation in CT images with adversarial learning,” Journal of Digital Imaging, vol. 34, no. 4, pp. 859–872, 2021. [Google Scholar]

27. P. M. Joe Prathap, D. Selvaraj, D. Arul Kumar and B. Murugeshwari, “Mining privacy-preserving association rules based on parallel processing in cloud computing,” International Journal of Engineering Trends and Technology, vol. 70, no. 3, pp. 284–294, 2022. [Google Scholar]

28. L. Xie, Y. Zhou, X. Chen, D. Wang and X. Jia, “Liver tumor segmentation based on convolutional neural network with hard sample mining and 3D refinement,” Computer Methods and Programs in Biomedicine, vol. 178, pp. 1–11, 2019. [Google Scholar]

29. D. Arul Kumar, S. Dinesh and K. Srikanth, “Recommendation system for research studies based on GCR,” in 2022 Int. Mobile and Embedded Technology Conf. (MECON), Noida, India, pp. 61–65, 2022. [Google Scholar]

30. D. Dhinakaran and P. M. Joe Prathap, “Preserving data confidentiality in association rule mining using data share allocator algorithm,” Intelligent Automation & Soft Computing, vol. 33, no. 3, pp. 1877–1892, 2022. [Google Scholar]

31. J. Catherine Bel, C. Esther, Z. Sen, D. Tamizhmalar, D. Dhinakaran et al., “Trustworthy cloud storage data protection based on blockchain technology,” in 2022 Int. Conf. on Edge Computing and Applications (ICECAA), Namakkal, India, pp. 538–543, 2022. https://doi.org/10.1109/ICECAA55415.2022.9936299 [Google Scholar] [CrossRef]

32. D. Selvaraj, S. M. Udhaya Sankar, D. Dhinakaran and T. P. Anish, “Outsourced analysis of encrypted graphs in the cloud with privacy protection,” SSRG International Journal of Electrical and Electronics Engineering, vol. 10, no. 1, pp. 53–62, 2023. https://doi.org/10.14445/23488379/IJEEE-V10I1P105 [Google Scholar] [CrossRef]

33. G. K. Monica, D. Dhinakaran, K. Haritha, K. Kohila and U. Priyadharshini, “MEMS based sensor robot for immobilized persons,” in 2023 Int. Conf. on Innovative Data Communication Technologies and Application (ICIDCA), Uttarakhand, India, pp. 924–929, 2023. [Google Scholar]

34. S. M. Udhaya Sankar, N. J. Kumar, D. Dhinakaran, S. S. Kamalesh and R. Abenesh, “Machine learning system for indolence perception,” in 2023 Int. Conf. on Innovative Data Communication Technologies and Application (ICIDCA), Uttarakhand, India, pp. 55–60, 2023. [Google Scholar]


Cite This Article

APA Style
Kalaiselvi, P., Anusuya, S. (2023). Liver tumor prediction with advanced attention mechanisms integrated into a depth-based variant search algorithm. Computers, Materials & Continua, 77(1), 1209-1226. https://doi.org/10.32604/cmc.2023.040264
Vancouver Style
Kalaiselvi P, Anusuya S. Liver tumor prediction with advanced attention mechanisms integrated into a depth-based variant search algorithm. Comput Mater Contin. 2023;77(1):1209-1226 https://doi.org/10.32604/cmc.2023.040264
IEEE Style
P. Kalaiselvi and S. Anusuya, “Liver Tumor Prediction with Advanced Attention Mechanisms Integrated into a Depth-Based Variant Search Algorithm,” Comput. Mater. Contin., vol. 77, no. 1, pp. 1209-1226, 2023. https://doi.org/10.32604/cmc.2023.040264


cc Copyright © 2023 The Author(s). Published by Tech Science Press.
This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
  • 424

    View

  • 256

    Download

  • 1

    Like

Share Link