Introduction
Brain tumors stand as one of the leading causes of death in the modern world. These tumors can manifest in various regions of the brain, often remaining asymptomatic until later stages of life. Symptoms of brain disease encompass a wide array of issues, including personality changes, memory difficulties, communication impairments, hearing or speech challenges, chronic migraines, and even vision loss [
1]. Notable examples of brain tumors include meningiomas, gliomas, pituitary adenomas, and acoustic neuromas. According to medical observations, meningiomas, gliomas, and pituitary tumors account for approximately 15%, 45%, and 15% of all brain tumors, respectively. A brain tumor can have long-lasting psychological effects on the patient. These tumors originate from primary abnormalities in the brain or central spine tissue that disrupt normal brain function. Brain tumors are classified into two main categories: benign and malignant. Benign tumors grow slowly and are non-cancerous; they are relatively rare and do not metastasize. In contrast, malignant brain tumors contain cancerous cells, typically originating in one region of the brain before swiftly spreading to other areas of the brain and spinal cord [
2]. Malignant tumors pose a significant health risk. The World Health Organization (WHO) classifies brain tumors into four grades based on their behavior within the brain: grades 1 and 2 are considered low-grade or benign tumors, while grades 3 and 4 are categorized as high-grade or malignant tumors. Several diagnostic methods, such as CT scanning and EEG, are available for detecting brain tumors, but magnetic resonance imaging (MRI) is the most reliable and widely utilized. MRI generates detailed internal images of the body’s organs by employing strong magnetic fields and radio waves [
3]. Essentially, CT or MRI scans can distinguish the affected brain region due to the tumor from the healthy tissue. Biopsies, clinical tests that extract brain cells, can be conducted as a prelude to cerebral surgery. Precision is paramount in measuring tumor cells or arriving at accurate diagnoses. The emergence of machine learning (ML) presents an opportunity to assist radiologists in furnishing precise disease status information [
4]. The proliferation of novel technologies, particularly artificial intelligence and ML, has left an indelible mark on the medical field, equipping various medical departments, including medical imaging, with indispensable tools to enhance their operations. As MRI images are processed to aid radiologists in decision making, a diverse array of automated learning strategies is employed for classification and segmentation purposes. While supervised methods for classifying brain tumors hold immense promise, they demand specialized expertise to optimize the feature extraction and selection techniques [
5]. In navigating and analyzing vast datasets, expert medical professionals benefit from the support of machine assistance. Furthermore, the failure to accurately identify life-threatening tumors could potentially result in treatment delays for patients. The utilization of deep-learning (DL) techniques in detecting brain tumors and extracting meaningful insights from data patterns has a longstanding history. DL’s capability to classify and model brain cancers is widely recognized [
6]. Effectively treating brain tumors hinges on early and precise disease diagnosis. Decisions regarding treatment methods are influenced by factors such as the tumor’s pathological type, grade, and stage at diagnosis. Neuro-oncologists have harnessed computer-aided diagnostic (CAD) tools for various purposes, including tumor detection, categorization, and grading within the realm of neurology [
7].
A glioma is a type of tumor that originates in brain tissue, distinct from nerve cells or blood vessels. In contrast, meningiomas develop from the protective membranes that envelop the brain and central nervous system, while pituitary tumors grow within the confines of the skull. Among these three tumor types, meningiomas are relatively rare and generally benign. Conversely, gliomas constitute the most prevalent form of malignant brain tumors. Even though pituitary tumors may be benign, they can still give rise to significant medical complications [
8]. Brain tumors rank as a leading cause of mortality worldwide. Research underscores the significance of early and accurate identification, coupled with prompt treatment, in improving survival rates for patients with cancerous tumors. In certain instances, healthcare professionals may encounter the need to differentiate between strokes and tumors. Hence, the early detection of brain tumors assumes pivotal importance for providing effective care and potentially extending the affected individual’s lifespan [
9]. Convolutional neural networks (CNNs), distinguished by their multi-layered architecture and high diagnostic accuracy when provided with ample input images, currently stand as a highly effective approach in image processing. Neural networks, including auto-encoders, an unsupervised learning technique, are harnessed for representation learning [
10]. Magnetic resonance imaging (MRI) emerges as an exceptional tool for obtaining clear and detailed visualizations within the human body. Unlike X-rays or CT scans that involve ionizing radiation, MRI offers significantly enhanced contrast between various soft tissues. Moreover, MRI technology furnishes detailed images from multiple angles, providing radiologists with abundant data on human soft-tissue anatomy [
11]. The aim of this paper is to introduce three fully automatic CNN models designed for the multi-classification of brain tumors, utilizing publicly available datasets. To the best of the authors’ knowledge, this represents the first endeavor in multi-classifying brain tumors from MRI images using CNNs, wherein nearly all the hyperparameters are automatically tuned through the grid search optimizer. The rest of this paper is organized as follows:
Introduction Section: this section provides a comprehensive overview of various tumor types and their diagnostic methods;
Related work Section: in this section, we delve into recent articles, examining their methods, outcomes, and applications;
Materials and methods Section: here, we detail the utilization of datasets and describe the proposed model architectures;
Experimental study Section: this section centers on a comparative analysis of the accuracies achieved by our proposed method and other state-of-the-art approaches;
Conclusions Section: this section offers the concluding remarks and insights related to our proposed model.
The author’s goal was to devise a classification approach that is notably more accurate, cost-effective, and self-training, utilizing an extensive collection of authentic datasets rather than augmented data. The customized VGG-16 (Visual Geometry Group) architecture was employed to classify 10,153 MRI images into three distinct classes (glioma, meningioma, and pituitary). The network demonstrated a remarkable performance, achieving an overall accuracy of 99.5% and precision rates of 99.4% for gliomas, 96.7% for meningiomas, and 100% for pituitaries [
12]. The proposed model’s efficacy was assessed using three CNN models: AlexNet, Visual Geometry Group (VGG)-16, and VGG-19. AlexNet achieved a peak detection accuracy of 99.55% using 349 images sourced from the Reference Image Database to Evaluate Response (RIDER) neuro MRI database. For brain tumor localization, employing 804 3D MRIs from the Brain Tumor Segmentation (BraTS) 2013 database, a Dice score of 0.87 was achieved [
13]. In the investigation of brain tumor categorization, an array of deep- and machine-learning techniques, including softmax, Random Forest, Support Vector Machine (SVM), K-Nearest Neighbors, and the ensemble method, were employed. These outcomes were compared with existing methods. Notably, the Inception-v3 model exhibited the highest performance, attaining a test accuracy of 94.34%. This advancement holds the potential to establish a prominent role in clinical applications for brain tumor analysis [
14]. An effective approach was proposed for categorizing brain MRIs into four classes: normal and three forms of malignant brain tumors (glioblastoma, sarcoma, and metastatic bronchogenic carcinoma). The method integrates the discrete wavelet transform (DWT) with a deep neural network (DNN). Employing a deep neural network classifier, one of the DL designs, a dataset of 66 brain MRIs was classified into the specified categories. The integration of DWT, a powerful feature extraction technique, principal component analysis (PCA), and the classifier yielded commendable performances across all evaluation metrics [
15]. The author introduced a strategy involving a CNN to distinguish brain tumors from 2D MRI scans of the brain. This initial separation is subsequently followed by the application of conventional classifiers and DL techniques. In addition, an SVM classifier, along with various activation algorithms, such as softmax, RMSProp, and sigmoid, were employed to validate and cross-check the proposed approach. The implementation of the author’s suggested solution was executed using TensorFlow and Keras in the Python programming language, chosen for its robust capabilities in expediting tasks. The achieved accuracy rate for the CNN model stood at an impressive 99.74% [
16]. This paper presents a brain tumor classification approach employing open-access datasets and CNN techniques. The methodology utilizes open-access datasets to classify tissue as either tumor or non-tumor through a distinctive framework that combines discrete cosine transform-based image fusion, CNN super-resolution, and a classifier. Employing super-resolution and the ResNet50 architecture, the framework attained an impressive accuracy of 98.14% [
17].
A novel approach for dimensionality reduction is proposed, utilizing the Grey Wolf Optimizer (GWO) and rough-set theory. This method identifies relevant features from extracted images, distinguishing between high-grade (HG) and low-grade (LG) glioblastoma multiforme (GBM) while accommodating feature correlation constraints to eliminate redundant attributes. Additionally, the article introduces a dynamic architecture for multilevel layer modeling in a Faster R-CNN (MLL-CNN) approach. This is achieved using a feature weight factor and a relative description model to construct selected features, thereby streamlining the processing and classifying of long-tailed files. This advancement leads to improved training accuracies for CNNs. The findings illustrate that the overall survival prediction for GBM brain growth achieves a higher accuracy of 95% and a lower error rate of 2.3% [
18]. The work involves the classification of 253 high-resolution brain MR images into normal and pathological classes. To efficiently and accurately train deep neural models, MR images were scaled, cropped, pre-processed, and enhanced. The Lu-Net model is compared against LeNet and VGG-16 using five statistical metrics: precision, recall, specificity, F-score, and accuracy. The CNN models were trained on enhanced images and validated on 50 sets of untrained data. LeNet, VGG-16, and the proposed approach achieved accuracy rates of 88%, 90%, and 98%, respectively [
19]. MIDNet18 outperformed AlexNet in categorizing brain tumor medical images. The proposed MIDNet18 model demonstrated effective learning, achieving a binary classification accuracy exceeding 98%, which is statistically significant (independent-sample
t-test,
p < 0.05). MIDNet18 excelled across all the performance indicators for the dataset used in this study [
20].
The objective of this study was to facilitate accurate early-stage diagnoses by medical professionals. Three DL architectures—AlexNet, GoogLeNet, and ResNet50—were employed to identify brain tumor images. Among them, the ResNet50 architecture demonstrated the highest accuracy rates. The experimental results yielded an accuracy of 85.71%, with the potential for further enhancement in future research [
21]. In the realm of Alzheimer’s disease diagnosis, the CNN approach was utilized to detect patients using MRSI and supplementary MRI data. High Matthews Correlation Coefficient (MCC) scores were achieved, with area-under-the-curve values of 0.87 and 0.91 for MRSI and MRI, respectively. A comparative analysis highlighted the superiority of Partial Least Squares and Support Vector Machines. The proposed system automatically selected critical spectral regions for diagnosis, corroborating findings with literature biomarkers [
22]. CNNs, ML pipelines inspired by biological neural processes, have been extensively studied. The author’s approach involved first acquiring an understanding of CNNs, followed by a literature search for a segmentation pipeline applicable to brain tumor segmentation. Additionally, the potential future role of CNNs in radiology was explored. The application of CNNs was demonstrated in predicting survival and medication responses through analyses of the brain tumor shape, texture, and signal intensity [
23]. In this paper, the state-of-the-art object detection framework YOLO (You Only Look Once) was employed to identify and classify brain tumors using DL. YOLOv5, a revolutionary object detection algorithm, stood out for its computational efficiency. The RSNA-MICCAI brain tumor radiogenomics classification BraTS 2021 dataset served as the basis. YOLOv5 achieved an 88% precision rate [
24]. The primary aim of this method is to classify brain images as healthy or tumorous using test MRI data. MRI-based brain tumor research offers superior internal imaging compared to CT scans. The approach involves denoising MRI images with an anisotropic diffusion filter, segmenting using morphological operations, and classifying via a five-layer CNN-based hybrid technique, outperforming other methods. The developed model, utilizing the publicly available KAGGLE brain MRI database, achieved an accuracy rate of 88.1% [
25]. The adoption of AI-powered computer systems can assist doctors in making more accurate diagnoses. In this research, we developed a brain tumor diagnostic system based on CNN technology, utilizing Ranger optimization and the extensive pre-processing of data from the EfficientNetv2 architecture [
26]. This research introduces a novel topology for a parallel deep CNN (PDCNN) designed to extract both global and local features from two parallel stages. Overfitting is addressed through the utilization of dropout regularization and batch normalization. Unlike conventional CNNs that collect features randomly without considering local and global contexts, our proposed PDCNN architecture aims to capture a comprehensive range of features [
27]. This study focuses on the classification of meningiomas, gliomas, and pituitary tumors using MRI imaging. The Dual VGG-16 CNN, equipped with a proprietary CNN architecture, constitutes the DCTN mode [
28]. The importance of the early detection of brain tumors cannot be overstated. Biopsies of brain tumors, the gold standard for diagnosis, are only possible during life-altering brain surgery. Methods based on computational intelligence can aid in the diagnosis and categorization of brain tumors [
29]. The author employed a DL model to classify MRI scans into glioma and normal categories, preceded by the extraction of scan information. Convolutional recurrent neural networks (CRNNs) were utilized for generating the classifications. This suggested method significantly improved the categorization of brain images within a specified input dataset [
30]. The network was trained and tested using BraTS2019 data. The approach was evaluated using the Dice similarity coefficient (DSC), sensitivity (Sen), specificity (Spec), and Hausdorff distance (HD). The DSCs for the entire tumor, tumor core, and enhancing tumor were 0.934, 0.911, and 0.851, respectively. The subregion Sen values were 0.922, 0.911, and 0.867. The Spec and HD scores were 1.000, 1.000, and 3.224, 2.990, 2.844, respectively [
31]. The cancer region segmentation from brain images is achieved using Deep K-Net, a hybrid approach that combines K-Net and utilizes Deep Joint Segmentation with Ruzicka similarity. The K-Net is trained using a Driving Training Taylor (DTT) algorithm. The DTT algorithm optimizes the Shepard CNN (ShCNN) for classification [
32].
The author provided an overview of contemporary computer-aided detection methods that utilize WCE images as input, distinguishing them as either diseased/abnormal or disease-free/normal. We conducted an evaluation of approaches designed for the detection of tumors, polyps, and ulcers, as these three conditions are categorized similarly. Furthermore, because general abnormalities and bleeding within the GI tract could be indicative of these disorders, we made an effort to shed light on the research conducted for the identification of abnormalities and bleeding within WCE images [
33]. Author have included several research studies, each accompanied by detailed descriptions of their techniques, findings, and conclusions. Additionally, we provide a discussion and comparison of previous review articles, which serves as a reference point for the current survey, while also highlighting its limitations [
34]. To enhance feature extraction, our proposed deep CNN model introduces an innovative approach by incorporating multiple convolutional kernels with varying window widths within the same hidden layer. This architecture is designed to be lightweight, consisting of 16 convolutional layers, 2 fully connected layers (FCN), and a softmax layer serving as the output layer. The activation function employed in the first 15 layers is MISH, followed by the Rectified Linear Unit (ReLU) activation function. This combination not only facilitates profound information propagation but also offers self-regularized, smoothly non-monotonic characteristics, while effectively mitigating saturation issues during training. The authors present a comprehensive set of experimental results, comparing our model’s performance against benchmarks like the MICCAI 2015 challenge and other publicly available datasets. Our findings demonstrate that the proposed model excels in terms of accuracy, sensitivity, the F1-score, the F2-score, and the Dice coefficient [
35].
Acknowledgements
Not applicable.
Human Participants Research Checklist:
Complete the following if your study involved human participants or human participants’ data. These questions should be addressed for prospective and retrospective studies.
1. Did you obtain ethics approval for this study?
• If yes, please upload (file type “Other”) the original approval document you received from your ethics committee. If the original document is in another language, please also provide an English translation.
Response: N/A
• If you did not obtain ethical approval, please explain why this was not required below.
2. If you prospectively recruited human participants for the study for example, you conducted a clinical trial, distributed questionnaires, or obtained tissues, data or samples for the purposes of this study, please report in the Methods:
i. the day, month and year of the start and end of the recruitment period for this study.
ii. whether participants provided informed consent, and if so, what type was obtained (for instance, written or verbal, and if verbal, how it was documented and witnessed). If your study included minors, state whether you obtained consent from parents or guardians. If the need for consent was waived by the ethics committee, please include this information.
Response: N/A
3. If you are reporting a retrospective study of medical records or archived samples, please report in the Methods section:
i. the day, month and year when the data were accessed for research purposes
ii. whether authors had access to information that could identify individual participants during or after data collection
Response: N/A
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.