 Research
 Open access
 Published:
Bearing capacity prediction of the concrete pile using tunned ANFIS system
Journal of Engineering and Applied Science volume 71, Article number: 39 (2024)
Abstract
The design process for pile foundations necessitates meticulous deliberation of the calculation pertaining to the bearing capacity of the piles. The primary objective of this work was to investigate the potential use of Coot bird optimization (\({\text{CBO}}\)) techniques in predicting the loadbearing capacity of concretedriven piles. Despite the availability of several suggested models, the investigation of Coot bird optimization (\({\text{CBO}}\)) for estimating the pilecarrying capacity has been somewhat neglected in this research. This work presents and validates a unique approach that combines the Coot bird optimization (\({\text{CBO}}\)) model with the Multilayered perceptron (\({\text{MLP}}\)) neural network and adaptive neurofuzzy inference system (\({\text{ANFIS}}\)). The findings of 472 different driven pile static load tests were put in a database. The proposed framework's building, validation, and testing stages were each accomplished utilizing the training set (70%), validation set (15%), and testing set (15%) of the dataset, respectively. According to the findings, \({{\text{MLP}}}_{{\text{CBO}}}\) and \({{\text{ANFIS}}}_{{\text{CBO}}}\) both offer remarkable possibilities for accurately predicting the pilebearing capacity of a given structure. The \({R}^{2}\) values for \({{\text{ANFIS}}}_{{\text{CBO}}}\) during the training stage were 0.9874, while during the validating stage, they were 0.9785, and during the testing stage they were 0.987. After considering various kinds of performance studies and contrasting them with existing literature, it has been concluded that the \({{\text{ANFIS}}}_{{\text{CBO}}}\) model provides a more appropriate calculation of the bearing capacity of concretedriven piles.
Introduction
To sustain superstructures with enormous loads or those situated on unstable ground, deep foundations are a typical and essential form of foundation [1]. Driven piles composed of wood, steel, precast concrete, and composite are another cost and qualityefficient alternative to drilled shafts. The axial pilebearing capability is considered the most significant factor when designing a pile foundation. As a result, many theoretical and empirical geotechnical research has focused on calculating this value.
The pilebearing capability may be assessed using five primary techniques: static analysis, dynamic analysis, dynamic testing, pile load test, and insitu testing [2,3,4,5]. It is often suggested to use the essential depth idea in design recommendations according to static analysis. The crucial depth, nevertheless, is an idealization that defies physical principles and lacks theoretical or trustworthy empirical evidence.
The hammerpilesoil system is subject to dynamic analysis using wave mechanics as a foundation. Uncertainties in calculating bearing capability are brought on by the hammer impact effect's ambiguity and also by variations in soil power due to environmental factors at the time of pile drive and loading. The measurement of velocity and force close to the pile head during driving is the foundation of dynamic testing techniques. Nevertheless, a skilled individual is able to interpret the evaluations. The capability estimate is also not known until the pile is driven, which is a significant restriction [6]. The fullscale pile settlement under a static load is measured in the field in a pile load test, which is thought to convey the most precise findings. This approach, nevertheless takes a lot of time and money [7]. So, it is crucial to create a straightforward, costeffective, and precise procedure.
Since the 1970s, in situ test techniques for measuring soil parameters have progressed quickly. This advancement is being accompanied by an increase in the estimation of pilebearing capability using data from insitu tests. Standard penetration test (\({\text{SPT}}\)), cone penetration test (\({\text{CPT}}\)), flat dilatometer test (\({\text{DMT}}\)), pressuremeter test (\({\text{PMT}}\)), plate loading test (\({\text{PLT}}\)), dynamic probing test (\({\text{DP}}\)), pressin and screwon probe test (\({\text{SS}}\)), and field vane test (\({\text{FVT}}\)) are a few examples of typical tests. In an effort to assess the material properties, every test employs several loading techniques to quantify the related soil reaction. The \({\text{SPT}}\) is often utilized to forecast the piles’ bearing capability between these insitu test data [8, 9].
In the research, many \({\text{SPT}}\) databased approaches for calculating pilebearing capacities have been developed. Direct and indirect techniques are the two categories into which they fall. Because of their simplicity in calculation, the direct techniques are preferred by field engineers. For instance, \({\text{SPT}}\) direct approaches for sandy or clayed soil were presented in [10,11,12,13,14,15]. The researchers used the limited component approach to assess the pile for a case investigation in Iran [15] and contrasted the results with four distinct \({\text{SPT}}\) direct techniques to arrive at an accurate forecast of the pile's bearing capability. Nevertheless, every one of these experimental formulations have certain shortcomings, based on [4]. As a result, scholars have started searching for various techniques to forecast pilebearing capability using \({\text{SPT}}\) data. Utilizing machine learning techniques may be successful, according to earlier research [16].
Machine learning (\({\text{ML}}\)), a subset of artificial intelligence that simulates the human brain's functioning, is capable of inferring novel information nonlinearly from past data via adaptive learning [17,18,19,20,21,22,23,24]. Additionally, as learning data increases, the machine learning (\({\text{ML}}\))based models’ efficiency may be enhanced progressively, keeping them current with the strict precision demands for complicated engineering challenges [25,26,27,28,29,30]. Numerous studies have shown how well \({\text{ML}}\)based models work in solving issues associated to civil engineering, such as forecasting the mechanical characteristics (compressive/tensile strength/shear) of hardened concrete [31,32,33,34], the ultimate bond strength of corroded reinforcement and surrounding concrete [35, 36], the bearing capability of piles [37, 38], the pulling capability of ground anchors [39,40,41], and others.
Artificial neural networks (\({\text{ANN}}\)), in particular, have been widely employed in \({\text{ML}}\)based models to forecast pilebearing capability. Previous attempts in this manner may be seen in [42, 43], which use \({\text{ANN}}\) with error back propagation. Utilizing results from 50 dynamic load tests performed on prefabricated concrete piles, [44] combines \({\text{ANN}}\) with genetic algorithm (\({\text{GA}}\)), with the weights of \({\text{ANN}}\) being adjusted by \({\text{GA}}\). Identical methods are suggested in [45], where the \({\text{ANN}}\) connection weights are optimized using particle swarm optimization (\({\text{PSO}}\)) along with \({\text{GA}}\). When using \({\text{ANN}}\) to forecast the piles’ bearing capability, \({\text{GA}}\) is also utilized to identify the most crucial characteristics in the unprocessed dataset [3]. Other \({\text{ML}}\)based approaches than \({\text{ANN}}\) have also been taken into account; for example, Samui [46] employs support vector machine (\({\text{SVM}}\)), Pham et al. [2] explore random forest, and Chen et al. [38] investigated the neuroimperialism and neurogenetic approaches.
Many studies employed a number of machine learning techniques to successfully measure the foundation's bearing capabilities. For the purpose of predicting the bearing capacity of the thinwalled foundation, \({\text{ANN}}\) models were developed utilizing a total of 150 specimens, and an adaptive neurofuzzy inference system (\({\text{ANFIS}}\)) was developed as well [47]. The researchers obtained the coefficient of determination (\({R}^{2}\)) and root mean squared error (\({\text{RMSE}}\)) values for the \({\text{ANFIS}}\) and \({\text{ANN}}\) models, which were determined to be 0.875 and 0.048, and 0.71 and 0.529, accordingly. In an alternative study, the determination of pile bearing capability included the use of 296 sets of data and the application of Gaussian process regression analysis, resulting in an \({R}^{2}\) value of 0.84 [48]. In their study, Kulkarni et al. [49] used a dataset consisting of 132 data points to elucidate the behavior of rocksocketed piles. This was achieved by using a hybrid approach that combined the genetic algorithm (\(GA\)) technique with artificial neural networks (\({\text{ANN}}\)). The researchers obtained a coefficient of determination (\({R}^{2}\)) value of 0.86 and a root mean square error (\({\text{RMSE}}\)) value of 0.0093. Using this dataset and a hybrid model known as particle swarm optimization (\({\text{PSO}}\))\({\text{ANN}}\), another group of researchers were able to replicate the behavior of rocksocketed piles [50]. Their results showed an \({R}^{2}\) value of 0.918 and an \({\text{RMSE}}\) value of 0.063. The dataset used in previous empirical endeavors to construct models consisted of a total of 472 specimens [3, 51]. Models that were taken into account included a hybrid deep neural network (\({\text{DNN}}\)) with genetic algorithm (\({\text{GA}}\)), as well as a hybrid whale optimization (\({\text{WOA}}\)) technique with extreme gradient boosting (\({\text{XGB}}\)). The \({R}^{2}\) and \({\text{RMSE}}\) values that were generated by the \({\text{GA}}\)\({\text{DNN}}\) model came out to be 0.882 and 109.965, accordingly. The findings for \({\text{WOA}}\)\({\text{XGB}}\) showed significant improvement, with \({R}^{2}\), and \({\text{RMSE}}\) values of 0.97 and 64 during the training step, and 0.94 and 87.03 at the testing step, respectively.
In conclusion, the present study has made some notable contributions, which may be outlined as follows:

In this study, a comprehensive dataset including 472 pile test findings is used to develop and validate machine learning models for the assessment of pilebearing capacity. It is essential to acknowledge that prior research has often concentrated on limited datasets.

The investigation of coot bird optimization (\({\text{CBO}}\)) for assessing the loadbearing capacity of piles has been somewhat neglected in scholarly research, despite the existence of numerous suggested models.

The selection of the appropriate model, which entails the selection of the appropriate hyperparameters, is of the utmost importance. This work presents and validates a unique approach that combines the coot bird optimization (\({\text{CBO}}\)) model the multilayered perceptron (\({\text{MLP}}\)) neural network and the adaptive neurofuzzy inference system (\({\text{ANFIS}}\)).
Methods
Data description and preprocessing
The dataset used for training and evaluating the machine learning approach described in this paper consists of findings of static load tests conducted on driven strengthened concrete piles. The 472 pieces of data that are now included in this collection were obtained for previous research [3]. The adequacy of the data collection for the development and validation of advanced machinelearning models is attributed to its substantial size. In order to evaluate the capacity of the precast piles, hydraulic piledriving machinery is used to drive the piles into the soil layers. The piles have closed tops. Figure 1 illustrates the testing environment, including the data gathering apparatus, as well as the specific testing site. The bearing capacity of piles was determined using a static load test experiment detailed in Fig. 1b. Figure 2 is a graphic representation of the pile structure, which includes the geometrical factors associated with it as well as the location of the soil layers. The following are categories of input parameters: \({In}_{1}\)) pile’s diameter \((mm)\), \({In}_{2}\)) the first soil layer’s thickness that piles embedded \((m)\), \({In}_{3}\)) the second soil layer’s thickness that pile embedded \((m)\), \({In}_{4}\)) the third soil layer’s thickness that piles embedded \((m)\), \({In}_{5}\)) top elevation of pile \((m)\), \({In}_{6}\)) natural ground elevation \((m)\), \({In}_{7}\)) the extra segment pile top’s elevation \((m)\), \({In}_{8}\)) the pile tip’s depth \((m)\), \({In}_{9}\)) the mean \(SPT\) blow count along the pile shaft, and \({In}_{10}\)) the mean \(SPT\) blow count at the pile tip [3]. Table 1 contains an overview of all ten of the conditioning variables that were considered while making a prediction about the dependent parameter (axial pile bearing capacity). These variables are denoted by the notation \({In}_{1}\) through \({In}_{10}\). Along with other pieces of information, this table also provides a statistical analysis of the independent parameters as well as the parameters that are being predicted. The dataset contains 472 pieces of data, and it was partitioned into three distinct subsets: the testing set, which comprises 15% of the dataset (or 71 data), the validation set, which also comprises 15% of the dataset (or 71 data), and the training set, which comprises 70% of the dataset (or 330 data) [3, 51]. The data points are extracted from the bigger database via a standard diffusion and are then picked at random. An investigation using statistics was performed so that the practicability of the input parameters could be evaluated. Additionally, the input and outcome factor distribution diagrams are shown in Fig. 3.
The degree to which the data points in a scatterplot conform to a linear pattern is measured by the Pearson correlation coefficient [52], which provides a numerical value. The primary objective of the Pearson correlation coefficient is to ascertain the degree of association between two variables. The range of the coefficient is from − 1 to + 1. A positive numerical value indicates a positive linear relationship, whereby there is a propensity for both parameters to increase together. A negative number indicates a negative linear relationship, whereby there is a tendency for the other variable to decrease as one parameter increases. Values closer to + 1 or − 1 indicate a powerful linear relationship, while values closer to 0 suggest a weak or negligible linear correlation. The Pearson correlation coefficient is shown here in Fig. 4. The study may have been not capable of effectively capturing the impact of substantial negative or positive Pearson correlation coefficient variables on the findings due to a potential deficiency in powerful technique. It is noteworthy to observe that there are not many deviations from the high association values that exist among the parameters. As a consequence of this, the development of models that make use of these inputs needs to be successful for the purpose of reaching the highest possible accuracy. It is evident that a significant proportion of the association values exhibit high magnitudes. Among the investigated variables, the highest positive value is seen for \({In}_{8}\) and \({In}_{3}\), both having a value of 0.99. The most negative value is seen in \({In}_{5}\) and \({In}_{2}\), measuring 0.96.
Applied optimizer and frameworks
Coot bird optimization (CBO)
The \({\text{COOT}}\) method, developed by Iraj Naruei et al. (2021), is a recently introduced metaheuristic optimization technique that aims to replicate the collective behavior of a swarm of coot birds. In comparison to other optimization methods, including the particle swarm optimization method and the differential evolution method, the \({\text{COOT}}\) method has rapid convergence velocity and great precision in convergence. Furthermore, the \({\text{COOT}}\) method has undergone validation in several engineering applications, including pressure vessel design, welded beam design, stepped cone pulley, and rolling bearing difficulties [53]. Hence, this study employs the \({\text{COOT}}\) method as a solution to address the \({\text{HDEED}}\) issue.
The underlying concept of the \({\text{COOT}}\) method may be described below (Figs. 5 and 6).

(1)
The first placement of the coot population will be established, followed by the computation of the associated fitness levels for each of these populations.
The variable \(Q\) is formed by the process of randomly initializing people. The variable \(d\) reflects the dimension of the problem that has to be solved. Additionally, the variables \({\text{Ub}}\) and \({\text{Lb}}\) represent the highest and lowest bounds of the population positions. The matrix displaying the positions of the coot is shown below.
Here, \({C{\text{pos}}}^{l}\) shows the entire coot position, and \(N\) shows the population’s scale.
The inclusion of the coot's position is included in the fitness function, resulting in the generation of a fitness matrix for the coot population.
Here, \({{\text{fit}}}_{i}^{l}\) shows the fitness value of the \({i}_{th}\) individual at the time of the \({l}_{th}\) iteration.

(2)
Common coot population position update
The current motion of the coot is modeled by simulating the collective motion of the population based on the distinct collective manners shown by the leader and the typical coot individual. Hence, there are four distinct patterns of movement seen on the water’s surface, such as random movement, chain movement, movement guided by the leader of the general population, and optimum movement executed by the leader.
A location is generated at random inside the spatial domain, allowing a common one to afterward update the location. The formula governing the accidental mobility of the individual is as outlined below:
Here, the location of the \({i}_{th}\) the typical individual is denoted as \({\text{Cpos}}(i)\), where \(A\) represents a decreasing integer ranging from 0 to 1, \(R2\) indicates an accidental value between 0 and 1, and \(Q\) indicates an accidental location inside the search area. The following formula determines the values of \(A\) and \(Q\) in the following way:
Here, \(l\) is the iterations’ present number, and \(L\) is the iterations’ highest number that may be performed, respectively.
A distance vector among two individuals updates the location throughout the typical population chain movement. The formula for the chain is outlined below:
In the context of leaderbased location modification, the typical individual is required to modify its location by means of the \({k}_{th}\) leader.
Where \(Lpos(k)\) is the \({k}_{th}\) leader’s location, \(R1\) represents an accidental integer within the range [0,1], and the variable \(k\) is selected in the manner outlined below:
In this equation, \(m\) represents the total number of leaders, and \(i\) indicates the present member of the \({i}_{th}\) coot group.

(3)
Position update of the leader population
As part of the leader’s location modification, it is not uncommon for the leader to abdicate the currently finest location in order to assist in the discovery of a location that is more advantageous. If a leader is looking for a finer location, the ideal location from the prior instant should be assigned to the leader; alternatively, the leader should return to the place they were in before. The following outlines the leader's location:
The variables \(R3\) and \(R4\) denote accidental values within the range of 0 to 1, whereas \(g{\text{best}}\) indicates the present ideal position. The modified position of variable \(B\) is outlined below:
Multilayered perceptron (MLP) neural network
According to [54], \({\text{ANN}}\) models are strong nonlinear modeling techniques that operate by mimicking the functioning of the human brain. They are computer models for information processing. When the input and outcome parameters are mapped into a complicated network, an ANN may assess nonlinear functions. This research utilized a popular \({\text{ANN}}\) with one input level, one outcome level, and at minimum one concealed level that was first put out by [55]. According to [56], the \({\text{ANN}}\) might be represented in the following way:
Here, \({x}_{i}\) stands for the \({i}_{th}\) nodal value, \(N\) for the nodes’ number, in the prior level; \(f\) shows the \({j}_{th}\) nodal value, \({y}_{i}\) shows the \({j}_{th}\) node’s bias, \({b}_{i}\) denotes the activation function, and \({w}_{ji}\) stands for a weightlinking \({x}_{i}\) and \({y}_{i}\).
Numerous investigations have shown that the \({\text{ANN}}\) model can predict complicated nonlinear functions for hydrological data with only one concealed level [57]. According to our research's preliminary findings, the link between the groundwater level and the other hydrologic cycle elements may be roughly approximated by one concealed level. Although optimal practices for finding the volume of the concealed nodes have also been offered, calculating the volume of the concealed nodes is commonly carried out by examination and mistake and is a crucial component of the \({\text{ANN}}\). According to Huang’s (2003) research, the following amount of concealed nodes is required to learn \({N}^{i}\) samples with a little tiny fault:
Here, \({N}^{H}\) shows the concealed nodes’ highest volume, and \({N}^{i}\) and \({N}^{o}\) show the input nodes and outcome nodes. In this work, the concealed nodes’ size was changed from one to \({N}^{H}\), and the ideal size was determined via a process of examination and mistake. For training, the Levenberg–Marquardt (\({\text{LM}}\)) method, one of \({\text{ANN}}\) s’ most effective methods, was employed [58]. The backpropagation method is applied to learn the biases and weights, and the \({\text{CBO}}\) algorithm determines the optimal neuron numbers in each hidden layer (Fig. 7). In this study, the maximum number of each hidden layer is assumed to be 25. According to the literature, two hidden layers are considered for \({\text{MLP}}\) structure to predict the bearing capacity of the concrete pile. MLPs often have hyperparameters (e.g., learning rate, number of layers) that impact their performance. CBO was employed to perform hyperparameter tuning, searching through the hyperparameter space to find the best configuration for the MLP. MLPs may suffer from overfitting, especially with complex architectures. Regularization techniques were optimized using optimization algorithms to find the right balance between model complexity and generalization.
Adaptive neurofuzzy inference system (ANFIS)
The basic model in the present work uses \({\text{ANFIS}}\) as a machine learning method. The Takagi–Sugeno fuzzy system approach, which underpins \({\text{ANFIS}}\), was first suggested by Jang et al. in 1993 [59]. \({\text{ANFIS}}\) is capable of using both fuzzy logic and neural networks in a single frame if one thinks of it as an integrated notion. The \({\text{ANFIS}}\) method's proper structure is chosen based on the input data, membership level, and input and result membership functions. By modifying the level of membership variables in accordance with the allowed rate of error throughout the training phase, input values are able to be closer to observed values. The \({\text{ANFIS}}\) technique makes use of fuzzy logic and neural network learning methods to handle the nonlinear connection between the values of the input and the output. It has great potential for use in time series modeling and categorization problems [60]. Based on the benefits of fuzzy rules, numeric data may be collected from a technically written rule, and the \({\text{ANFIS}}\) is able to easily assess the intricate conversion of mankind's intelligence into fuzzy systems. The fuzzy if–then principles that may be learned to estimate nonlinear functions are used in its inference mechanism. \({\text{ANFIS}}\) is hence regarded as a wellliked estimating tool in engineering domains. \({\text{ANFIS}}\) is hence regarded as a wellliked estimating tool in engineering domains. In this work, fuzzy grouping according to the \({\text{k}}\)means grouping method was done using the fuzzy cluster means (\({\text{FCM}}\)) technique, and the optimal optimization technique for the traditional \({\text{ANFIS}}\) was found using the backpropagation technique [61]. In the present work, the \({\text{ANFIS}}\) model was implemented using fuzzy cmeans (\({\text{FCM}}\)) clustering by collecting the collection of rules required for producing the fuzzy inference system (\({\text{FIS}}\)). Regarding \({\text{ANFIS}}\) designs, a fundamental \({\text{ANFIS}}\) design was created using the initial parameters. The \({\text{CBO}}\) methodology was then used to optimize the developed \({\text{ANFIS}}\) framework. The parameters of the membership function for the proposed system were enhanced by the use of the \({\text{CBO}}\) method in this research. The \({\text{RMSE}}\) index was calculated as a fitness indicator in order to assess the precision of the optimization framework. Finally, a hybridized and optimized \({{\text{ANFIS}}}_{{\text{CBO}}}\) network was defined, whereby the parameters of the network, such as the number of fuzzy words and the maximum number of iterations, were determined.
Indices to effectiveness evaluation
The performance comparison criteria were developed to fulfill the need for a standardized and quantifiable approach to assessing and comparing the overall efficiency of a number of different models. The use of these metrics enables the stakeholders to perform an evaluation of the performance of a number of answers, determine regions in need of development, and make choices based on accurate information. In this study, the following metrics were calculated and included in the analysis: coefficient of determination (\({R}^{2}\)), root mean square error (\({\text{RMSE}}\)), normalized rootmeansquare (\({\text{NRMSE}}\)), relative absolute error (\({\text{RAE}}\)), root relative squared error (\({\text{RRSE}}\)), and performance index (\({\text{PI}}\)). Lower values of \({\text{RMSE}}\), \({\text{RAE}}\), \({\text{RRSE}}\), and \({\text{PI}}\) are indicative of greater accuracy. Furthermore, a greater \({R}^{2}\) value signifies enhanced performance.
In these equations, \({m}_{d}\), \(\overline{m }\), \({z}_{d}\), and \(\overline{z }\) are the observations, the average of the observations, the simulations, and the average of the simulations, respectively. Also, \(D\) depicts the total number of datasets.
Results and discussion
Model development
The determination of the bearing capacity of the concrete piles was conducted utilizing the hybridized \({\text{CBO}}\) models as outlined in this study. The hybrid models denoted as \({{\text{MLP}}}_{{\text{CBO}}}\) and \({{\text{ANFIS}}}_{{\text{CBO}}}\), were used to ascertain the maximum iterations and the count of fuzzy words via the employment of \({\text{MLP}}\) and \({\text{ANFIS}}\) techniques. The bearing capacity of the concrete piles was assessed and computed throughout the training, validation, and testing phases of the developed \({{\text{MLP}}}_{{\text{CBO}}}\) and \({{\text{ANFIS}}}_{{\text{CBO}}}\) techniques, as seen in Fig. 8. The error distribution is shown on the right side of Fig. 8. The metrics \({R}^{2}\), \({\text{RMSE}}\), \({\text{NRMSE}}\), \({\text{RAE}}\), \({\text{RRSE}},\) and \({\text{PI}}\) were used to conduct an analysis of how effective \({{\text{MLP}}}_{{\text{CBO}}}\) and \({{\text{ANFIS}}}_{{\text{CBO}}}\) were in the research of procedure prediction (see Table 2 for more information). Furthermore, the reliability and efficacy of the developed models, namely \({\text{XGB}}\), \({\text{WOA}}{\text{XGB}}\), and \({\text{GA}}{\text{DLNN}}\), were assessed based on the results obtained from the present study, which primarily centered on the performance of these models. The findings were compared to those that were found in the previous research [3] and [51].
Based on the data, it seems that both the \({{\text{MLP}}}_{{\text{CBO}}}\) and \({{\text{ANFIS}}}_{{\text{CBO}}}\) models have substantial potential for accurately predicting the pilebearing capacity. During the training stage, the \({R}^{2}\) values for \({{\text{MLP}}}_{{\text{CBO}}}\) were determined to be 0.954, while during the validating stage, they were 0.9667, and during the testing stage they were 0.9729. in a similar manner, the \({R}^{2}\) values for \({{\text{ANFIS}}}_{{\text{CBO}}}\) during the training, stage were 0.9874, while during the validating stage, they were 0.9785, and during the testing stage they were 0.987. Evaluating the dependability of a model based just on one metric is inadequate. In order to achieve this objective, it is important to conduct a comprehensive examination of a range of measures, including but not limited to \({\text{RSME}}\), \({\text{NRMSE}}\), \({\text{RAE}}\), \({\text{RRSE}}\), and \({\text{PI}}\). When one compares the percentages that are reported for \({{\text{MLP}}}_{{\text{CBO}}}\) and \({{\text{ANFIS}}}_{{\text{CBO}}}\) for a variety of metrics, it is easy to see that \({{\text{ANFIS}}}_{{\text{CBO}}}\) reports a significant drop in comparison to \({{\text{MLP}}}_{{\text{CBO}}}\). The observed drop in performance demonstrates the high level of accuracy shown by the \({{\text{ANFIS}}}_{{\text{CBO}}}\) in estimating the bearing capacity of concrete piles.
In order to assess the reliability of the models, a comprehensive evaluation is conducted by comparing them with existing literature, specifically considering the models \({\text{XGB}}\) [51, 51] \({\text{WOA}}{\text{XGB}}\), and \({\text{GA}}{\text{DLNN}}\) [3]. A comprehensive examination of Table 2 reveals that the \({{\text{ANFIS}}}_{{\text{CBO}}}\) model, as introduced in this study, had favorable results in comparison to previous investigations discussed in the existing literature. The assessment was conducted using consistent metrics for the training, validation, and testing datasets, namely \({R}^{2}\) (coefficient of determination) and \({\text{RMSE}}\) (root mean square error). The best model, \({{\text{ANFIS}}}_{{\text{CBO}}}\), has higher \({R}^{2}\) values and lower \({\text{RMSE}}\) compared to \({\text{XGB}}\) [51, 51]\({\text{WOA}}{\text{XGB}}\), and \({\text{GA}}{\text{DLNN}}\) [3], indicating more precision and strength in its conclusions. For example, according to the results obtained from the \({\text{WOA}}{\text{XGB}}\) model, the coefficient of determination (\({R}^{2}\)) increases from 0.97 to 0.9874 in the training stage and from 0.94 to 0.987 in the testing stage. Additionally, the metrics based on \(RMSE\) error shown a drop from 62.74 to 39.6769 in the training stage and from 87.72 to 38.6543 in the testing stage. A comprehensive comparison between \({{\text{ANFIS}}}_{{\text{CBO}}}\) and \({\text{GA}}{\text{DLNN}}\) [3] may be undertaken, taking into account the improvements seen in the training, validating, and testing data sets. These improvements include an increase in the values of \({R}^{2}\) and a decrease in the values of \({\text{RMSE}}\).
The distribution of the predicted/observed ratio of the algorithms throughout both the training and testing phases can be viewed on the right side of Fig. 8. Demonstrating enhanced efficacy is shown by a limited margin of error, including both a minimum and maximum threshold. The findings indicate that \({{\text{ANFIS}}}_{{\text{CBO}}}\) has higher efficacy in two phases, as shown by a decrease in error variability.
Recommendations for future studies
The focus of the current study was on the installation of driven reinforced concrete piles and their interaction with certain soil conditions. In the next research, it may be investigated how effectively the previously constructed machine learning models function with various soil profiles and pile kinds, such as drilled heaps and wood piles. Investigating ensemble models (such as random forests, gradient boosting, or model stacking techniques) might potentially enhance prediction accuracy even more. This is accomplished by combining a large number of machinelearning models. In subsequent research, field validation on actual construction projects may play a role, with the purpose of determining how useful the suggested models are when applied to actuallife circumstances. There is also the possibility of doing research on the challenges and considerations that go into effective implementation. It is feasible that by performing case studies on a variety of construction projects and contrasting the findings to the standards of the industry, it will be possible to learn a great deal about the effectiveness of machine learning models as well as the potential cost savings they may provide.
Sensitivity analysis
Sensitivity analysis is a technique used in various fields, including finance, engineering, environmental science, and decisionmaking processes [18]. The primary aim of sensitivity analysis is to assess how changes in the input parameters or assumptions of a model affect the output or outcomes. It helps in understanding the robustness and reliability of a model or system under different scenarios. Sensitivity analysis helps in identifying which variables or parameters have a significant impact on the model’s output. By understanding the sensitivity of the model to different inputs, decisionmakers can focus on key factors that drive the results. In the present, the procedure mentioned by Khatti, and Grover was chosen as a sensitivity analysis (\({S}_{a}\))’s method [62]. The results of the \({S}_{a}\) was provided in Fig. 9. As it is clear from this plot, most of the input variables have high impact on the target higher than 0.9444, with some exceptions. The highest value belonged to \(In3\) at 0.9836, while the lowest value was for In4 at 0.7216 of \({S}_{a}\) (Table 3 in Appendix).
Conclusions
The hybridized \({\text{CBO}}\) systems that were given in this study were used in order to evaluate the bearing capacity of the concrete piles. The hybrid models denoted as \({{\text{MLP}}}_{{\text{CBO}}}\) and \({{\text{ANFIS}}}_{{\text{CBO}}}\), in which the \({\text{MLP}}\) and \({\text{ANFIS}}\) techniques were specified for the purpose of model augmentation. Furthermore, the reliability and robustness of the developed models, namely \({\text{XGB}}\), \({\text{WOA}}{\text{XGB}}\), and \({\text{GA}}{\text{DLNN}}\), were assessed based on the results obtained from the present study, which mostly centered on the produced models. The results obtained in this study were compared with the findings reported in previous research studies [3, 51].

1.
Based on the data, it seems that both the \({{\text{MLP}}}_{{\text{CBO}}}\) and the \({{\text{ANFIS}}}_{{\text{CBO}}}\) models have a great lot of potential for accurately forecasting the pilebearing capacity. During the training stage, the \({R}^{2}\) values for \({{\text{MLP}}}_{{\text{CBO}}}\) were determined to be 0.954, while during the validating stage, they were 0.9667, and during the testing stage they were 0.9729. In a similar manner, the \({R}^{2}\) values for \({ANFIS}_{CBO}\) during the training stage were 0.9874, while during the validating stage, they were 0.9785, and during the testing stage they were 0.987.

2.
The comparison of the percentages of \({{\text{MLP}}}_{{\text{CBO}}}\) and \({{\text{ANFIS}}}_{{\text{CBO}}}\) across several errorbased measurements reveals a notable decrease in \({{\text{ANFIS}}}_{{\text{CBO}}}\) in comparison to \({{\text{MLP}}}_{{\text{CBO}}}\). The observed drop in performance demonstrates the high level of accuracy shown by the adaptive neurofuzzy inference system with coot bird optimization (\({{\text{ANFIS}}}_{{\text{CBO}}}\)) in estimating the bearing capacity of concrete piles.

3.
The results obtained from the \({{\text{ANFIS}}}_{{\text{CBO}}}\) model provides higher values of \({R}^{2}\) and lower \({\text{RMSE}}\) compared to the \({\text{XGB}}\) [51, 51]\({\text{WOA}}{\text{XGB}}\), and \({\text{GA}}{\text{DLNN}}\) [3] models, indicating that the \({{\text{ANFIS}}}_{{\text{CBO}}}\) model exhibits more effectiveness and accuracy. The outcomes obtained from the train, validate, and test datasets shown significant enhancements by increasing the \({R}^{2}\) values and decreasing the \({\text{RMSE}}\) values. These outcomes may be considered when conducting a comprehensive comparison between \({{\text{ANFIS}}}_{{\text{CBO}}}\) and \({\text{GA}}{\text{DLNN}}\) [3].

4.
The findings of the distribution of the predicted/observed ratio indicated that \({{\text{ANFIS}}}_{{\text{CBO}}}\) has higher efficacy in two phases, as shown by a decrease in error variability.
Availability of data and materials
Data can be shared upon request.
Abbreviations
 ML:

Machine learning
 GA:

Genetic Algorithm
 ANN:

Artificial neural network
 SVM:

Support Vector Machine
 CBO:

Coot Bird Optimization
 PSO:

Particle Swarm Optimization
 MLP:

Multilayered perceptron
 WOA:

Whale optimization
 ANFIS:

Adaptive NeuroFuzzy Inference System
 XGB:

Extreme gradient boosting
 SPT:

Standard penetration test
 DNN:

Deep neural network
 CPT:

Cone penetration test
 R ^{2} :

Coefficient of determination
 DMT:

Flat dilatometer test
 RMSE:

Root mean square error
 PMT:

Pressuremeter test
 NRMSE:

Normalized rootmeansquare
 PLT:

Plate loading test
 RAE:

Relative absolute error
 DPT:

Dynamic probing test
 RRSE:

Root relative squared error
 SS:

Pressin and screwon probe test
 PI:

Performance index
 FVT:

Field vane test
References
Sarkhani Benemaran R, EsmaeiliFalak M, Katebi H (2022) Physical and numerical modelling of pilestabilised saturated layered slopes. Proc Inst Civ Eng Eng 175:523–538
Pham TA, Ly HB, Tran VQ, Van Giap L, Vu HLT, Duong HAT (2020) Prediction of pile axial bearing capacity using artificial neural network and random forest. Appl Sci 10:1871
Pham TA, Tran VQ, Vu HLT, Ly HB (2020) Design deep neural network architecture using a genetic algorithm for estimation of pile bearing capacity. PLoS One 15:e0243030
Shariatmadari N, Eslami AA, Karim PFM (2008) Bearing capacity of driven piles in sands from SPT–applied to 60 case histories
EsmaeiliFalak M, Katebi H, Vadiati M, Adamowski J (2019) Predicting triaxial compressive strength and Young’s modulus of frozen sand using artificial intelligence methods. J Cold Reg Eng 33:4019007. https://doi.org/10.1061/(ASCE)CR.19435495.0000188
Rausche F, Goble GG, Likins GE Jr (1985) Dynamic determination of pile capacity. J Geotech Eng 111:367–383
Shooshpasha I, Hasanzadeh A, Taghavi A (2013) Prediction of the axial bearing capacity of piles by SPTbased and numerical design methods. Geomate J 4:560–564
Benali A, Nechnech A, Bouafia A (2013) Bored pile capacity by direct SPT methods applied to 40 case histories. Civ Env Res 5:118–122
Bouafia A, Derbala A (2002) Assessment of SPTbased method of pile bearing capacity–analysis of a database. Proc. Int. Work. Found. Des. Codes Soil Investig. View Int. Harmon. PerformanceBased Des 369–74
Aoki N, Velloso D de A (1975) An approximate method to estimate the bearing capacity of piles. Proc., 5th PanAmerican Conf. Soil Mech. Found. Eng., vol. 1, International Society of Soil Mechanics and Geotechnical Engineering Buenos …; p. 367–76.
Bazaraa AR, Kurkur MM (1986) Nvalues used to predict settlements of piles in Egypt. ASCE, Use Situ tests Geotech. Eng., pp 462–474
Briaud JL, Tucker LM (1988) Measured and predicted axial response of 98 piles. J Geotech Eng 114:984–1001
Decourt L (1995) Prediction of loadsettlement relationships for foundations on the basis of standard penetration test. Conf. cycle Intl.“Zeevaert 87–102
Meyerhof GG (1976) Bearing capacity and settlement of pile foundations. J Geotech Eng Div 102:197–228
Shioi Y, Fukui J (2021) Application of Nvalue to design of foundations in Japan. In: Penetration Testing, volume 1. Routledge, p 159–164
Hoang ND, Tran XL, Huynh TC (2022) Prediction of pile bearing capacity using oppositionbased differential flower pollinationoptimized least squares support vector regression (ODFPLSSVR). Adv Civ Eng 1–25. https://doi.org/10.1155/2022/7183700
Raja MNA, Shukla SK, Khan MUA (2022) An intelligent approach for predicting the strength of geosyntheticreinforced subgrade soil. Int J Pavement Eng 23(10):3505–3521
Raja MNA, Jaffar STA, Bardhan A, Shukla SK (2023) Predicting and validating the loadsettlement behavior of largescale geosyntheticreinforced soil abutments using hybrid intelligent modeling. J Rock Mech Geotech Eng 15:773–788
Raja MNA, Shukla SK (2022) An extreme learning machine model for geosyntheticreinforced sandy soil foundations. Proc Inst Civ Eng Eng 175:383–403
Raja MNA, Shukla SK (2021) Multivariate adaptive regression splines model for reinforced soil foundations. Geosynth Int 28(4):368–390
Raja MNA, Shukla SK (2021) Predicting the settlement of geosyntheticreinforced soil foundations using evolutionary artificial intelligence technique. Geotext Geomembranes 49:1280–1293
Esmaeili‐Falak M, Sarkhani Benemaran R (2024) Application of optimization‐based regression analysis for evaluation of frost durability of recycled aggregate concrete. Struct Concr 1–22. https://doi.org/10.1002/suco.202300566
Dawei Y, Bing Z, Bingbing G, Xibo G, Razzaghzadeh B (2023) Predicting the CPTbased pile setup parameters using HHORF and PSORF hybrid models. Struct Eng Mech 86:673–686
Liang R, Bayrami B (2023) Estimation of frost durability of recycled aggregate concrete by hybridized Random Forests algorithms. Steel Compos Struct 49:91–107. https://doi.org/10.12989/scs.2023.49.1.091
Sarkhani Benemaran R, EsmaeiliFalak M, Javadi A (2022) Predicting resilient modulus of flexible pavement foundation using extreme gradient boosting based optimised models. Int J Pavement Eng 1–20. https://doi.org/10.1080/10298436.2022.2095385.
EsmaeiliFalak M, Benemaran RS (2023) Ensemble deep learningbased models to predict the resilient modulus of modified base materials subjected to wetdry cycles. Geomech Eng 32(6):583–600. https://doi.org/10.12989/gae.2023.32.6.583
Sarkhani Benemaran R (2023) Application of extreme gradient boosting method for evaluating the properties of episodic failure of borehole breakout. Geoenergy Sci Eng 211837. https://doi.org/10.1016/j.geoen.2023.211837.
Aghayari Hir M, Zaheri M, Rahimzadeh N (2022) Prediction of rural travel demand by spatial regression and artificial neural network methods (Tabriz County). J Transp Res 20(4):367–386
Shi X, Yu X, EsmaeiliFalak M (2023) Improved arithmetic optimization algorithm and its application to carbon fiber reinforced polymersteel bond strength estimation. Compos Struct 306:116599. https://doi.org/10.1016/j.compstruct.2022.116599
Zhu Y, Huang L, Zhang Z, Bayrami B (2022) Estimation of splitting tensile strength of modified recycled aggregate concrete using hybrid algorithms. Steel Compos Struct 44:389–406. https://doi.org/10.12989/scs.2022.44.3.389
Bui DK, Nguyen T, Chou JS, NguyenXuan H, Ngo TD (2018) A modified firefly algorithmartificial neural network expert system for predicting compressive and tensile strength of highperformance concrete. Constr Build Mater 180:320–333
Hoang ND (2019) Estimating punching shear capacity of steel fibre reinforced concrete slabs using sequential piecewise multiple linear regression and artificial neural network. Measurement 137:58–70
Kurtoglu AE, Gulsan ME, Abdi HA, Kamil MA, Cevik A (2017) Fiber reinforced concrete corbels: modeling shear strength via symbolic regression. Comp and Concr 20:1–10
Benemaran RS, EsmaeiliFalak M (2023) Predicting the Young’s modulus of frozen sand using machine learning approaches: stateoftheart review. Geomech Eng 34:507–527
Hoang ND, Tran XL, Nguyen H (2020) Predicting ultimate bond strength of corroded reinforcement and surrounding concrete using a metaheuristic optimized least squares support vector regression model. Neural Comput Appl 32:7289–7309
Benemaran RS, EsmaeiliFalak M, Kordlar MS (2023) Improvement of recycled aggregate concrete using glass fiber and silica fume. Multiscale Multidiscip Model Exp Des. https://doi.org/10.1007/s41939023003132
Cao MT, Nguyen NM, Wang WC (2022) Using an evolutionary heterogeneous ensemble of artificial neural network and multivariate adaptive regression splines to predict bearing capacity in axial piles. Eng Struct 268:114769
Chen W, Sarir P, Bui XN, Nguyen H, Tahir MM, Jahed AD (2020) Neurogenetic, neuroimperialism and genetic programing models in predicting ultimate bearing capacity of pile. Eng Comput 36:1101–1115
Cheng MY, Cao MT, Tsai PK (2021) Predicting load on ground anchor using a metaheuristic optimized least squares support vector regression model: a Taiwan case study. J Comput Des Eng 8:268–282
Shahin MA, Jaksa MB (2005) Neural network prediction of pullout capacity of marquee ground anchors. Comput Geotech 32:153–163
Moradi G, Hassankhani E, Halabian AM (2022) Experimental and numerical analyses of buried box culverts in trenches using geofoam. Proc Inst Civ Eng Eng 175:311–322
Lee IM, Lee JH (1996) Prediction of pile bearing capacity using artificial neural networks. Comput Geotech 18:189–200. https://doi.org/10.1016/0266352X(95)000278
Teh CI, Wong KS, Goh ATC, Jaritngam S (1997) Prediction of pile capacity using neural networks. J Comput Civ Eng 11:129–138
Momeni E, Nazir R, JahedArmaghani D, Maizir H (2014) Prediction of pile bearing capacity using a hybrid genetic algorithmbased ANN. Measurement 57:122–31. https://doi.org/10.1016/j.measurement.2014.08.007
Murlidhar BR, Sinha RK, Mohamad ET, Sonkar R, Khorami M (2020) The effects of particle swarm optimisation and genetic algorithm on ANN results in predicting pile bearing capacity. Int J Hydromechanics 3(1):69–87
Samui P (2011) Prediction of pile bearing capacity using support vector machine. Int J Geotech Eng 5:95–102
Momeni E, Armaghani DJ, Fatemi SA, Nazir R (2018) Prediction of bearing capacity of thinwalled foundation: a simulation approach. Eng Comput 34:319–327
Momeni E, Dowlatshahi MB, Omidinasab F, Maizir H, Armaghani DJ (2020) Gaussian process regression technique to estimate the pile bearing capacity. Arab J Sci Eng 45:8255–8267
Kulkarni RU, Dewaikar DM (2017) Prediction of interpreted failure loads of rocksocketed piles in Mumbai Region using hybrid artificial neural networks with genetic algorithm. Int J Eng Res 6:365–372
JahedArmaghani D, Shoib RSNSBR, Faizi K, Rashid ASA (2017) Developing a hybrid PSO–ANN model for estimating the ultimate bearing capacity of rocksocketed piles. Neural Comput Appl 28:391–405. https://doi.org/10.1007/s005210152072z
Nguyen H, Cao MT, Tran XL, Tran TH, Hoang ND (2023) A novel whale optimization algorithm optimized XGBoost regression for estimating bearing capacity of concrete piles. Neural Comput Appl 35:3825–3852. https://doi.org/10.1007/s0052102207896w
Benesty J, Chen J, Huang Y, Cohen I (2009) Pearson correlation coefficient. Springer, Noise Reduct. speech Process., pp 1–4
Naruei I, Keynia F (2021) A new optimization method based on COOT bird natural life model. Expert Syst Appl 183:115352
Mokhtarzad M, Eskandari F, Jamshidi Vanjani N, Arabasadi A (2017) Drought forecasting by ANN, ANFIS, and SVM and comparison of the models. Environ Earth Sci 76:1–10
Rumelhart DE, Hinton GE, McClelland JL (1986) A general framework for parallel distributed processing. Parallel Distrib Process Explor Microstruct Cogn 1:26
Anderson JA (1995) An introduction to neural networks. MIT Press, Massachusetts
Coulibaly P, Anctil F, Bobée B (2000) Daily reservoir inflow forecasting using artificial neural networks with stopped training approach. J Hydrol 230:244–257
Maiti S, Tiwari RK (2014) A comparative study of artificial neural networks, Bayesian neural networks and adaptive neurofuzzy inference system in groundwater level prediction. Environ Earth Sci 71:3147–3160
Jang JS (1993) ANFIS: adaptivenetworkbased fuzzy inference system. IEEE Trans Syst Man Cybern 23:665–685
Kisi O, Demir V, Kim S (2017) Estimation of longterm monthly temperatures by three different adaptive neurofuzzy approaches using geographical inputs. J Irrig Drain Eng 143:4017052
Azad A, Farzin S, Kashi H, Sanikhani H, Karami H, Kisi O (2018) Prediction of river flow using hybrid neurofuzzy models. Arab J Geosci 11:1–14
Khatti J, Grover KS (2023) Prediction of compaction parameters of soil using GA and PSO optimized relevance vector machine (RVM). ICTACT J Soft Comput 13(2):2890–2903. https://doi.org/10.21917/ijsc.2023.0409
Acknowledgements
I would like to take this opportunity to acknowledge that there are no individuals or organizations that require acknowledgment for their contributions to this work.
Funding
This research received no specific grant from any funding agency in the public, commercial, or notforprofit sectors.
Author information
Authors and Affiliations
Contributions
All authors contributed to the study’s conception and design. Data collection, simulation, and analysis were performed by WG, J L, and S Ch. The first draft of the manuscript was written by S Ch and all authors commented on previous versions of the manuscript. All authors have read and approved the manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare that they have no competing interests.
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Appendix
Appendix
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
About this article
Cite this article
Gu, W., Liao, J. & Cheng, S. Bearing capacity prediction of the concrete pile using tunned ANFIS system. J. Eng. Appl. Sci. 71, 39 (2024). https://doi.org/10.1186/s4414702400369y
Received:
Accepted:
Published:
DOI: https://doi.org/10.1186/s4414702400369y