10.8
CiteScore
 
5.3
Impact Factor
Generic selectors
Exact matches only
Search in title
Search in content
Post Type Selectors
Search in posts
Search in pages
Filter by Categories
Corrigendum
Current Issue
Editorial
Erratum
Full Length Article
Full lenth article
Letter to Editor
Original Article
Research article
Retraction notice
Review
Review Article
SPECIAL ISSUE: ENVIRONMENTAL CHEMISTRY
10.8
CiteScore
5.3
Impact Factor
Generic selectors
Exact matches only
Search in title
Search in content
Post Type Selectors
Search in posts
Search in pages
Filter by Categories
Corrigendum
Current Issue
Editorial
Erratum
Full Length Article
Full lenth article
Letter to Editor
Original Article
Research article
Retraction notice
Review
Review Article
SPECIAL ISSUE: ENVIRONMENTAL CHEMISTRY
View/Download PDF

Translate this page into:

Original article
11 2022
:15;
104284
doi:
10.1016/j.arabjc.2022.104284

Development of ANFIS technique for estimation of CO2 solubility in amino acids and study on impact of input parameters

Department of Life Science and Agriculture, Zhoukou Normal University, Zhoukou, Henan 466001, China
Department of Health and Rehabilitation Sciences, College of Applied Medical Sciences, Prince Sattam bin Abdulaziz University, Al Kharj, Saudi Arabia
Department of Physical Therapy, Kasr Al-Aini Hospital, Cairo University, Giza, Egypt
Institute of Pharmacy, Sechenov First Moscow State Medical University, 8 Trubetskaya St., bldg. 2, Moscow 119991, Russian Federation
Laboratory of Food Chemistry, Federal Research Center of Nutrition, Biotechnology and Food Safety, 2/14 Ustyinsky pr, Moscow 109240, Russian Federation
Department of Chemical Engineering and Petroleum Industries, Al-Mustaqbal University College, 51001 Hillah, Babylon, Iraq

⁎Corresponding author. 20041026@zknu.edu.cn (Ying Lai)

Disclaimer:
This article was originally published by Elsevier and was migrated to Scientific Scholar after the change of Publisher.

Peer review under responsibility of King Saud University.

Abstract

ANFIS (Adaptive neuro fuzzy inference system) modeling of CO2 capture using chemical absorbent was carried out in this study to correlate the solubility of CO2 to the solvent and operational parameters. In the ANFIS model, the input parameters including temperature, pressure, and physio-chemical properties of the solvent were considered, while the loading of CO2 in the absorbent was considered as the sole target output to be predicted by the model. Indeed, we developed a machine learning based model for predicting the CO2 loading capacity in amino acid salt solutions as the chemical absorbent of carbon dioxide. This model uses a metaheuristic optimized ANFIS based on a wide range of amino acids. This study's novel part is the use of Differential Evolution (DE) and Firefly Algorithm (FA) metaheuristics in order to solve hyper-parameter tuning of ANFIS as an optimization problem based on differential evolution. Accordingly, the optimized ANFIS model has an R2 score of 0.9520 for the test data and a score of 0.9841 for the training data. This indicates that the proposed model is both general and accurate in terms of its predictions for CO2 loading in amino acid salt solutions. The MAPE and RMSE error rates are also 1.17E-01, respectively, while the MAPE error rate is 1.14E-01.

Keywords

Simulation
CO2 capture
Machine learning
Environmental pollution
Separation
1

1 Introduction

Removal and capture of pollutants from water and air has been investigated recently to meet the global standard target as set out by international organizations. Among different pollutants, carbon dioxide has been on the main focus for controlling the climate change (Cheng, 2021; Liu, 2022; Shan, 2022; Batres, 2021; Schweizer, 2020). Chemical sorption of CO2 in solvents has been the primary process for CO2 capture in chemical industries such as oil and gas industry. In this process, CO2 molecules are transferred from the gas phase to the liquid phase by mass transfer and chemical absorption takes place in the liquid phase for removal of CO2 from gas phase. Indeed, the presence of chemical reaction enhances the removal efficiency drastically in this process (Dhaneesh and Ranganathan, 2022; Fang, 2020; Li, 2022; Pang, 2021). For design of the CO2 capture process based on chemical absorption, other criteria should be taken into account such as the process economic and also regeneration of the solvent for reuse in the process. Furthermore, the loading of CO2 in the solvent is the key parameter which should be observed for design of process and also selecting the proper solvent for the separation process.

CO2 solubility in wide solvents can be determined through experimental measurements which is tedious and costly to be done. On the other hands, computational techniques can be used to estimate the CO2 solubility in solvents as function of process parameters and the absorbent properties, as well. In the modeling techniques, models based on machine learning can be employed to predict the solubility of CO2 in wide solvents and wide range of parameters (Soroush, 2019). The method of machine learning has attracted much attention in the area of CO2 capture in chemical solvents due to its superior accuracy and better fitting compared to similar models (Jin, 2022; Shalaby, 2021; Wang, 2021; Yin, 2022; Zhang, 2022). However, these methods rely on the available data for training and some measurements are required to provide the CO2 solubility data to build the machine learning models. This can be regarded as the disadvantage of these models, as they are not recognized as pure predictive models for CO2 removal using chemical solvents.

Because of a recent development in technology known as machine learning, computers are now capable of learning from data without the need for explicit programming. This is one of the many applications of computer science (ML). One of the primary objectives of machine learning is to develop meta-programs that are able to analyze experimental data and then make use of that data as an input in the construction of models that make use of the same experimental data as an input (El Naqa and Murphy, 2015; Goodfellow et al., 2016; Shang, 2021; Tian, 2022; Alibak, 2022).

An optimal neural network approach to solving the function approximation problem based on neural networks is adaptive network-based fuzzy inference systems (ANFIS). This type of information system provides a mapping relationship between the input data and the output data through the use of crossbred learning methods. This is in order to identify the optimum distribution of membership functions for the input data. In the ANFIS architecture, artificial neural networks (ANNs) are employed in conjunction with fuzzy logic (FLs). The ANFIS modeling process can become more systematic and less dependent on expert knowledge when such a framework is used. In order to construct this system of inference, five layers are used. There are several nodes in every ANFIS layer, and each of these nodes is defined by a node function.

In this research, we have used the ANFIS method, but this method has already been used for the same data set in (Soroush, 2019). The difference of our work is that we have used metaheuristic methods including DE and FA to optimize this model. Therefore, the optimized ANFIS model is different from its raw model and its result is more reliable in terms of over-fitting and generality. The data for the modeling have been collected on the solubility of CO2 in amino acids salt solutions.

2

2 Material and methods

2.1

2.1 Data set of CO2 solubility

The dataset in this research have more than 800 data points that is taken from (Soroush, 2019) for loading CO2 into amino acid salt solutions. In the dataset, there are six input features including T(K), weight (%), PCO2(kPa), Mw_am, Mp C, and Mw_C and the single output is alfa which is the values of CO2 loading. The explanation of the parameters can be found in (Soroush, 2019) in which the data of inputs are related to the operational parameters of the absorption (e.g., P and T) as well as the physio-chemical parameters of the absorbent such as molecular weight and percentage of salt. In fact, the most important parameters have been taken into the model for prediction and description of the CO2 loading. The distribution of variables is shown in Fig. 1. In this figure, the diagonal subplots (with the x-axis and y-axis identical) also show kernel density estimates (KDEs). Similar to histograms, KDE plots show the distribution of observations.

Distributions of variables used in this work.
Fig. 1
Distributions of variables used in this work.

2.2

2.2 Adaptive network-based fuzzy inference system (ANFIS)

The base model employed in this work for correlation of the CO2 solubility data is ANFIS model. The ANFIS network is famous for modeling complex systems using a hybrid neuro-fuzzy approach (Jang and Sun, 1995; Jang, 1993). With ANFIS, a collection of fuzzy If-Then conditions (rules) is used to incorporate a fuzzy system's reasoning style, which is similar to human reasoning. As universal estimators, ANFIS models offer interpretable If-Then rules as well as universal approximation (Azeem, 2012; Sadrmomtazi et al., 2013).

We consider in this system only two inputs × and y alongside with a single output fout to illustrate the procedures of an ANFIS. Fig. 2 shows the ANFIS architecture that was used in this work. Below is a description of the node functions associated with each layer (Ziari, 2016).

Architecture schematics for ANFIS used in correlation of CO2 loading data.
Fig. 2
Architecture schematics for ANFIS used in correlation of CO2 loading data.

Layer 1: In this layer, the nodes are adaptive and can do tasks including as (Azeem, 2012): O 1 , i = μ A i x for i = 1 , i = 2 O 1 , i = μ B i - 2 y for i = 3 , i = 4

In this case, x (or y) represents the node's input. The linguistic label is Ai (or Bj), μ(x)/μ(y) represents the membership function. Most often, a bell-shaped distribution is used with an upper bound of 1 and a lower bound of 0.

Layer 2: The nodes in second layer are fixed nodes, outlined in circles and labeled Π, whose functions will be multiplied by input signals to generate an output signal (Azeem, 2012): O 2 , i = μ A i ( x ) · μ A i ( x ) = w i f o r i = 1 , 2

A rule's firing strength is represented by the output signal wi.

Layer 3: This layer relies on fixed nodes, represented by circles and labeled N, as well as a node function to Calculate the ratio of each node's firing strength to the sum of all rules' firing strengths to normalize firing strength (Azeem, 2012): O 3 , i = w i w i = w i w 1 + w 2 for i = 1 , 2

Layer 4: Adaptive nodes are indicated by a square as a part of this layer, and node functions are defined for each of these nodes (Azeem, 2012): O 3 , i = w ¯ i · f i f o r i = 1 , 2

In this case, f1, f2 are the if-then fuzzy rules as (Azeem, 2012):

  • Rule 1: if (x==A1 & y==B1):

f 1 = p 1 x + q 1 y + r 1
  • Rule 2: if (x==A2 & y==B2):

f 2 = p 2 x + q 2 y + r 2

Here, the parameters set, or consequent parameters, are [pi, qi, ri] (Chen et al., 2010).

Layer 5: There are four fixed nodes in this layer, each indicated by one circle and tagged with the Sigma, with a node function to calculate the total output (Azeem, 2012): O 5 = w ¯ i · f i = f out f o r i = 1 , 2

In ANFIS, the error signal is calculated recursively from the output layer to the input layer using backpropagation gradient descent. The backpropagation learning rule is the same for feedforward neural networks (Topçu and Sarıdemir, 2008; Ramezanianpour, 2004).

2.3

2.3 Model optimization

In order to optimize the ANFIS model hyper-parameters, we have raised the problem as a multi-criteria optimization problem and optimized with two meta-heuristic and nature-inspired methods, which are briefly explained below.

A popular example of bio-inspired algorithm is Differential Evolution (DE) (Das and Suganthan, 2011) which is a stochastic algorithm that solves numerical optimization problems by using evolutionary operators, selection, crossover, and mutation in order to create a population of the algorithm. Using firefly algorithms (FA) (Yang, 2010) by Yang, a method that is based on how flashes communicate with each other and the flashing behavior of fireflies.

Different objectives based on statistical errors and score types are considered here to optimize ANFIS model. These types of errors are defined as follows (Botchkarev, 2018): RMSE = 1 n i = 1 n ( t i - o i ) 2 MAPE = 1 n ( i = 1 n t i - o i t i ) × 100

Also, it is also possible to compute the absolute fraction of variance (R2) based on the following formula (Botchkarev, 2018): R 2 = ( n t i o i - t i o i ) 2 ( n t i 2 - t i 2 ) - ( n a i 2 - ( a i ) 2 )

The n identifies the quantity of data, the t indicates the measured value, and the o specifies the estimated value.

In comparison between experimental and predicted outputs, RMSE and MAPE provide a precise measure of absolute error and total error. The RMSE and MAPE values are low, indicating that the predicted results are approximately equivalent to the experimental data. Moreover, big numbers indicate a large difference between predictions and results (Botchkarev, 1809).

2.4

2.4 Differential evolution

In 1995, differential evolution (DE) was introduced to the public as a population-based stochastic method for global optimization. There have been many developments in DE research during the last 20 years, but there are still new applications emerging (Das and Suganthan, 2011; Chakraborty, 2008). The DE mutation operator initially prioritizes exploration as the process of evolution progresses. The mutation operator tends to be more in favor of exploitation as evolution continues. In this way, DE automatically adjusts the mutation increments to an optimal value depending on where the evolution process currently is. Accordingly, there is no hard and fast rule for the distribution of possible outcomes when it comes to mutation in DE. It has been widely acknowledged as a precise optimization method that is also fast and reliable (Hegerty et al., 2009).

2.5

2.5 Firefly algorithm

Developed on the behavior of fireflies (flashing), Yang (Yang, 2010) proposed Firefly Algorithm which can be viewed as an unconventional heuristic based on swarms for optimization tasks. The method is an iterative technique based on a population of agents (seen as fire flies) who collaborate to find an optimal solution to a given optimization issue. As a result of bioluminescent glowing, the agents in this system can communicate with each other, allowing them to conduct more efficient cost function search compared to random search with standard distribution, assuming that the optimization problem can be regarded as an agent (firefly) that glows in response to its performance in the specified problem setting.

3

3 Results and discussions

The model optimization is the main novelty of this research as mentioned above. The result of this step is shown in Fig. 3. This figure shows that the FA optimization makes the best performance on near to 180 iterations. So, the final ANFIS model is selected at the hyper-parameters of this setting. The future analysis is done based on this model.

Tuning with metaheuristics.
Fig. 3
Tuning with metaheuristics.

Fig. 4 shows the comparison of expected and predicted values by the model developed and tuned in this study. The red points are the test data, and the blue points are the training data, which are randomly separated with a ratio of 2 to 8. The distance of each point from the diagonal line indicates its prediction error. By interpreting this figure, we realize that the obtained model has high generality and accuracy because the test points are not too far apart from the training points. Indeed, the model has been well tuned such that the CO2 loading data can be well estimated using the developed model.

Actual and Estimated Values with ANFIS Model.
Fig. 4
Actual and Estimated Values with ANFIS Model.

The same fact is confirmed by Table 1 because the error values as well as the R2 score in testing and training are close to each other. Indeed, the statistical analysis is reported in Table 1 indicating that the R2 values for both training and validation of the model are greater than 0.95 which confirms the goodness of fitting data points.

Table 1 Final ANFIS Results.
Metric R2 MAPE RMSE
Train 0.9841 1.08E-01 9.10E-02
Test 0.9520 1.17E-01 1.14E-01

Fig. 5 compares the relative importance values of features on target output. Permutation is used to determine the importance of features in this case. If the data are tabular, any fitted estimator can be examined using permutation feature importance. For non-linear estimation methods or opaque methods, it will be beneficial. Permutation feature importance describes the effect of randomly shuffling a single feature value on the model score. Using this method, there is no connection between the feature and the destination. Therefore, if the model's score goes down, it means it is overly dependent on that attribute. It can be also seen that the partial pressure of CO2 in the gas feed plays important role in the amount of CO2 loading in the solvent as predicted by the model. This is in agreement with the physical observation as the mass transfer of CO2 from gas phase to the absorbent phase is controlled and driven by the partial pressure gradient between two phases.

Permutation importance of features.
Fig. 5
Permutation importance of features.

4

4 Conclusion

As part of this investigation, we devised a sophisticated model for estimating the amount of carbon dioxide that can be loaded onto amino acid salt solutions. This model employs a metaheuristic optimized ANFIS that is derived from a diverse selection of amino acids. The innovative aspect of this research is the utilization of the Differential Evolution (DE) and Firefly Algorithm (FA) metaheuristics in order to address the hyper-parameter tuning of ANFIS as an optimization problem that is based on differential evolution. As a consequence of this, the ANFIS model's R2 score for the test data is 0.9520, whereas its score for the training data is 0.9841. This demonstrates that the model that was proposed is both comprehensive and reliable in terms of the predictions it makes. The closeness of scores and errors in testing and training shows the lack of over-fitting of the models, which has not been investigated and analyzed in many similar studies. The RMSE error rate is 1.17E-01 whereas the MAPE error rate is 1.14E-01. Both of these error rates are almost identical to each other.

Acknowledgement

This work was supported by Al-Mustaqbal University College (grant number MUS-E-0122).

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

References

  1. , . Simulation the adsorption capacity of polyvinyl alcohol/carboxymethyl cellulose based hydrogels towards methylene blue in aqueous solutions using cascade correlation neural network (CCNN) technique. J. Clean. Prod.. 2022;337:130509
    [Google Scholar]
  2. Azeem, M.F., Fuzzy inference system: theory and applications. 2012: BoD–Books on Demand.
  3. , . Environmental and climate justice and technological carbon removal. Electric. J.. 2021;34(7):107002
    [Google Scholar]
  4. Botchkarev, A., Performance metrics (error measures) in machine learning regression, forecasting and prognostics: Properties and typology. arXiv preprint arXiv:1809.03006, 2018.
  5. Botchkarev, A., Evaluating performance of regression machine learning models using multiple error metrics in azure machine learning studio. Available at SSRN 3177507, 2018.
  6. Chakraborty, U., Advances in differential evolution. Vol. 143. 2008: Springer.
  7. , , , . Forecasting tourist arrivals by using the adaptive network-based fuzzy inference system. Expert Syst. Appl.. 2010;37(2):1185-1191.
    [Google Scholar]
  8. , . New insights into the effects of methane and oxygen on heat/mass transfer in reactive porous media. Int. Commun. Heat Mass Transfer. 2021;129:105652
    [Google Scholar]
  9. , , . Differential evolution: a survey of the state-of-the-art. IEEE Trans. Evol. Comput.. 2011;15(1):4-31.
    [Google Scholar]
  10. , , . A comprehensive review on the hydrodynamics, mass transfer and chemical absorption of CO2 and modelling aspects of rotating packed bed. Sep. Purif. Technol.. 2022;295:121248
    [Google Scholar]
  11. , , . What is machine learning? In: Machine Learning in Radiation Oncology. Springer; . p. :3-11.
    [Google Scholar]
  12. , . Emission and control of flue gas pollutants in CO2 chemical absorption system – a review. Int. J. Greenhouse Gas Control. 2020;93:102904
    [Google Scholar]
  13. , , , . Machine learning basics. Deep learning. 2016;1(7):98-164.
    [Google Scholar]
  14. 34. Hegerty, B., C.-C. Hung, and K. Kasprak. A comparative study on differential evolution and genetic algorithms for some combinatorial problems. in Proceedings of 8th Mexican International Conference on Artificial Intelligence. 2009.
  15. , . ANFIS: adaptive-network-based fuzzy inference system. IEEE Trans. Syst., Man, Cybernetics. 1993;23(3):665-685.
    [Google Scholar]
  16. , , . Neuro-fuzzy modeling and control. Proc. IEEE. 1995;83(3):378-406.
    [Google Scholar]
  17. , . Computational simulation using machine learning models in prediction of CO2 absorption in environmental applications. J. Mol. Liq.. 2022;358:119159
    [Google Scholar]
  18. , . CFD modeling on the chemical absorption of CO2 in a microporous tube-in-tube microchannel reactor. Fuel. 2022;327:125064
    [Google Scholar]
  19. , . Spatial and temporal distribution characteristics of haze and pollution particles in China based on spatial statistics. Urban Clim.. 2022;41:101031
    [Google Scholar]
  20. , . Mass transfer of chemical absorption of CO2 in a serpentine minichannel. Chem. Eng. J.. 2021;414:128791
    [Google Scholar]
  21. Ramezanianpour, A., M. SOBHANI, and J. Sobhani, Application of network-based neuro-fuzzy system for prediction of the strength of high strength concrete. 2004.
  22. , , , . Modeling compressive strength of EPS lightweight concrete using regression, neural network and ANFIS. Constr. Build. Mater.. 2013;42:205-216.
    [Google Scholar]
  23. , . Integrated climate-change assessment scenarios and carbon dioxide removal. One Earth. 2020;3(2):166-172.
    [Google Scholar]
  24. , . A machine learning approach for modeling and optimization of a CO2 post-combustion capture unit. Energy. 2021;215:119113
    [Google Scholar]
  25. , . Effects of activated carbon on liquefaction resistance of calcareous sand treated with microbially induced calcium carbonate precipitation. Soil Dyn. Earthquake Eng.. 2022;161:107419
    [Google Scholar]
  26. , . Haze prediction model using deep recurrent neural network. Atmosphere. 2021;12(12):1625.
    [Google Scholar]
  27. , . ANFIS modeling for prediction of CO2 solubility in potassium and sodium based amino acid Salt solutions. J. Environ. Chem. Eng.. 2019;7(1):102925
    [Google Scholar]
  28. , . Smog prediction based on the deep belief - BP neural network model (DBN-BP) Urban Clim.. 2022;41:101078
    [Google Scholar]
  29. , , . Prediction of mechanical properties of recycled aggregate concretes containing silica fume using artificial neural networks and fuzzy logic. Comput. Mater. Sci.. 2008;42(1):74-82.
    [Google Scholar]
  30. , . Machine learning-based ionic liquids design and process simulation for CO2 separation from flue gas. Green Energy Environ.. 2021;6(3):432-443.
    [Google Scholar]
  31. , . Firefly algorithm, stochastic test functions and design optimisation. Int. J. Bio-Inspired Comput.. 2010;2(2):78-84.
    [Google Scholar]
  32. , . Multiple machine learning models for prediction of CO2 solubility in potassium and sodium based amino acid salt solutions. Arabian J. Chem.. 2022;15(3):103608
    [Google Scholar]
  33. , . Development of hybrid machine learning model for simulation of chemical reactors in water treatment applications: Absorption in amino acid. Environ. Technol. Innovation. 2022;27:102417
    [Google Scholar]
  34. , . Analysing the accuracy of pavement performance models in the short and long terms: GMDH and ANFIS methods. Road Mater. Pavement Des.. 2016;17(3):619-637.
    [Google Scholar]
Show Sections