Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.
Advertisement
Scientific Reports volume 16, Article number: 19029 (2026)
245
Metrics details
Accurate short-term photovoltaic (PV) power forecasting at sub-hourly resolutions is essential for maintaining grid reliability and supporting renewable energy integration. This study benchmarks six machine learning models against a persistence baseline using high-resolution data from the Yulara Solar System (327.6 kW) resampled at 5-, 10-, and 15-minute intervals. A comprehensive preprocessing framework was developed, incorporating cyclical temporal encoding, lag and rolling statistical features, interquartile-range-based outlier filtering, and hyperparameter optimization using RandomizedSearchCV with TimeSeriesSplit. The evaluated models include XGBoost, Decision Tree, Linear Regression, Ridge, Lasso, and Elastic Net. Performance was assessed using multiple error metrics (MAE, RMSE, WAPE, SMAPE, and skill score) across training, validation, testing, seasonal subsets, and stress scenarios involving night-time and highly variable irradiance conditions. Results demonstrate that tree-based models substantially outperform linear regressors in both accuracy and robustness. The Decision Tree achieved the best overall performance, with MAE values of 0.33–0.54 kW and RMSE values of 0.82–1.65 kW, corresponding to 97–98% skill-score improvements over persistence forecasting. XGBoost also showed strong and consistent performance across temporal resolutions and seasonal variations. In contrast, linear models exhibited limited capability in capturing nonlinear PV behavior, particularly under highly variable conditions. These findings highlight the suitability of tree-based approaches for accurate, interpretable, and resilient short-term PV forecasting in smart grid applications.
The global transition to renewable energy is a necessary response to climate change, the depletion of fossil fuel resources, and the adverse ecological consequences associated with traditional energy production1. Solar photovoltaic (PV) systems have emerged as a sustainable and cost-effective solution due to their scalability and declining costs2. However, their intermittent output, influenced by meteorological factors such as cloud cover, temperature, and humidity, alongside diurnal cycles, poses challenges for grid integration, energy market participation, and microgrid operation3. Accurate short-term PV forecasting at sub-hourly resolutions (e.g., 5, 10, or 15 min) is essential for grid stability, scheduling optimization, and economic efficiency in applications like grid interconnection and microgrid management4,5. High-resolution (5-minute) forecasts enable rapid real-time control actions, including automatic generation control, frequency regulation, and microgrid islanding decisions, where delays of only a few minutes may compromise system stability6,7. This study leverages operational data from the Yulara Solar System, a real-world 327.6 kW PV installation, to address these challenges using machine learning models.
PV power forecasting is categorized by prediction horizon: ultra-short-term (minutes to hours) for real-time monitoring and control, short-term (days to weeks) for operational planning and trading, and medium- to long-term (months to years) for capacity planning and policy design8. Ultra-short-term forecasting enables rapid adjustments to maintain grid reliability, with accuracy improving at finer resolutions9. A wide range of forecasting approaches have been developed, broadly grouped into physical, statistical, machine learning (ML), deep learning (DL), and hybrid methods10. Statistical models, including regression techniques11, autoregressive integrated moving average (ARIMA) models12, and gray system theory8, are effective in identifying temporal cycles but often fall short when confronted with the nonlinear and stochastic behavior of PV generation. ML techniques, such as support vector regression (SVR)13 and light gradient boosting machine (LightGBM)14,15, have proven more capable of capturing nonlinearities and complex dynamics. DL architectures extend these capabilities further: for example, long short-term memory (LSTM) networks applied to day-ahead forecasts16, convolutional neural networks (CNN) combined with bidirectional LSTM (CNN–BiLSTM)17, and optimized convolutional LSTMs (CLSTMs)18 have demonstrated significant improvements in forecast performance. Hybrid approaches, such as integrating wavelet transforms with quantile regression neural networks (QRNNs) for probabilistic wind speed forecasting, are increasingly being adapted for PV applications as well19. Recent advancements include novel DL architecture like TimesNet20, which have been tailored for complex time-series analysis. Complementary preprocessing techniques, such as empirical mode decomposition (EMD)21, further contribute to reducing errors and improving model reliability22. Additionally, the effectiveness of both ML and DL–based forecasting is closely tied to hyperparameter optimization. Conventional methods such as grid search23 often involve high computational cost, prompting the adoption of more efficient alternatives like Bayesian optimization24.
Despite these advances, most existing studies using raw meteorological and power data face inherent limitations during periods of high volatility, such as rapid cloud passages. Signal decomposition techniques (e.g., variational mode decomposition (VMD), wavelet packet decomposition (WPD)) can separate complex PV signals into more predictable intrinsic modes, improving forecast stability under stress conditions25,26. Moreover, recent attention-based architectures, including hierarchical frequency-aware transformers and hybrid transformer-BiLSTM models27,28, have shown superior capability in capturing multi-scale temporal dependencies. While such DL methods offer high accuracy, they require large datasets, extensive computational resources, and are less interpretable than tree-based models. Therefore, the present study focuses on classical machine learning models (linear, regularised, and tree-based) to establish a clear, reproducible benchmark for sub-hourly PV forecasting under seasonal and stress conditions. DL architectures, including LSTMs and transformers, are deliberately excluded to maintain focus on interpretability, computational efficiency, and practical deployability; they are left for future work. Moreover, DL models typically require substantially larger datasets (e.g., multiple years of high-frequency data from many sites) to generalize well, whereas the single-site Yulara dataset, while large, is not ideal for training deep networks without risk of overfitting.
Although recent advancements in PV power forecasting have been substantial, several important research gaps remain unaddressed. Many studies, such as8,10, primarily emphasize point forecasting or a single prediction horizon, while systematic evaluations of multiple machine learning models at high temporal resolutions (e.g., 5-, 10-, and 15-minute intervals) using actual field data are still relatively limited. Furthermore, the majority of approaches rely on raw data without signal decomposition, making them vulnerable to high-volatility events25,26. Moreover, edge-case scenarios, such as periods of high variability (rapid irradiance fluctuations due to cloud cover) or night-time conditions (zero PV output), are often neglected, despite their importance for ensuring forecast robustness and reliability22. Another limitation lies in the breadth of evaluation frameworks: few studies employ a diverse set of error indicators, such as normalized RMSE (nRMSE), symmetric mean absolute percentage error (SMAPE), or skill scores, which capture different aspects of error magnitude, symmetry, and relative improvement, and visual comparative analyses are rarely incorporated to provide deeper performance insights24. Likewise, preprocessing pipelines frequently omit essential components such as advanced feature engineering and temporal encoding, constraining the adaptability of models to diverse operating conditions10. Equally important, the issue of computational efficiency in hyperparameter optimization remains insufficiently explored. Conventional approaches, such as grid search, tend to be computationally demanding, underscoring the need for more efficient optimization techniques to balance accuracy with scalability21. However, systematic multi-resolution benchmarking of machine learning models under real-world, high-variability scenarios, together with computational efficiency analysis, is still insufficiently addressed in recent literature.
To address these limitations, this study develops a comprehensive framework for short-term PV power forecasting using operational data from the Yulara Solar System (327.6 kW). The dataset includes 14 measured variables, including Active Power, Wind Speed, and Pyranometer_1, and is resampled at 5-, 10-, and 15-minute intervals. First, a robust preprocessing pipeline is developed, incorporating interquartile-range-based outlier detection, cyclical temporal encoding, and lagged and rolling statistical feature engineering. Hyperparameter optimization is performed using RandomizedSearchCV combined with TimeSeriesSplit to improve computational efficiency while preserving temporal consistency. Second, six ML models, namely Decision Tree, XGBoost, Linear Regression, Ridge, Lasso, and Elastic Net, are systematically benchmarked against a persistence baseline. Emphasis is placed on tree-based approaches because of their interpretability and ability to capture nonlinear PV dynamics. Third, extensive model evaluation is conducted using diverse performance metrics, including Mean Absolute Error (MAE), Root Mean Squared Error (RMSE), normalized MAE (nMAE), nRMSE, coefficient of determination (R²), Weighted Absolute Percentage Error (WAPE), SMAPE and skill score. Evaluation is performed across training (70%), validation (15%), and testing (15%) subsets, as well as seasonal divisions and stress scenarios involving night-time and highly variable irradiance conditions. Unlike simulation-based studies, this work relies entirely on real operational field data, thereby enhancing the practical relevance and applicability of the proposed framework. The overall research workflow adopted in this study is illustrated in Fig. 1, which summarizes the main stages, including data acquisition, dataset preparation, preprocessing, model development, forecasting, and performance evaluation. In addition, Table 1 provides a comparative overview of recent solar forecasting studies and highlights the methodological position and contribution of the current work relative to existing literature.
The main contributions of this study are summarized as follows:
A robust and scalable preprocessing framework for short-term PV forecasting is developed, incorporating cyclical temporal encoding, lagged and rolling statistical feature engineering, and effective outlier handling to improve model stability across multiple temporal resolutions.
A comprehensive multi-metric evaluation framework is established using MAE, RMSE, nMAE, nRMSE, WAPE, SMAPE, R², and skill score, combined with walk-forward validation and seasonal analysis to assess forecasting accuracy, temporal generalization, and robustness under real operational conditions.
Focused stress testing is conducted under challenging operating scenarios, including night-time conditions and periods of highly variable irradiance, to evaluate the resilience and reliability of forecasting models for practical deployment.
Seasonal analyses across spring, summer, autumn, and winter are performed to quantify model adaptability under varying environmental and operational conditions, reinforcing the applicability of the proposed methods for smart grid and renewable energy integration systems.
The remainder of this paper is organized as follows. Section 2 describes the methodology, including the dataset, preprocessing procedures, forecasting models, and evaluation strategy. Section 3 presents experimental results and discussion. Finally, Sect. 4 concludes the paper and outlines directions for future research.
Research methodology.
The analysis in this study is based on a solar PV dataset collected from the Yulara Solar System36 located adjacent to Uluru in central Australia. The study focuses on a subsystem with a rated capacity of 327.6 kW. The dataset comprises 964,421 records and includes 15 variables: timestamp, active power, wind speed, ambient temperature (°C), global horizontal radiation, wind direction, daily rainfall, maximum wind speed, atmospheric pressure, hail accumulation, Pyranometer_1, Temperature_Probe_1, Temperature_Probe_2, Current_Phase_Average, and cumulative active energy delivered and received. The timestamp variable was used as the temporal index. The dataset spans from April 2, 2016 (08:00:00) to July 16, 2025 (16:30:00), corresponding to approximately 9.29 years of historical operational data. Although the endpoint extends beyond the current calendar date, it reflects the complete temporal coverage contained in the source dataset.
An initial data integrity assessment identified 12,578 missing intervals at the native 5-minute resolution, representing approximately 1.29% of the expected 976,999-time steps. Several intermittent gaps were observed throughout the dataset, including missing intervals between July 17, 2016, and September 28, 2020. To preserve temporal causality and prevent information leakage, the dataset was first divided chronologically into training (70%), validation (15%), and testing (15%) subsets. Missing timestamps were subsequently handled during the imputation stage described in Sect. 2.2.
Separate experiments were conducted for each target temporal resolution (5, 10, and 15 min). In each case, the raw dataset was first partitioned chronologically, after which each subset was independently resampled to the corresponding temporal resolution using mean aggregation. Following resampling, several feature columns (e.g., ambient temperature and global horizontal radiation) still contain sparse NaN values, particularly near the end of the dataset. These missing values were addressed during the imputation stage described in Sect. 2.2. Table 2 provides a comprehensive overview of dataset dimensions across the chronological splits and independently resampled partitions, highlighting variations introduced by temporal resampling and gap handling within each subset.
The preprocessing framework followed a strict, temporally consistent workflow to ensure reproducibility and prevent information leakage. Operations requiring parameter estimation, such as outlier threshold calculation and feature scaling, were derived exclusively from the training subset and subsequently applied to the validation and testing subsets. In contrast, partition-independent transformations, including temporal feature extraction, lag construction, rolling statistics, clipping, and imputation, were performed separately within each dataset partition using only information available within that partition.
Step (1) Chronological split into training, validation, and test sets.
Immediately after loading the raw dataset, the observations were partitioned chronologically to prevent future information from leaking into the training process. Maintaining the temporal sequence of observations is essential in time-series forecasting because it ensures that the model is trained exclusively on historical data and evaluated using unseen future observations. To preserve methodological rigor, the chronological order of the dataset was maintained throughout all preprocessing and modeling stages.
The dataset was divided into three subsets. The training subset comprised 70% of the observations and was used for model training and parameter estimation. The validation subset accounted for 15% of the data and was used for hyperparameter tuning and performance monitoring during model development. The remaining 15% constituted the testing subset, which was reserved exclusively for final evaluation of predictive performance.
Step (2) – Resampling to standardize temporal resolution.
Following chronological partitioning, each subset (training, validation, and testing) was independently resampled to target resolutions of 5, 10, and 15 min using mean aggregation. This process standardized the temporal index and automatically generated rows corresponding to missing timestamps, resulting in NaN values where observations were unavailable. The aggregation interval represented the only variation across the three sampling schemes. To maintain methodological consistency, the same preprocessing framework, feature engineering procedures, and variable selection strategy were applied across all temporal resolutions.
Step (3) – Outlier detection, clipping, and imputation.
Outlier handling for the Active_Power variable was performed exclusively on the training subset using the interquartile range (IQR) method with a threshold of 1.5 × IQR. Observations located more than 1.5 × IQR below the first quartile or above the third quartile were identified as outliers and subsequently replaced with missing values. To maintain methodological consistency and prevent information leakage, the same outlier thresholds derived from the training subset were applied unchanged to the validation and testing subsets.
To ensure the physical plausibility of the measurements, several clipping constraints were implemented before the imputation stage. Wind- and irradiance-related variables, including Wind_Speed, Max_Wind_Speed, Global_Horizontal_Radiation, Pyranometer_1, Weather_Daily_Rainfall, and Hail_Accumulation, were restricted to non-negative values. Temperature probe measurements were capped at 60 °C to eliminate unrealistic readings, while Wind_Direction values were constrained to the valid range of 0°–360°. In addition, irradiance-related variables were bounded to a maximum value of 1200 W/m². Any measurement below − 100, except for Active_Power, was considered invalid and treated as a missing value.
Missing values within the training subset were imputed using time-aware interpolation restricted exclusively to the training partition, ensuring that no information outside the training period was introduced. This was followed by forward filling to preserve temporal continuity. For the validation and testing subsets, only forward filling was applied, while interpolation was intentionally excluded to avoid potential information leakage. Furthermore, constant-valued features, defined as variables containing only a single unique value within the training subset, were removed from all dataset partitions. This step reduced data dimensionality and minimized the risk of model overfitting.
Step (4) – Feature engineering (temporal, lag, and rolling features).
To improve forecasting performance, several features were engineered independently within each dataset partition while ensuring that only locally available information was used. This approach preserved temporal integrity and prevented information leakage between the training, validation, and testing subsets.
Temporal features were generated to capture daily and seasonal patterns within the time-series data. These features included hour of day and day of year, together with their sinusoidal and cosinusoidal transformations (hour_sin, hour_cos, day_sin, and day_cos). The cyclical transformations enabled the models to better represent periodic behaviors associated with diurnal and annual variations.
In addition, lag-based features were generated from the Active_Power variable to incorporate historical dependencies into the forecasting process. Three lag features were constructed using lag intervals of 1-, 12-, and 288-time steps. Because lagging was performed after resampling, the effective temporal duration represented by each lag depended on the sampling resolution. For example, under the 5-minute resolution, the lag features corresponded to 5 min, 1 h, and 24 h, respectively, whereas under the 15-minute resolution they represented 15 min, 3 h, and 72 h.
Rolling statistical features were also introduced to capture short-term temporal trends and smoothing effects in the PV generation data. These included Active_Power_rolling_mean_12 and Active_Power_rolling_mean_288, representing moving averages computed over the previous 12- and 288-time steps, respectively. Because these rolling windows were applied after resampling, their effective temporal durations varied according to the sampling resolution. To prevent information leakage, all rolling statistics were shifted by one time step so that the current observation was excluded from feature construction. Finally, rows containing missing values generated during lagging or shifting operations were removed to ensure data completeness and consistency across all dataset partitions.
Step (5) – Feature selection, scaling, and explicit prevention of data leakage.
Feature selection using the SelectKBest method with the f_regression criterion was embedded within the model training pipeline and evaluated using a 5-fold TimeSeriesSplit cross-validation strategy. Feature selection was performed on scaled training features within each cross-validation fold to ensure consistent feature ranking under normalized distributions. This approach ensured that feature selection was conducted independently within each training fold, thereby preserving temporal order and preventing information leakage from future observations.
For each fold, the top 15 predictors were selected according to their f_regression scores. Feature-selection stability was subsequently analysed by identifying predictors consistently selected across cross-validation folds. The features consistently identified across the folds included Wind_Speed, Active_Power_rolling_mean_12, Pyranometer_1, Active_Power_lag_12, hour_cos, Active_Power_lag_1, Max_Wind_Speed, Temperature_Probe_1, Temperature_Probe_2, Global_Horizontal_Radiation, Weather_Temperature_Celsius, Active_Power_rolling_mean_288, Wind_Direction, Active_Power_lag_288, and hour_sin. The value of k = 15 was determined through preliminary experiments to provide an effective balance between predictive performance and model complexity.
Following feature selection, the target variable (Active_Power) was separated from the predictor variables, resulting in the feature matrices X_train, X_val, and X_test, together with the corresponding target vectors y_train, y_val, and y_test.
Feature scaling was subsequently applied using MinMaxScaler, which normalized all variables to the range [0, 1]. The scaler was fitted exclusively on the training subset and then applied to transform the validation and testing subsets. This procedure ensured that no information from unseen data contributed to the scaling parameters, thereby preventing information leakage.
Several additional measures were implemented throughout preprocessing and model development to preserve temporal integrity and avoid information leakage. Hyperparameter optimization using RandomizedSearchCV was conducted exclusively within the training subset using TimeSeriesSplit cross-validation to maintain chronological order. Similarly, walk-forward validation, described in Sect. 3.6, was performed exclusively on the training subset. Furthermore, all preprocessing parameters associated with imputation, feature engineering, feature selection, and scaling were derived exclusively from the training subset before being applied to the validation and testing subsets.
Figure 2 summarizes the complete Preprocessing workflow. The proposed preprocessing pipeline provides a robust, consistent, and temporally rigorous framework for model development and evaluation. The methodology ensures reproducibility, scalability across multiple temporal resolutions, and strict independence between the training, validation, and testing subsets, thereby supporting reliable and unbiased predictive performance assessment.
Preprocessing framework.
In this study, six machine learning algorithms, namely eXtreme Gradient Boosting (XGBoost), Decision Tree Regressor, Linear Regression, Ridge Regression, Lasso Regression, and Elastic Net, were systematically evaluated against a persistence model used as the baseline benchmark. The persistence model predicts Active_Power using a fixed lag of 288-time steps, corresponding to 1 day at 5-minute resolution, 2 days at 10-minute resolution, and 3 days at 15-minute resolution.
A fixed lag of 288 steps was selected instead of conventional 1-hour or 24-hour persistence baselines to provide a consistent step-based benchmark across all temporal resolutions. In addition, the corresponding multi-day forecasting horizons better capture persistent weather patterns, such as consecutive cloudy periods, which are relevant under seasonal stress conditions. A shorter persistence horizon, such as 1 h, would produce unrealistically low forecasting errors because of strong short-term autocorrelation, thereby artificially inflating skill scores and reducing the rigor of the benchmark. For comparison purposes, a conventional 24-hour persistence baseline was also evaluated, and the corresponding results are provided in Table S1 of the Supplementary Materials.
The modeling framework followed a structured pipeline integrating feature selection using SelectKBest (k = 15, f_regression criterion) to maintain consistency with the preprocessing framework while retaining the most statistically significant predictors. Hyperparameter optimization was conducted using RandomizedSearchCV combined with 5-fold TimeSeriesSplit cross-validation to preserve temporal ordering and minimize information leakage.
RandomizedSearchCV was selected over Bayesian optimization for three primary reasons. First, the hyperparameter search spaces presented in Table 3 are relatively low dimensional, allowing random sampling with 15–25 iterations to identify near-optimal configurations efficiently. Second, RandomizedSearchCV supports parallel execution across cross-validation folds, thereby reducing total tuning time compared with sequential Bayesian approaches. Third, its stochastic search strategy, combined with a fixed random seed, ensures reproducible results without the additional complexity associated with surrogate modeling. Nevertheless, Bayesian optimization remains a promising direction for future work involving larger models or more complex hyperparameter spaces.
For XGBoost, early stopping with 10 rounds was implemented to reduce the risk of overfitting. The explored hyperparameter search spaces for each algorithm are summarized in Table 3, while the optimized configurations and corresponding tuning times are presented in Table 4. Model performance was evaluated across forecasting intervals of 5, 10, and 15 min to examine the adaptability of the tuning process under different temporal resolutions.
The XGBoost algorithm, first presented in 201638, is a highly optimized implementation of the gradient boosting framework. It operates as an ensemble learning technique in which multiple weak learners, typically shallow decision trees, are combined to construct a robust predictive model. Through a sequential learning strategy, each newly generated tree attempts to correct the residual errors produced by previous trees, thereby improving predictive accuracy while maintaining computational efficiency and scalability. These characteristics make XGBoost particularly suitable for large-scale, high-dimensional, and complex datasets.
At its core, XGBoost follows supervised learning principles by learning relationships between input features and target outputs to generalize effectively to unseen data. The algorithm can be applied to both regression and classification problems. Its methodological foundation lies in gradient boosting, where trees are constructed sequentially to minimize residual prediction errors from preceding iterations. As illustrated in Fig. 3, the process begins with an initial model, after which subsequent trees progressively reduce the prediction error by modeling the residuals. This iterative refinement continues until a stopping criterion or convergence condition is satisfied. The final prediction is obtained as the cumulative sum of the contributions from all trees within the ensemble.
The Supervised Learning Process of XGBoost.
The procedure can be interpreted as an iterative refinement process, beginning with an initial prediction, commonly initialized to zero, after which successive trees are incrementally introduced to reduce the residual errors. Mathematically, the model can be expressed as:
where, (:{widehat{y}}_{i}) denotes the final predicted output for the ith data instance, K represents the total number of trees in the ensemble, and (:{f}_{k}left({x}_{i}right)) corresponds to the prediction generated by the Kth tree for the same data point.
In XGBoost, the optimization objective is composed of two components: the loss function and a regularization term. The loss function evaluates the degree of fit between the model’s predictions and the observed data, while the regularization term penalizes overly complex trees to encourage simpler, more generalizable structures. The generic formulation of the objective function ((:objleft(theta:right))) can be written as:
where, (:lleft({y}_{i}{widehat{,y}}_{i}right)) represents the loss function, which quantifies the discrepancy between the actual value (:{y}_{i}) and the model’s prediction (:{widehat{y}}_{i}), while (:varOmega:left({f}_{k}right)) denotes the regularization component designed to penalize excessive model complexity and prevent overfitting.
Rather than constructing the entire model in a single step, XGBoost adopts an iterative optimization strategy. The process begins with an initial prediction, typically set as (:{widehat{y}}_{i})(0) = 0. At each subsequent iteration, a new tree is introduced to refine the model and reduce residual errors. The prediction for the ith data point after the addition of the tth tree can therefore be expressed as:
In this expression, (:{widehat{y}}_{i}^{left(t-1right)}:)denotes the prediction obtained from the previous iteration, while (:{f}_{t}left({x}_{i}right)) represents the output of the tth tree for the ith data instance.
The regularization component, (:varOmega:left({f}_{k}right)), serves to control model complexity by introducing penalties related to both the number of leaves in a decision tree and the magnitude of the leaf weights. Its general formulation can be expressed as:
where, T denotes the total number of leaves in the tree, (:gamma:)represents a regularization parameter that regulates the complexity of the tree structure, and λ corresponds to the coefficient applied to penalize the squared magnitude of the leaf weights wj.
When constructing the tree, the decision on how to split a node is guided by the computation of the information gain for each potential division. The information gain associated with a candidate split is defined as:
where, GL and GR represent the aggregated gradient values for the left and right child nodes, respectively, while HL and HR denote the corresponding sums of the second-order derivatives.
By evaluating the information gain across all potential splits at each node, XGBoost systematically identifies the split that maximizes this gain. This selection strategy ensures that the chosen partition yields the greatest reduction in prediction error, thereby enhancing the overall performance and accuracy of the model.
Decision Tree Regression is a predictive modeling technique that employs a hierarchical tree structure to estimate continuous target values. The method recursively partitions the dataset into smaller subsets using feature-based thresholds selected to minimize prediction error. At each decision node, the algorithm determines both the optimal feature and the corresponding split threshold that maximize error reduction within the resulting partitions. This recursive process continues until predefined stopping criteria are satisfied, ultimately producing a tree in which each terminal node corresponds to a constant output value assigned to all observations within that region.
The structure of a decision tree regressor.
As illustrated in Fig. 4, a Decision Tree Regressor consists of three main components: the Root Node, representing the complete dataset at the initial stage; the Split Nodes, which partition the data according to feature thresholds to reduce prediction error; and the Leaf Nodes, which assign constant prediction values to the final partitions. Decision trees are particularly effective in modeling complex and nonlinear relationships without assuming linearity, thereby providing flexible and interpretable predictions through transparent decision rules.
However, a key limitation of decision trees is their tendency to overfit when the tree becomes excessively deep or contains too many splits, reducing generalization capability. To mitigate this issue, regularization techniques such as pruning and constraints on maximum tree depth or minimum samples per leaf are commonly applied to balance predictive accuracy and model complexity.
Linear Regression is a fundamental supervised learning algorithm that models the relationship between predictor variables and a target variable using an optimized linear function. The method assumes a proportional linear relationship between independent and dependent variables. Its simplicity, interpretability, and computational efficiency make it one of the most widely used predictive techniques and a foundational component of more advanced models, including logistic regression and neural networks. Owing to these characteristics, Linear Regression has broad applications in domains such as finance, healthcare, and energy forecasting.The equation describing the best-fit line in simple linear regression is given by:
where, ({hat y}) denotes the predicted outcome of the dependent variable, (:x) represents the independent variable, (:{beta:}_{1}) corresponds to the slope of the line (capturing the rate of change in (:widehat{y}) for each unit variation in (:x)), and (:{beta:}_{0}) refers to the intercept, which reflects the expected value of (:widehat{y}) when (:x) = 0.
The best-fit line is obtained by minimizing the residual sum of squares Errors (SSE):
where the residual represents the difference between the actual observed values and the corresponding predictions. To determine the best-fit line, the Least Squares method is commonly employed, minimizing squared deviations between the estimated line and observed data, thereby ensuring the most accurate approximation.
For multiple linear regression, where (:k) independent variables are considered, the hypothesis function is generalized as:
In this expression, (:h(x)) denotes the predicted value of the dependent variable, (:{beta:}_{0}) is the intercept term, and (:{beta:}_{1}),(::{beta:}_{2}), …, (:{beta:}_{k}) are regression coefficients capturing the contributions of each independent variable.
Ridge Regression, also known as L2 regularization, extends ordinary linear regression by incorporating a penalty term into the optimization objective to mitigate multicollinearity and reduce overfitting. This regularization constrains the magnitude of the regression coefficients, shrinking them toward zero and thereby improving model stability and generalization performance.
The linear regression model is expressed as:
where, y denotes the n×1 vector of observed target values, X represents the n×p matrix of predictor variables, β is the p×1 vector of regression coefficients, and (epsilon) corresponds to the n×1 error vector.
The OLS estimator of β is given by:
Ridge Regression modifies this by introducing a penalty term, where λ ≥ 0 is the ridge regularization parameter and (:I) is the identity matrix:
This shrinks regression coefficients toward zero, producing more stable estimates and mitigating adverse effects of multicollinearity.
Lasso Regression, also referred to as L1 regularization, addresses overfitting by incorporating an L1 penalty into the loss function. This penalty encourages sparsity by forcing certain regression coefficients to become exactly zero, thereby enabling automatic feature selection.
The Lasso loss function is defined as:
where, the first component, (sumnolimits_{i=1}^{m}{left({y}_{i}-{hat y}_{i}right)}^{2}), corresponds to the residual sum of squares, which quantifies the prediction error, while the second component, (:lambda::{sum:}_{j=1}^{n}left|{beta:}_{j}right|), represents the L1 penalty that promotes sparsity within the coefficient vector β. The parameter λ ≥ 0 acts as a regularization factor, governing the intensity of the penalty and thereby influencing the degree of shrinkage applied to the coefficients.
Elastic Net Regression combines the advantages of both Lasso (L1) and Ridge (L2) regularization, providing a balance between feature selection and multicollinearity reduction. This dual-penalty framework is particularly effective when predictor variables exhibit strong correlations.
The Elastic Net optimization objective is defined as:
In this formulation, the first component corresponds to the residual sum of squares, which captures the prediction error. The second component represents the L1 penalty inherited from Lasso, promoting sparsity in the coefficient vector and effectively selecting the most relevant features. The third component introduces the L2 penalty from Ridge, which mitigates the effects of multicollinearity by stabilizing coefficient estimates. The parameters that are associated with these terms, (:{lambda:}_{1}:)and (:{lambda:}_{2}), regulate the relative influence of the L1 and L2 penalties, thereby controlling the balance between feature selection and model stability.
The proposed models were evaluated based on two primary criteria: predictive accuracy and computational efficiency, both of which are essential for practical deployment in real-time PV forecasting systems. Predictive performance was assessed using a comprehensive set of evaluation metrics, including MAE, RMSE, nMAE, nRMSE, R², WAPE, SMAPE, and skill score. Collectively, these metrics capture different aspects of forecasting performance, including accuracy, robustness, error distribution, and explanatory capability under varying operating conditions.
In parallel, computational efficiency was evaluated by measuring the time required for hyperparameter tuning, model training, and testing at forecasting intervals of 5, 10, and 15 min. This analysis provides insight into the scalability and practical suitability of the models for large-scale or time-sensitive forecasting applications. The corresponding computational times are summarized in Table 5.
To improve robustness, a daytime filtering criterion was applied by selecting observations where Active_Power exceeded 10 kW, thereby ensuring that evaluation focused primarily on periods of substantial PV generation. Model validation employed a walk-forward strategy using 5-fold TimeSeriesSplit cross-validation to preserve chronological order while systematically assessing model stability across successive temporal segments.
Additional stress-testing experiments were conducted under challenging operating conditions, including night-time periods characterized by Active_Power values at or below 10 kW and high-variability intervals identified using a rolling standard deviation (SD) exceeding the 75th percentile threshold. Performance evaluation was conducted across training, validation, testing, and seasonal subsets to examine forecasting behavior under diverse temporal and environmental conditions.
The evaluation metrics are defined as follows:
MSE: This metric calculates the average of the squared deviations between actual (:{(y}_{i})) and predicted values (:left({widehat{text{y}}}_{i}right)), giving greater weight to larger errors. A lower MSE indicates better accuracy, with an optimal score of zero representing a perfect fit.
RMSE: Defined as the square root of the MSE, this metric expresses prediction error in the same units as the target variable. By construction, it places greater emphasis on larger deviations, making it particularly sensitive to significant errors.
MAE: This metric represents the average of the absolute differences between predicted and actual values, providing a straightforward measure of forecasting accuracy. Its interpretability and scale dependence make it widely applicable across domains, as it expresses the typical magnitude of error in the same units as the target variable.
R²: This metric quantifies the proportion of variance in the dependent variable that is accounted for by the model. Its value ranges between 0, indicating no explanatory power, and 1, representing a perfect fit.
where, (:stackrel{-}{{y}_{i}}) is the mean of observed values.
nMAE and nRMSE: These represent scaled versions of MAE and RMSE, respectively, where the errors are adjusted relative to a reference value such as the mean or the range of the target variable. Normalization provides a dimensionless measure of error, enabling comparability across datasets with different scales. The exact definition depends on the chosen normalization criterion, e.g.,
WAPE: This metric evaluates forecasting accuracy by weighing absolute percentage errors according to the magnitude of the actual values. It is particularly useful for datasets with imbalanced distributions, as it reduces bias introduced by small target values.
SMAPE: This metric is a modified form of MAPE that treats overestimation and underestimation more evenly, thereby reducing bias in percentage-based error evaluation.
Skill Score: This indicator evaluates the relative improvement of a model compared to a predefined baseline (such as a persistence forecast), calculated as Skill Score = (:1-frac{{RMSE}_{model}}{{RMSE}_{baseline}}:). A positive score signifies superior performance over the reference method. The formula follows the standard definition widely used in solar forecasting literature47. The persistence baseline was chosen as a fixed lag of 288-time steps, which corresponds to 1 day at 5 min, 2 days at 10 min, and 3 days at 15 min resolution. This baseline is deliberately weak, it does not account for diurnal patterns or weather changes – ensuring that high skill scores (e.g., 97–98%) reflect genuine predictive ability rather than an overly easy benchmark.
Together, these performance metrics, when integrated with the described validation procedures and stress-testing strategies, establish a comprehensive and robust framework for assessing model effectiveness under a wide range of PV forecasting conditions.
This section presents a comprehensive evaluation of the proposed forecasting models, XGBoost, Decision Tree, Linear Regression, Ridge Regression, Lasso Regression, and Elastic Net, against a persistence baseline for short-term PV prediction at 5-minute, 10-minute, and 15-minute horizons. The evaluation covers training, validation, testing, and seasonal subsets (Spring, Summer, Autumn, Winter), and incorporates walk-forward validation for temporal stability, stress testing for robustness under night-time and high-variability conditions, and computational efficiency analysis.
Performance is assessed using multiple metrics, including MAE, RMSE, nMAE, nRMSE, R², WAPE, SMAPE (restricted to daytime conditions where Active_Power > 10 kW), and Skill Score (percentage RMSE improvement over persistence). Visual tools such as time-series plots and scatter plots are used to support interpretation.
Overall, the discussion focuses on performance trends across forecasting horizons and seasons, highlights the superiority of tree-based models, evaluates computational trade-offs, and discusses limitations relevant to real-world grid integration.
Among the evaluation metrics, SMAPE and WAPE provide complementary perspectives beyond MAE and RMSE. While MAE and RMSE quantify absolute errors in kilowatts and are scale-dependent, SMAPE provides a scale-independent percentage error that treats over- and under-predictions symmetrically. This makes it particularly suitable for comparing performance across different PV systems and seasonal conditions.
In contrast, WAPE weights absolute errors by actual power magnitude, reducing distortion from low-generation periods such as nighttime or cloudy conditions. As shown in Tables 6, 7, 8 and 9, tree-based models consistently achieve lower SMAPE (approximately 0.6–1.4%) and WAPE (approximately 0.6–0.8%) compared to linear models, which reach SMAPE values up to 5.5% and WAPE up to 3.8%. These results confirm that tree-based approaches maintain high relative accuracy across varying operating conditions.
The analysis of Figs. 5 and 6 underscores the dominance of tree-based models, with Decision Tree and XGBoost excelling in capturing rapid PV power fluctuations over a 3-day horizon. Decision Tree’s MAE of 0.3583 kW and XGBoost’s R² of 0.9997 (Table 6) highlight their prowess in modeling nonlinear dynamics, critical for grid reliability under dynamic solar conditions. Linear models (e.g., Linear Regression’s RMSE of 3.7901 kW) and persistence (RMSE = 33.5494 kW) lag significantly, revealing their limitations in handling short-term variability. Linear models performed significantly worse than tree-based methods due to the inherently non-linear relationship between PV power and weather variables, the presence of complex feature interactions, and the abrupt regime shifts (e.g., cloud cover) that characterise solar generation. Linear models with lag features can capture smooth trends but fail to model the threshold-like behaviour needed for high-variability and stress conditions, as evidenced by their high RMSE. These results underscore that linear regressors are unsuitable for sub-hourly PV forecasting under dynamic weather. The robustness of tree-based models (Decision Tree and XGBoost) stems from three key factors. First, the autoregressive features (Active_Power_lag_1, lag_12, lag_288) encode historical dependencies that tree-based models exploit through threshold-based splits, allowing them to capture abrupt changes (e.g., cloud cover) that linear models smooth out. Second, meteorological variables (e.g., Pyranometer_1, Wind_Speed, Temperature_Probe_1) interact non-linearly with PV output – for example, a combined effect of high irradiance and moderate temperature on module efficiency. Tree models naturally partition such interactions without requiring manually engineered cross-terms. Third, the regression algorithm itself – recursive binary partitioning – enables decision trees to approximate step-wise, non-monotonic functions, which is essential for handling rapid regime shifts (sunrise/sunset, passing clouds). In contrast, linear regressors assume a constant rate of change, leading to large errors during stress conditions. Scatter plots across all horizons (Figs. 6, 8, 10) consistently show tree-based predictions clustering tightly around actual values, while linear approaches exhibit greater dispersion, a pattern reinforced here at 5 min. The high skill scores (96–98%) reflect the combination of a weak persistence baseline (RMSE ≈ 45 kW) and the excellent predictive ability of tree-based models, which capture the strong daily pattern of PV generation. Similar skill levels have been reported in other high-resolution forecasting studies48,49.
Combined Prediction vs. Actual (3 Days) – 5-minute Interval.
All Models Actual vs. Predicted – 5-minute Interval.
At the 10-minute horizon (Figs. 7 and 8), tree-based models maintain superior performance as temporal aggregation reduces short-term variability. Decision Tree achieves a MAE of 0.3423 kW, while XGBoost reaches an R² of 0.9999 (Table 7), indicating highly stable predictive performance.
Although the performance gap between tree-based and linear models slightly narrows due to smoothing effects at longer intervals, tree-based methods remain consistently superior. The persistence model (RMSE = 36.3667 kW) continues to serve as a weak baseline.
Combined Prediction vs. Actual (3 Days) – 10-minute Interval.
All Models Actual vs. Predicted – 10-minute Interval.
On the 15-minute horizon (Figs. 9 and 10), tree-based models continue to dominate. Decision Tree achieves a MAE of 0.3313 kW and a Skill Score of 97.8881% in summer, while XGBoost maintains an R² of 0.9999 (Table 8). Although temporal smoothing reduces variability, tree-based models consistently outperform linear models (e.g., Linear Regression RMSE = 2.4811 kW) and persistence (RMSE = 37.4153 kW), which exhibits the poorest performance. These results confirm that tree-based methods remain robust even under reduced temporal resolution, making them suitable for operational forecasting applications.
Combined Prediction vs. Actual (3 Days) – 15-minute Interval.
All Models Actual vs. Predicted – 15-minute Interval.
Seasonal evaluation across spring, summer, autumn, and winter reveals the influence of climatic variability on forecasting performance. Decision Tree performs best under stable summer conditions (MAE = 0.3349 kW, R² = 0.9999 at 15 min), while XGBoost demonstrates strong consistency across all seasons (R² ≈ 0.9996 in autumn). Linear models perform poorly during highly variable seasons such as autumn and winter (e.g., Linear Regression RMSE = 5.4889 kW at 5 min). Persistence remains the weakest baseline across all seasonal conditions (WAPE up to 29.65%). These results (Figs. 11, 12 and 13) confirm that nonlinear and adaptive models are essential for handling seasonal variability in PV forecasting.
Seasonal Actual vs. Predicted – 5-min (All 6 Models, All Seasons).
Seasonal Actual vs. Predicted – 10-min (All 6 Models, All Seasons).
Seasonal Actual vs. Predicted – 15-min (All 6 Models, All Seasons).
The Stress Test Performance Metrics are designed to assess the robustness and reliability of PV forecasting models when exposed to challenging conditions that fall outside typical operational scenarios. These tests specifically evaluate model performance during night-time intervals, defined as periods with Active_Power ≤ 10 kW when PV generation is negligible, and during high-variability intervals, identified by a rolling SD greater than the 75th percentile, which captures rapid fluctuations in power output often caused by weather dynamics or other transient factors.
Table 9 summarizes the evaluation results across all forecasting intervals, reporting performance indicators such as MAE, RMSE, nMAE, nRMSE, R2, and WAPE. These metrics were computed by applying targeted masks to the test dataset and then quantifying the discrepancies between predicted and actual Active_Power values. MAE and RMSE capture the magnitude of forecasting errors in absolute and squared terms, while their normalized counterparts (nMAE, nRMSE) provide scale-independent insight relative to mean power values. The coefficient of determination (R2) reflects the proportion of variance explained by the models, and WAPE (%) quantifies percentage errors weighted by the magnitude of observed values.
Furthermore, the results reveal that Decision Tree regression demonstrates superior performance in night-time scenarios (e.g., MAE = 0.1204 at the 5-minute horizon), suggesting greater resilience under low-power conditions. Conversely, XGBoost outperforms other models under high-variability scenarios (e.g.,R2 = 0.9979 at 5-minute intervals) due to its enhanced capacity to capture nonlinear dependencies and complex fluctuations. By contrast, larger error values observed for Linear Regression in high-variability contexts (e.g., MAE = 2.5058 and RMSE = 7.4835) underscore the difficulty of modeling rapid transients using simpler linear formulations, highlighting the importance of model-specific optimizations in such settings.
The Walk-Forward Validation Metrics are employed to evaluate the generalization ability and stability of PV forecasting models under conditions that closely resemble real-time deployment, ensuring their reliability in handling sequential time-series data. This validation strategy applies to a walk-forward approach using TimeSeriesSplit with five folds, where the models are iteratively trained on historical observations and validated on subsequent future segments while maintaining chronological order. In each fold, the training window expands over time, and validation occurs on the next contiguous temporal block, thereby simulating an operational forecasting environment.
Table 10 reports the results of this procedure in terms of RMSE, MAE, R2, nMAE, nRMSE, and WAPE, presented as mean values with corresponding ± SD calculated across folds. This provides not only an overall measure of performance but also an indication of variability and robustness. The evaluation is carried out by computing the designated metrics for each fold, based on the differences between predicted and actual Active_Power values, followed by averaging the results across all folds and quantifying the dispersion through SDs.
The findings highlight that Decision Tree regression achieved the lowest MAE at the 5-minute resolution (0.4417 ± 0.1188 kW), demonstrating consistent accuracy across folds. Similarly, XGBoost maintained exceptionally high explanatory power, with R2 = 0.9997 ± 0.0001, pointing to its strong reliability in capturing variance. The SD values further illustrate stability across folds, for example, XGBoost’s RMSE of 0.3212 at the 5-minute horizon signals robustness, whereas linear models exhibited greater variability such as Linear Regression with an RMSE SD of 0.8197. This comparative analysis confirms that walk-forward validation offers a reliable assessment framework, highlighting models that generalize effectively to sequential PV forecasting tasks while revealing sensitivities to data shifts.
Tables 6, 7, 8, 9 and 10 reveal Decision Tree’s lowest MAE (0.3313 at 15-minute training) and highest Skill Scores (up to 97.2388%), driven by its nonlinear capture of PV patterns. XGBoost balances robust generalization (R² ≈ 0.9997) with high computational cost (231.09 s tuning). Linear models are efficient (0.13 s tuning) but error-prone (e.g., RMSE = 5.4889 in autumn), and persistence (WAPE = 19.8290%) lag.
This study conducted a comprehensive benchmarking of six machine learning models for short-term PV power forecasting using sub-hourly field data resampled at 5-, 10-, and 15-minute intervals. The Decision Tree model consistently outperformed all others, achieving a minimum Mean Absolute Error (MAE) of 0.33 kW at the 15-minute resolution and maximum skill scores up to 98% across all temporal horizons. XGBoost also performed robustly, with an MAE within 0.35–0.55 kW and skill scores above 95%. In contrast, linear regression-based models exhibited higher errors (MAE > 0.65 kW) and lower skill, reflecting their limited capacity to capture nonlinear dynamics in PV generation. The proposed preprocessing pipeline, featuring temporal cyclical encoding and targeted lagged features, proved effective in enhancing prediction accuracy and model stability. Stress tests under night-time and high-variability irradiance intervals demonstrated that tree-based models retained high predictive skill (above 90%), highlighting their robustness in challenging scenarios.
A limitation of this study is that all experiments were conducted on a single PV site (Yulara, Australia); therefore, the generalizability of the findings to other geographical locations, climatic conditions, and plant capacities remains to be further validated. Overall, these quantitative results suggest that advanced tree-based machine learning approaches can offer advantages in accuracy, resilience, and operational relevance for integrating PV power into grids like the one studied.
To advance the field and address the study’s limitations, several avenues for future research are proposed. Future research should validate the proposed models on multiple PV plants located in different climatic zones and with varying installed capacities, to assess the transferability of the tree-based benchmarking results. Additionally, more advanced hyperparameter optimisation techniques, such as Bayesian optimisation, could be explored to further improve model efficiency and performance, especially when scaling larger datasets or more complex architectures. Expanding the methodological framework to incorporate hybrid and DL models, including attention-based architectures and ensemble strategies, may further enhance prediction accuracy and generalisation across sites. In particular, the hybrid and DL models mentioned in the Introduction (LSTM, BiLSTM, transformers) will be systematically benchmarked in future work to compare their performance against the tree-based baselines established here. Integrating real-time meteorological data and broader geospatial datasets will support the development of transferable forecasting systems and enable more granular adaptation to local climatic variability. Additionally, extending forecast horizons beyond 15 min, and applying probabilistic and uncertainty quantification techniques, can improve the operational value of forecasts for grid operators. Finally, focusing on the interpretability and transparency of machine learning models and investigating their real-time deployment and computational scalability in utility environments will be pivotal for wider adoption in smart grid and renewable energy applications.
The datasets used in this study, consisting of measurements from the Yulara Solar System, can be downloaded directly from the DKASC portal. They are publicly and freely available.Yulara Solar System: https://dkasolarcentre.com.au/download? location=yulara.
Autoregressive integrated moving average
Bidirectional long short-term memory
Convolutional long short-term memory
Convolutional neural network
Deep learning
Empirical mode decomposition
Gated recurrent unit
Interquartile range
Least absolute shrinkage and selection operator
Light gradient boosting machine
Long short-term memory
Mean absolute error
Machine learning
Mean squared error
Normalized mean absolute error
Normalized root mean squared error
Photovoltaic
Quantile regression neural network
Coefficient of determination
Root mean squared error
Standard deviation
Symmetric mean absolute percentage error
Sum of squared errors
Support vector regression
Variational mode decomposition
Weighted absolute percentage error
Wavelet packet decomposition
eXtreme gradient boosting
Gielen, D. et al. The role of renewable energy in the global energy transformation. Energy Strateg Rev. 24, 38–50. (2018). https://doi.org/10.1016/j.esr.2019.01.006
Article Google Scholar
Li, P. et al. Deep learning model for solar and wind energy forecasting considering Northwest China as an example. Results Eng. 24, 102939. https://doi.org/10.1016/j.rineng.2024.102939 (2024).
Article Google Scholar
Krishna, R. & Hemamalini, S. Improved TLBO algorithm for optimal energy management in a hybrid microgrid with support vector machine-based forecasting of uncertain parameters. Results Eng. 24, 102992. https://doi.org/10.1016/j.rineng.2024.102992 (2024). September.
Article Google Scholar
Smaili, I. H., Almalawi, D. R., Shaheen, A. M. & Mansour, H. S. E. Optimizing PV sources and shunt capacitors for energy efficiency improvement in distribution systems using subtraction-average algorithm. Mathematics https://doi.org/10.3390/math12050625 (2024).
Article Google Scholar
Fathima, A. H. & Palanisamy, K. Optimization in microgrids with hybrid energy systems – A review. Renew. Sustain. Energy Rev. 45, 431–446. https://doi.org/10.1016/j.rser.2015.01.059 (2015).
Article Google Scholar
Cui, X., Wu, X., Hu, Y., Jia, F., Meng, H., Chen, Y. and Shen, J., Smart Community Microgrid Energy Planning Strategy Based on Improved Grey Wolf Algorithm. In 2026 2nd International Conference on Electrical Automation and Artificial Intelligence (ICEAAI) (pp. 436-443). IEEE. (2026) https://doi.org/10.1109/ICEAAI68945.2026.11442504 (2026).
Article Google Scholar
Amato, G. et al. A method for practical stability region evaluation and a preventive control strategy for microgrids. IEEE Trans. Ind. Appl. 61 (3), 4890–4897. (2025). https://doi.org/10.1109/TIA.2025.3536431 (2025).
Article ADS Google Scholar
Ding, S., Li, R. & Tao, Z. A novel adaptive discrete grey model with time-varying parameters for long-term photovoltaic power generation forecasting. Energy Convers. Manag. 227, 113644. (2020). https://doi.org/10.1016/j.enconman.2020.113644
Article Google Scholar
Atiea, M. A., Shaheen, A. M., Alassaf, A. & Alsaleh, I. Enhanced solar power prediction models with integrating meteorological data toward sustainable energy forecasting, Int. J. Energy Res. (2024). https://doi.org/10.1155/er/8022398
Article Google Scholar
Hou, Z., Zhang, Y., Liu, Q. & Ye, X. A hybrid machine learning forecasting model for photovoltaic power. Energy Rep. 11, 5125–5138. (2024). https://doi.org/10.1016/j.egyr.2024.04.065
Article Google Scholar
Hassan, M. A., Bailek, N., Bouchouicha, K. & Nwokolo, S. C. Ultra-short-term exogenous forecasting of photovoltaic power production using genetically optimized non-linear auto-regressive recurrent neural networks. Renew. Energy. 171, 191–209. https://doi.org/10.1016/j.renene.2021.02.103 (2021).
Article Google Scholar
Bouzerdoum, M., Mellit, A. & Massi Pavan A. A hybrid model (SARIMA-SVM) for short-term power forecasting of a small-scale grid-connected photovoltaic plant. Sol Energy. 98, 226–235. (2013). https://doi.org/10.1016/j.solener.2013.10.002
Article ADS Google Scholar
Xun, T. et al. Photovoltaic power forecasting method based on adaptive classification strategy and HO-SVR algorithm. Energy Rep. 6, 921–928. https://doi.org/10.1016/j.egyr.2020.11.108 (2020).
Article Google Scholar
Ye, J., Zhao, B. & Deng, H. Photovoltaic Power Prediction Model Using Pre-train and Fine-tune Paradigm Based on LightGBM and XGBoost. Procedia Comput. Sci. 224, 407–412. https://doi.org/10.1016/j.procs.2023.09.056 (2023).
Article Google Scholar
Huang, X. et al. Time series forecasting for hourly photovoltaic power using conditional generative adversarial network and Bi-LSTM, Sci. Direct. (2022). https://doi.org/10.1016/j.energy.2022.123403
Article Google Scholar
Wang, F. et al. A day-ahead PV power forecasting method based on LSTM-RNN model and time correlation modification under partial daily pattern prediction framework. Energy Convers. Manag. 212, 112766. https://doi.org/10.1016/j.enconman.2020.112766 (2020). February.
Article Google Scholar
Li, S., Li, R., Yang, J., Wu, F. & Rashed, G. I. Combined Prediction of Photovoltaic Power Based on Sparrow Search Algorithm Optimized Convolution Long and Short-Term Memory Hybrid Neural Network. Electron 11 (10). https://doi.org/10.3390/electronics11101654 (2022).
Zhang, X. Developing a hybrid probabilistic model for short-term wind speed forecasting. Appl. Intell. 53 (1), 728–745. https://doi.org/10.1007/s10489-022-03644-8 (2023).
Article Google Scholar
Gan, D., Wang, Y., Yang, S. & Kang, C. Embedding based quantile regression neural network for probabilistic load forecasting. J. Mod. Power Syst. Clean. Energy. 6 (2), 244–254. https://doi.org/10.1007/s40565-018-0380-x (2018).
Article Google Scholar
Mellit, A., Pavan, A. M. & Lughi, V. Deep learning neural networks for short-term photovoltaic power forecasting. Renew. Energy. 172, 276–288. https://doi.org/10.1016/j.renene.2021.02.166 (2021).
Article Google Scholar
Shaik, M., Yadav, S. K. & Shaik, A. G. An EMD and Decision Tree-Based Protection Algorithm for the Solar PV Integrated Radial Distribution System. IEEE Trans. Ind. Appl. 57 (3), 2168–2177. https://doi.org/10.1109/TIA.2021.3058618 (2021).
Article Google Scholar
Wu, H. et al. Timesnet: Temporal 2D-Variation Modeling for General Time Series Analysis, 11th Int. Conf. Learn. Represent. ICLR 2023, pp. 1–23, (2023).
Ghimire, S., Deo, R. C., Raj, N. & Mi, J. Deep solar radiation forecasting with convolutional neural network and long short-term memory network algorithms. Appl. Energy. 253, 113541. https://doi.org/10.1016/j.apenergy.2019.113541 (2019).
Article Google Scholar
Qu, J., Qian, Z. & Pei, Y. Day-ahead hourly photovoltaic power forecasting using attention-based CNN-LSTM neural network embedded with multiple relevant and target variables prediction pattern. Energy 232, 120996. https://doi.org/10.1016/j.energy.2021.120996 (2021).
Article Google Scholar
Ferkous, K. et al. Optimized solar power forecasting: A multi-decomposition framework using VMD and swarm techniques. AIP Adv. (2025). https://doi.org/10.1063/5.0282210
Article Google Scholar
Ferkous, S. et al. Short-Term Photovoltaic Power Forecasting Using a Hybrid VMD-WPD-LSTM Model with Reconstruction. Arab. J. Sci. Eng. https://doi.org/10.1007/s13369-025-11035-7 (2026).
Article Google Scholar
Ferkous K, Menakh S, Guermoui M, Sekayar S, Laid FZ, Hierarchical frequency-aware transformer for robust short- and multi-horizon photovoltaic power forecasting. Sci. Direct. https://doi.org/10.1016/j.eswa.2026.131624 (2026).
Wang, T. et al. Short-term PV forecasting of multiple scenarios based on multi-dimensional clustering and hybrid transformer-BiLSTM with ECPO. Sci. Direct. https://doi.org/10.1016/j.energy.2025.137654 (2025).
Article Google Scholar
Gairaa, K., Voyant, C., Notton, G., Benkaciali, S. and Guermoui, M., 2022. Contribution of ordinal variables to short-term global solar irradiation forecasting for sites with low variabilities. Renewable Energy, 183, pp.890-902. https://doi.org/10.1016/j.renene.2021.11.028
Guermoui, M., Melgani, F. & Danilo, C. Multi-step ahead forecasting of daily global and direct solar radiation: A review and case study of Ghardaia region. J. Clean. Prod. 201, 716–734. https://doi.org/10.1016/J.JCLEPRO.2018.08.006 (Nov. 2018).
A. R. 2 Mawloud Guermoui 1, Amor Fezzani 1, Zaiani Mohamed 1, 9* Khaled Ferkous 3, Nadjem Bailek 4, Sabrina Bouallit 1, Abdelkader Riche 5, Mohit Bajaj 6, 7, 8, and E. A. 11 & S. S. M. G. 12 Shir Ahmad Dost Mohammadi 10*, An analysis of case studies for advancing photovoltaic power forecasting through multi-scale fusion techniques. https://doi.org/10.1038/s41598-024-57398-z
10 and Ievgen Zaitsev 11 Reski Khelif, 1 Mawloud Guermoui, 1 Abdelaziz Rabehi, 2 Ayoub Taallah, 3Abdelhalim Zoukel, 4, 5, Sherif, S. M. & Ghoneim, 6 Mohit Bajaj, 7, 8, 9Kareem M. AboRas, Short-Term PV Power Forecasting Using a Hybrid TVF-EMD-ELM Strategy. https://doi.org/10.1155/2023/6413716
Fan, Y., Ma, Z., Tang, W., Liang, J. & Xu, P. Using Crested Porcupine Optimizer Algorithm and CNN-LSTM-Attention Model Combined with Deep Learning Methods to Enhance Short-Term Power Forecasting in PV Generation. Energies 17 (14). https://doi.org/10.3390/en17143435 (2024).
Jacques, R. J. et al. Advancing short-term solar irradiance forecasting accuracy through a hybrid deep learning approach with Bayesian optimization. Results Eng. 23, 102461. https://doi.org/10.1016/J.RINENG.2024.102461 (Sep. 2024).
Suanpang, P., Suanpang, P., orgGoogl., S., Scholar & andPitchaya, O. J. 2 1, Machine Learning Models for Solar Power Generation Forecasting in Microgrid Application Implications for Smart Cities, https://doi.org/10.3390/su16146087
Ahmad, M. A. A. I. Solar power generation forecasting using ensemble approach based on deep learning and statistical methods, https://doi.org/10.1016/j.aci.2019.11.002
J. M. 2 Ayesha Nadeem 1,†, Muhammad Farhan Hanif 2, 3,†,, , Muhammad Sabir Naveed 2, Muhammad Tahir Hassan 3, Mustabshirha Gul 3, Naveed Husnain 3, AI-Driven precision in solar forecasting: Breakthroughs in machine learning and deep learning, vol. 10, no. 4. https://doi.org/10.3934/geosci.2024035
Abisoye, B. O., Sun, Y. & Zenghui, W. A survey of artificial intelligence methods for renewable energy forecasting: Methodologies and insights. Renew. Energy Focus. 48, 100529. (2024). https://doi.org/10.1016/J.REF.2023.100529
Article Google Scholar
Huang, X. et al. Time series forecasting for hourly photovoltaic power using conditional generative adversarial network and Bi-LSTM. Energy 246, 123403. (2022). https://doi.org/10.1016/J.ENERGY.2022.123403
Article Google Scholar
Author, S. F., Logo, O., Boland, J. & Logo, O. Time series model for real-time forecasting of Australian photovoltaic solar farms power output, https://doi.org/10.1063/5.0050621
Wu, X. et al. Short-term PV prediction using multiperiod similar days and TimeGAN-inception. Int. J. Electr. Power Energy Syst. (2025) https://doi.org/10.1016/j.ijepes.2025.111287
Article Google Scholar
Guo, S., Fan, H. & Huang, J. Ultra-short-term PV power prediction based on an improved hybrid model with sky image features and data two-dimensional purification. Energy 331, 137018. (2024). https://doi.org/10.1016/j.energy.2025.137018
Article Google Scholar
de Silva, J. L., de Paula, M. V., de Barros, J. & Barros, T. A. D. S. Anomaly detection workflow using random forest regressor in large-scale photovoltaic power plants. IEEE Access. 13, 54168–54176. (2025). https://doi.org/10.1109/ACCESS.2025.3554162
Article Google Scholar
Xu, C. et al. Enhancing Reinforcement Learning-Based Energy Management Through Transfer Learning With Load and PV Forecasting. IEEE Access. 13, 43956–43972. (2025). https://doi.org/10.1109/ACCESS.2025.3548990
Article Google Scholar
Vijay Babu, A. R. et al. Solar Energy Forecasting Using Machine Learning Techniques for Enhanced Grid Stability. IEEE Access. 13, 93735–93754. https://doi.org/10.1109/ACCESS.2025.3574093 (2025).
Article Google Scholar
Alwesabi, K. et al. A novel dual-attention Bi-GRU model optimized by hybrid arithmetic fire hawk optimizer for a precise predictive modeling of solar power generation. Comput. Electr. Eng. https://doi.org/10.1016/j.compeleceng.2025.110818 (2026).
Article Google Scholar
Liu, B. et al. Predictability and forecast skill of solar irradiance over the contiguous United States. Renew. Sustain. Energy Rev. https://doi.org/10.1016/j.rser.2023.113359 (2023).
Article PubMed PubMed Central Google Scholar
Gong, J., Qu, Z., Zhu, Z. & Xu, H. Parallel TimesNet-BiLSTM model for ultra-short-term photovoltaic power forecasting using STL decomposition and auto-tuning. Energy 320, 135286. https://doi.org/10.1016/j.energy.2025.135286 (2025).
Article Google Scholar
Huang, X. et al. Time series forecasting for hourly photovoltaic power using conditional generative adversarial network and Bi-LSTM. Energy 246, 123403. https://doi.org/10.1016/j.energy.2022.123403 (2022).
Article Google Scholar
Download references
This work was supported by the Ongoing Researchers Funding Program, King Saud University, Riyadh, Saudi Arabia, under Grant ORF-2026-337.
Electrical Engineering Department, Faculty of Engineering, Suez Canal University, Ismailia, 41522, Egypt
Hany S. E. Mansour & Amira S. Mohamed
Egypt-Japan KOSEN (EJ-KOSEN) Institute, 10th of Ramadan City 44629, Egypt
Hany S. E. Mansour & Hany S. Elnashar
Electrical Engineering Department, College of Engineering, Imam Mohammad Ibn Saud Islamic University (IMSIU), Riyadh, 11432, Saudi Arabia
Hassan M. Hussein Farh
College of Electrical and Information Engineering, Hunan University, Changsha, 410083, China
AL-Wesabi Ibrahim
Electrical Engineering Department, College of Engineering, King Saud University, Riyadh, 11421, Saudi Arabia
Abdullah M. Al-Shaalan
Department of Electrical, Telecommunications and Computer Engineering, Kampala International University, Ishaka, Bushenyi, Uganda
Abdullahi Bala Kunya
Faculty of Computers and Artificial Intelligence, Beni-Suef University, Benisuef, 62511, Egypt
Hany S. Elnashar
PubMed Google Scholar
PubMed Google Scholar
PubMed Google Scholar
PubMed Google Scholar
PubMed Google Scholar
PubMed Google Scholar
PubMed Google Scholar
Hany S. E. Mansour and Amira S. Mohamed: conceptualization, software, methodology, writing – original draft preparation. Hassan M. Hussein Farh, AL-Wesabi Ibrahim and Abdullahi Bala Kunya: investigation, conceptualization, methodology, writing – review and editing, software, data curation, visualization. Abdullah M. Al-Shaalan: project administration, data curation, visualization and validation. Hany S. Elnashar: methodology, writing – original draft preparation, supervision. All authors have read and agreed to the published version of the manuscript.
Correspondence to Hany S. E. Mansour or Abdullahi Bala Kunya.
The authors have no competing interests or other interests that might be perceived to influence the results and/or discussion reported in this paper.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Below is the link to the electronic supplementary material.
Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.
Reprints and permissions
Mansour, H.S.E., Mohamed, A.S., Farh, H.M.H. et al. High-resolution photovoltaic power forecasting using machine learning models under seasonal and stress conditions. Sci Rep 16, 19029 (2026). https://doi.org/10.1038/s41598-026-56832-8
Download citation
Received:
Accepted:
Published:
Version of record:
DOI: https://doi.org/10.1038/s41598-026-56832-8
Anyone you share the following link with will be able to read this content:
Sorry, a shareable link is not currently available for this article.
Provided by the Springer Nature SharedIt content-sharing initiative
Advertisement
Scientific Reports (Sci Rep)
ISSN 2045-2322 (online)
© 2026 Springer Nature Limited
Sign up for the Nature Briefing: AI and Robotics newsletter — what matters in AI and robotics research, free to your inbox weekly.