nowcasting_benchmark

This repository is an accompaniment to an article (available here or here) benchmarking common nowcasting and machine learning methodologies. It illustrates how to estimate each of the methods examined in the analysis in either R or Python. 17 methodologies were tested in nowcasting quarterly US GDP using data from the Federal Reserve of Economic Data (FRED). The variables chosen were those specified in Bok, et al (2018). The methodologies were tested on a period dating from Q1 2002 to Q3 2022.

In applied nowcasting exercises, ideally, several methodologies should be employed and their results compared empirically for final model selection. In practice, this is difficult due to the fragmented landscape of different nowcasting methodology frameworks and implementations. This repository aims to make things significantly easier by giving fully runnable boilerplate code in R or Python for each methodology examined in this benchmarking analysis. The methodologies/ directory contains self-contained Jupyter notebooks illustrating how each methodology can be run in the nowcasting context with an example using data from FRED and testing from 2005 to 2010. This was to reduce runtime for illustration, users can select their own testing periods if so desired.

All the notebooks assume initial input data is in the format of seasonally adjusted growth rates in the highest frequency of the data (monthly in this case), with a date colum in the beginning and a separate column for each variable. Lower frequency data should have their values listed in the final month of that period (e.g. December for yearly data, March, June, September, or December for quarterly data), with no data / NAs in the intervening periods or for missing data at the end of series. An example is below.

Also necessary is a metadata CSV listing the name of the series/column and its frequency. Once these two conditions are met, it should be possible to run any of the methodologies on your own dataset, with adjustments as needed using the Notebooks as a guide. Below is a short overview of each of the methodologies, followed by graphical results of the full benchmarking analysis showing predictions on different data vintages.

Recommendation for a single methodology

If you only have bandwidth or interest to try out one methodology, the LSTM is recommended. It is accessible, available in four different programming languages (both R and Python notebooks are included in the methodologies/ directory), and straightforward to estimate and generate predictions given the data format stipulated above. It has shown strong predictive performance in relation to the other methodologies, including during shock conditions, and will not throw estimation errors on certain data. It does, however, have hyperparameters that may need to be tuned if initial performance is not good. This process can be done from within the nowcast_lstm library via the hyperparameter_tuning function. It is also the only implementation with built-in functionality for variable selection. In this analysis, input variables were taken as given, but often this is not the case. The variable_selection function will select best-performing variables from a pool of candidate variables. Both hyperparameter tuning and variable selection can also be performed together with the select_model function. See the documentation for the nowcast_lstm library for more information.

Methodologies

ARMA (model_arma.ipynb):
- background: Wikipedia
- language, library, and function: Python,ARIMA function of pmdarima library
- commentary: Univariate benchmark model. Acceptable performance in normal/non-volatile times, extremely limited use during any shock periods. Potential use as another way to fill "ragged-edge" missing data for component series in other methodologies, as opposed to mean-filling.
Bayesian mixed-frequency vector autoregression (model_bvar.ipynb):
- background: Wikipedia, ECB working paper
- language, library, and function: R, estimate_mfbvar function of mfbvar library
- commentary: Difficult to get data into proper format for the function to estimate properly, making dynamic/programmatic changing and selection of variables and overall usage hard, but doable. Very performant methodology in this benchmarking analysis, ranking second-best in terms of RMSE and best in terms of MAE. However, predictions were very volatile, with highest month-to-month revisions in predictions on average. Also may produce occasional large outlier predictions or fail to estimate on a dataset due to convergence, etc., issues. This library/implementation cannot handle yearly variables.
Decision tree (model_dt.ipynb):
- background: Wikipedia
- language, library, and function: Python,DecisionTreeRegressor function of sklearn library
- commentary: Simple methodology, not traditionally used in nowcasting. Doesn't handle time series, handled via including additional variables for lags. Poor performance in this benchmarking analysis. The four tree-based methodologies in this analysis, decision trees, random forest, and gradient boost, learn most of their information from the latest available data, so have difficulties predicting things other than the mean in early data vintages. See model_gb.ipynb for a means of addressing this. All three also have difficulties predicting values more extreme than any they have seen before, limiting their use in shock periods, e.g. during the COVID crisis. Has hyperparameters which may need to be tuned.
DeepVAR (model_deepvar.ipynb):
- background: Working paper
- language, library, and function: Python, DeepVAREstimator function of GluonTS library
- commentary: Originally developed as a forecasting tool with the business context in mind. Generates probabilistic forecasts via an autoregressive recurrent neural network (RNN). Its performance in this analysis was poor, akin to that of an improved ARMA model. Depending on hyperparameters, it is slow to estimate compared with the other methodologies in this analysis. Its syntax is also complicated to get working for the nowcasting context.
Dynamic factor model (DFM) (model_dfm.ipynb):
- background: Wikipedia, FRB NY paper, UNCTAD research paper
- language, library, and function: R, dfm function of nowcastDFM library
- commentary: De facto standard in nowcasting, very commonly used. Middling performance in this analysis. May require assigning variables to different "blocks" or groups, which can be an added complication. In this benchmarking analysis, the DFM without blocks (equivalent to one "global" blocks/factor) performed worse than the model with the blocks specified by the FED. The model also fails to estimate on many datasets due to uninvertibility of matrices. Estimation may also take a long time depending on convergence of the expectation-maximization algorithm. Estimating models with more than 20 variables can be very slow. This library/implementation cannot handle yearly variables.
Elastic net (model_elasticnet.ipynb):
- background: Wikipedia
- language, library, and function: Python, ElasticNet function of sklearn library
- commentary: OLS with introduction of L1 and L2 regularization penalty terms. Can potentially help with multicollinearity issues of OLS in the nowcasting context. Performance is expectedly better than that of OLS in this benchmarking analysis, with less volatile predictions. Overall it was the best performer amongst methodologies that do not natively handle time series. Introduces the Lasso alpha hyperparameter and L1 ratio, which need to be tuned.
Gradient boosted trees (model_gb.ipynb):
- background: Wikipedia
- language, library, and function: Python,GradientBoostingRegressor function of sklearn library
- commentary: Very performant model in traditional machine learning applications. Doesn't handle time series, handled via including additional variables for lags. Poor performance in this benchmarking analysis. However, performance can be substantially improved by training separate models for different data vintages, details in model_gb.ipynb example file. This method can be applied to any of the methodologies that don't handle time series (OLS, random forest, etc.), but it had the biggest positive impact in this benchmarking analysis for gradient boosted trees. Has hyperparameters which may need to be tuned.
Lasso (model_lasso.ipynb):
- background: Wikipedia
- language, library, and function: Python, Lasso function of sklearn library
- commentary: OLS with introduction of L1 regularization penalty term. Can potentially help with multicollinearity issues of OLS in the nowcasting context. Performance is expectedly better than that of OLS in this benchmarking analysis, with less volatile predictions. Overall it was better than ridge regression, but worse than elastic net. Introduces the Lasso alpha hyperparameter which needs to be tuned.
Long short-term memory neural network (LSTM) (model_lstm.ipynb):
- background: Wikipedia, first article, second article
- language, library, and function: Python or R, LSTM function of nowcast_lstm library. Also available in Python, R, MATLAB, and Julia.
- commentary: Very performant model, best performer in terms of RMSE, second-best in terms of MAE. Able to handle any frequency of data in either target or explanatory variables, easiest data setup process of any implementation in this benchmarking analysis. Couples high predictive performance with relatively low volatility, e.g. in contrast with Bayesian VAR, which also has good predictive performance, but is quite volatile. Can handle an arbitrarily large number of input variables without affecting estimation time and can be estimated on any dataset without error. Has hyperparameters which may need to be tuned.
Mixed-frequency vector autoregression (MF-VAR) (model_var.ipynb):
- background: Wikipedia, Minneapolis Fed paper
- language, library, and function: Python,VAR function of PyFlux library
- commentary: Has been used in nowcasting. Middling performance in this benchmarking analysis. The PyFlux implementation can be difficult to get working and may not run on versions of Python > 3.5.
Mixed data sampling regression (MIDAS) (model_midas.ipynb):
- background: Wikipedia, paper
- language, library, and function: R, midas_r function of midasr library
- commentary: Has been used in nowcasting, solid performance in this benchmarking analysis. Difficult data set up process to estimate and get predictions.
Midasml (model_midasml.ipynb):
- background: Working paper
- language, library, and function: R, cv.sglfit function of midasml library
- commentary: Relatively new methodology that builds on MIDAS models by introducing LASSO regularization. Solid performance (third-best) in this analysis. Relatively difficult syntax to get working in the nowcasting context.
Multilayer perceptron (feedforward) artificial neural network (model_mlp.ipynb):
- background: Wikipedia, paper
- language, library, and function: Python, MLPRegressor function of sklearn library
- commentary: Has been used in nowcasting, decent performance in this benchmarking analysis. Doesn't handle time series, handled via including additional variables for lags. Has hyperparameteres which may need to be tuned.
Ordinary least squares regression (OLS) (model_ols_ridge.ipynb):
- background: Wikipedia
- language, library, and function: Python, LinearRegression function of sklearn library
- commentary: Extremely popular approach to regression problems. Doesn't handle time series, handled via including additional variables for lags. Middling performance in this benchmarking analysis and very volatile, will also probably suffer from multicollinearity if many variables are included.
Random forest (model_rf.ipynb):
- background: Wikipedia
- language, library, and function: Python,RandomForestRegressor function of sklearn library
- commentary: Popular methodology in classical machine learning, combining the predictions of many random decision trees. Doesn't handle time series, handled via including additional variables for lags. Poor performance in this benchmarking analysis. Has hyperparameters which may need to be tuned.
Ridge regression (model_ridge.ipynb):
- background: Wikipedia
- language, library, and function: Python, RidgeRegression function of sklearn library
- commentary: OLS with introduction of L2 regularization penalty term. Can potentially help with multicollinearity issues of OLS in the nowcasting context. Performance is expectedly slightly better than that of OLS in this benchmarking analysis, with less volatile predictions. Introduces the ridge alpha hyperparameter which needs to be tuned.
XGBoost (model_xgboost.ipynb):
- background: Wikipedia
- language, library, and function: Python,XGBRegressor function of sklearn library
- commentary: Similar methodolgy to gradient boost with added regularization and implementation tweaks. Performance is very similar to that of gradient boost.

Scatter plots

These plots show MAE and RMSE plotted against average revision (i.e., volatility). The ideal model would be in the lower left corner (low error and low volatility).

Graphical results

Thee plots show actuals and nowcasts at different time vintages for each methodology. Ordered alphabetically.

Name	Name	Last commit message	Last commit date
Latest commit ? History 35 Commits
data	data	?	?
html_output	html_output	?	?
images	images	?	?
methodologies	methodologies	?	?
.gitattributes	.gitattributes	?	?
.gitignore	.gitignore	?	?
LICENSE	LICENSE	?	?
README.md	README.md	?	?

油脂旺盛是什么原因	宫寒吃什么药	势在必得是什么意思	女人肾虚吃什么好得快	4级手术是什么意思
心病科主要看什么病	11月4号是什么星座	p5是什么意思	qty什么意思	鼻窦炎吃什么抗生素
洗银首饰用什么清洗	红糖水什么时候喝	脾虚痰湿吃什么中成药	易烊千玺什么星座	桂林山水甲天下是什么意思
千古一帝指什么生肖	中央党校校长是什么级别	腿脚浮肿是什么原因引起的	肠道长息肉是什么原因造成的	梦见小男孩拉屎是什么意思

蜈蚣吃什么食物hcv8jop5ns0r.cn	排酸对身体有什么好处hcv9jop6ns6r.cn	孩子鼻子流鼻血是什么原因hcv9jop7ns2r.cn	腹泻可以吃什么hcv9jop0ns6r.cn	望穿秋水是什么意思hcv9jop3ns6r.cn
孔雀为什么会开屏fenrenren.com	一什么野花xianpinbao.com	牙齿痛什么原因hcv8jop4ns4r.cn	狐臭手术挂什么科室aiwuzhiyu.com	肚皮冰凉是什么原因呢travellingsim.com
什么病可以办低保hcv7jop7ns2r.cn	石英表是什么意思hcv7jop9ns7r.cn	头发需要什么营养hcv9jop2ns7r.cn	抗角蛋白抗体阳性是什么意思hcv8jop4ns3r.cn	aug什么意思hcv8jop7ns1r.cn
吃什么治疗便秘hcv8jop8ns4r.cn	什么是精神分裂hcv9jop4ns9r.cn	耳石症眩晕吃什么药hcv9jop5ns0r.cn	应届是什么意思hcv7jop4ns7r.cn	五点到七点是什么时辰yanzhenzixun.com

日子好了天天都是年（新春走基层）

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

nowcasting_benchmark

Recommendation for a single methodology

Methodologies

Scatter plots

Graphical results

About

Uh oh!

Releases

Packages

Languages

License

dhopp1/nowcasting_benchmark

Folders and files

Latest commit

History

Repository files navigation

nowcasting_benchmark

Recommendation for a single methodology

Methodologies

Scatter plots

Graphical results

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages