Sales forecasting, need to improve accuracy

Tracey · October 7, 2024, 9:54am

I’m having some difficulty with a sales forecasting project and need some help.

Dataset: Weekly sales data; So columns such as Store, Item, Week of Year, Sales. This is the most minimal part of the dataset. I can pull in some features such as store dimensional info, item dimensional info, price, and if it is on sale. The date range is about 150 weeks. About 10 unique items and 1000 unique stores.

Objective: Forecast 1 week out.

My accuracy metric, is 1 - ( sum of absolute errors / sum of actual sales ). I need to achieve an accuracy of at least 0.75.

What I have tried: ARIMA, ETS, xgboost and lightgbm. However, with all these models, I can only achieve an accuracy of 0.35 (with lightgbm). With the ML models, I have tried using tweedie objective, and used a plethora of lagged and rolling features. Most of my data are 0’s, and if they are not 0’s, tend to be smaller numbers (< 10). Making it hard to accurately forecast.

I’m at my wits end and would appreciate any advice.

Gaius · October 7, 2024, 9:56am

I hope my post’s responses are helpful.

My goal was somewhat similar. Hopefully, anything turns out OK.

The group was quite open about using various strategies.

Esther · October 7, 2024, 10:01am

Hey which one worked out for you

Jerry · October 7, 2024, 10:10am

To be honest, rolling average usually has a decent accuracy score. You could play with the number of weeks you look at. Try starting at 4 weeks.

DataDrifter · October 7, 2024, 10:15am

Start by doing a 13 week rolling average. How accurate is that as a predictive measure?

QuantumSleuth · October 7, 2024, 10:17am

This doesn’t sound like a very realistic goal.

The dimensionality on this is huge.

Accurately predicting 1000 individual stores, across 52 weeks, with 10 items, would require immense amounts of data.

My first question would be:

How accurately does your model predict for 10 items, when treating all stores as just one seller, without any features other than “week of year,” 1 through 52?

From there, think of how to engineer features with the data you have.

Pull together some of those features and run individual regressions on each, and determine if they are independent of each other.

Some questions to ask in feature engineering:

Are there additional patterns in the calendar that can be turned into features? Does adding in specific holidays as a feature, for example, improve accuracy?
Do some of the stores act more like some stores than others, such that you could chunk them into several groups? When you do, can you achieve higher accuracy by adding those groups as a feature?

And keep going down this path, from a very simple model with very few features, and examining one feature at a time.