What happens if we don't use cross-validation (CV) and just use trainset and test-set within our forecasting on tail of time data?

@JoaquinAmatRodrigo I have a question that I did not use cross-validation (CV) like you have done in _[Data Partition](https://cienciadedatos.net/documentos/py60-probabilistic-forecasting-prediction-intervals-multi-step-forecasting#Data-partition)_ section in recent notebook. I just split dat into train and test-set (without validation-set) and I plotted for your consideration.
![image](https://github.com/user-attachments/assets/0aecfa76-3f95-4135-a884-f101eb6bf9b6)

1. Do you think is it critical and used ML-based  regression models' learning could be suffer from over/under-fiting when i did not consider CV-set?
2. base on the picture do you think I _damaged_ nature of time-data in the plot after pre-processing stage (de-noise filter, fill missing sequences, detect and replaced global outliers)? in the plot as legend shows cleaned data divided into train-set and test-set and plot over raw time data.

I also did not also use GridSerachCV within my pipeline also due to save runtime! I have lost of `df`s samples I need to apply the designed pipeline. Can  you kindly comment on these Qs separately

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

What happens if we don't use cross-validation (CV) and just use trainset and test-set within our forecasting on tail of time data? #887

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

What happens if we don't use cross-validation (CV) and just use trainset and test-set within our forecasting on tail of time data? #887

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions