1,340 views
1 1 vote

So far, I have modeled on known historical data. What if there are variables known only after the fact?
Let me give you an example. I want to predict the outcome of the match, win, lose or draw. I use variables from previous games such as ball possession, number of shots, corners, etc. Let's say the Chelsea-Arsenal game is approaching Saturday. How am I supposed to build a model and predict the result if this data is not yet available for my event? What to do in such cases, is it possible to forecast such data?

 
50% Accept Rate Accepted 1 answers out of 2 questions

1 Answer

Best answer
0 0 votes
Your answer is actually based on what we always do in machine learning. We collect datasets, split to training and testing set, we train using the training set, and evaluate performance based on the testing set.

Assume you have 100 matches with all statistics and parameters you want to use in the training (such as ball possession, number of shots, corners, etc). You can take 80 of these matches for training and the rest of 20 matches for evaluating the model you created based on 80% of data simply because you already know that "future" statistics and outcome to compare with the output of your model to check the performance.

I hope this answers your question.
selected by

Related questions

0 0 votes
1 1 answer
1.0k
1.0k views
Kesz asked Nov 17, 2020
1,041 views
Hi. I have a question about model-based predictions when data is only available after the fact. Let me give you an example. I try to predict the result (HOME, AWAY or a D...
0 0 votes
0 0 answers
531
531 views
HbibOs asked Jun 21, 2021
531 views
Hello,I trained a CNN using synthetic data to perform a segmentation task on human faces. During the test and to evaluate the prediction of this network, I used 200 examp...
1 1 vote
1 1 answer
725
725 views
metelon asked Dec 15, 2020
725 views
When I standardized my data when I created my model. Do I need to save the standardization transformation when I want to predict with my model new data ?
0 0 votes
0 0 answers
550
550 views
Anas asked Nov 28, 2021
550 views
So say I have a column with categorical data like different styles of temperature: 'Lukewarm', 'Hot', 'Scalding', 'Cold', 'Frostbite',... etc.I know that we can use pd.ge...
1 1 vote
0 0 answers
1.1k
1.1k views
Gwanza asked Jan 21, 2022
1,077 views
I want to build a dynamic pricing model which means if product is too expansive for a client and there is a risk that we might loose a client we lower the price for them ...