I am working on an automated ML (Regression) algorithm where the flow of process is: User uploads a data -- Data Cleaning -- Encoding(Target Encoder) -- Fitting model -- results.
I am completely fine until this point, my confusion is when the user wants to test this in an unseen data without target variable, then I need to again perform Data cleaning -- Encoding and encoding technique I have used while fitting the model can work only if there is a target variable (unseen data will not have a target variable) and I cannot change the Encoding technique on unseen data as the testing data needs to go through the same procedure as the data used while fitting the model as per my knowledge.
Could someone please help me in finding a way to overcome this issue or any suggestions would be of great help.
Thanks in advace.
from Predicting unseen data on Target variable based Encoding Technique
No comments:
Post a Comment