0 0 votes Machine Learning onehotencoder and standardscaler + – 50% Accept Rate Accepted 1 answers out of 2 questions kaADSS 230 points 2 2 6 answer comment Share See all 2 Comments 2 2 Comments reply tofighi 116k points 73 79 101 commented Feb 13, 2020 reply flag Please clarify your question by providing an example 0 0 replyShare kaADSS 230 points 2 2 6 commented Feb 13, 2020 reply flag actually, I have found out the answer. every parameter of list of the column, the OneHotEncoder() will detect how many categorial variable there are. if 2 choices, then create one new column to representing the choice just by Binary variable(1, 0). so if 3 choices for the categorial variable, then it will create 2 more columns to show all the binary variables. So you need see whether is necessary, usually the 2 choice , you only need one of them to minimize the data size. and choice of 3 , only need 2 of them to show the actual value. so you need manually minimize to optimal condition. 0 0 replyShare Please log in or register to add a comment.
0 0 votes As you descierbed in the comments, additional columns are used as dummy variables. tofighi answered Feb 18, 2020 tofighi 116k points 73 79 101 comment Share 0 reply Please log in or register to add a comment.