Monthly Archives: June 2018

Leave One Out Encoding for Categorical Feature Variables on Spark

Categorical feature variables is a thorny issue for many supervised Machine Learning algorithms. Many learning algorithms can not handle categorical feature variables. In this post, we will go over an encoding scheme called Leave One Out Encoding, as implemented with … Continue reading

Posted in Big Data, Data Science, ETL, Spark | Tagged | 1 Comment