Categories
Data Science Statistics Writing

New on Win-Vector: Checking your Data for Signal

I have a new article up on the Win-Vector Blog, on checking your input variables for signal: An all too common approach to modeling in data science is to throw all possible variables at a modeling procedure and “let the algorithm sort it out.” This is tempting when you are not sure what are the […]