Simply Salford Blog

Data Mining & Sampling Issues: Do we need a 3-way partition of data (learn, validate, test)?

Posted by Dan Steinberg on Wed, May 14, 2014 @ 10:51 AM

The short answer to this question is “no” we do not think that the 3-way partition is mandatory for SPM core models such as CART and TreeNet.  Here we discuss the issue.

Read More

Topics: train and test data, partition, sample size

How To Build Your First Predictive Model in SPM [slideshare]

Posted by Heather Hinman on Thu, Dec 19, 2013 @ 05:34 AM

Using a new data mining tools can be like learning a new language. We get it. Why spend your time learning how to use the SPM software suite? Well, I'm glad you asked!

Read More

Topics: video, SPM, partition, predictive modeling, beginner, Data Prep

Data Mining: How to Partition Data into Train and Test

Posted by Dan Steinberg on Fri, May 18, 2012 @ 11:33 AM

There are several options for partitioning data randomly into train and test partitions, repeating the process to obtain different partitions.

Read More

Topics: CART, train and test data, partition