Simply Salford Blog

Predicting Shifts in El Niño Using Birds & Data Mining

Posted by Kimberly Fahrnkopf on Wed, Sep 3, 2014 @ 10:38 AM

Dr. Grant Humphries, from the Zoology department at the University of Otago, New Zealand, has spent the last three years studying how a bird species called Sooty Shearwaters can help predict upcoming El Niño occurrences. After much time and research, he has figured out a way to do so using data mining.

Read More

Topics: TreeNet, data mining, Variable Importance, big data, data science, prediction, predictive modeling, predictive model

A Few Words On Predictive Accuracy From The Experts

Posted by Heather Hinman on Tue, Feb 25, 2014 @ 05:22 AM

Predictive accuracy is repeatedly cited by data scientists as one of the most important demands in modern data mining algorithms and software. It stands right along side the importance of model-building speed, missing value handling, and memory efficiency. So, if it is so important, how do the experts TEST the accuracy of their models?

Read More

Topics: prediction, predictive modeling, predictive model

A Data Science Prediction for 2014 [In 120 Words]

Posted by Heather Hinman on Mon, Jan 6, 2014 @ 09:51 AM

January is commonly a time to reflect on the past and make predictions about the future. To each his own, I suppose, but I’m confident that many of us will agree on a few common themes for 2014.

Read More

Topics: data science, prediction

Case Study: Making Predictions with MARS

Posted by Heather Hinman on Tue, Oct 16, 2012 @ 03:41 PM

At the 2012 Salford Analytics and Data Mining Conference, Maria Lupetini from Qualcomm gave an easy-to-understand overview of how they are able to make predictions using Salford Systems' MARS (Multivariate Adaptive Regression Splines). A portion of the recorded presentation is below, enjoy!

Read More

Topics: MARS, prediction

Prediction Generation in the Salford Predictive Modeler

Posted by Dan Steinberg on Wed, May 2, 2012 @ 10:00 AM

Once you have built an SPM model (CART, MARS, TreeNet, RandomForests) and
have saved the grove (.GRV) file you are in a position to make predictions
for any other data set containing relevant predictors. Thus, if you trained
your model on file A using variables X1, X2,...,X50, for example, you can now
predictions for file B, provided that file B contains at least some of the same
variables (and preferably all of the variables actually used in the model).


This process of prediction generation is called SCORING in our software and
most models are built specifically so that they can be put into production to
generate predictions. The process can also be used for SIMULATION. In this case
you prepare a data set which will also contain the columns X1, X2, ...,X50 but
the values appearing may not necessarily be real data. Instead the file could contain
hypothesized or imagined values, or forecasted values, as in the case when you
want to make predictions for certain possible future scenarios.

Read More

Topics: SPM, prediction