Simply Salford Blog

9 Data Mining Challenges From a Data Scientist Like You

Posted by Salford Systems on Tue, Jan 19, 2016 @ 07:00 AM

Data mining has a plethora of challenging aspects. Some of these challenges are common among nearly all data scientists, analysts, and predictive modelers while others are more industry-specific. Nevertheless, we all run into a snag here and there (hopefully more like there, not here) and it can be a trying task to overcome our day-to-day or project-to-project challenges.

Read More

Topics: command line, sample size, big data, GUI, missing values, data analysis, data mining in education

Working With SPM Command Line [Video]

Posted by Eric Lee on Thu, Mar 27, 2014 @ 07:31 AM

Although we generally discuss how to use the SPM software as a GUI application, there are
many people who are more comfortable with the traditional command-line version. For data modelers who are working with extremely large amounts of data, and running the software on a server, the command-line (non-GUI) application may be more appropriate.

Read More

Topics: SPM, command line

9 Data Mining Challenges From Data Scientists Like You

Posted by Heather Hinman on Tue, Jul 23, 2013 @ 06:08 AM

Data mining has a plethora of challenging aspects. Some of these challenges are common among nearly all data scientists, analysts, and predictive modelers while others are more industry-specific. Nevertheless, we all run into a snag here and there (hopefully more like there, not here) and it can be a trying task to overcome our day-to-day or project-to-project challenges.

Read More

Topics: command line, sample size, big data, GUI, missing values, data analysis, data mining in education

Sanitizing Data: Keep the Details of Your Data Mining Project Private

Posted by Dan Steinberg on Mon, Apr 22, 2013 @ 11:04 AM

The SPM software suite includes a handy utility for changing all the variable names on your data to uninformative labels such X1, X2, etc. To convert a data set this way just follow the pattern:

Read More

Topics: SPM, command line

A Quick Tip on Faster Processing in SPM

Posted by Dan Steinberg on Mon, Oct 8, 2012 @ 11:06 AM

For those comfortable working scripts we have a tip on getting your results faster and with less overhead and little to no screen clutter:

From the menus select File/Submit Command File

Read More

Topics: SPM, command line, GUI, speed

Finding The Automatically Stored Command Log

Posted by Dan Steinberg on Wed, May 2, 2012 @ 10:09 AM

Most users of Salford Systems’ data mining tools (CART®, MARS®, TreeNet®, RandomForests® or the more recent integrated SPM™ package) rely on the GUI (Graphical User Interface) to do their work. The GUI makes life easy as you do not need to remember any command syntax and of course the GUI has many useful visual displays of important results. But there are some good reasons to learn how to work with command scripts which is the topic for the current posting. We will refer to our software as SPM (Salford Predictive Modeler) which includes all of our individual data mining engines.

It is useful to remember that almost everything you do during a GUI session using SPM has a “command equivalent.” That means that you could accomplish the identical model and results simply by submitting a set of commands to SPM instead of pointing and clicking. Even more useful to remember is that SPM automatically creates the equivalent set of commands for you as you work, saving the results to a text file. We will return to how to locate that text file a bit later.

Read More

Topics: command line, command log