Simply Salford Blog

9 Data Mining Challenges From a Data Scientist Like You

Posted by Salford Systems on Tue, Jan 19, 2016 @ 07:00 AM

Data mining has a plethora of challenging aspects. Some of these challenges are common among nearly all data scientists, analysts, and predictive modelers while others are more industry-specific. Nevertheless, we all run into a snag here and there (hopefully more like there, not here) and it can be a trying task to overcome our day-to-day or project-to-project challenges.

Read More

Topics: command line, sample size, big data, GUI, missing values, data analysis, data mining in education

Data Science in Biology: A Few Problems & Solutions [guest post]

Posted by Kimberly Fahrnkopf on Thu, Sep 11, 2014 @ 10:10 AM

Guest post by Grant Humphries, Post Doctoral researcher, University of California, Davis

Read More

Topics: TreeNet, data mining, big data, data science, predictive modeling, data analysis

Predicting Shifts in El Niño Using Birds & Data Mining

Posted by Kimberly Fahrnkopf on Wed, Sep 3, 2014 @ 10:38 AM

Dr. Grant Humphries, from the Zoology department at the University of Otago, New Zealand, has spent the last three years studying how a bird species called Sooty Shearwaters can help predict upcoming El Niño occurrences. After much time and research, he has figured out a way to do so using data mining.

Read More

Topics: TreeNet, data mining, Variable Importance, big data, data science, prediction, predictive modeling, predictive model

Musings on Becoming a Data Scientist [guest post]

Posted by Heather Hinman on Fri, May 9, 2014 @ 07:25 AM

Guest Post by Scott Terry, Rapid Progress Marketing and Modeling, LLC 

Read More

Topics: big data, data science

Buzzwords, De-buzzed: 5 Varying Definitions for Big Data

Posted by Heather Hinman on Thu, Jan 23, 2014 @ 08:30 AM

This article is an update (or add-on) to The Elusive Quest to Define Big Data published on August 6, 2013.

Read More

Topics: big data

The Elusive Quest To Define 'Big Data'

Posted by Heather Hinman on Tue, Aug 6, 2013 @ 08:37 AM

It can be difficult trying to define certain trendy terms like 'Big Data.' When I started out to write this article, I thought it would be quite easy to discuss how Big Data has become extremely relevant to so many companies of different sizes, shapes, and industries. However, what I found is that there is a lot of confusion out there about what Big Data really is. How do we define it?

Read More

Topics: big data

Unlock the Key to Big Data & Modern Regression Techniques at JSM 2013

Posted by Heather Hinman on Thu, Aug 1, 2013 @ 11:14 AM

The recent launch of the Salford Predictive Modeler software suite v7.0, the latest version of Salford Systems’ data mining software package, spurred enormous interest in the data science community, especially after its showcase webinar series: “The Evolution of Regression Modeling.” The series’ instructors, Salford Systems’ CEO and Founder Dr. Dan Steinberg and Senior Scientist Mikhail Golovnya, will continue this showcase of the software’s forefront predictive methodology at the Joint Statistical Meetings in Montreal.

For the last decade Salford Systems has hosted computer training workshops at JSM to educate the leading statisticians, analysts, data scientists, and researchers on its flagship products CART® decision trees, MARS® nonlinear regression, TreeNet® stochastic gradient boosting, and Random Forests®. In conjunction with the software suite’s new model compression techniques and hybrid modeling capabilities, Salford Systems will present yet another new algorithm that has been added to the Salford toolkit, Generalized PathSeeker (GPS). This technology includes methods like LASSO, Ridge, and regularized regression. 

Read More

Topics: training, big data, Regression, Tutorial

Confessions of a Data Scientist

Posted by Heather Hinman on Fri, Jul 26, 2013 @ 10:19 AM

All data scientists have their inner monologue of quirky comments and frustrating dilemmas. Sometimes, these comments  have the opportunity to be spoken out loud -- usually in the context of some informal meeting around the coffee pot, or in an under-the-breath mumble during a pacing rant of frustration. I have even witnessed the punching of a balloon and the kicking of a bean-bag chair (accompanied with some inaudible comments) over a data mining challenge that was driving the person bonkers.

Read More

Topics: big data, Data Prep, missing values, data analysis

9 Data Mining Challenges From Data Scientists Like You

Posted by Heather Hinman on Tue, Jul 23, 2013 @ 06:08 AM

Data mining has a plethora of challenging aspects. Some of these challenges are common among nearly all data scientists, analysts, and predictive modelers while others are more industry-specific. Nevertheless, we all run into a snag here and there (hopefully more like there, not here) and it can be a trying task to overcome our day-to-day or project-to-project challenges.

Read More

Topics: command line, sample size, big data, GUI, missing values, data analysis, data mining in education

Knowledge Discovery and Data Mining (KDD) 2012 Beijing Recap

Posted by Dan Steinberg on Wed, Aug 22, 2012 @ 12:56 PM

It was great to be in Beijing for the 2012 KDD meetings where we managed to meet up with old friends and meet new like-minded people during three well-organized long days. Salford was a sponsor and we had the opportunity to demonstrate both the English and the Chinese-language versions of the SPM Salford Predictive Modeler software suite at our exhibitor booth.

Read More

Topics: KDD 2012, big data