Tuesday, November 26, 2013

Applied Predictive Modeling – September 15, 2013


Applied Predictive Modeling Hardcover – September 15, 2013

Author: Visit Amazon's Max Kuhn Page | Language: English | ISBN: 1461468485 | Format: PDF, EPUB

Applied Predictive Modeling – September 15, 2013
Free download Applied Predictive Modeling Hardcover – September 15, 2013 from mediafire, rapishare, and mirror link

Review

This strong, technical, hands-on treatment clearly spells out the concepts, and illustrates its themes tangibly with the language R, the most popular open source analytics solution.

Eric Siegel, Ph.D. Founder, Predictive Analytics World, Author, Predictive Analytics: The Power to Predict Who Will Click, Buy, Lie, or Die

From the Back Cover

This text is intended for a broad audience as both an introduction to predictive models as well as a guide to applying them. Non-mathematical readers will appreciate the intuitive explanations of the techniques while an emphasis on problem-solving with real data across a wide variety of applications will aid practitioners who wish to extend their expertise. Readers should have knowledge of basic statistical ideas, such as correlation and linear regression analysis. While the text is biased against complex equations, a mathematical background is needed for advanced topics.

Dr. Kuhn is a Director of Non-Clinical Statistics at Pfizer Global R&D in Groton Connecticut. He has been applying predictive models in the pharmaceutical and diagnostic industries for over 15 years and is the author of a number of R packages. 

Dr. Johnson has more than a decade of statistical consulting and predictive modeling experience in pharmaceutical research and development.  He is a co-founder of Arbor Analytics, a firm specializing in predictive modeling and is a former Director of Statistics at Pfizer Global R&D.  His scholarly work centers on the application and development of statistical methodology and learning algorithms.

Applied Predictive Modeling covers the overall predictive modeling process, beginning with the crucial steps of data preprocessing, data splitting and foundations of model tuning.  The text then provides intuitive explanations of numerous common and modern regression and classification techniques, always with an emphasis on illustrating and solving real data problems.  Addressing practical concerns extends beyond model fitting to topics such as handling class imbalance, selecting predictors, and pinpointing causes of poor model performance—all of which are problems that occur frequently in practice.
 
The text illustrates all parts of the modeling process through many hands-on, real-life examples.  And every chapter contains extensive R code for each step of the process.  The data sets and corresponding code are available in the book’s companion AppliedPredictiveModeling R package, which is freely available on the CRAN archive.
 
This multi-purpose text can be used as an introduction to predictive models and the overall modeling process, a practitioner’s reference handbook, or as a text for advanced undergraduate or graduate level predictive modeling courses.  To that end, each chapter contains problem sets to help solidify the covered concepts and uses data available in the book’s R package.
 
Readers and students interested in implementing the methods should have some basic knowledge of R.  And a handful of the more advanced topics require some mathematical knowledge.

See all Editorial Reviews

Download latest books on mediafire and other links compilation Applied Predictive Modeling – September 15, 2013
  • Hardcover: 600 pages
  • Publisher: Springer; 2013 edition (September 15, 2013)
  • Language: English
  • ISBN-10: 1461468485
  • ISBN-13: 978-1461468486
  • Product Dimensions: 9.3 x 6.1 x 1.4 inches
  • Shipping Weight: 2.3 pounds (View shipping rates and policies)
  • Amazon Best Sellers Rank: #7,997 in Books (See Top 100 in Books)
    • #1 in Books > Medical Books > Basic Sciences > Biostatistics
    • #1 in Books > Textbooks > Medicine & Health Sciences > Research > Biostatistics
    • #6 in Books > Computers & Technology > Software > Mathematical & Statistical
I read "Applied predictive modeling" (which I will shorten to APM) shortly after I read "Introduction to statistical learning" (ISL) by James, Witten, Hastie and Tibshirani, and find that book both closest to APM, and helpful in highlighting APM's strengths.

The two books cover the same broad subject. If you google "kuhn caret", you will find Max Kuhn's (very informative) presentation of his "caret" R package, and its first slide will tell you that he uses "predictive modeling" as a synonym of "machine learning" - what Hastie and Tibshirani call "statistical learning". Adopting H&T's terminology choice, I will say that both books combine theory of "statistical learning" with hands-on illustrations and exercises implemented in R; the get-your-hands-dirty, try-it-out element is, in fact, ISL's key difference from the earlier, venerable "Elements of statistical learning".

Both books, inevitably, go over a catalog of statistical-learning techniques. The shorter ISL, in my opinion, is superior at explaining the concepts and communicating the principles, while APM takes the more straightforward approach of "beefing up" the catalog, by spending more pages on each item and including more items. While ISL is by design very accessible, APM can be more technical - the detail will surely be appreciated by any practitioner - and, as it talks about the various methods, it can and does discuss recent extensions, offering an extensive and "fresh" bibliography. R-wise, APM's advantage is not decisive (if you look at content, not line count) but big; the book naturally favors "caret" - which has a useful role, "wrapping" a plethora of third-party R packages, and providing a common interface, plus helpful utilities - but both references and uses the specialist packages as well.
There are many fine math-oriented predictive modeling books, such as Hastie (The Elements of Statistical Learning: Data Mining, Inference, and Prediction, Second Edition (Springer Series in Statistics)). Kuhn et al consider them "sister texts" and begin immediately to differentiate-- their approach is hands on and practical, for the express purpose of demonstrating HOW to sort, structure and predict via Python or R, for the purpose of accuracy and understanding of the DATA and trends, NOT learning the underlying math.

For a couple of pharmaceutical guys, (who BTW use R extensively, I've been an analyst in that industry), you'd think the examples would be new chemical or biological entities. Not so! The cases are fun and exciting, ranging from the nontrivial compression strength of concrete (want that bridge to hold when you cross?) to fuel economy, credit scoring, success in grant applications (boy their colleagues will love that one!), and cognitive impairment. I evaluate technology for patents at payroy dot com, and we have a log likelihood model using Bayesian and Monte Carlo that their grant section helped translate seamlessly to R! We're NOT talking pie in the sky pseudo code here, but real life, real results recipes.

The authors talk about the "scholarly veil" -- meaning we general workers and researchers don't always "deserve" to see the underlying process, software and data (and, other than open source, often can't afford it). Wow, do they pop that myth!

Applied Predictive Modeling – September 15, 2013 Download

Please Wait...

No comments:

Post a Comment