Buy new:
-34% EUR28.32EUR28.32
EUR 9.33 delivery Thursday, January 29
Ships from: Amazon.com Sold by: Amazon.com
Save with Used - Very Good
EUR23.27EUR23.27
EUR 9.33 delivery January 30 - February 12
Ships from: Amazon Sold by: Olivia-Products
Return this item for free
Free returns are available for the shipping address you chose. You can return the item for any reason in new and unused condition: no return shipping charges.
Learn more about free returns.- Go to your orders and start the return
- Select your preferred free shipping option
- Drop off and leave!
Sorry, there was a problem.
There was an error retrieving your Wish Lists. Please try again.Sorry, there was a problem.
List unavailable.
Download the free Kindle app and start reading Kindle books instantly on your smartphone, tablet, or computer - no Kindle device required.
Read instantly on your browser with Kindle for Web.
Using your mobile phone camera - scan the code below and download the Kindle app.
Image Unavailable
Color:
-
-
-
- To view this video download Flash Player
Follow the authors
OK
Data Science for Business: What You Need to Know about Data Mining and Data-Analytic Thinking 1st Edition
Purchase options and add-ons
Written by renowned data science experts Foster Provost and Tom Fawcett, Data Science for Business introduces the fundamental principles of data science, and walks you through the "data-analytic thinking" necessary for extracting useful knowledge and business value from the data you collect. This guide also helps you understand the many data-mining techniques in use today.
Based on an MBA course Provost has taught at New York University over the past ten years, Data Science for Business provides examples of real-world business problems to illustrate these principles. You’ll not only learn how to improve communication between business stakeholders and data scientists, but also how participate intelligently in your company’s data science projects. You’ll also discover how to think data-analytically, and fully appreciate how data science methods can support business decision-making.
- Understand how data science fits in your organization―and how you can use it for competitive advantage
- Treat data as a business asset that requires careful investment if you’re to gain real value
- Approach business problems data-analytically, using the data-mining process to gather good data in the most appropriate way
- Learn general concepts for actually extracting knowledge from data
- Apply data science principles when interviewing data science job candidates
- ISBN-101449361323
- ISBN-13978-1449361327
- Edition1st
- PublisherO'Reilly Media
- Publication dateSeptember 17, 2013
- LanguageEnglish
- Dimensions7 x 0.9 x 9.19 inches
- Print length413 pages
Discover the latest buzz-worthy books, from mysteries and romance to humor and nonfiction. Explore more
Frequently bought together

More items to explore
Storytelling with Data: A Data Visualization Guide for Business ProfessionalsPaperbackGet it as soon as Friday, Jan 30
R for Data Science: Import, Tidy, Transform, Visualize, and Model DataHadley WickhamPaperbackEUR 9.42 shipping
The Data Warehouse Toolkit: The Definitive Guide to Dimensional ModelingPaperbackEUR 9.43 shippingGet it as soon as Thursday, Jan 29
Customers also bought or read
- Storytelling with Data: A Data Visualization Guide for Business Professionals#1 Best SellerInformation Management
PaperbackEUR19.06EUR19.06EUR 15.07 delivery Wed, Feb 11 - The Data Science Handbook: Advice and Insights from 25 Amazing Data Scientists
PaperbackEUR21.54EUR21.54EUR 7.19 delivery Thu, Jan 29 - Becoming a Data Head: How to Think, Speak, and Understand Data Science, Statistics, and Machine Learning
PaperbackEUR20.25EUR20.25EUR 7.31 delivery Thu, Jan 29 - The Big Book of Dashboards: Visualizing Your Data Using Real-World Business Scenarios
PaperbackEUR24.99EUR24.99EUR 9.59 delivery Thu, Jan 29 - Practical Statistics for Data Scientists: 50+ Essential Concepts Using R and Python
PaperbackEUR37.04EUR37.04EUR 7.99 delivery Thu, Jan 29 - Competing on Analytics: The New Science of Winning; With a New Introduction
HardcoverEUR18.95EUR18.95EUR 8.21 delivery Thu, Jan 29 - Naked Statistics: Stripping the Dread from the Data#1 Best SellerProbability & Statistics
PaperbackEUR13.58EUR13.58EUR 7.38 delivery Thu, Jan 29 - Everybody Lies: Big Data, New Data, and What the Internet Can Tell Us About Who We Really Are
PaperbackEUR13.09EUR13.09EUR 7.17 delivery Thu, Jan 29 - Introduction to Machine Learning with Python: A Guide for Data Scientists
PaperbackEUR38.79EUR38.79EUR 9.33 delivery Thu, Jan 29 - Python for Data Analysis: Data Wrangling with pandas, NumPy, and Jupyter
PaperbackEUR37.92EUR37.92EUR 9.42 delivery Thu, Jan 29 - Essential Math for Data Science: Take Control of Your Data with Fundamental Linear Algebra, Probability, and Statistics
PaperbackEUR31.98EUR31.98EUR 7.46 delivery Thu, Jan 29 - Lean Analytics: Use Data to Build a Better Startup Faster
PaperbackEUR25.26EUR25.26EUR 7.99 delivery Thu, Jan 29 - Data Smart: Using Data Science to Transform Information into Insight
PaperbackEUR29.76EUR29.76EUR 9.59 delivery Fri, Jan 30 - Fundamentals of Data Engineering: Plan and Build Robust Data Systems
PaperbackEUR37.92EUR37.92EUR 9.59 delivery Thu, Jan 29 - Python Data Science Handbook: Essential Tools for Working with Data
PaperbackEUR38.08EUR38.08EUR 9.43 delivery Fri, Jan 30 - SQL for Data Analysis: Advanced Techniques for Transforming Data into Insights
PaperbackEUR31.45EUR31.45EUR 7.99 delivery Thu, Jan 29 - Big Data: A Revolution That Will Transform How We Live, Work, and Think
PaperbackEUR8.59EUR8.59EUR 7.38 delivery Thu, Jan 29 - Data Science from Scratch: First Principles with Python
PaperbackEUR33.47EUR33.47EUR 8.57 delivery Thu, Jan 29 - Hands-On Machine Learning with Scikit-Learn, Keras, and TensorFlow: Concepts, Tools, and Techniques to Build Intelligent Systems
PaperbackEUR42.67EUR42.67EUR 10.55 delivery Thu, Jan 29 - Ace the Data Science Interview: 201 Real Interview Questions Asked By FAANG, Tech Startups, & Wall Street
PaperbackEUR38.79EUR38.79EUR 8.21 delivery Thu, Jan 29 - Advancing into Analytics: From Excel to Python and R
PaperbackEUR32.75EUR32.75EUR 7.47 delivery Thu, Jan 29 - Data Smart: Using Data Science to Transform Information into Insight
PaperbackEUR30.26EUR30.26EUR 9.43 delivery Thu, Jan 29 - An Introduction to Statistical Learning: with Applications in Python (Springer Texts in Statistics)#1 Best SellerMathematical & Statistical Software
HardcoverEUR55.42EUR55.42EUR 9.53 delivery Thu, Jan 29 - Analytics at Work: Smarter Decisions, Better Results
HardcoverEUR11.04EUR11.04EUR 7.70 delivery Tue, Feb 24 - SQL for Data Scientists: A Beginner's Guide for Building Datasets for Analysis
PaperbackEUR27.07EUR27.07EUR 7.46 delivery Thu, Jan 29 - The StatQuest Illustrated Guide To Machine Learning
PaperbackEUR21.84EUR21.84EUR 8.47 delivery Fri, Jan 30 - Data Science For Dummies (For Dummies (Computer/Tech))
PaperbackEUR22.03EUR22.03EUR 8.21 delivery Thu, Jan 29 - Python for Everybody: Exploring Data in Python 3
PaperbackEUR12.92EUR12.92EUR 7.65 delivery Thu, Jan 29
From the brand
-
Explore more Data Science
-
Start learning with O'Reilly
-
More From O'Reilly
-
Sharing the knowledge of experts
O'Reilly's mission is to change the world by sharing the knowledge of innovators. For over 40 years, we've inspired companies and individuals to do new things (and do them better) by providing the skills and understanding that are necessary for success.
Our customers are hungry to build the innovations that propel the world forward. And we help them do just that.
Editorial Reviews
Review
"This book goes beyond data analytics 101. It's the essential guide for those of us (all of us?) whose businesses are built on the ubiquity of data opportunities and the new mandate for data-driven decision-making."--Tom PhillipsCEO of Media6Degrees and Former Head of Google Search and Analytics
"Data is the foundation of new waves of productivity growth, innovation, and richer customer insight. Only recently viewed broadly as a source of competitive advantage, dealing well with data is rapidly becoming table stakes to stay in the game. The authors' deep applied experience makes this a must read--a window into your competitor's strategy."-- Alan MurraySerial Entrepreneur; Partner at Coriolis Ventures
"This timely book says out loud what has finally become apparent: in the modern world, Data is Business, and you can no longer think business without thinking data. Read this book and you will understand the Science behind thinking data."-- Ron BekkermanChief Data Officer at Carmel Ventures
"A great book for business managers who lead or interact with data scientists, who wish to better understand the principles and algorithms available without the technical details of single-disciplinary books."-- Ronny KohaviPartner Architect at Microsoft Online Services Division
About the Author
Tom Fawcett holds a Ph.D. in machine learning and has worked in industry R&D for more than two decades for companies such as GTE Laboratories, NYNEX/Verizon Labs, and HP Labs. His published work has become standard reading in data science.
Product details
- Publisher : O'Reilly Media
- Publication date : September 17, 2013
- Edition : 1st
- Language : English
- Print length : 413 pages
- ISBN-10 : 1449361323
- ISBN-13 : 978-1449361327
- Item Weight : 1.5 pounds
- Dimensions : 7 x 0.9 x 9.19 inches
- Best Sellers Rank: #16,297 in Books (See Top 100 in Books)
- #2 in Data Mining (Books)
- #5 in Business Statistics
- #13 in Statistics (Books)
- Customer Reviews:
About the authors

Tom Fawcett holds a Ph.D. in machine learning and has worked in industry R&D for more than two decades for companies such as GTE Laboratories, NYNEX/Verizon Labs, and HP Labs. His published work has become standard reading in data science.

Foster Provost is Professor of Data Science at NYU and Ira Rennert Professor of Entrepreneurship and Information Systems at the NYU Stern School of Business. His award-winning research is read and cited broadly. Prof. Provost has co-founded several successful companies focusing on data science for marketing, fraud control, and other business applications.
Customer reviews
Customer Reviews, including Product Star Ratings help customers to learn more about the product and decide whether it is the right product for them.
To calculate the overall star rating and percentage breakdown by star, we don’t use a simple average. Instead, our system considers things like how recent a review is and if the reviewer bought the item on Amazon. It also analyzed reviews to verify trustworthiness.
Learn more how customers reviews work on AmazonCustomers say
Generated from the text of customer reviewsSelect to learn more
Reviews with images
Don't buy the kindle version
Top reviews from the United States
There was a problem filtering reviews. Please reload the page.
- Reviewed in the United States on March 7, 2015Format: PaperbackVerified PurchaseData Science for Business by Foster Provost and Tom Fawcett is a very important book about data mining and data analytic thinking. In 1971, Abbie Hoffman shocked the world when he demanded hippie readers (at the time, a likely oxymoron) "Steal This Book". While I wouldn't go so far as to encourage current and future data scientists to shoplift, I will demand that they READ THIS BOOK!
Not long ago, data was difficult and expensive to come by. Today, we're living in a world of far too much data, vast amounts of cheap computing power, and way too many poorly defined questions. Mix them all together and you're guaranteed to make a mess.
Going from data dearth to plethora presents substantive issues. In business, the balance between gut feel decision-making and analysis paralysis is changing, rapidly. Whether it moves too far from gut to paralysis, only time will tell. Through Data Science for Business, Provost and Fawcett offer practitioners a guide to equilibrium.
Read this book and you'll find yourself moving briskly down the road towards data analytic enlightenment. While not highly technical, the authors covers each topic with enough rigor to appreciate the tools being presented and the insights being offered.
From the outset, the authors are clear about the book's objectives: "The primary goals of this book are to help you view business problems from a data perspective and understand principles of extracting useful knowledge from data. There is fundamental structure to data-analytic thinking, and basic principals that should be understood. There are also particular areas where intuition, creativity, common sense, and domain knowledge must be brought to bear… As you get better at data-analytic thinking you will develop intuition as to how and where to apply creativity and domain knowledge."
This paragraph makes me think of all those undergrad and graduate students studying Statistics at Universities all over the world, my daughter included, who are being bombarded by one math or statistics class after another (Calculus III, Math Stat I and II, Linear Algebra, etc.). Yet, far too often, they enter the real world lacking "data analytic thinking" or a sense of "basic principals" They do, however, have a sense of being overwhelmed and under prepared. The epic battle between "frequentists" and "Bayesians", takes a back seat to what should be the real controversy in statistics departments around the world, the balance between "application" and "theory". The book's "primary goals" should be the walking orders of every statistics program at any college or university anywhere.
From the outset (page 2), the authors state, "Data mining is a craft. It involves the application of a substantial amount of science and technology, but the proper application still involves art as well." Absolutely true! It's great to read this stuff! This is followed by a concise discussion of CRISP-DM, a well-defined data mining process, whose concepts are elementary, essential, and integral to the responsible, proper, and successful practice of data mining.
From this point on, the authors proceed to accomplish their primary goals. They present such topics as predictive modeling, correlation, classification, clustering, regression, logistic regression, linear discriminants, and much more. Their presentations are user friendly, their real world examples are interesting, and their guidance and insights are extremely valuable.
My criticisms are limited to their website. The Data Science for Business site leaves me wanting more real world examples to enjoy, access to more resources and tools of the trade, more references to peruse, and a more rigorous approach to some of the solutions. Perhaps Data Science for Business the sequel is on the horizon?
Whether you're a seasoned statistician (or, data scientist), a young aspiring novice, or an adventurous business person looking to expand his/her horizons, Data Science for Business by Foster Provost and Tom Fawcett is well worth the price of admission and the reading time you'll invest.
Foster Provost and Tom Fawcett state, "[i]deally, we envision a book that any data scientist would give to his collaborators…" I'll do them one better, I'm giving it to my daughter!
- Reviewed in the United States on October 14, 2015Format: PaperbackVerified PurchaseIt's an excellent, even mandatory book for your Data Science shelf. I am glad I bought it. I am 67% of the way through reading this book. It has nowhere near enough material on some areas, though, and is just missing some material that you need for DS. That's actually OK because of course no single book is enough to cover everything you need to know in a field. Look how many books you may have bought just to get an undergrad degree, and I bet it was not just one book.
So here is a list of good and bad about this excellent book.
Its good points:
The profit curve. After reading this book, I will never use Accuracy to select a model any more, as that's nearly a worthless metric especially when there are marginal costs and marginal profits involved in an application scenario. The book is just amazingly good on describing how to select models based on estimated profit, and foremost the profit curve, and selected other supporting curves like ROC area under curve.
The expected profit computation and the cost-benefit matrix as a partner to the confusion matrix. This is great stuff. It's not even described in other data science courses that I have taken.
Other good points: ...And don't worry about the other good points (there are some). The profit curve analysis, and the lead-up to that, are superior.
Its bad points:
p.224: "We will train on the complete dataset and then test on the same dataset we trained on." What follows next the rest of the chapter is just an inappropriate error analysis, because it is overly optimistic (but otherwise the techniques are great.) The models have seen the training data. We should never completely assess (test) -- and base the entire remainder of the chapter material -- on error (accuracy) estimates produced from data that the models have already seen.
In most chapters, there is just not enough detail in the material, to enable this book to be used as a "correct reference" basis against which to write your own working code as you follow along with the text in whatever computer language you want to use for analysis.
In summary:
The book is outstanding. It is necessary for your DS bookshelf, but on the other hand it is nowhere near sufficient.
The data science course sequence by Johns Hopkins University identifies many of the elements of a nice overall outline as to what DS practitioners need to be able to do (and this is not even sufficient either):
Reproducible research; Experimental design; R programming (or python, or perhaps SAS or Octave, but some mathy language for sure); Exploratory data analysis; Regression models; Statistical inference; Practical machine learning; Scientific writing; Developing data products; Big data techniques (e.g. Apache Spark programming or at least MapReduce-style programming); SQL and NoSQL databases; Concurrent, distributed, and parallel programming; Advanced statistics (such as multiple testing corrections).
This book by Provost et al gives just a part of the necessary DS material. However the part it provides, is essential. I wish the biological data scientists in academia would adopt and integrate the cost-benefit matrix idea and the profit curve idea into their model selection techniques instead of just using the accuracy metric mostly.
Also a data scientist could do several follow-on added-value extensions to the profit curve chapter. You could produce Revenue curve (or Cost) since sometimes that matters more. You could quickly find alternatives which are nearly equi-profitable to the optimal profit but which exhibit (less revenue, less cost) or (more revenue, more cost). You could detail the model selection and profit consequences of fixed budgets. You could further assess the implications of marginal profit analysis on the optimal quantity when the profitability ratio changes. You could directly assess the data science solution against the best business wisdom solution and estimate what amount of profit is lost when using the old business wisdom decisions. It's a testament to this book's strong value that you can do a lot more based on its material.
Nice work. Recommended.
- Reviewed in the United States on March 7, 2016Format: PaperbackVerified PurchaseThis is an excellent textbook on data science. The text itself explains concepts and theories well and provides definitions, examples, and formulas that help the reader understand and apply these concepts. The information presented is well-organized, and the visual aids include ample graphs and charts. Section breaks are obvious with well-designed titles. Chapters are easy enough to read but don't over-simplify important concepts. Inclusion of Glossary, Bibliography, and index, as well as a detailed table of contents, makes it easy to navigate. The only exception our instructor took with the text during my course was their insistence that only the best data scientists should be considered. Removing this bias, the information provided was clear, concise, and helpful for anyone working with big data or in data analytics.
Top reviews from other countries
Jim-CReviewed in Germany on November 24, 20195.0 out of 5 stars Great intro and quick refresher course
Format: KindleVerified PurchaseI found this book great to refresh some key concepts after being away for the field for many years.
The contents are good, well organised and they cover most of what you need to know.
The approach is not theoretical but practical and to the point.
The examples are also good as it is the level of detail.
And you have enough references to go deeper if you need.
Great job, I would love to have a second book to go deeper.
-
ToninoCarotoneReviewed in Spain on August 11, 20145.0 out of 5 stars Buena compra
Format: PaperbackVerified PurchaseMuy bueno. Explica algunas técnicas pero me ha gustado sobretodo por como explica los fundamentos. Un bue libro para empezar con el tema del data science....
Payam MokhtarianReviewed in Australia on March 6, 20185.0 out of 5 stars Five Stars
Format: PaperbackVerified PurchaseHighly recommended book for those who wnat to hands on data science and business principles of machine learning
-
AdrianoReviewed in Italy on October 27, 20175.0 out of 5 stars Perfetto per iniziare, ma anche per chi ha già esperienza
Format: PaperbackVerified PurchaseUn ottimo manuale per comprendere l'ABC della data science, adatto sia a chi non sa nulla sia a chi è navigato ed esperto.
Credo sia adatto a tutte le diverse tipologie di soggetti: lo sviluppatore, il manager, il dirigente, l'operativo, il ricercatore, l'analista... C'è materiale per tutti e il linguaggio è tarato in base alle diverse tipologie di interlocutore.
Consigliato.
ATTENZIONE: è in inglese
-
2501Reviewed in France on November 11, 20215.0 out of 5 stars Très intéressant
Format: PaperbackVerified PurchasePeut-être le livre le plus intéressant que j'ai pu lire sur le machine learning. Livre non destiné au débutants, car si vous ne maîtrisez pas déjà le sujet, vous n'en tirerez pas grand chose, mais si vous avez déjà une certaine expérience sur le sujet, il vous fera comprendre pas mal de subtilités habituellement jamais évoquées.










