Python Data Science Handbook

Jake VanderPlas

出版时间

2016-06-25

ISBN

9781491912058

评分

★★★★★
书籍介绍
For many researchers, Python is a first-class tool mainly because of its libraries for storing, manipulating, and gaining insight from data. Several resources exist for individual pieces of this data science stack, but only with the Python Data Science Handbook do you get them all-IPython, NumPy, Pandas, Matplotlib, Scikit-Learn, and other related tools. Working scientists and data crunchers familiar with reading and writing Python code will find this comprehensive desk reference ideal for tackling day-to-day issues: manipulating, transforming, and cleaning data; visualizing different types of data; and using data to build statistical or machine learning models. Quite simply, this is the must-have reference for scientific computing in Python. With this handbook, you'll learn how to use: IPython and Jupyter: provide computational environments for data scientists using Python NumPy: includes the ndarray for efficient storage and manipulation of dense data arrays in Python Pandas: features the DataFrame for efficient storage and manipulation of labeled/columnar data in Python Matplotlib: includes capabilities for a flexible range of data visualizations in Python Scikit-Learn: for efficient and clean Python implementations of the most important and established machine learning algorithms
AI导读
核心看点
  • 系统讲解IPython、NumPy、Pandas等核心工具
  • 涵盖数据清洗、可视化及机器学习建模全流程
  • 提供丰富代码示例,适合快速查阅与实战参考
适合谁读
  • 具备Python基础,希望入门数据科学的开发者
  • 需要高效处理数据、进行统计分析的研究人员
  • 寻找Python数据科学生态系统工具书的技术人员
读前提醒
  • 建议结合GitHub上的Jupyter Notebook边读边练
  • 作为工具书按需查阅,无需从头到尾线性阅读
  • 注意核对勘误,部分早期版本代码可能存在小错
读者共识
  • 内容全面且浅显,是极佳的数据科学入门参考书
  • 相比官方文档更友好,提供了常用场景的具体示例
  • 适合有编程基础者,理论深度适中,实用性强

本导读基于书籍简介、目录、原文摘录、短评和书评生成,不等同于全文精读。

精彩摘录
  • "Looking through the Python 3.4 source code, we find that the integer (long) type definition effectively looks like this (once the C macros are expanded):"
  • "Here PyObject_HEAD is the part of the structure containing the reference count, type code, and other pieces mentioned before. Notice the difference here: a C integer is essentially a label for a position in memory whose bytes encode an integer value. A Python integer is a pointer to a position in me"
  • "速度很快,而且不需要选择超参数, 所以通常很适合作为初步分类手段, 在借助更复杂的模型进行优化之前使用。"
用户评论
好书,而且免费。
比 Google 和看文档快点
比Mckinney那本更适合做工具书
前四章重新看了遍 发现不少初期学的时候没注意的小技巧、、第五章作为ml的简单入门也是写的很易懂了、、
比 Python for data analysis 那本好啊
实践有余而理论不足 对数据处理来说很全能了,知识范围面很广 基于sklearn模块编写 感觉灵活性差些
Great toolbox!
conceptions in this version are moreorless clearer than the Chinese translation version. suitable for beginners. But better to use together with the documents of numpy and pandas. some operations used in the book are deprecated now.
Nunpy, Pandas, Matplotlib三剑客,讲的挺清楚。
basic python reference for numpy, pandas, matplotlib; introduction to machine learning with sklearn
下载
收藏