jermwatt/machine_learning_refined: Notes, examples, and Python demos for the 2nd ...

原作者: [db:作者] 来自: 网络收藏邀请

开源软件名称（OpenSource Name）：

jermwatt/machine_learning_refined

开源软件地址(OpenSource Url)：

https://github.com/jermwatt/machine_learning_refined

开源编程语言(OpenSource Language)：

Python 99.8%

开源软件介绍(OpenSource Introduction)：

Machine Learning Refined: Notes, Exercises, and Jupyter notebooks

Below you will find a range of resources that complement the 2nd edition of Machine Learning Refined (published by Cambridge University Press).

Sample chapters from the 2nd edition
A sampler of widgets / pedagogy
Online notes (jupyter notebooks)
What is new in the second edition?
How to use the book
Technical prerequisites
Coding exercises
Slides and additional instructor resources
Errata
Get a copy of the book
Reviews and Endorsements
Software installation and dependencies
Contact

A sampler of widgets and our pedagogy

(Back to top)

We believe mastery of a certain machine learning concept/topic is achieved only when the answer to each of the following three questions is affirmative.

Intuition Can you describe the idea with a simple picture?
Mathematical derivation Can you express your intuition in mathematical notation and derive underlying models/cost functions?
Implementation Can you code up your derivations in a programming language, say Python, without using high-level libraries?

Intuition comes first. Intuitive leaps precede intellectual ones, and because of this we have included over 300 color illustrations in the book that have been meticulously designed to enable an intuitive grasp of technical concepts. Many of those illustrations are snapshots of animations that show convergence of certain algorithms, evolution of certain models from underfitting all the way to overfitting, etc. This sort of concepts can be illustrated and intuited best using animations (as opposed to static figures). You'll find a large number of such animations in this repository -- which you can modify yourself too via the raw Jupyter notebook version of these notes. Here are just a few examples:


Cross-validation (regression)	Cross-validation (two-class classification)	Cross-validation (multi-class classification)


K-means clustering	Feature normalization	Normalized gradient descent


Rotation	Convexification	Dogification!


A nonlinear transformation	Weighted classification	The moving average


Batch normalization	Logistic regression


Polynomials vs. NNs vs. Trees (regression)	Polynomials vs. NNs vs. Trees (classification)


Changing gradient descent's steplength (1d)	Changing gradient descent's steplength (2d)


Convex combination of two functions	Taylor series approximation


Feature selection via regularization	Secant planes


Function approximation with a neural network	A regression tree

Mathematical optimization: the workhorse of machine learning. We highly emphasize the importance of mathematical optimization in our treatment of machine learning. Optimization is the workhorse of machine learning and is fundamental at many levels – from the tuning of individual models to the general selection of appropriate nonlinearities via cross-validation. Because of this a strong understanding of mathematical optimization is requisite if one wishes to deeply understand machine learning, and if one wishes to be able to implement fundamental algorithms. Part I of the book provides a complete introduction to mathematical optimization, covering zero-, first-, and second-order methods, that are relied upon later in deriving and tuning machine learning models.

Learning by doing. We place significant emphasis on the design and implementation of algorithms throughout the text with implementations of fundamental algorithms given in Python. These fundamental examples can then be used as building blocks for the reader to help complete the text’s programming exercises, allowing them to ”get their hands dirty” and ”learn by doing,” practicing the concepts introduced in the body of the text. While in principle any programming language can be used to complete the text’s coding exercises, we highly recommend using Python for its ease of use and large support community. We also recommend using the open-source Python libraries NumPy, autograd, and matplotlib, as well as the Jupyter notebook editor to make implementing and testing code easier. A complete set of installation instructions, datasets, as well as starter notebooks can be found in this repository.

Online notes

(Back to top)

A select number of Chapters/Sections are highlighted below and are linked to HTML notes that served as early drafts for the second edition of the textbook. You can find these html files as well as Jupyter notebooks which created them in the notes subdirectory.

Chapter 1. Introduction to Machine Learning

1.1 Introduction
1.2 Distinguishing Cats from Dogs: a Machine Learning Approach
1.3 The Basic Taxonomy of Machine Learning Problems
1.4 Mathematical Optimization
1.5 Conclusion

Chapter 8. Linear Unsupervised Learning

8.1 Introduction
8.2 Fixed Spanning Sets, Orthonormality, and Projections
8.3 The Linear Autoencoder and Principal Component Analysis
8.4 Recommender Systems
8.5 K-Means Clustering
8.6 General Matrix Factorization Techniques
8.7 Conclusion
8.8 Exercises
8.9 Endnotes

Chapter 9. Feature Engineering and Selection

9.1 Introduction
9.2 Histogram Features
9.3 Feature Scaling via Standard Normalization
9.4 Imputing Missing Values in a Dataset
9.5 Feature Scaling via PCA-Sphering
9.6 Feature Selection via Boosting
9.7 Feature Selection via Regularization
9.8 Conclusion
9.9 Exercises

Chapter 10. Principles of Nonlinear Feature Engineering

10.1 Introduction
10.2 Nonlinear Regression
10.3 Nonlinear Multi-Output Regression
10.4 Nonlinear Two-Class Classification
10.5 Nonlinear Multi-Class Classification
10.6 Nonlinear Unsupervised Learning
10.7 Conclusion
10.8 Exercises

Chapter 11. Principles of Feature Learning

11.1 Introduction
11.2 Universal Approximators
11.3 Universal Approximation of Real Data
11.4 Naive Cross-Validation
11.5 Efficient Cross-Validation via Boosting
11.6 Efficient Cross-Validation via Regularization
11.7 Testing Data
11.8 Which Universal Approximator Works Best in Practice?
11.9 Bagging Cross-Validated Models
11.10 K-Fold Cross-Validation
11.11 When Feature Learning Fails
11.12 Conclusion
11.13 Exercises

Chapter 12. Kernel Methods

12.1 Introduction
12.2 Fixed-Shape Universal Approximators
12.3 The Kernel Trick
12.4 Kernels as Measures of Similarity
12.5 Optimization of Kernelized Models
12.6 Cross-Validating Kernelized Learners
12.7 Conclusion
12.8 Exercises

Chapter 13. Fully Connected Neural Networks

13.1 Introduction
13.2 Fully Connected Neural Networks
13.3 Activation Functions
13.4 The Backpropagation Algorithm
13.5 Optimization of Neural Network Models
13.6 Batch Normalization
13.7 Cross-Validation via Early Stopping
13.8 Conclusion
13.9 Exercises

Chapter 14. Tree-Based Learners

14.1 Introduction
14.2 From Stumps to Deep Trees
14.3 Regression Trees
14.4 Classification Trees
14.5 Gradient Boosting
14.6 Random Forests
14.7 Cross-Validation Techniques for Recursively Defined Trees
14.8 Conclusion
14.9 Exercises

Appendix A. Advanced First- and Second-Order Optimization Methods

A.1 Introduction
A.2 Momentum-Accelerated Gradient Descent
A.3 Normalized Gradient Descent
A.4 Advanced Gradient-Based Methods
A.5 Mini-Batch Optimization
A.6 Conservative Steplength Rules
A.7 Newton’s Method, Regularization, and Nonconvex Functions
A.8 Hessian-Free Methods

Appendix B. Derivatives and Automatic Differentiation

B.1 Introduction
B.2 The Derivative
B.3 Derivative Rules for Elementary Functions and Operations
B.4 The Gradient
B.5 The Computation Graph
B.6 The Forward Mode of Automatic Differentiation
B.7 The Reverse Mode of Automatic Differentiation
B.8 Higher-Order Derivatives
B.9 Taylor Series
B.10 Using the autograd Library

Appendix C. Linear Algebra

C.1 Introduction
C.2 Vectors and Vector Operations
C.3 Matrices and Matrix Operations
C.4 Eigenvalues and Eigenvectors
C.5 Vector and Matrix Norms

What is new in the second edition?

(Back to top)

The second edition of this text is a complete revision of our first endeavor, with virtually every chapter of the original rewritten from the ground up and eight new chapters of material added, doubling the size of the first edition. Topics from the first edition, from expositions on gradient descent to those on One-versusAll classification and Principal Component Analysis have been reworked and polished. A swath of new topics have been added throughout the text, from derivative-free optimization to weighted supervised learning, feature selection, nonlinear feature engineering, boosting-based cross-validation, and more. While heftier in size, the intent of our original attempt has remained unchanged: to explain machine learning, from first principles to practical implementation, in the simplest possible terms.

How to use the book?

(Back to top)

Example ”roadmaps” shown below provide suggested paths for navigating the text based on a variety of learning outcomes and university courses taught using the present book.

Recommended study roadmap for a course on the essentials of machine learning, including requisite chapters (left column), sections (middle column), and corresponding topics (right column). This essentials plan is suitable for time-constrained courses (in quarter-based programs and universities) or self-study, or where machine learning is not the sole focus but a key component of some broader course of study.

Recommended study roadmap for a full treatment of standard machine learning subjects, including chapters, sections, as well as corresponding topics to cover. This plan entails a more in-depth coverage of machine learning topics compared to the essentials roadmap given above, and is best suited for senior undergraduate/early graduate students in semester-based programs and passionate independent readers.

Recommended study roadmap for a course on mathematical optimization for machine learning and deep learning, including chapters, sections, as well as topics to cover.

Recommended study roadmap for an introductory portion of a course on deep learning, including chapters, sections, as well as topics to cover.

Technical prerequisites

(Back to top)

To make full use of the text one needs only a basic understanding of vector algebra (mathematical functions, vector arithmetic, etc.) and computer programming (for example, basic proficiency with a dynamically typed language like Python). We provide complete introductory treatments of other prerequisite topics including linear algebra, vector calculus, and automatic differentiation in the appendices of the text.

Coding exercises

(Back to top)

In the mlrefined_exercises directory you can find starting wrappers for coding exercises from the first and second editions of the text.

Slides and additional instructor resources

(Back to top)

Slides for the 2nd edition of the text are available in pptx, jupyter, and reveal.js formats. Slides for the 1st edition of the text are also available.

Instructors may request a copy of this text for examination from the publisher's website. Cambridge University Press can also provide you with the solution manual to both editions of the text.

Errata

(Back to top)

Here you can find a regularly updated errata sheet for the second edition of the text. Please report any typos, bugs, broken links, etc., in the Issues Section of this repository or by contacting us directly via email (see contact section for more info).

Get a copy of the book

(Back to top)

Free sample chapters in pdf format
From Cambridge University Press
From Amazon
From Barnes & Noble

Reviews and Endorsements

(Back to top)

An excellent book that treats the fundamentals of machine learning from basic principles to practical implementation. The book is suitable as a text for senior-level and first-year graduate courses in engineering and computer science. It is well organized and covers basic concepts and algorithms in mathematical optimization methods, linear learning, and nonlinear learning techniques. The book is nicely illustrated in multiple colors and contains numerous examples and coding exercises using Python.

John G. Proakis, University of California, San Diego

Some machine learning books cover only programming aspects, often relying on outdated software tools; some focus exclusively on neural networks; others, solely on theoretical foundations; and yet more books detail advanced topics for the specialist. This fully revised and expanded text provides a broad and accessible introduction to machine learning for engineering and computer science students. The presentation builds on first principles and geometric intuition, while offering real-world examples, commented implementations in Python, and computational exercises. I expect this book to become a key resource for students and researchers.

Osvaldo Simeone, King's College, London

This book is great for getting started in machine learning. It builds up the tools of the trade from first principles, provides lots of examples, and explains one thing at a time at a steady pace. The level of detail and runnable code show what's really going when we run a learning algorithm.

David Duvenaud, University of Toronto

This book covers various essential machine learning methods (e.g., regression, classification, clustering, dimensionality reduction, and deep learning) from a unified mathematical perspective of seeking the optimal model parameters that minimize a cost function. Every method is explained in a comprehensive, intuitive way, and mathematical understanding is aided and enhanced with many geometric illustrations and elegant Python implementations.

Kimiaki Sihrahama, Kindai University, Japan

Books featuring machine learning are many, but those which are simple, intuitive, and yet theoretical are extraordinary 'outliers'. This book is a fantastic and easy way to launch yourself into the exciting world of machine learning, grasp its core concepts, and code them up in Python or Matlab. It was my inspiring guide in preparing my 'Machine Learning Blinks' on my BASIRA YouTube channel for both undergraduate and graduate levels.

Islem Rekik, Director of the Brain And SIgnal Research and Analysis (BASIRA) Laboratory

Software installation and dependencies

(Back to top)

After cloning this reposi

鲜花

握手

雷人

路过

鸡蛋

该文章已有0人参与评论

请发表评论

全部评论

专题导读

More+

10-27 六六分期app的软件客服如何联系？(六六分期

11-06 可心卡盟:win10系统火狐flash插件崩溃怎么

11-06 亲亲特价:怎么删除回收站图标

11-06 济南大学虚拟社区:鲁大师节能降温的具体办

11-06 xlueops.exe:无线网络安装向导

11-06 女斗合众国:win7系统cf与主机连接不稳定怎

11-06 0xc000022-[cf烟雾头]cf怎么调烟雾头

11-06 qizideyouhuo:应用程序无法正常启动0xc0000

11-06 ipz-185:win7系统vcf文件怎么打开

11-06 傻哥蹦迪:win10系统s4怎么打开usb调试

11-06 八神浩树gtaste:回收站清空了怎么恢复

11-06 妖尾之黑色守护:win10系统电脑没有1440x900

11-06 校园至尊魔王小说:win7系统浏览网页时字体

11-06 女斗合众国:win10系统访问共享文件夹提示请

11-06 tokyo hot n0654:恢复win7系统默认字体一招

11-06 雨酷仙境:设置win7系统转移临时文件夹腾出

11-06 阿穆纳伊之杖:win7系统开始菜单在右边还原

11-06 tunespotting:win10系统火狐flash插件总是

11-06 甘尔葛分析师：计谋网站seo关键词暴涨有什

11-06 蔡贵霖: 计谋网站seo关键词暴涨有什么秘密

11-06 博益网首页:ao3网页版进入不了解决方法

11-06 漏斗子专栏: 网站数据分析小白易懂精华篇

11-06 见证双虹怎么做:win7系统开启telnet命令的

11-06 颾狐蝶蜋:系统资源不足无法完成请求的服务

11-06 国光中学校歌:提交网站到alexa查询详细步骤

11-06 西安有情天:静态网页和动态网页的区别

11-06 红木雅尚斋:外部链接构造对网站的好处

11-06 前官礼遇：防止域名劫持–增强域安全性的10

11-06 密传二转答案: 中文分词算法有哪些

11-06 金泉家园邮编:百度快照劫持的表现及应对方

aleju/imgaug: Image augmentation for machine learning experiments.发布时间：2022-08-18

shunliz/Machine-Learning: 机器学习原理发布时间：2022-08-18

剪的笔顺,诠释剪的笔画,认识剪的部首

1 六六分期app的软件客服如何联系？(六六分期

六六分期app的软件客服如何联系？不知道吗？加qq群【895510560】即可！标题：六六分期

阅读：19131|2023-10-27

2 可心卡盟:win10系统火狐flash插件崩溃怎么

今天小编告诉大家如何处理win10系统火狐flash插件总是崩溃的问题，可能很多用户都不知

阅读：9972|2022-11-06

3 亲亲特价:怎么删除回收站图标

今天小编告诉大家如何对win10系统删除桌面回收站图标进行设置，可能很多用户都不知道

阅读：8317|2022-11-06

4 济南大学虚拟社区:鲁大师节能降温的具体办

今天小编告诉大家如何对win10系统电脑设置节能降温的设置方法，想必大家都遇到过需要

阅读：8686|2022-11-06

5 xlueops.exe:无线网络安装向导

我们在使用xp系统的过程中,经常需要对xp系统无线网络安装向导设置进行设置，可能很多

阅读：8627|2022-11-06

6 女斗合众国:win7系统cf与主机连接不稳定怎

今天小编告诉大家如何处理win7系统玩cf老是与主机连接不稳定的问题，可能很多用户都不

阅读：9643|2022-11-06

7 0xc000022-[cf烟雾头]cf怎么调烟雾头

电脑对日常生活的重要性小编就不多说了，可是一旦碰到win7系统设置cf烟雾头的问题，很

阅读：8611|2022-11-06

8 qizideyouhuo:应用程序无法正常启动0xc0000

我们在日常使用电脑的时候，有的小伙伴们可能在打开应用的时候会遇见提示应用程序无法

阅读：7991|2022-11-06

9 ipz-185:win7系统vcf文件怎么打开

今天小编告诉大家如何对win7系统打开vcf文件进行设置，可能很多用户都不知道怎么对win

阅读：8642|2022-11-06

10 傻哥蹦迪:win10系统s4怎么打开usb调试

今天小编告诉大家如何对win10系统s4开启USB调试模式进行设置，可能很多用户都不知道怎

阅读：7527|2022-11-06

客服电话

电子邮件

jermwatt/machine_learning_refined: Notes, examples, and Python demos for the 2nd ...

开源软件名称（OpenSource Name）：

开源软件地址(OpenSource Url)：

开源编程语言(OpenSource Language)：

开源软件介绍(OpenSource Introduction)：

Machine Learning Refined: Notes, Exercises, and Jupyter notebooks

Table of Contents

A sampler of widgets and our pedagogy

Online notes

Chapter 1. Introduction to Machine Learning

Chapter 2. Zero-Order Optimization Techniques

Chapter 3. First-Order Optimization Techniques

Chapter 4. Second-Order Optimization Techniques

Chapter 5. Linear Regression

Chapter 6. Linear Two-Class Classification

Chapter 7. Linear Multi-Class Classification

Chapter 8. Linear Unsupervised Learning

Chapter 9. Feature Engineering and Selection

Chapter 10. Principles of Nonlinear Feature Engineering

Chapter 11. Principles of Feature Learning

Chapter 12. Kernel Methods

Chapter 13. Fully Connected Neural Networks

Chapter 14. Tree-Based Learners

Appendix A. Advanced First- and Second-Order Optimization Methods

Appendix B. Derivatives and Automatic Differentiation

Appendix C. Linear Algebra

What is new in the second edition?

How to use the book?

Recommended study roadmap for a course on mathematical optimization for machine learning and deep learning, including chapters, sections, as well as topics to cover.

Recommended study roadmap for an introductory portion of a course on deep learning, including chapters, sections, as well as topics to cover.

Technical prerequisites

Coding exercises

Slides and additional instructor resources

Errata

Get a copy of the book

Reviews and Endorsements

Software installation and dependencies

请发表评论

全部评论

上一篇：

下一篇：

CVE-2022-35628

kube-rs/kube-rs: Rust Kubernetes client

PacktPublishing/Python-Machine-Learning-

armancodv/building-energy-model-matlab:

鲁东大学一米网:Win7系统USB驱动器RAM的操

剪的笔顺,诠释剪的笔画,认识剪的部首

六六分期app的软件客服如何联系？(六六分期

florent37/ViewAnimator: A fluent Android

florent37/Shrine-MaterialDesign2: implem

CVE-2020-36276

SimpleSoftwareIO/simple-sms: Send and re

关于我们

产品与服务

解决方案

139-2527-9053