Papers
arxiv:2501.14787

Matrix Calculus (for Machine Learning and Beyond)

Published on Jan 7, 2025
Authors:
,
,

Abstract

Course covers differential calculus for general vector spaces, practical applications including optimization and machine learning, introduces reverse-mode differentiation, and provides an introduction to automatic differentiation techniques.

AI-generated summary

This course, intended for undergraduates familiar with elementary calculus and linear algebra, introduces the extension of differential calculus to functions on more general vector spaces, such as functions that take as input a matrix and return a matrix inverse or factorization, derivatives of ODE solutions, and even stochastic derivatives of random functions. It emphasizes practical computational applications, such as large-scale optimization and machine learning, where derivatives must be re-imagined in order to be propagated through complicated calculations. The class also discusses efficiency concerns leading to "adjoint" or "reverse-mode" differentiation (a.k.a. "backpropagation"), and gives a gentle introduction to modern automatic differentiation (AD) techniques.

Community

🎉 The notes are part of the MIT open course: “Matrix Calculus For Machine Learning And Beyond”

🎁 Course Link: https://ocw.mit.edu/courses/18-s096-matrix-calculus-for-machine-learning-and-beyond-january-iap-2023/

🍓Link to PDF: https://arxiv.org/pdf/2501.14787

IMG_5786.jpeg

Sign up or log in to comment

Get this paper in your agent:

hf papers read 2501.14787
Don't have the latest CLI?
curl -LsSf https://hf.co/cli/install.sh | bash

Models citing this paper 0

No model linking this paper

Cite arxiv.org/abs/2501.14787 in a model README.md to link it from this page.

Datasets citing this paper 1

Spaces citing this paper 0

No Space linking this paper

Cite arxiv.org/abs/2501.14787 in a Space README.md to link it from this page.

Collections including this paper 1