Stochastic gradient descent (SGD)

Stochastic gradient descent

Stochastic gradient descent is an iterative method used to optimize an objective function by approximating the gradient with a randomly selected subset of the data. It reduces computational burden in high-dimensional optimization problems, resulting in faster iterations but a lower convergence rate. This method has become significant in machine learning.

5 courses cover this concept

CS 294 - The Mathematics of Information and Data

UC Berkeley

Fall 2013

This course investigates the mathematical principles behind data and information analysis. It brings together concepts from statistics, optimization, and computer science, with a focus on large deviation inequalities, and convex analysis. It's tailored towards advanced graduate students who wish to incorporate these theories into their research.

No concepts data

+ 20 more concepts

CS 221 Artificial Intelligence: Principles and Techniques

Stanford University

Autumn 2022-2023

Stanford's CS 221 course teaches foundational principles and practical implementation of AI systems. It covers machine learning, game playing, constraint satisfaction, graphical models, and logic. A rigorous course requiring solid foundational skills in programming, math, and probability.

No concepts data

+ 88 more concepts

11-785 Introduction to Deep Learning

Carnegie Mellon University

Spring 2020

This course provides a comprehensive introduction to deep learning, starting from foundational concepts and moving towards complex topics such as sequence-to-sequence models. Students gain hands-on experience with PyTorch and can fine-tune models through practical assignments. A basic understanding of calculus, linear algebra, and Python programming is required.

No concepts data

+ 40 more concepts

COS 324 - Introduction to Machine Learning

Princeton University

Fall 2017

A thorough introduction to machine learning principles such as online learning, decision making, gradient-based learning, and empirical risk minimization. It also explores regression, classification, dimensionality reduction, ensemble methods, neural networks, and deep learning. The course material is self-contained and based on freely available resources.

No concepts data

+ 14 more concepts

CS231n: Deep Learning for Computer Vision

Stanford University

Spring 2022

This is a deep-dive into the details of deep learning architectures for visual recognition tasks. The course provides students with the ability to implement, train their own neural networks and understand state-of-the-art computer vision research. It requires Python proficiency and familiarity with calculus, linear algebra, probability, and statistics.

No concepts data

+ 55 more concepts