Locality-sensitive hashing is an algorithmic technique used for data clustering and nearest neighbor search. It hashes similar input items into the same "buckets" while maximizing hash collisions, and can be seen as a way to reduce the dimensionality of high-dimensional data. It is one of two main categories of hashing methods, the other being locality-preserving hashing.
Stanford University
Spring 2023
This course focuses on data mining and machine learning algorithms for large scale data analysis. The emphasis is on parallel algorithms with tools like MapReduce and Spark. Topics include frequent itemsets, locality sensitive hashing, clustering, link analysis, and large-scale supervised machine learning. Familiarity with Java, Python, basic probability theory, linear algebra, and algorithmic analysis is required.
No concepts data
+ 17 more concepts