IEEE.org logo.

Nonlinear Embedding Methods in Modern Data Visualization: Theory & Practice

Loading Events

Learning and representing low-dimensional structures from noisy and possibly high-dimensional data is an indispensable component of modern data science. Recently, a special class of nonlinear embedding methods has become particularly influential, most notably, the t-distributed stochastic neighbor embedding (t-SNE) and the uniform manifold approximation and projection (UMAP). Despite their empirical success in many research fields, these algorithms are oftentimes subject to criticisms such as lack of theoretical understanding, unclear interpretations, sensitivity to tuning parameters, etc.
This talk will present a novel theoretical framework for understanding and explaining the exceptional performance of t-SNE and other related algorithms for visualizing high-dimensional clustered data. The results uncover the intrinsic mechanism, the large-sample limits, and several fundamental principles behind the algorithms; they also have practical implications to improve the current nonlinear embedding methods in real-world applications, such as enabling efficient selection of tuning parameters, improving normativity of analytic praxis, and avoiding common interpretive pitfalls. Recognizing current limitations, it will also introduce some new approaches and ideas that may lead to more accountable and reliable dimension reduction and data visualization.
Join our talk to dive into data and explore the cutting-edge techniques of dimension reduction and data visualization, along with their practical applications.
Speaker(s): Rong Ma
Virtual: https://events.vtools.ieee.org/m/413700

Share This Story, Choose Your Platform!

Go to Top