Statistics (Machine Learning) Papers (@StatsPapers) / X

Statistics (Machine Learning) Papers

58K posts

Statistics (Machine Learning) Papers

@StatsPapers

New Statistics (Machine Learning) papers from arxiv.org: supervised, unsupervised. Thank you to arXiv for use of its open access interoperability.

Worldwide

arxiv.org/list/stat.ML/n…

Joined April 2010

Following

6,899

Followers

Statistics (Machine Learning) Papers
@StatsPapers
Sep 17, 2024
Automated by @PremiumAccts
A Survey on Statistical Theory of Deep Learning: Approximation, Training Dynamics, and Generative Models. arxiv.org/abs/2401.07187
24K
Statistics (Machine Learning) Papers
@StatsPapers
Jun 17, 2025
Automated by @PremiumAccts
Random Matrix Theory for Deep Learning: Beyond Eigenvalues of Linear Models.
arxiv.org
Random Matrix Theory for Deep Learning: Beyond Eigenvalues of Linear Models
Modern Machine Learning (ML) and Deep Neural Networks (DNNs) often operate on high-dimensional data and rely on overparameterized models, where classical low-dimensional intuitions break down. In...
7.9K
Statistics (Machine Learning) Papers
@StatsPapers
Jul 18, 2019
Automated by @PremiumAccts
Potential Outcome and Directed Acyclic Graph Approaches to Causality: Relevance for Empirical Practice in Economics. arxiv.org/abs/1907.07271
Statistics (Machine Learning) Papers
@StatsPapers
Jul 22, 2025
Automated by @PremiumAccts
Statistical and Algorithmic Foundations of Reinforcement Learning. arxiv.org/abs/2507.14444
8.4K
Statistics (Machine Learning) Papers
@StatsPapers
May 21, 2024
Automated by @PremiumAccts
Can a Transformer Represent a Kalman Filter?.
arxiv.org
Can a Transformer Represent a Kalman Filter?
Transformers are a class of autoregressive deep learning architectures which have recently achieved state-of-the-art performance in various vision, language, and robotics tasks. We revisit the...
17K
Statistics (Machine Learning) Papers
@StatsPapers
Aug 19, 2025
Automated by @PremiumAccts
Universal Learning of Nonlinear Dynamics.
arxiv.org
Universal Learning of Nonlinear Dynamics
We study the fundamental problem of learning a marginally stable unknown nonlinear dynamical system. We describe an algorithm for this problem, based on the technique of spectral filtering, which...
5.7K
Statistics (Machine Learning) Papers
@StatsPapers
Jun 26, 2019
Automated by @PremiumAccts
Monte Carlo Gradient Estimation in Machine Learning. arxiv.org/abs/1906.10652
Statistics (Machine Learning) Papers
@StatsPapers
May 24, 2017
Automated by @PremiumAccts
The Marginal Value of Adaptive Gradient Methods in Machine Learning. arxiv.org/abs/1705.08292
Statistics (Machine Learning) Papers
@StatsPapers
Apr 26, 2025
Automated by @PremiumAccts
Linear Convergence of Diffusion Models Under the Manifold Hypothesis. arxiv.org/abs/2410.09046
4.8K
Statistics (Machine Learning) Papers
@StatsPapers
Jun 2, 2025
Automated by @PremiumAccts
A Mathematical Perspective On Contrastive Learning.
arxiv.org
A Mathematical Perspective On Contrastive Learning
Multimodal contrastive learning is a methodology for linking different data modalities; the canonical example is linking image and text data. The methodology is typically framed as the...
4.1K
Statistics (Machine Learning) Papers
@StatsPapers
Jun 3, 2025
Automated by @PremiumAccts
Riemannian Principal Component Analysis. arxiv.org/abs/2506.00226
4.4K
Statistics (Machine Learning) Papers
@StatsPapers
Jul 22, 2025
Automated by @PremiumAccts
Diffusion Models for Time Series Forecasting: A Survey. arxiv.org/abs/2507.14507
3.5K
Statistics (Machine Learning) Papers
@StatsPapers
Feb 15, 2025
Automated by @PremiumAccts
Regularization can make diffusion models more efficient. arxiv.org/abs/2502.09151
5.8K
Statistics (Machine Learning) Papers
@StatsPapers
Jun 6, 2024
Automated by @PremiumAccts
The No Free Lunch Theorem, Kolmogorov Complexity, and the Role of Inductive Biases in Machine Learning.
arxiv.org
The No Free Lunch Theorem, Kolmogorov Complexity, and the Role of...
No free lunch theorems for supervised learning state that no learner can solve all problems or that all learners achieve exactly the same accuracy on average over a uniform distribution on...
8.4K