- Heavy tails in Bayesian NN [reporter: Marat Khusainov]
Approximate Fisher Information Matrix to Characterise the Training of Deep Neural Networks [reporter: "your name"]
Hyperparameters: optimize, or integrate out? [reporter: "your name"]- Learning Approximately Objective Priors [reporter: Galina Boeva]
- A widely applicable Bayesian information criterion [reporter: "your name"]
The Description Length of Deep Learning Models[reporter: "Gavrilyuk Alexander"]
* Scalable marginal likelihood estimation for model selection in deep learning article [reporter: "your name"]
- Rényi divergence [reporter: Kseniia Petrushina]
- Variational dropout [reporter: Eduard Vladimirov]
- Alpha-divergence [reporter: Boeva Galina]
- Learning to Discover Sparse Graphical Models [reporter: "your name"]
- Probabilistic circuits [reporter: "Parviz Karimov"]
- Hypertransformer: Model generation for supervised and semi-supervised few-shot learning [reporter: "Timofey Chernikov"]
- OUTRAGEOUSLY LARGE NEURAL NETWORKS: THE SPARSELY-GATED MIXTURE-OF-EXPERTS LAYER [reporter: Kseniia Petrushina]
- Approximate Fisher Information Matrix to Characterise the Training of Deep Neural Networks [reporter: "your name"]
- Hyperparameters: optimize, or integrate out? [reporter: "your name"]
- A widely applicable Bayesian information criterion [reporter: "your name"]
- Scalable marginal likelihood estimation for model selection in deep learning article [reporter: Maksim Tyurikov]
- Dangers of Bayesian Model Averaging under Covariate Shift [reporter: "Timofey Chernikov"]
- Can You Trust Your Model’s Uncertainty? Evaluating Predictive Uncertainty Under Dataset Shift [reporter: "Dmitry Protasov"]
- Weighted Random Search for Hyperparameter Optimization [reporter: Boeva Galina]
- c-TPE: Tree-Structured Parzen Estimator with Inequality Constraints for Expensive Hyperparameter Optimization [reporter: "your name"]
- Bayesian Optimization with Gradients [reporter: "your name"]
- Forward and Reverse Gradient-Based Hyperparameter Optimization [reporter: Maksim Tyurikov]
- Approximate Fisher Information Matrix to Characterise the Training of Deep Neural Networks [reporter: "your name"]
- Hyperparameters: optimize, or integrate out? [reporter: "your name"]
- A widely applicable Bayesian information criterion [reporter: "your name"]
- c-TPE: Tree-Structured Parzen Estimator with Inequality Constraints for Expensive Hyperparameter Optimization [reporter: "your name"]
- Bayesian Optimization with Gradients [reporter: "your name"]
- Neural Architecture Search without Training article [reporter: Dmitry Protasov]
- Bayesnas: A bayesian approach for neural architecture search article [reporter: "your name"]
- Bananas: Bayesian optimization with neural architectures for neural architecture search article [reporter: Boeva Galina]
- AutoML-Zero: Evolving Machine Learning Algorithms From Scratch [reporter: Kseniia Petrushina]
- Proving the Lottery Ticket Hypothesis: Pruning is All You Need [reporter: "your name"]
- Generalized Inner Loop Meta-Learning [reporter: "your name"]
- HOW TO TRAIN YOUR MAML [reporter: Dmitry Protasov]
- Discovering Inductive Bias with Gibbs Priors: A Diagnostic Tool for Approximate Bayesian Inference [reporter: "your name"]
- Variational multi-task learning with gumbel-softmax priors [reporter: "your name"]
- Dreaming to Distill: Data-free Knowledge Transfer via DeepInversion [reporter: Boeva Galina]
- Learning to select data for transfer learning with bayesian optimization [reporter: "your name"]
- Knowledge transfer via dense cross-layer mutual-distillation [reporter: "your name"]
- Data augmentation in Bayesian neural networks and the cold posterior effect [reporter: "Marat Khusainov"]
- Evolutionary MCMC [reporter: "your name"]
- AN INDUCTIVE BIAS FOR DISTANCES: NEURAL NETS THAT RESPECT THE TRIANGLE INEQUALITY [reporter: "your name"]
- MsC: Siamese networks + prob. metric learning [reporter: "your name"]
- Neural operator search [reporter: "your name"]
- Super resolution neural operator [reporter: "your name"]
- Functional MOE [reporter: "your name"]
- Neural ensemble search via Bayesian sampling [reporter: Kseniia Petrushina]
- The Variational Gaussian Process [reporter: Dmitry Protasov]
- State Space Expectation Propagation: Efficient Inference Schemes for Temporal Gaussian Processes [reporter: "your name"]