Pranshu Malviya

AI Research Scientist Intern at DRW PhD Candidate at MILA / Polytechnique Montreal

About

I build things at the intersection of optimization and continual learning. Currently an AI Research Scientist Intern at DRW in Montreal, and a PhD candidate at MILA / Polytechnique Montreal under Prof. Sarath Chandar.

My research asks how models can keep learning without forgetting, and how optimizers can navigate loss landscapes more effectively. Some highlights: Manifold Metric (CoLLAs 2025, Oral), Lookbehind-SAM (ICML 2024), Critical Momenta (TMLR 2024), and TAG (CoLLAs 2022).

Before Montreal, I completed my MS at IIT Madras working with Prof. Balaraman Ravindran and Prof. Chandar at RBCDSAI.

When not working: sketching, hiking, reading non-fiction, cricket, travelling (4/7 wonders so far).

Latest News

2026.03

New preprint: CoPeP on continual pretraining for protein language models

Benchmarking how protein language models handle continual pretraining -- with Darshan Patil, Mathieu Reymond, Quentin Fournier, and Sarath Chandar.

2025.09

Joined DRW as AI Research Scientist Intern

Working on AI/ML research at DRW in Montreal.

2025.05

Paper accepted at CoLLAs 2025 (Oral): Manifold Metric

A loss landscape approach for predicting model performance.

2024.09

Awarded PBEEE Doctoral Research Scholarship by FRQNT Quebec

Fonds de recherche du Québec — Nature et technologies doctoral scholarship.

2024.05

Paper accepted at ICML 2024: Lookbehind-SAM

k steps back, 1 step forward -- sharpness-aware minimization with lookbehind.

Research

Selected Publications

View the full list of my publications on Google Scholar

Manifold Metric: A Loss Landscape Approach for Predicting Model Performance

CoLLAs 2025 (Oral)

Using loss landscape geometry to predict model generalization without held-out data.

Authors: P. Malviya, J. Huang, A. Baratin, Q. Fournier, S. Chandar

Loss Landscape

Architectures

Paper

Lookbehind-SAM: k steps back, 1 step forward

ICML 2024

An efficient extension to Sharpness-Aware Minimization that leverages historical gradient information.

Authors: G. Mordido, P. Malviya, A. Baratin, S. Chandar

Optimization

SAM

Paper

arXiv

Github

Promoting Exploration in Memory-Augmented Adam using Critical Momenta

TMLR 2024

A memory-augmented optimizer that stores and retrieves critical momenta to promote exploration in the loss landscape.

Authors: P. Malviya, G. Mordido, A. Baratin, R. Babanezhad, J. Huang, S. Lacoste-Julien, R. Pascanu, S. Chandar

Optimization

OpenReview

arXiv

TAG: Task-based Accumulated Gradients for Lifelong Learning

CoLLAs 2022

A gradient accumulation method for continual learning that prevents catastrophic forgetting.

Authors: P. Malviya, B. Ravindran, S. Chandar

Continual Learning

Optimization

PMLR

arXiv

Github

An Introduction to Lifelong Supervised Learning

arXiv 2022

A comprehensive primer on lifelong/continual supervised learning — survey of the field covering task-incremental, class-incremental, and domain-incremental settings.

Authors: S. Sodhani, M. Faramarzi, S.V. Mehta, P. Malviya, M. Abdelsalam, J. Rajendran, S. Chandar

Continual Learning

Survey

Paper