Pranshu Malviya

AI Research Scientist Intern at DRW PhD Candidate at MILA / Polytechnique Montreal

PM

About

I build things at the intersection of optimization and continual learning. Currently an AI Research Scientist Intern at DRW in Montreal, and a PhD candidate at MILA / Polytechnique Montreal under Prof. Sarath Chandar.

My research asks how models can keep learning without forgetting, and how optimizers can navigate loss landscapes more effectively. Some highlights: Manifold Metric (CoLLAs 2025, Oral), Lookbehind-SAM (ICML 2024), Critical Momenta (TMLR 2024), and TAG (CoLLAs 2022).

Before Montreal, I completed my MS at IIT Madras working with Prof. Balaraman Ravindran and Prof. Chandar at RBCDSAI.

When not working: sketching, hiking, reading non-fiction, cricket, travelling (4/7 wonders so far).

Latest News

2026.03

New preprint: CoPeP on continual pretraining for protein language models

Benchmarking how protein language models handle continual pretraining -- with Darshan Patil, Mathieu Reymond, Quentin Fournier, and Sarath Chandar.

2025.09

Joined DRW as AI Research Scientist Intern

Working on AI/ML research at DRW in Montreal.

2025.05

Paper accepted at CoLLAs 2025 (Oral): Manifold Metric

A loss landscape approach for predicting model performance.

2024.09

Awarded PBEEE Doctoral Research Scholarship by FRQNT Quebec

Fonds de recherche du Québec — Nature et technologies doctoral scholarship.

2024.05

Paper accepted at ICML 2024: Lookbehind-SAM

k steps back, 1 step forward -- sharpness-aware minimization with lookbehind.

Research

Selected Publications

View the full list of my publications on Google Scholar

Manifold Metric: A Loss Landscape Approach for Predicting Model Performance

Using loss landscape geometry to predict model generalization without held-out data.

Authors: P. Malviya, J. Huang, A. Baratin, Q. Fournier, S. Chandar

Loss Landscape
Architectures

Lookbehind-SAM: k steps back, 1 step forward

An efficient extension to Sharpness-Aware Minimization that leverages historical gradient information.

Authors: G. Mordido, P. Malviya, A. Baratin, S. Chandar

Optimization
SAM

Promoting Exploration in Memory-Augmented Adam using Critical Momenta

A memory-augmented optimizer that stores and retrieves critical momenta to promote exploration in the loss landscape.

Authors: P. Malviya, G. Mordido, A. Baratin, R. Babanezhad, J. Huang, S. Lacoste-Julien, R. Pascanu, S. Chandar

Optimization

TAG: Task-based Accumulated Gradients for Lifelong Learning

A gradient accumulation method for continual learning that prevents catastrophic forgetting.

Authors: P. Malviya, B. Ravindran, S. Chandar

Continual Learning
Optimization

An Introduction to Lifelong Supervised Learning

A comprehensive primer on lifelong/continual supervised learning — survey of the field covering task-incremental, class-incremental, and domain-incremental settings.

Authors: S. Sodhani, M. Faramarzi, S.V. Mehta, P. Malviya, M. Abdelsalam, J. Rajendran, S. Chandar

Continual Learning
Survey

A Causal Approach for Unfair Edge Prioritization and Discrimination Removal

Using causal inference to identify and remove discriminatory edges in decision systems.

Authors: P. Ravishankar, P. Malviya, B. Ravindran

Causal Inference
Fairness

Others

Preprints and workshop papers

CoPeP: Benchmarking Continual Pretraining for Protein Language Models

D. Patil, P. Malviya, M. Reymond, Q. Fournier, S. Chandar

(arXiv 2026)
Torque-Aware Momentum

P. Malviya, G. Mordido, A. Baratin, R. Babanezhad, G.K. Dziugaite, R. Pascanu, S. Chandar

(arXiv 2024)
Interpolate: How Resetting Active Neurons can also improve Generalizability in Online Learning

P. Malviya, D. Patil, M. Hashemzadeh, Q. Fournier, S. Chandar

(Under review 2025)
Experimental Design for Nonstationary Optimization

D. Patil, P. Malviya, M. Hashemzadeh, S. Chandar

(Under review 2025)

Service & Teaching

ReviewerICML · ICLR · CoLLAs · TMLR
Teaching assistant
  • Deep Learning Dynamics — Polytechnique Montreal
  • Machine Learning — Polytechnique Montreal
  • Reinforcement Learning — IIT Madras (NPTEL)
  • Practical ML with TensorFlow — IIT Madras (NPTEL)
  • Big Data Lab — IIT Madras
Volunteering

Earlier Work

Fast CSP and Learning-based Minesweeper Solver

AI solver using Constraint Satisfaction Problems, XGBoost, and Deep Q-Learning — achieving 92% win rate on standard boards.

C++
Python
XGBoost
Deep Q-Learning
CSP

Graphical Interpretation of Data using ArangoDB

A technical walkthrough of modeling and querying graph data with ArangoDB — from setup to traversal queries and visualization.

ArangoDB
Graph Databases
NoSQL
Contact

Get in Touch

Want to chat? Feel free to reach out via email.