Sourav Chakraborty

Sourav Chakraborty
Ph.D. Student · Theoretical Machine Learning · University of Colorado

"All models are wrong, but some are useful." – George Box

[Name pronounced either as Saw-rv or Show-oo-rob in Bengali]

I am a fourth-year Ph.D. student in Computer Science at the University of Colorado Boulder, where I am advised by Prof. Lijun Chen. My research lies at the intersection of theoretical machine learning and online learning, with a focus on algorithms for sequential decision-making under uncertainty.

My research focuses on the mathematical foundations of online learning and sequential decision-making, particularly in uncertain or evolving environments. I study problems where feedback is partial or delayed, and where learning must adapt to structural constraints such as network topology, temporal drift, or incentive compatibility.

Prior to my Ph.D., I completed a Master’s in Computer Science at the University of Colorado Boulder. I earned my bachelor's degree from the Birla Institute of Technology, Mesra in Ranchi, India, and worked as a software engineer at Flipkart in Bangalore before moving to the U.S.

Beyond research, I find myself drawn to cinema, literature, and music. I keep an archive of films I've watched here, and a list of books here. Lately, I’ve also begun sharing fragments of music and poetry here. I’ve also carried with me, since childhood, a quiet devotion to cricket. I played for a local club during school, then for the Flipkart team, and now I follow the game as an ever-watchful spectator. Although I no longer play, the game continues to ground me, a thread back to childhood and to the quieter parts of myself that persist beneath everything else.

Official Email • LinkedIn • CV • Scholar

Taken in Ouray, Colorado,
known as "The Switzerland of America".

Updates & News

2025-07: Paper with Amit, Lijun & Claire got accepted at IEEE CDC 2025!
2024-12: Cleared the Computer Science PhD Prelims (Area) Exam!
2024-03: Continuing as a CS Lead Teaching Assistant for the AY 2024-25.
2024-01: My paper with Lijun got accepted at IEEE ACC 2024!
2023-08: Guest lectured for Fall 2023 Advanced ML course on Bandit models.
2023-03: Elected as the student representive for the CS Graduate Committee.
2023-03: Re-appointed as a CS Lead Teaching Assistant for the AY 2023-24.
2022-08: Started my Ph.D journey at the University of Colorado Boulder in CS!
2022-05: Graduated with Master of Science in Computer Science!
2022-04: Successfully defended Master's thesis! (link/ slides)
2022-02: Got offer for Ph.D from CS @ Colorado for Fall 2022!
2019-08: Started Master of Science in Computer Science at Colorado!
2016-12: Joined Flipkart as a Software Engineer, in Bangalore, India.
2016-06: Recieved Bachelor's degree in engineering completed from BIT Mesra.

Awards & Honors

2025-04: Won the Outstanding Teaching Assistant Award from the CS department.
2025-04: Won the Outstanding Service Award from the CS department.
2024-04: Won the Outstanding Research Paper Award from the CS department.
2024-04: Won the Full Conference Travel Fellowship for ACC in Toronto, Canada 2024.
2024-04: Won the CU research Expo research poster annual award for the annual year 2023-24.
2024-03: Won the Publication Recognition Award for the annual year 2023-24.
2022-09: Won the Early Career Development Fellowship from the CS department.
2022-05: Won the Lloyd Botway Award for Outstanding Master's student for "outstanding academics, teaching, research and service to the department"!
2022-04: Won the CU research Expo research poster annual award for the annual year 2021-22.
2022-04: Selected as a Lead Teaching Assistant (department lead) for CS @ CU for the annual year.

Research

I work on theoretical machine learning, with a focus on online learning and sequential decision-making under uncertainty. Much of my research studies how structural constraints, like limited feedback, local movement, or evolving graph structure, shape the learnability of an environment.

I design algorithms that are simple, robust, and provably efficient, often in settings where actions are constrained and global information is unavailable. Recent work includes bandit learning on dynamic graphs and exploration strategies that adapt to feedback, movement, and temporal drift.

Incentivized Lipschitz Bandits.
Sourav Chakraborty*, Amit Kiran Rege*, Claire Monteleoni and Lijun Chen
Accepted at IEEE Conference on Decision and Control (CDC) 2025 in Rio de Janeiro, Brazil.
IEEE version / arXiv

Incentivized Exploration in Non-stationary Stochastic Bandits.
Sourav Chakraborty and Lijun Chen
Accepted at IEEE American Control Conference (ACC) 2024 in Toronto, Canada.
Master's Thesis, Committee: Lijun Chen, Raf Frongillo, Bo Waggoner.
IEEE version / arXiv / thesis

Teaching

Summer 2025: Teaching Assistant for CSCI 1300 - Starting Computing
Spring 2025: Teaching Assistant for CSCI 3104 - Algorithms
Fall 2024: Teaching Assistant for CSCI 3104 - Algorithms
Summer 2024: Teaching Assistant for CSCI 2270 - Data Structures.
Spring 2024: Teaching Assistant for CSCI 3104 - Algorithms
Fall 2023: Teaching Assistant for CSCI 2275 - Programming and Data Structures.
Spring 2023: Teaching Assistant for CSCI 2270 - Data Structures.
Fall 2022: Teaching Assistant for CSCI 2270 - Data Structures.
Spring 2022: Teaching Assistant for CSCI 2270 - Data Structures.
Fall 2021: Instructor for CSCI 1200 - Introduction to Computational Thinking.
Summer 2021: Teaching Assistant for CSCI 1300 - Starting Computing.
Spring 2021: Teaching Assistant for CSCI 1300 - Starting Computing.
Fall 2020: Teaching Assistant for CSCI 1300 - Starting Computing.
Summer 2020: Instructor for CSCI 3022 - Introduction to Data Science with Probability & Statistics.

Selected Personal Projects

*Alphabetically

	Inverse Reinforcement Learning via Maximum Entropy Formulation. Tuhina Tripathi, Alexa Reed, Sourav Chakraborty April , 2022 report / code / demo / interface-code Final project for ASEN 5519: Decision Making Under Uncertainty. This project explores the use of Inverse Reinforcement Learning, via Maximum Entropy Formulation, in a Markov Decision Process. The concepts explored in this project were demonstrated using a grid world environment.
	Incentivized Exploration for Multi-Armed Bandits under Reward Drift. Sourav Chakraborty September, 2020 original paper / code Just playing around with the paper by Liu & Wang et alon Incentivized Exploration for Multi-Armed Bandits under Reward Drift where the players receive compensation for exploring arms other than the greedy choice and may provide biased feedback on reward drift.
	Contextual vectorized representation of words: Soam word embeddings Amit Baran Roy, Sourav Chakraborty* May, 2020 report / code A word embedding model implementation based on the popular skipgram architecture. It involves alterations of the scoring algorithm to give more weightage to the context words that are closer to the target word in a skipgram sliding window.
	Solving Games using the combination of Q-learning and Regret Matching Methods Sourav Chakraborty, Nagarajan Shanmuganathan May, 2020 report / code It is known well that Counterfactual regret minimization (CFR) has been used in games which have both terminal states and perfect recall to minimize regret. This project aims to relax those constraints and use a local no-regret algorithm (LONR) by Kash et al, which internally uses a Q-learning like update rule to games which do not have terminal states or perfect recall.
	Occupancy Network based 3D Image Reconstruction using Single-Depth View Amit Baran Roy, Aparajita Singh, Sourav Chakraborty, Tanmai Gajula Feb-April*, 2020 report / code The complete 3D geometry of an object from a single 2.5D depth view was acquired by using deep learning techniques such as generative adversarial networks and 3D convolution neural networks. The resolution of the final 3D voxelized output was improved by transforming the voxel representation into another representation called occupancy networks.

Last updated: July 2025
Thanks Jon Barron!