Gokul M K

I'm a Dual Degree student in Robotics at the Indian Institute of Technology Madras. I work with Prof. Anuj Tiwari at the DiRO Lab on Multi-agent Quadruped Robots. I also collaborate with Prof. Arvind Easwaran at the CPS Research Group, NTU Singapore, where I focus on trustworthy learning agents for F1TENTH Autonomous Racing.

I worked as an AI Research Intern at Qneuro India Pvt Ltd, developing deep learning models for inferring EEG, PCG Signals. I interned at HiRO Lab, IISC Bangalore, where I focused on learning for bimanual robot manipulation. I was part of Team Anveshak, the student-run Mars Rover team at IIT Madras that competes in international rover competitions. I initially contributed as an Embedded Engineer before being promoted to Electronics and Software Lead.

Email  /  CV  /  LinkedIn  /  Github

profile photo

Research Interests

I explore Reinforcement Learning, Deep Learning, Generative AI, LLMs and Robotics. My research interests lie in developing intelligent, model-based, heirarchical controllers for robotic applications. In parallel, I explore advancing reasoning capabilities in LLMs and fine-tuning them using RLHF and SFT.

Projects

Residual Reinforcement Learning for F1TENTH Racing
NTU Global Connect Fellow (2025)
Gokul M K, Eduardo de Conto, Subrat Prasad Panda, Arvind Easwaran

Improving the baseline tracking controller with residual Reinforcement Learning. Generalization across various racing tracks, static and dynamic obstacles.

Poster

Model-Based Koopman control for Legged Locomotion in Uneven Terrains
ISRO
Gokul M K, Anandhakrishnan, Suraj Kumar (ISRO), Anuj Tiwari

Testing the convex-MPC framework for quadruped locomotion with various gaits. Improving the control pipeline thorugh a data-driven Koopman Approach.

Cooperative Payload Transport using Multiple Quadrupeds in Uneven Terrain
Final Year Project
Gokul M K, Anuj Tiwari

Formulating an heirarchical model-based controller for intelligent cooperative payload transport by a group of quadrupeds in varying terrains.

Reinforcement Learning for Dynamic Swarm Navigation
Inter-IIT Techmeet 13.0, PS Team Lead
Gokul M K

Implemented a Decentralized Training and Decentralized Execution (DTCE) approach for tackling swarm navigation in continously evolving environments.

Report | Code

Modulated Dynamical Systems for Coordinated Bimanual Manipulation
HiRO Lab, Robotics Summer Intern, RBCCPS IISC Bangalore
Gokul M K, Dr. Ravi Prakash

Implemented the research by LASA Lab, EPFL on modulated dynamical systems for Coordinated Bimanual Robotic Manipulation. Also worked on implementing Tossingbot by learning the residual physics for throwing.

Exploring Various Robotic Grasping Algorithms
e-Yantra Summer Intern, IIT Bombay
Gokul M K, Archit Jain, Jaison Jose, Ravikumar Chaurasia

Compared various learning-based, analytical grasping algorithms and benchmarked their perfomance in simulation and hardware. Formulated a light-weight grasping algorithm using Euclidean clustering.

Video

Course Projects

Implicit Reinforcement Learning without Interaction at Scale
DA7400, Recent Advances in Reinforcement Learning
Keerthivasan M, Gokul M K

Addressing sub-optimality and diversity challenges in Offline-RL trained with large datasets collected for long-horizon tasks. Employed a heirarchical framework and validated its performance.

Report | Code

Denoising and Deblurring MVTEC AD Dataset
EE5179, Deep Learning for Imaging
Gokul M K, Keerthivasan M

Optimized RIDNet, a deep learning model to denoise and deblur MVTEC dataset. It achieved a PSNR of 32.8 on the training examples and 34.78 on the test examples. Benchmarked the results with other architectures.

Report | Code

Comparative Study of SMDP and Intra-Option Learning in the Taxi Domain
CS6700, Introduction to Reinforcement Learning
Gokul M K, Keerthivasan M

Compared two option learning methods, SMDP and Intra Option Q learning in the taxi gym environment. Intra-Option made faster updates while an action is taken rather than waiting for the option to end.

Report | Code

Trajectory Continuous Optimal Planning for a Mobile Manipulator
ED5215, Introduction to Motion Planning
Gokul M K, Keerthivasan M

Continuously tracing a trajectory using RRT* while minimizing the deviation in the end-effector pose through an Optimal control formulation. This shares similarities with 3D printing task by mobile manipulators.

Report | Code | Videos

Competitions

e-Yantra Robotics Competition 22-23
IIT Bombay
Gokul M K, Nikhil S

Programmed a Mobile Manipulator to identify, pluck and place coloured bell peppers inside a greenhouse.

Code | Video


Website source code from jonbarron