Hi, I'm

Rugved Katole

PhD Candidate in Computer Science

Pioneering techniques in world foundation models and Vision-Language-Action systems to tackle data scarcity in robotics. Building autonomous systems that make real-world impact—from precision agriculture to wildlife conservation.

What I've Built

Pioneering work in world foundation models, Vision-Language-Action systems, and robotics—from synthetic data generation to autonomous field deployments.

Accelerative Synthetic Data Generation

Intelligent diffusion-based filtering to detect and remove inauthentic synthetic videos 9× faster, optimizing workflows for scarce datasets with early exit diffusion pipelines.

Diffusion ModelsPyTorchSynthetic DataOptimization
View Project →

Physics-based Digital Twin for Wildlife Monitoring

Photorealistic NVIDIA Omniverse simulation with generative animal behaviors, herd dynamics, and drone testing for ecological research—reducing field deployment costs.

OmniverseDigital TwinsPhysics SimulationWildlife AI
View Project →

World Foundation Models + VLA Integration

Augmented real-world video datasets for Vision-Language-Action training, addressing data scarcity with minimal trajectories for efficient robotic policy learning.

VLA ModelsWorld ModelsFoundation ModelsRobotics
View Project →

Let's Talk

I'm always interested in discussing research collaborations, speaking opportunities, or consulting on AI and robotics projects. Best way to reach me is through Email or LinkedIn.