VL2G @ IITJ - People

Faculty

Anand Mishra

Computer Vision, Language and Knowledge Graphs

PhD Students

Yogesh Kumar

(PhD Thesis Submitted)

Deep Learning, Video Understanding

Neelu Verma

Chart Image Parsing and ChartQA

Gyan Prabhat

Deep Learning, Computer Vision, Handwritten Image Analysis

Meenal Joshi

Deep Learning, Computer Vision

Anik De

Document Intelligence, Indian Language Scene Text Understanding

	Research Area
MTech
Arnav Sharma	Audio-visual Event localization
Prateek	Indian language scene text understanding
Avi Bhandari	VLMs
Vishal Dangiwala	Video Understanding
BTech
Aditya Rathor	Scene Text Understanding
Akshat Jain	Reliable VLMs
Mehta Jay Kamalkumar	Reliable VLMs
Patel Darsh	Reliable VLMs
RA
K. Lokesh	Historical Document Image Analysis, AI for Healthcare
Dikshant Sharma	Computer Vision
Alik Sarkar	Video Understanding
Intern
Rongali Balaji (NIT Surat)	Massively Multilingual Word Restoration and Inpainting
Sagar Premani (NIT Jaipur)	Massively Multilingual Word Restoration and Inpainting

Alumni

	Research Area	First Employment
PHD
Abhirama Subramanyam Penamakuri	Multimodal Deep Learning, Knowledge-intensive visual tasks	PostDoctoral Researcher in MBZUAI
MTech
Pravin Kumar	Indian language scene text understanding
Ritu Singh	VLMs for Document Images	Pradhi AI
Mohit Sharma	Historical Manuscript Restoration	Applied Materials
Kranti Prakash	Video Understanding	Computer Vision Engineer @WESEE (Indian Navy)
Apoorv Shekhar	Visual Relationships	TuningBill
Dhriti Prasanna Paul	Computer Vision	Rakuten Mobile Japan
Deepti Gupta	Indic Scene Text Detection	Spanidea
Gaurav Pilankar	-	CDAC Pune
Pratik Vilasrao Somwanshi	-	Spanidea Systems
Kena Hemnani	-	Valeo India
Megha kumari	Math-based QA	CDAC Mohali
Rahul Kumar Chaudhary	-	Spanidea
Parsa Revanth	Multimodal Knowledge Graphs	-
Stuti Pathak	Flow2code	-
Rati Kumari	-	-
Ambikesh Kumar Singh	-	-
Arnav Sharma	Audio-visual Event localization	Microsoft
Avi Bhandari	VLMs	Lightstorm Telecom
Prateek	Indian language scene text understanding	[x]cube LABS, Hyderabad
Vishal Dangiwala	Video Understanding	Accenture
Btech
Suyash Maniyar	Document Understanding	MS at UMASS
Navlika Singh	Small VLMs	MSc at Imperial College London
Piyush Arora	Small VLMs	MSc at Imperial College London
Arvind Kumar Sharma	Visual Translation	Raapid AI
Shreyas Vaidya	Visual Translation	LTImindtree
Nakul Sharma	Graphical Elements Interpretation, LLMs	SpreeAI
Shreya Shukla	Graphical Elements Interpretation, LLMs, Code Generation	Mercedes Benz R&D
Abu Shahid	Video Understanding, Handwritten Recognition	Decimal Point Analytics
Maniyar Suyash	Document AI	Decimal Point Analytics
Ayush Anand	Object Detection, Document Intelligence	-
Mayank Maheshwari	Visual Relationships	Wadhawani AI
Vaibhav Mishra	-	Jio Platforms
Aditya Rathore	Scene Text Understanding	Samsung Electronics
RA
Uday Agarwal	Video Understanding	Visiting Scholar at CVML, National University of Singapore.
Prajwal Gatti	-	Ph.D. at University of Bristol
Revant Teotia	-	MS at Columbia University → PhD student at NYU