Faculty


Anand Mishra Anand Mishra

Computer Vision, Language and Knowledge Graphs

PhD Students


Abhirama

Abhirama Subramanyam Penamakuri

Multimodal Deep Learning, Knowledge-intensive visual tasks
Neelu Verma

Neelu Verma

Chart Image Parsing and ChartQA
Yogesh Kumar

Yogesh Kumar

Deep Learning, Video Understanding
Gyan Prabhat

Gyan Prabhat

Deep Learning, Computer Vision, Handwritten Image Analysis
Meenal Joshi

Meenal Joshi

Deep Learning, Computer Vision


d>
Research Area
MTech
Arnav Sharma Audio-visual Event localization
Prateek Indian language scene text understanding
Avi Bhandari VLMs
Vishal Dangiwala Video Understanding
BTech
Harshiv Shah Scene Text Understanding
Aditya Rathor Scene Text Understanding
RA
Anik De Document Intelligence, Indian Language Scene Text Understanding
Uday Agarwal Video Understanding
K. Lokesh Historical Document Image Analysis, AI for Healthcare
Dikshant Sharma Computer Vision
Intern
Rongali Balaji Svnit Massively Multilingual Word Restoration and Inpainting
Sagar Premani Massively Multilingual Word Restoration and Inpainting

Alumni


Research Area First Employment
MTech
Pravin Kumar Indian language scene text understanding
Ritu Singh VLMs for Document Images Pradhi AI
Mohit Sharma Historical Manuscript Restoration Applied Materials
Kranti Prakash Video Understanding Computer Vision Engineer @WESEE (Indian Navy)
Apoorv Shekhar Visual Relationships TuningBill
Dhriti Prasanna Paul Computer Vision Rakuten Mobile Japan
Deepti Gupta Indic Scene Text Detection Spanidea
Gaurav Pilankar - CDAC Pune
Pratik Vilasrao Somwanshi - Spanidea Systems
Kena Hemnani - Valeo India
Megha kumari Math-based QA CDAC Mohali
Rahul Kumar Chaudhary - Spanidea
Parsa Revanth Multimodal Knowledge Graphs -
Stuti Pathak Flow2code -
Rati Kumari - -
Ambikesh Kumar Singh - -
Btech
Arvind Kumar Sharma Visual Translation -
Shreyas Vaidya Visual Translation -
Nakul Sharma Graphical Elements Interpretation, LLMs SpreeAI
Shreya Shukla Graphical Elements Interpretation, LLMs, Code Generation Mercedes Benz R&D
Abu Shahid Video Understanding, Handwritten Recognition Decimal Point Analytics
Maniyar Suyash Document AI Decimal Point Analytics
Ayush Anand Object Detection, Document Intelligence -
Mayank Maheshwari Visual Relationships Wadhawani AI
Vaibhav Mishra - Jio Platforms
RA
Prajwal Gatti - Ph.D. at University of Bristol
Revant Teotia - MS at Columbia University → PhD student at NYU