Faculty
PhD Students
Abhirama Subramanyam Penamakuri
(PhD Thesis Submitted)
Multimodal Deep Learning, Knowledge-intensive visual tasks
| Research Area | |
|---|---|
| MTech | |
| Arnav Sharma | Audio-visual Event localization |
| Prateek | Indian language scene text understanding |
| Avi Bhandari | VLMs |
| Vishal Dangiwala | Video Understanding |
| BTech | |
| Harshiv Shah | Scene Text Understanding |
| Aditya Rathor | Scene Text Understanding |
| RA | |
| Uday Agarwal | Video Understanding |
| K. Lokesh | Historical Document Image Analysis, AI for Healthcare |
| Dikshant Sharma | Computer Vision |
| Alik Sarkar | Video Understanding | Intern |
| Rongali Balaji (NIT Surat) | Massively Multilingual Word Restoration and Inpainting |
| Sagar Premani (NIT Jaipur) | Massively Multilingual Word Restoration and Inpainting |
Alumni
| Research Area | First Employment | |
|---|---|---|
| MTech | ||
| Pravin Kumar | Indian language scene text understanding | |
| Ritu Singh | VLMs for Document Images | Pradhi AI |
| Mohit Sharma | Historical Manuscript Restoration | Applied Materials |
| Kranti Prakash | Video Understanding | Computer Vision Engineer @WESEE (Indian Navy) |
| Apoorv Shekhar | Visual Relationships | TuningBill |
| Dhriti Prasanna Paul | Computer Vision | Rakuten Mobile Japan |
| Deepti Gupta | Indic Scene Text Detection | Spanidea |
| Gaurav Pilankar | - | CDAC Pune |
| Pratik Vilasrao Somwanshi | - | Spanidea Systems |
| Kena Hemnani | - | Valeo India |
| Megha kumari | Math-based QA | CDAC Mohali |
| Rahul Kumar Chaudhary | - | Spanidea |
| Parsa Revanth | Multimodal Knowledge Graphs | - |
| Stuti Pathak | Flow2code | - |
| Rati Kumari | - | - |
| Ambikesh Kumar Singh | - | - |
| Btech | ||
| Suyash Maniyar | Document Understanding | MS at UMASS |
| Navlika Singh | Small VLMs | MSc at Imperial College London |
| Piyush Arora | Small VLMs | MSc at Imperial College London |
| Arvind Kumar Sharma | Visual Translation | Raapid AI |
| Shreyas Vaidya | Visual Translation | LTImindtree |
| Nakul Sharma | Graphical Elements Interpretation, LLMs | SpreeAI |
| Shreya Shukla | Graphical Elements Interpretation, LLMs, Code Generation | Mercedes Benz R&D |
| Abu Shahid | Video Understanding, Handwritten Recognition | Decimal Point Analytics |
| Maniyar Suyash | Document AI | Decimal Point Analytics |
| Ayush Anand | Object Detection, Document Intelligence | - |
| Mayank Maheshwari | Visual Relationships | Wadhawani AI |
| Vaibhav Mishra | - | Jio Platforms |
| RA | ||
| Prajwal Gatti | - | Ph.D. at University of Bristol |
| Revant Teotia | - | MS at Columbia University → PhD student at NYU |