Visual Text Matters: Improving Text-KVQA with Visual Text Entity Knowledge-aware Large Multimodal Assistant,
Abhirama Subramanyam Penamakuri, Anand Mishra.
EMNLP 2024.(NEW)
[
Paper]
[
Project Page]
Show Me the World in My Language: Establishing the First Baseline for Scene-Text to Scene-Text Translation,
Shreyas Vaidya*, Arvind Kumar Sharma*, Prajwal Gatti, Anand Mishra. (*: equal contribution)
ICPR 2024.(NEW)
[
Paper]
[
Project Page]
[
Code]
Sketch-guided Image Inpainting with Partial Discrete Diffusion Process,
Nakul Sharma, Aditay Tripathi, Anirban Chakraborty, Anand Mishra
CVPR Workshop 2024.(NEW)
[
Paper]
[
Code]
QDETRv: Query-Guided DETR for One-Shot Object Localization in Videos,
Yogesh Kumar, Saswat Mallick, Anand Mishra, Sowmya Rasipuram, Anutosh Maitra, Roshni Ramnani
AAAI 2024.(NEW)
[
Paper]
[
Code]
Composite Sketch+Text Queries for Retrieving Objects with Elusive Names and Complex Interactions,
Prajwal Gatti, Kshitij Parikh, Dhriti Paul, Manish Gupta, Anand Mishra.
AAAI 2024.(NEW)
[
Paper]
[
Code]
Answer Mining from a Pool of Images: Towards Retrieval-Based Visual Question Answering,
(NEW)
Abhirama Subramanyam Penamakuri, Anand Mishra, Manish Gupta, Mithun Das Gupta,
IJCAI 2023.
[Paper][Project Page][Code]
Towards Making Flowchart Images Machine Interpretable,
(NEW)
Shreya Shukla, Prajwal Gatti, Yogesh Kumar, Vikash Yadav, Anand Mishra,
ICDAR 2023.
[Paper][Project Page][Code]
Few-Shot Referring Relationships in Videos,
(NEW)
Yogesh Kumar, Anand Mishra,
CVPR 2023.
[Paper][Project Page][Code]
Contrastive Multi-View Textual-Visual Encoding:
Towards One Hundred Thousand-Scale One-Shot Logo Identification,
Nakul Sharma, Abhirama Subramanyam Penamakuri, Anand Mishra,
ICVGIP 2022.
[Paper][Project Page][Code]
VISTOT: Vision-Augmented Table-to-Text Generation,
Prajwal Gatti, Anand Mishra, Manish Gupta, Mithun Das Gupta,
EMNLP 2022.
[Paper][Project Page][Code]
COFAR: Commonsense and Factual Reasoning in Image Search
Prajwal Gatti, Abhirama Subramanyam Penamakuri, Revant Teotia, Anand Mishra, Shubhashis Sengupta, Roshni Ramnani
AACL-IJCNLP 2022.
[Paper][Project Page][Code]
Few-shot Visual Relationship Co-localization
,
Revant Teotia*, Vaibhav Mishra*, Mayank Maheshwari*, Anand Mishra,
ICCV 2021.
[Paper][Project Page][Code]
(*: equal contribution)
Look, Attend and Ask: Learning to Ask Questions by Reading Text in Images,
Soumya Jahagirdar, Shankar Gangisetty, Anand Mishra
ICDAR 2021 .
[Paper]
Sketch-Guided Object Localization in Natural Images,
Aditay Tripathi, Rajath R. Dani, Anand Mishra, Anirban Chakraborty
ECCV 2020 (Spotlight Presentation).
[Paper] [bibtex]
[Project page][Code]
[Know the paper in 90 seconds] [Know the paper in ten minutes]
From Strings to Things: Knowledge-enabled VQA model that can read and reason,
Ajeet Kumar Singh, Anand Mishra, Shashank Shekhar, and Anirban Chakraborty
ICCV 2019 (oral).
[Paper] [bibtex]
[Project page]