• Visual Text Matters: Improving Text-KVQA with Visual Text Entity Knowledge-aware Large Multimodal Assistant,
    Abhirama Subramanyam Penamakuri, Anand Mishra.
    EMNLP 2024.(NEW)
  • [Paper] [Project Page]

  • Show Me the World in My Language: Establishing the First Baseline for Scene-Text to Scene-Text Translation,
    Shreyas Vaidya*, Arvind Kumar Sharma*, Prajwal Gatti, Anand Mishra. (*: equal contribution)
    ICPR 2024.(NEW)
  • [Paper] [Project Page] [Code]

  • Sketch-guided Image Inpainting with Partial Discrete Diffusion Process,
    Nakul Sharma, Aditay Tripathi, Anirban Chakraborty, Anand Mishra
    CVPR Workshop 2024.(NEW)
  • [Paper] [Code]

  • QDETRv: Query-Guided DETR for One-Shot Object Localization in Videos,
    Yogesh Kumar, Saswat Mallick, Anand Mishra, Sowmya Rasipuram, Anutosh Maitra, Roshni Ramnani
    AAAI 2024.(NEW)
  • [Paper] [Code]

  • Composite Sketch+Text Queries for Retrieving Objects with Elusive Names and Complex Interactions,
    Prajwal Gatti, Kshitij Parikh, Dhriti Paul, Manish Gupta, Anand Mishra.
    AAAI 2024.(NEW)
  • [Paper] [Code]

  • Answer Mining from a Pool of Images: Towards Retrieval-Based Visual Question Answering, (NEW)
    Abhirama Subramanyam Penamakuri, Anand Mishra, Manish Gupta, Mithun Das Gupta,
    IJCAI 2023.
    [Paper][Project Page][Code]

  • Towards Making Flowchart Images Machine Interpretable, (NEW)
    Shreya Shukla, Prajwal Gatti, Yogesh Kumar, Vikash Yadav, Anand Mishra,
    ICDAR 2023.
    [Paper][Project Page][Code]

  • Few-Shot Referring Relationships in Videos, (NEW)
    Yogesh Kumar, Anand Mishra,
    CVPR 2023.
    [Paper][Project Page][Code]

  • Contrastive Multi-View Textual-Visual Encoding: Towards One Hundred Thousand-Scale One-Shot Logo Identification,
    Nakul Sharma, Abhirama Subramanyam Penamakuri, Anand Mishra,
    ICVGIP 2022.
    [Paper][Project Page][Code]

  • VISTOT: Vision-Augmented Table-to-Text Generation,
    Prajwal Gatti, Anand Mishra, Manish Gupta, Mithun Das Gupta,
    EMNLP 2022.
    [Paper][Project Page][Code]

  • COFAR: Commonsense and Factual Reasoning in Image Search
    Prajwal Gatti, Abhirama Subramanyam Penamakuri, Revant Teotia, Anand Mishra, Shubhashis Sengupta, Roshni Ramnani
    AACL-IJCNLP 2022.
    [Paper][Project Page][Code]

  • Few-shot Visual Relationship Co-localization ,
    Revant Teotia*, Vaibhav Mishra*, Mayank Maheshwari*, Anand Mishra,
    ICCV 2021.
    [Paper][Project Page][Code] (*: equal contribution)

  • Look, Attend and Ask: Learning to Ask Questions by Reading Text in Images,
    Soumya Jahagirdar, Shankar Gangisetty, Anand Mishra
    ICDAR 2021 .
    [Paper]

  • Sketch-Guided Object Localization in Natural Images,
    Aditay Tripathi, Rajath R. Dani, Anand Mishra, Anirban Chakraborty
    ECCV 2020 (Spotlight Presentation).
    [Paper] [bibtex] [Project page][Code] [Know the paper in 90 seconds] [Know the paper in ten minutes]

  • From Strings to Things: Knowledge-enabled VQA model that can read and reason,
    Ajeet Kumar Singh, Anand Mishra, Shashank Shekhar, and Anirban Chakraborty
    ICCV 2019 (oral).
    [Paper] [bibtex] [Project page]