Publications

publications by categories in reversed chronological order. generated by jekyll-scholar.

2025

  1. UR2N: Unified Retriever and ReraNker
    Riyaz Ahmad Bhat, Jaydeep Sen, Rudra Murthy, and 1 more author
    In Proceedings of the 31st International Conference on Computational Linguistics: Industry Track, 2025
  2. Granite Embedding Models
    Parul Awasthy, Aashka Trivedi, Yulong Li, and 8 more authors
    arXiv preprint arXiv:2502.20204, 2025

2024

  1. Towards understanding and mitigating the hallucinations in NLP and Speech
    Ashish Mittal, Rudra Murthy, Vishwajeet Kumar, and 1 more author
    In Proceedings of the 7th Joint International Conference on Data Science & Management of Data (11th ACM IKDD CODS and 29th COMAD), 2024
  2. PUB: A Pragmatics Understanding Benchmark for Assessing LLMs’ Pragmatics Capabilities
    Settaluri Lakshmi Sravanthi, Meet Doshi, Tankala Pavan Kalyan, and 3 more authors
    arXiv preprint arXiv:2401.07078, 2024
  3. Airavata: Introducing Hindi Instruction-tuned LLM
    Jay Gala, Thanmay Jayakumar, Jaavid Aktar Husain, and 8 more authors
    arXiv preprint arXiv:2401.15006, 2024
  4. Do LLMs understand Pragmatics? An Extensive Benchmark for Evaluating Pragmatic Understanding of LLMs
    Settaluri Lakshmi Sravanthi, Meet Doshi, Pavan Kalyan Tankala, and 2 more authors
    2024
  5. INDIC QA BENCHMARK: A Multilingual Benchmark to Evaluate Question Answering capability of LLMs for Indic Languages
    Abhishek Kumar Singh, Rudra Murthy, Jaydeep Sen, and 2 more authors
    arXiv preprint arXiv:2407.13522, 2024
  6. Mistral-SPLADE: LLMs for better Learned Sparse Retrieval
    Meet Doshi, Vishwajeet Kumar, Rudra Murthy, and 2 more authors
    arXiv preprint arXiv:2408.11119, 2024
  7. QUESTION GENERATION OVER TABLES AND TEXT
    Saneem Ahmed Chemmengath, Vishwajeet Kumar, Jaydeep Sen, and 1 more author
    Oct 2024
    US Patent App. 18/193,975
  8. Evaluating the Instruction-following Abilities of Language Models using Knowledge Tasks
    Rudra Murthy, Prince Kumar, Praveen Venkateswaran, and 1 more author
    arXiv preprint arXiv:2410.12972, Oct 2024
  9. MILU: A Multi-task Indic Language Understanding Benchmark
    Sshubam Verma, Mohammed Safi Ur Rahman Khan, Vishwajeet Kumar, and 2 more authors
    arXiv preprint arXiv:2411.02538, Oct 2024
  10. Benchmarking and Building Zero-Shot Hindi Retrieval Model with Hindi-BEIR and NLLB-E5
    Arkadeep Acharya, Rudra Murthy, Vishwajeet Kumar, and 1 more author
    arXiv preprint arXiv:2409.05401, Oct 2024
  11. SYSTEMS AND METHODS TO BUILD ONEQG: A UNIFIED QUESTION GENERATION SYSTEM ACROSS MODALITIES
    Vishwajeet Kumar, Jaydeep Sen, Saneem Ahmed Chemmengath, and 1 more author
    Nov 2024
    US Patent App. 18/317,703

2023

  1. Semi-Structured Object Sequence Encoders
    Rudra Murthy, Riyaz Bhat, Chulaka Gunasekara, and 5 more authors
    arXiv preprint arXiv:2301.01015, Nov 2023
  2. Denoising-based UNMT is more robust to word-order divergence than MASS-based UNMT
    Tamali Banerjee, Rudra Murthy, and Pushpak Bhattacharyya
    arXiv preprint arXiv:2303.01191, Nov 2023
  3. StarCoder: may the source be with you!
    Raymond Li, Loubna Ben Allal, Yangtian Zi, and 8 more authors
    arXiv preprint arXiv:2305.06161, Nov 2023
  4. Prompting with Pseudo-Code Instructions
    Mayank Mishra, Prince Kumar, Riyaz Bhat, and 3 more authors
    arXiv preprint arXiv:2305.11790, Nov 2023
  5. Towards Safer Communities: Detecting Aggression and Offensive Language in Code-Mixed Tweets to Combat Cyberbullying
    Nazia Nafis, Diptesh Kanojia, Naveen Saini, and 1 more author
    In The 7th Workshop on Online Abuse and Harms (WOAH), Nov 2023
  6. Modelling Political Aggression on Social Media Platforms
    Akash Rawat, Nazia Nafis, Dnyaneshwar Bhadane, and 2 more authors
    In Proceedings of the 13th Workshop on Computational Approaches to Subjectivity, Sentiment, & Social Media Analysis, Nov 2023
  7. A Study of Multilingual versus Meta-Learning for Language Model Pre-Training for Adaptation to Unseen Low Resource Languages
    Jyotsana Khatri, Rudra Murthy, Amar Prakash Azad, and 1 more author
    In Proceedings of Machine Translation Summit XIX, Vol. 1: Research Track, Nov 2023

2022

  1. Simple measures of bridging lexical divergence help unsupervised neural machine translation for low-resource languages
    Jyotsana Khatri, Rudra Murthy, Tamali Banerjee, and 1 more author
    Machine Translation, Nov 2022
  2. HiNER: A Large Hindi Named Entity Recognition Dataset
    Rudra Murthy, Pallab Bhattacharjee, Rahul Sharnagat, and 3 more authors
    arXiv preprint arXiv:2204.13743, Nov 2022
  3. Naamapadam: A Large-Scale Named Entity Annotated Data for Indic Languages
    Arnav Mhaske, Harshit Kedia, Sumanth Doddapaneni, and 4 more authors
    arXiv preprint arXiv:2212.10168, Nov 2022
  4. On Utilizing Constituent Language Resources to Improve Downstream Tasks in Hinglish
    Vishwajeet Kumar, Rudra Murthy, and Tejas Dhamecha
    In Findings of the Association for Computational Linguistics: EMNLP 2022, Nov 2022

2021

  1. Scrambled Translation Problem: A Problem of Denoising UNMT
    Tamali Banerjee, Rudra Murthy, and Pushpak Bhattacharyya
    In Proceedings of Machine Translation Summit XVIII: Research Track, Nov 2021
  2. Cognitively Aided Zero-Shot Automatic Essay Grading
    Sandeep Mathias, Rudra Murthy, Diptesh Kanojia, and 1 more author
    arXiv preprint arXiv:2102.11258, Nov 2021
  3. Crosslingual Embeddings are Essential in UNMT for Distant Languages: An English to IndoAryan Case Study
    Tamali Banerjee, Rudra Murthy, and Pushpak Bhattacharyya
    In Proceedings of Machine Translation Summit XVIII: Research Track, Nov 2021
  4. Role of Language Relatedness in Multilingual Fine-tuning of Language Models: A Case Study in Indo-Aryan Languages
    Tejas Indulal Dhamecha, Rudra Murthy, Samarth Bharadwaj, and 2 more authors
    In Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, Nov 2021
  5. Language Model Pretraining and Transfer Learning for Very Low Resource Languages
    Jyotsana Khatri, Rudra Murthy, and Pushpak Bhattacharyya
    In Proceedings of the Sixth Conference on Machine Translation, Nov 2021

2020

  1. A Study of Efficacy of Cross-lingual Word Embeddings for Indian Languages
    Jyotsana Khatri, Rudra Murthy, and Pushpak Bhattacharyya
    In Young Researchers’ Symposium, Proceedings of the 7th ACM IKDD CoDS and 25th COMAD, Nov 2020
  2. Happy Are Those Who Grade without Seeing: A Multi-Task Learning Approach to Grade Essays Using Gaze Behaviour
    Sandeep Mathias, Rudra Murthy, Diptesh Kanojia, and 2 more authors
    In Proceedings of the 1st Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 10th International Joint Conference on Natural Language Processing, Nov 2020
  3. Looking inside noun compounds: Unsupervised prepositional and free paraphrasing
    Girishkumar Ponkiya, Rudra Murthy, Pushpak Bhattacharyya, and 1 more author
    In Findings of the Association for Computational Linguistics: EMNLP 2020, Nov 2020

2019

  1. Addressing word-order Divergence in Multilingual Neural Machine Translation for extremely Low Resource Languages
    Rudra Murthy, Anoop Kunchukuttan, and Pushpak Bhattacharyya
    In 2019 Annual Conference of the North American Chapter of the Association for Computational Linguistics, Nov 2019

2018

  1. Judicious Selection of Training Data in Assisting Language for Multilingual Neural NER
    Rudra Murthy, Anoop Kunchukuttan, and Pushpak Bhattacharyya
    In Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), Nov 2018
  2. Improving NER Tagging Performance in Low-Resource Languages via Multilingual Learning
    Rudra Murthy, Mitesh M Khapra, and Pushpak Bhattacharyya
    ACM Transactions on Asian and Low-Resource Language Information Processing (TALLIP), Nov 2018

2017

  1. Identifying Raga Similarity in Hindustani Classical Music through Distributed Representation of Raga Names
    Joe Cheri Ross, Rudra Murthy, Kaustuv Kanti Ganguli, and 1 more author
    In Proceedings of the 13th International Symposium on CMMR, Nov 2017

2016

  1. A deep learning solution to Named Entity Recognition
    Rudra Murthy, and Pushpak Bhattacharyya
    In International Conference on Intelligent Text Processing and Computational Linguistics, Nov 2016

2015

  1. Unsupervised most frequent sense detection using word embeddings
    Sudha Bhingardive, Dhirendra Singh, Rudra Murthy, and 2 more authors
    In DENVER, Nov 2015
  2. Using Word Embeddings for Bilingual Unsupervised WSD
    Sudha Bhingardive, Dhirendra Singh, Rudra Murthy, and 1 more author
    In Proceedings of the 12th International Conference on Natural Language Processing, Nov 2015