publications

publications by categories in reversed chronological order. generated by jekyll-scholar.

2026

  1. Aurelius: Relation Aware Text-to-Audio Generation At Scale
    Yuhang He, He Liang, Yash Jain, Andrew Markham, and Vibhav Vineet
    In International Conference on Learning Representations (ICLR), 2026

2025

  1. rnj1.png
    Rnj-1: Building Instruments of Intelligence
    Essential AI
    2025
    Model Release
  2. Test-time Prompt Refinement for Text-to-Image Models
    Mohammad Abdul Hafeez Khan*Yash Jain*, Siddhartha Bhattacharyya, and Vibhav Vineet
    In ICCV Workshop on Multimodal Algorithmic Reasoning (MARS2), 2025
  3. ritta.png
    RiTTA: Modeling Event Relations in Text-to-Audio Generation
    Yuhang He, Yash Jain, Xubo Liu, Andrew Markham, and Vibhav Vineet
    In Empirical Methods in Natural Language Processing (EMNLP), 2025
  4. plum.png
    PLUM: Improving Inference Efficiency by Leveraging Repetition-Sparsity Trade-Off
    Sachit Kuhar, Yash Jain, and Alexey Tumanov
    Transactions on Machine Learning Research (TMLR), 2025
  5. lpo.png
    Local Prompt Optimization
    Yash Jain, and Vishal Chowdhary
    In NAACL 2025 (Main Conference), 2025
  6. GeoMeter: Probing Geometric Perception of Large Visual-Language Models
    Shehreen Azad, Yash Jain, Rishit Garg, Yogesh S Rawat, and Vibhav Vineet
    In IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), 2025

2024

  1. peekaboo.gif
    PEEKABOO: Interactive Video Generation via Masked-Diffusion
    Yash Jain*, Anshul Nasery*, Vibhav Vineet, and Harkirat Behl
    In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2024
  2. 3m.jpg
    Multi-Stage Multi-Modal Pre-Training for Automatic Speech Recognition
    Yash Jain, D. Chan, P. Dheram, A. Khare, O. Shonibare, and 2 more authors
    In Joint International Conference on Computational Linguistics and Language Resources and Evaluation (LREC-COLING), 2024

2023

  1. damex.jpg
    DAMEX: Dataset-aware Mixture-of-Experts for Visual Understanding of Mixture-of-Datasets
    Yash Jain, Harkirat Behl, Zsolt Kira, and Vibhav Vineet
    In Advances in Neural Information Processing Systems (NeurIPS), 2023
  2. On the Utility of Virtual On-body Acceleration Data for Fine-grained Human Activity Recognition
    Zikang Leng, Yash Jain, Hyeokhyen Kwon, and Thomas Ploetz
    ACM International Symposium on Wearable Computers (ISWC), 2023

2022

  1. collossl.png
    Collossl: Collaborative Self-Supervised Learning for Human Activity Recognition
    Yash Jain*, Chi Ian Tang*, Chulhong Min, Fahim Kawsar, and Akhil Mathur
    Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies (UbiComp), 2022
  2. eating.png
    On the Effectiveness of Virtual IMU Data for Eating Detection with Wrist Sensors
    Yash Jain, Hyeokhyen Kwon, and Thomas Ploetz
    In ACM International Symposium on Wearable Computers (ISWC), 2022
  3. link_prediction.png
    Integrating Transductive and Inductive Embeddings Improves Link Prediction Accuracy
    Chitrank Gupta*Yash Jain*, Abir De, and Soumen Chakrabarti
    In Proceedings of the 31st ACM International Conference on Information and Knowledge Management (CIKM), 2022

2021

  1. Group Supervised Learning: Extending Self-Supervised Learning to Multi-Device Settings
    Yash Jain, Chi Ian Tang, Chulhong Min, Fahim Kawsar, and Akhil Mathur
    In Workshop on Self-Supervised Learning for Reasoning and Perception at ICML, 2021

2020

  1. rfid.png
    RFID Tattoo: A Wireless Platform for Speech Recognition
    Jingxian Wang, Chengfeng Pan, Haojian Jin, Vaibhav Singh, Yash Jain, and 3 more authors
    ACM Interactive, Mobile, Wearable and Ubiquitous Technologies (UbiComp), 2020