publications

publications by categories in reversed chronological order. generated by jekyll-scholar.

2026

Aurelius: Relation Aware Text-to-Audio Generation At Scale

Yuhang He, He Liang, Yash Jain, Andrew Markham, and Vibhav Vineet

In International Conference on Learning Representations (ICLR), 2026

PDF

2025

Rnj-1: Building Instruments of Intelligence

Essential AI

2025

Model Release

HTML Base Instruct
Test-time Prompt Refinement for Text-to-Image Models

Mohammad Abdul Hafeez Khan^*, Yash Jain^*, Siddhartha Bhattacharyya, and Vibhav Vineet

In ICCV Workshop on Multimodal Algorithmic Reasoning (MARS2), 2025

PDF
RiTTA: Modeling Event Relations in Text-to-Audio Generation

Yuhang He, Yash Jain, Xubo Liu, Andrew Markham, and Vibhav Vineet

In Empirical Methods in Natural Language Processing (EMNLP), 2025

PDF
PLUM: Improving Inference Efficiency by Leveraging Repetition-Sparsity Trade-Off

Sachit Kuhar, Yash Jain, and Alexey Tumanov

Transactions on Machine Learning Research (TMLR), 2025

PDF
Local Prompt Optimization

Yash Jain, and Vishal Chowdhary

In NAACL 2025 (Main Conference), 2025
GeoMeter: Probing Geometric Perception of Large Visual-Language Models

Shehreen Azad, Yash Jain, Rishit Garg, Yogesh S Rawat, and Vibhav Vineet

In IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), 2025

PDF

2024

PEEKABOO: Interactive Video Generation via Masked-Diffusion

Yash Jain^*, Anshul Nasery^*, Vibhav Vineet, and Harkirat Behl

In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2024

Awarded PDF Code

Invited Talk at 5th Large Scale Holistic Video Understanding Workshop
Multi-Stage Multi-Modal Pre-Training for Automatic Speech Recognition

Yash Jain, D. Chan, P. Dheram, A. Khare, O. Shonibare, and 2 more authors

In Joint International Conference on Computational Linguistics and Language Resources and Evaluation (LREC-COLING), 2024

PDF

2023

DAMEX: Dataset-aware Mixture-of-Experts for Visual Understanding of Mixture-of-Datasets

Yash Jain, Harkirat Behl, Zsolt Kira, and Vibhav Vineet

In Advances in Neural Information Processing Systems (NeurIPS), 2023

PDF Code
On the Utility of Virtual On-body Acceleration Data for Fine-grained Human Activity Recognition

Zikang Leng, Yash Jain, Hyeokhyen Kwon, and Thomas Ploetz

ACM International Symposium on Wearable Computers (ISWC), 2023

PDF

2022

Collossl: Collaborative Self-Supervised Learning for Human Activity Recognition

Yash Jain^*, Chi Ian Tang^*, Chulhong Min, Fahim Kawsar, and Akhil Mathur

Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies (UbiComp), 2022

PDF Code
On the Effectiveness of Virtual IMU Data for Eating Detection with Wrist Sensors

Yash Jain, Hyeokhyen Kwon, and Thomas Ploetz

In ACM International Symposium on Wearable Computers (ISWC), 2022

PDF
Integrating Transductive and Inductive Embeddings Improves Link Prediction Accuracy

Chitrank Gupta^*, Yash Jain^*, Abir De, and Soumen Chakrabarti

In Proceedings of the 31st ACM International Conference on Information and Knowledge Management (CIKM), 2022

PDF

2021

Group Supervised Learning: Extending Self-Supervised Learning to Multi-Device Settings

Yash Jain, Chi Ian Tang, Chulhong Min, Fahim Kawsar, and Akhil Mathur

In Workshop on Self-Supervised Learning for Reasoning and Perception at ICML, 2021

PDF

2020

RFID Tattoo: A Wireless Platform for Speech Recognition

Jingxian Wang, Chengfeng Pan, Haojian Jin, Vaibhav Singh, Yash Jain, and 3 more authors

ACM Interactive, Mobile, Wearable and Ubiquitous Technologies (UbiComp), 2020

Awarded PDF

Best Long Paper Award, IJCAI 2021 SC Best Papers