publications
publications by categories in reversed chronological order. generated by jekyll-scholar.
2026
- Aurelius: Relation Aware Text-to-Audio Generation At ScaleIn International Conference on Learning Representations (ICLR), 2026
2025
- Test-time Prompt Refinement for Text-to-Image ModelsIn ICCV Workshop on Multimodal Algorithmic Reasoning (MARS2), 2025
-
RiTTA: Modeling Event Relations in Text-to-Audio GenerationIn Empirical Methods in Natural Language Processing (EMNLP), 2025 -
PLUM: Improving Inference Efficiency by Leveraging Repetition-Sparsity Trade-OffTransactions on Machine Learning Research (TMLR), 2025 -
Local Prompt OptimizationIn NAACL 2025 (Main Conference), 2025 - GeoMeter: Probing Geometric Perception of Large Visual-Language ModelsIn IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), 2025
2024
-
Multi-Stage Multi-Modal Pre-Training for Automatic Speech RecognitionIn Joint International Conference on Computational Linguistics and Language Resources and Evaluation (LREC-COLING), 2024
2023
- On the Utility of Virtual On-body Acceleration Data for Fine-grained Human Activity RecognitionACM International Symposium on Wearable Computers (ISWC), 2023
2022
-
On the Effectiveness of Virtual IMU Data for Eating Detection with Wrist SensorsIn ACM International Symposium on Wearable Computers (ISWC), 2022 -
Integrating Transductive and Inductive Embeddings Improves Link Prediction AccuracyIn Proceedings of the 31st ACM International Conference on Information and Knowledge Management (CIKM), 2022
2021
- Group Supervised Learning: Extending Self-Supervised Learning to Multi-Device SettingsIn Workshop on Self-Supervised Learning for Reasoning and Perception at ICML, 2021