publications
publications by categories in reversed chronological order. generated by jekyll-scholar.
2026
-
DEXSIM: Real-Time Dexterous Simulation With Unified Causal Video DiffusionICLR 2026 Workshop World Models, 2026
2025
-
SIMSplat: Predictive Driving Scene Editing with Language-aligned 4D Gaussian SplattingarXiv preprint arXiv:2510.02469, 2025 -
GOATex: Geometry & Occlusion-Aware TexturingAdvances in Neural Information Processing Systems, 2025 - SAFARI: Sample-specific Assessment Framework for AI in Real-world Interactions2025
2024
- Vlaad: Vision and language assistant for autonomous drivingIn Proceedings of the IEEE/CVF winter conference on applications of computer vision, 2024
- Towards efficient visual-language alignment of the q-former for visual reasoning tasksIn Findings of the Association for Computational Linguistics: EMNLP 2024, 2024
- Merlin: Multimodal embedding refinement via llm-based iterative navigation for text-video retrieval-rerank pipelineIn Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing: Industry Track, 2024
2023
-
-
Parameter-efficient fine-tuning of instructblip for visual reasoning tasksIn NeurIPS 2023 Workshop on Efficient Natural Language and Speech Processing, 2023