Publications

(2024). Trust but Verify: Programmatic VLM Evaluation in the Wild. arXiv preprint arXiv:2410.13121 (2024).

PDF Dataset Explore Dataset

(2024). FaithEval: Can Your Language Model Stay Faithful to Context, Even If "The Moon is Made of Marshmallows". arXiv preprint arXiv:2410.03727 (2024).

PDF Dataset Code Blog

(2024). SFR-RAG: Towards Contextually Faithful LLMs. arXiv preprint arXiv:2409.09916 (2024).

PDF Contextual Benchmark Blog Press Press Press

(2024). xGen-VideoSyn-1: High-fidelity Text-to-Video Synthesis with Compressed Representations. AI4VA Workshop at ECCV24.

PDF Code

(2024). xGen-MM (BLIP-3): A Family of Open Large Multimodal Models. EVAL-FoMo Workshop at ECCV 2024.

PDF Models

(2023). Diffusion Model Alignment Using Direct Preference Optimization. Conference on Computer Vision and Pattern Recognition (CVPR) 2024 .

PDF Code

(2023). ConRad: Image Constrained Radiance Fields for 3D Generation from a Single Image. Advances in Neural Information Processing Systems 33 (NeurIPS 2023).

PDF Poster

(2021). The Functional Correspondence Problem. International Conference on Computer Vision (ICCV) 2021.

PDF Project Page

(2021). Audio-Visual Floorplan Reconstruction. International Conference on Computer Vision (ICCV) 2021.

PDF Code Talk (5min) Slides Poster

(2016). Pose from Action: Unsupervised Learning of Pose Features based on Motion. Workshop on Action and Anticipation for Visual Learning at ECCV 2016..

PDF Poster

(2016). Stochastic Multiple Choice Learning for Training Diverse Deep Ensembles. Advances in Neural Information Processing Systems (NIPS) 2016.

PDF Poster

(2015). Why M Heads are Better than One: Training a Diverse Ensemble of Deep Networks. arxiv preprint arXiv:1511.06314.

PDF

(2014). Combining the Best of Graphical Models and ConvNets for Semantic Segmentation. arxiv preprint arXiv:1412.4313.

PDF

(2013). Automatic Segmentation of Adipose Tissue from Thigh Magnetic Resonance Images. International Conference on Image Analysis and Recognition (ICIAR) 2013.

PDF