I am a Ph.D. student in Computer Science at Rice University advised by Professor Vicente Ordóñez Román, working on computer vision and language. I received a Master of Computer Science at the University of Virginia and my B.S. in Engineering at the Tecnológico de Costa Rica. Previously I spent 10 years working as a Software Engineer at different Tech Companies. Here is my CV and a Research Summary of my work during my graduate studies.
I am on the academic job market to start in Fall 2024. Please feel free to contact me.
![]() |
On the Transferability of Visual Features in Generalized Zero-Shot Learning. Paola Cascante-Bonilla, Leonid Karlinsky, James Seale Smith, Yanjun Qi, Vicente Ordonez. November 2022. [code] [bibtex] |
![]() |
Dense and Aligned Captions (DAC) Promote Compositional Reasoning in VL Models (spotlight). Sivan Doveh, Assaf Arbelle, Sivan Harary, Roei Herzig, Donghyun Kim, Paola Cascante-Bonilla, Amit Alfassy, Rameswar Panda, Raja Giryes, Rogerio Feris, Shimon Ullman, Leonid Karlinsky. Thirty-seventh Conference on Neural Information Processing Systems. NeurIPS 2023. New Orleans, Lousiana. December 2023. [arxiv] [bibtex] | |
![]() |
Going Beyond Nouns With Vision & Language Models Using Synthetic Data. Paola Cascante-Bonilla, Khaled Shehada, James Seale Smith, Sivan Doveh, Donghyun Kim, Rameswar Panda, Gül Varol, Aude Oliva, Vicente Ordonez, Rogerio Feris, Leonid Karlinsky. The 19th International Conference on Computer Vision. ICCV 2023. Paris, France. December 2023. [arxiv] [project page] [bibtex] | |
![]() |
ConStruct-VL: Data-Free Continual Structured VL Concepts Learning. James Seale Smith, Paola Cascante-Bonilla, Assaf Arbelle, Donghyun Kim, Rameswar Panda, David Cox, Diyi Yang, Zsolt Kira, Rogerio Feris, Leonid Karlinsky. 2023 Conference on Computer Vision and Pattern Recognition. CVPR 2023. Vancouver, Canada. June 2023. [arxiv] [bibtex] | |
![]() |
CODA-Prompt: COntinual Decomposed Attention-based Prompting for Rehearsal-Free Continual Learning. James Seale Smith, Leonid Karlinsky, Vyshnavi Gutta, Paola Cascante-Bonilla, Donghyun Kim, Assaf Arbelle, Rameswar Panda, Rogerio Feris, Zsolt Kira. 2023 Conference on Computer Vision and Pattern Recognition. CVPR 2023. Vancouver, Canada. June 2023. [arxiv] [bibtex] | |
![]() |
SimVQA: Exploring Simulated Environments for Visual Question Answering. Paola Cascante-Bonilla, Hui Wu, Letao Wang, Rogerio Feris, Vicente Ordonez. 2022 Conference on Computer Vision and Pattern Recognition. CVPR 2022. New Orleans, Lousiana. June 2022. [arxiv] [project page] [bibtex] |
|
![]() |
Evolving Image Compositions for Feature Representation Learning. Paola Cascante-Bonilla, Arshdeep Sekhon, Yanjun Qi, Vicente Ordonez. The 32nd British Machine Vision Conference. BMVC 2021. Virtual Conference. November 2021. [arxiv] [project page] [bibtex] |
|
![]() |
Curriculum Labeling: Revisiting Pseudo-Labeling for Semi-Supervised Learning. Paola Cascante-Bonilla, Fuwen Tan, Yanjun Qi, Vicente Ordonez. The 35th AAAI Conference on Artificial Intelligence. AAAI 2021. Virtual Conference. February 2021. [arxiv] [code] [bibtex] |
|
![]() |
Drill-down: Interactive Retrieval of Complex Scenes using Natural Language Queries. Fuwen Tan, Paola Cascante-Bonilla, Xiaoxiao Guo, Hui Wu, Song Feng, Vicente Ordonez. Conf. on Neural Information Processing Systems. NeurIPS 2019. Vancouver, Canada. December 2019. [arxiv] [code] [bibtex] |
|
Moviescope: Large-scale Analysis of Movies using Multiple Modalities. Paola Cascante-Bonilla, Kalpathy Sitaraman, Mengjia Luo, Vicente Ordonez. August 2019. [arxiv] [project page] [bibtex] Media coverage: techxplore article |
||
![]() |
Chat-crowd: A Dialog-based Platform for Visual Layout Composition. Paola Cascante-Bonilla, Xuwang Yin, Vicente Ordonez, Song Feng. North American Chapter of the Association for Computational Linguistics. NAACL 2019. System Demonstrations. Minneapolis, Minnesota. June 2019. [arxiv] [project page] [code] [bibtex] |