Hernán J. Maina

About

Hi! I’m Hernán 👋

I’m a Computer Scientist currently pursuing a Ph.D. at Facultad de Matemática, Astronomía, Física y Computación (FaMAF) · Universidad Nacional de Córdoba (UNC) · Argentina, with a doctoral fellowship from CONICET. I work at the intersection of computer vision and natural language processing. My research, under the supervision of Dra. Luciana Benotti^↗, focuses on understanding the challenges of multimodal vision-and-language models to help visually impaired people perform daily tasks and interact with their environment. In particular, I work with visual question answering (VQA) systems, and the detection and recognition of written text in images.

Before that, I completed a Master’s degree with a thesis on real-time recognition of urban bus lines, under the supervision of Dr. Jorge Sánchez^↗. This work applied computer vision techniques to improve public transport accessibility for blind and low-vision users.

While my academic background has shaped my problem-solving approach, my interests go beyond research. I’m passionate about developing technology with real-world impact — especially in accessibility and inclusive design — and I’m always open to new challenges and collaborations that push AI out of the lab and into people’s hands. I’m also an enthusiast of robotics and the Internet of Things (IoT), constantly exploring how these technologies can be integrated into intelligent systems that interact with the physical world.

Curious about what doing a Ph.D. in Computer Science is really like — and what it takes to get through it? Head over to the Maybe you’d like to know section. I’ve shared some milestones, side projects, and things I’ve learned along the way.

Publications

Here you can find a selection of my academic work, organized by year. This includes peer-reviewed articles, workshop papers, co-authored research, and academic work associated with degree programs (entries marked with 🎓 highlight milestones from my academic journey).

My work explores the intersection of computer vision, natural language processing, and accessibility — with a focus on how multimodal AI systems can support blind and low-vision individuals in their everyday lives. Some of my work has also examined social biases and stereotypes embedded in language models, and how these may impact the fairness and inclusiveness of AI systems.

2025 ⤴

ROSA: Addressing text understanding challenges in photographs via ROtated SAmpling.
Hernán Maina , Guido Ivetta , Mateo Lione Stuto , Julian Martin Eisenschlos , Jorge Sánchez , Luciana Benotti . arXiv.org. Preprint.
[PDF]
Low-resource domain adaptation while minimizing energy and hardware resource consumption.
Hernán Maina , Nicolás Wolovick , Luciana Benotti . arXiv.org. Preprint. A shorter version of this paper was accepted at the WiNLP 2023 Workshop (not publicly released).
[PDF]

2024 ⤴

CVQA: Culturally-diverse Multilingual Visual Question Answering Benchmark.
David Romero , Chenyang Lyu , Teresa Lynn , Injy Hamed , … Hernán Maina , … Alham Fikri Aji* . 38th Conference on Neural Information Processing Systems (NeurIPS 2024) Track on Datasets and Benchmarks.. Puplished.
[PDF]
Selectively Answering Visual Questions.
Julian Eisenschlos , Hernán Maina , Guido Ivetta , Luciana Benotti . Findings of the Association for Computational Linguistics: ACL 2024. Published.
[PDF]
Exploring stereotypes and biases in language technologies in Latin America.
Hernán Maina , Laura Alonso Alemany , Guido Ivetta , Mariela Rajngewerc , Beatriz Busaniche , Luciana Benotti . Communications of the Association for Computing Machinery (ACM). Published.
[PDF]
Detecting correct answers to open questions and its impact on language models' confidence scores.
Guido Ivetta , Hernán Maina , Luciana Benotti . Conference: LatinX in AI at North American Chapter of the Association for Computational Linguistics Conference 2024. Published.
[PDF]

2023 ⤴

Bias assessment for experts in discrimination, not in computer science.
Laura Alonso Alemany , Luciana Benotti , Hernán Maina , Lucía González , Lautaro Martínez , Beatriz Busaniche , Alexia Halvorsen , Amanda Mata Rojo , Mariela Rajngewerc . Proceedings of the First Workshop on Cross-Cultural Considerations in NLP (C3NLP). Published.
[PDF] [Code] [Demo]
A methodology to characterize bias and harmful stereotypes in natural language processing in Latin America.
Laura Alonso Alemany , Luciana Benotti , Hernán Maina , Lucía González , Mariela Rajngewerc , Lautaro Martínez , Jorge Sánchez , Mauro Schilman , Guido Ivetta , Alexia Halvorsen , Amanda Mata Rojo , Matías Bordone , Beatriz Busaniche . arXiv.org. Preprint.
[PDF] [Code] [Demo]

2022 ⤴

A tool to overcome technical barriers for bias assessment in human language technologies.
Laura Alonso Alemany , Luciana Benotti , Lucía González , Hernán Maina , Beatriz Busaniche , Alexia Halvorsen , Matías Bordone , Jorge Sánchez . arXiv.org. Preprint.
[PDF]
What kinds of errors do reference resolution models make and what can we learn from them?.
Jorge Sánchez , Mauricio Mazuecos , Hernán Maina , Luciana Benotti . Findings of the Association for Computational Linguistics: NAACL 2022. Published.
[PDF] [Video]
Automatic multi-modal processing of language and vision to assist people with visual impairments.
Hernán Maina , Luciana Benotti . North American Chapter of the Association for Computational Linguistics Conference: LatinX in AI (LXAI) Research Workshop 2022, Virtual. Published.
[PDF] [Slides] [Poster]
An interpretable representation of dialog history in referential visual dialog.
Mauricio Mazuecos , Franco Luque , Jorge Sánchez , Hernán Maina , Thomas Vadora , Luciana Benotti . North American Chapter of the Association for Computational Linguistics Conference: LatinX in AI (LXAI) Research Workshop 2022, Virtual. Published.
[PDF]

2021 ⤴

Region under Discussion for visual dialog.
Mauricio Mazuecos , Franco M. Luque , Jorge Sánchez , Hernán Maina , Thomas Vadora and Luciana Benotti . Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing. Published.
[PDF] [Video] [Code]

2020 ⤴

Stop the Bus!: Computer vision for automatic recognition of urban bus lines.
Maina Hernán J , Sánchez Jorge A . XXI Simposio Argentino de Inteligencia Artificial (ASAI 2020)-JAIIO 49 (Modalidad virtual). Published.
[PDF] [Slides] [Video] [Code]

2019 ⤴

(🎓 Master's Thesis) Visión artificial para el reconocimiento automático, en tiempo real, de líneas urbanas de autobuses.
Maina Hernán J , Sánchez Jorge A . Thesis work to obtain Master's Degree in Computer Science. Published.
[PDF] [Slides] [Code]

Maybe you'd like to know

Here are some highlights and milestones from my academic journey — including presentations, collaborations, courses, grants, and side projects that shaped my Ph.D. experience along the way.

December 2024
- 🎙️ Speaker at the Responsible Artificial Intelligence Workshop, presenting the talk “Accesibilidad en modelos de Visión & Lenguaje (VLMs)”. The event was part of a collaboration between FaMAF · UNC, the CALMS project (Imperial College London), and the HESEIA project (Fundación Vía Libre). [Slides]

May 2024
- ✍️ Wrote a blog post for the High Performance Computing Center (CCAD) at the UNC, titled “Modelos de lenguaje grandes en GPUs chicas”. [Post]

December 2023
- 🎓🧪 Final project for the “Automatic Processing of Situated Language in a Visual Environment” postgraduate course, taught by Dra. Luciana Benotti at FaMAF · UNC. Replicated results from prior research papers and extended them to address a specific problem involving grounded language understanding in visual scenes. [Report1] [Report2]

November 2023
- 🎓🛠️ Workshop instructor (seminar accredited) for “Modelos de Lenguaje a tu Medida”, delivered during the 2nd Argentine NLP Workshop, Córdoba, Argentina.
  
  “This workshop, introduced key concepts in domain adaptation for Large Language Models (LLM). It covered distinctions between pre-training, fine-tuning, and domain adaptation, along with hands-on practice adapting BERT and GPT-2 models to Spanish and English datasets. Participants worked step-by-step using Colab notebooks and shared their results on Hugging Face platform”. [GitHub].

September 2023
- 🎙️🛠️ Workshop instructor at “Modelos de Lenguaje a tu Medida”, part of the 12th Regional Free Software Conference Córdoba, Argentina. [GitHub]

March 2023
- 🎙️ Poster Presentation at the Latin American Meeting In Artificial Intelligence – KHIPU 2023. [Poster]

December 2022
- 🏅 Received a Travel Grant to attend KHIPU 2023 in Montevideo, Uruguay.

August 2022
- 🧩 Joined the FAIR Project under the supervision of Dra. Laura Alonso Alemany and Dra. Luciana Benotti, developing tools to detect Biases in Large Language Models. Project led by Fundación Vía Libre, supported by Women at the Table, the A+ Alliance, EthicsTechLab (University of Notre Dame), and Heinrich Böll Foundation.

July 2022
- 🏅 Diversity & Inclusion (D&I) grant recipient to virtually attend to The 2022 Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL 2022).

December 2021
- 🎓🧪 Final project for the “Text Mining” postgraduate course, taught by Dra. Laura Alonso Alemany at FaMAF · UNC. Developed a classifier to generate new categories of visual questions using the VizWiz-VQA dataset. [Report] [GitHub]

November 2021
- 🎓 Seminar presenter of “VQA Systems Designed for People with Visual Impairments” Delivered at FaMAF · UNC, Argentina.
  
  “The seminar provided an overview of Vision-and-Language (V&L) systems and their relevance in accessibility contexts. It explored why the intersection of visual and linguistic modalities is essential for supporting people with visual impairments. A special focus was given to the VizWiz-VQA dataset—created by and for blind users—which presents unique challenges such as blurry or misframed images, lack of visual context, conversational questions, and unanswerable queries.”. [Slides]
- 🏅 Diversity & Inclusion (D&I) grant recipient to attend to The 2021 Conference on Empirical Methods in Natural Language Processing (EMNLP 2021).
- 🎙️ CEIMAF Talk — “Inteligencia artificial que habla de lo que ve”. Presented during Computing Day, part of Science Month celebrations. [Video]

July 2021
- 🧪 Completed the Aprendizaje Automático con Datos Escasos course, taught by Jorge A. Sánchez, as part of the ECI – Escuela de Ciencias Informáticas at UBA. [Certification]

June 2021
- 🎓🧪 Final project for the “Parallel Computing” postgraduate course, taught by Nicolás Wolovick at FaMAF · UNC. Developed VizWiz-BERT, a domain-adapted BERT-based model for the VizWiz-VQA dataset, focusing on optimizing training performance through multi-GPU parallelization. [Report]

April 2021
- 🏅 Diversity & Inclusion (D&I) grant recipient for attend to The 16th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2021)
- 🏅 Awarded a Doctoral Fellowship by CONICET

December 2020
- 🎓🧪 Final project for the “Neural Networks” postgraduate course, taught by Dr. Francisco Tamarit at FaMAF · UNC. Training of Conditional Deep Convolutional Generative Adversarial Networks (cDCGANs) model to automatic image colorization. [Report]
- 🧪 Completed the Build Basic GANs course on Coursera. [Certification]

September 2020
- 🧪 Completed the Natural Language Processing with Classification and Vector Spaces course on Coursera. [Certification]