Andreas Goulas

I am currently a PhD Student at Queen Mary University of London and a Research Assistant at the Information Technologies Institute (ITI), Centre for Research and Technology Hellas (CERTH).

I graduated with a Diploma in Electrical and Computer Engineering from the Aristotle University of Thessaloniki. My research focuses on Video Understanding with Multimodal LLMs.

Email: agoulas at iti dot gr

profile picture

Publications

For an up-to-date list of publications, visit my Google Scholar profile.

Conference and Workshop Papers

  1. N. Pantelidis, E. Kosmidou, D. Galanopoulos, D. Georgalis, S. Pasios, K. Apostolidis, A. Goulas, M. Pegia, G. Tsionkis, K. Gkountakos, G. Kouvrakis, A. Moumtizdou, I. Gialampoukidis, S. Vrochidis, V. Mezaris, I. Kompatsiaris. “VERGE in VBS 2026”. Proc. 32nd Int. Conf. on MultiMedia Modelling (MMM2026), Prague, CZ, Jan 2026.
  2. A. Goulas, D. Galanopoulos, I. Patras, V. Mezaris. “MLLM Frame Subset Ensembling for Audio-Visual Video QA and MLLM-based Reranking for Ad-hoc Video Search in TRECVID 2025”. Proc. TRECVID 2025 Workshop, Dec 2025.
  3. D. Galanopoulos, A. Goulas, V. Mezaris. “Cross-modal Image Recommendation for News Articles by Multimodal Foundation Models-based Retrieval-Reranking”. Proc. 2025 Multimedia Evaluation Workshop (MediaEval’25), Dublin, Ireland, Oct. 2025.
  4. A. Goulas, V. Mezaris, I. Patras. “VidCtx: Context-aware Video Question Answering with Image Models”. IEEE Int. Conf. on Multimedia and Expo (ICME 2025), Nantes, France, June-July 2025.
  5. D. Galanopoulos, A. Goulas, A. Leventakis, I. Patras, V. Mezaris. “An LLM Framework for Long-form Video Retrieval and Audio-Visual Question Answering Using Qwen2/2.5”. Proc. IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), Nashville, TN, USA, June 2025.
  6. A. Goulas, N. Malamas, A. L. Symeonidis. “A Methodology for Enabling NLP Capabilities on Edge and Low-Resource Devices”. Proc. Int. Conf. on Applications of Natural Language to Information Systems (NLDB), Valencia, Spain, June 2022.
  7. N. Gkalelis, A. Goulas, D. Galanopoulos, V. Mezaris. “ObjectGraphs: Using Objects and a Graph Convolutional Network for the Bottom-Up Recognition and Explanation of Events in Video”. Proc. IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), June 2021.

Recognition