Computer Vision

Image and video understanding, geometry, and recognition.

Projects

Completed

Vision, Language and Visual Retrieval

Multimodal methods connecting images and language: large-scale visual retrieval, semantic art understanding, and image caption generation.

Computer Vision Machine Learning

Completed

Self-Supervised Monocular Depth Estimation

Learning to predict depth from single images without ground-truth supervision, with a focus on dynamic scenes and challenging conditions.

Computer Vision Machine Learning Monocular Depth Estimation

Completed

3D Reconstruction and Multi-View Stereo

Recovering accurate 3D shape from collections of images, using multi-view stereo, volumetric graph-cuts, and probabilistic depth-map fusion.

3D Reconstruction Computer Vision

Publications

BaseBoostDepth: Exploiting Larger Baselines For Self-supervised Monocular Depth EstimationKieran Saunders, Luis J. Manso and George VogiatzisarXiv (Cornell University) · 2024
Self-supervised Monocular Depth Estimation: Let’s Talk About The WeatherKieran Saunders, George Vogiatzis and Luis J. MansoarXiv (Cornell University) · 2023
Dyna-DM: Dynamic Object-aware Self-supervised Monocular Depth MapsKieran Saunders, George Vogiatzis and Luis J. Manso2023
Dyna-DM: Dynamic Object-aware Self-supervised Monocular Depth MapsKieran Saunders, George Vogiatzis and Luis J. MansoarXiv (Cornell University) · 2022
Variational Recurrent Sequence-to-Sequence Retrieval for Stepwise IllustrationVishwash Batra, Aparajita Haldar, Yulan He, Hakan Ferhatosmanoğlu, George Vogiatzis and Tanaya GuhaLecture notes in computer science · 2020
Learning non-metric visual similarity for image retrievalNoa García and George VogiatzisImage and Vision Computing · 2019
How to Read Paintings: Semantic Art Understanding with Multi-modal RetrievalNoa García and George VogiatzisLecture notes in computer science · 2019
A Deep Learning Approach to Automatic Caption Generation for News ImagesVishwash Batra, Yulan He and George VogiatzisAston Publications Explorer (Aston University) · 2019
Asymmetric Spatio-Temporal Embeddings for Large-Scale Image-to-Video RetrievalNoa García and George VogiatzisAston Publications Explorer (Aston University) · 2018
Neural Caption Generation for News ImagesVishwash Batra, Yulan He and George Vogiatzis2018
How to Read Paintings: Semantic Art Understanding with Multi-Modal RetrievalNoa García and George VogiatzisarXiv (Cornell University) · 2018
Dress Like a Star: Retrieving Fashion Products from VideosNoa García and George Vogiatzis2017
Large-Scale Data for Multiple-View StereopsisHenrik Aanæs, Rasmus Ramsbøl Jensen, George Vogiatzis, Engin Tola and Anders Bjorholm DahlInternational Journal of Computer Vision · 2016
Our 3D Vision Data-Sets in the MakingHenrik Aanæs, Knut Conradsen, Alessandro Dal Corso, Anders Bjorholm Dahl, Alessio Del Bue, Mads Emil Brix Doest, Jeppe Revall Frisvad, Sebastian Hoppe Nesgaard Jensen, Jannik Boll Nielsen, Jonathan Dyssel Stets and George VogiatzisTechnical University of Denmark, DTU Orbit (Technical University of Denmark, DTU) · 2015
Large Scale Multi-view Stereopsis EvaluationRasmus Ramsbøl Jensen, Anders Bjorholm Dahl, George Vogiatzis, Engil Tola and Henrik Aanæs2014
A Generative Model for Online Depth FusionOliver J. Woodford and George VogiatzisLecture notes in computer science · 2012
WITHDRAWN: Video-based, real-time multi-view stereoGeorge Vogiatzis and Carlos HernándezImage and Vision Computing · 2012
Video-based, real-time multi-view stereoGeorge Vogiatzis and Carlos HernándezImage and Vision Computing · 2011
Automatic Object Segmentation from Calibrated ImagesNeill D. F. Campbell, George Vogiatzis, Carlos Hernández and Roberto Cipolla2011
Live 3D shape reconstruction, recognition and registrationCarlos Hernández, Frank Perbet, Minh-Tri Pham, George Vogiatzis, Oliver J. Woodford, Atsuto Maki, Björn Stenger and Roberto Cipolla2011
Shape from Photographs: A Multi-view Stereo PipelineCarlos Hernández and George VogiatzisStudies in computational intelligence · 2010
Practical 3D Reconstruction Based on Photometric StereoGeorge Vogiatzis and Carlos HernándezStudies in computational intelligence · 2010
Using Multiple Hypotheses to Improve Depth-Maps for Multi-View StereoNeill D. F. Campbell, George Vogiatzis, Carlos Hernández and Roberto CipollaLecture notes in computer science · 2008
Automatic 3D object segmentation in multiple views using volumetric graph-cutsNeill D. F. Campbell, George Vogiatzis, Carlos Hernández and Roberto CipollaImage and Vision Computing · 2008
Multiview Stereo via Volumetric Graph-Cuts and Occlusion Robust Photo-ConsistencyGeorge Vogiatzis, Carlos Hernández Esteban, Philip H. S. Torr and Roberto CipollaIEEE Transactions on Pattern Analysis and Machine Intelligence · 2007
Probabilistic visibility for multi-view stereoCarlos Hernández, George Vogiatzis and Roberto Cipolla2007
Automatic 3D Object Segmentation in Multiple Views using Volumetric Graph-CutsNeill D. F. Campbell, George Vogiatzis, Cristina Hernandez and Roberto Cipolla2007
Reconstructing relief surfacesGeorge Vogiatzis, Philip H. S. Torr, Steven M. Seitz and Roberto CipollaImage and Vision Computing · 2007
Lighting-Up Geometry: Accurate 3D Modelling of Museum Artifacts with a Torch and a CameraGeorge Vogiatzis, Carlos Hernández and Roberto CipollaCambridge University Engineering Department Publications Database · 2006
Multi-View Stereo via Volumetric Graph-CutsGeorge Vogiatzis, Philip H. S. Torr and Roberto Cipolla2005
Using frontier points to recover shape, reflectance and illuminationGeorge Vogiatzis, Paolo Favaro and Roberto Cipolla2005
Reconstructing Relief SurfacesGeorge Vogiatzis, Philip H. S. Torr, Steven M. Seitz and Roberto Cipolla2004
Bayesian Stochastic Mesh Optimization for 3D reconstructionGeorge Vogiatzis, Philip H. S. Torr and Roberto Cipolla2003