An energy minimization based approach for scene text recognition with seamless integration of multiple cues.
Applied also to the challenging open vocabulary setting, where a word-specific lexicon is unavailable.
Comprehensive experimental evaluation on several state-of-the-art benchmarks.