We propose a novel method for classifying graphics-scene and 2D–3D texts in video.
An iterative procedure to identify text candidates is presented.
Stroke width and medial axis are explored for classifying graphics and scene texts.
Gradient directions and medial axis are combined for classifying 2D and 3D texts.