Email Record: A direct approach to scene text-to-speech synthesis