CASIA-10K is a Chinese scene text dataset. This dataset contains 10000 images under various scenarios, in which 7000 images are for training and 3000 images are for testing. For each text line, 8 coordinates of a quadrilateral are annotated. In evaluation stage, line-level predictions are required.
Download CASIA-10k (5.2G)
24th International Conference on Pattern Recognition
15th International Conference on Frontiers in Handwriting Recognition
10th IAPR-TC15 Workshop on Graph-based Representations in Pattern Recognition
Haidian | Beijing | China
Phone : (+86-10)8254-4797
Fax : (+86-10) 8254-4594
Email:liucl@nlpr.ia.ac.cn
Website:www.nlpr.ia.ac.cn/pal/