Home

Awesome

Scene Text Localization & Recognition Resources

Read this institute-wise: English, 简体中文.

Read this year-wise: English, 简体中文.

Tags: [STL] (Scene Text Localization), [TR] (Text Recognition)

[STL] (Scene Text Localization) Detect text area from scene input image

[TR] (Text Recognition) Recognize text content

Last update: Sep.17 2023

1. Papers & Code

Overview

University of Oxford

Shenzhen Institutes of Advanced Technology

South China University of Technology

Fudan University

Huazhong University of Science and Technology

Universitat Autònoma de Barcelona

Stanford University

Seoul National University

Megvii Technology Inc: Face++

Institute of Automation, Chinese Academy of Sciences

University of California, San Diego

University of California, Santa Cruz

Cornell University

Pennsylvania State University

University of Science and Technology Beijing

Pohang University of Science and Technology

École d'Ingénieurs en Informatique

České vysoké učení technické v Praze. Czech Technical University

Google Inc

Microsoft Inc

Samsung R&D Institute China

Vicarious FPC Inc

Chinese State Key Laboratory of Management and Control for Complex Systems

Stanford University

Visual Computing Department, Institute for Infocomm Research

University of Florida

University of Southern California

Hikvision Research Institute

University of Adelaide

City University of New York

The University of Hong Kong

Zhejiang University

University of Potsdam

Arizona State Unviversity

Stevens Institute of Technology

Nanyang Technological University

Alibaba Group

Chinese Academy of Sciences

University of Cambridge

Peking University

SenseTime Research

Naver Clova AI Research

Baidu

University of Adelaide

Nanjing University

The Chinese University of Hong Kong

Malong Technologies

University of Rochester

Facebook AI Research

University of Marlyand

Penta-AI

Central China Normal University

Tencent

Tsinghua University

University of Science and Technology of China

University of Electronic Science and Technology of China

Indian Statistical Institute

Institute of Information Engineering, Chinese Academy of Sciences

University of Chinese Academy of Sciences

Amazon

Heritage Institute of Technology

Indian Institute of Technology

Xidian University

Tongji University

Harbin Institute of Technology

Shanghai Jiao Tong University

Ping An Property & Casualty Insurance

Hefei University of Technology

Beihang University

Boston University

Carnegie Mellon University

Northwestern Polytechnical University

VinAI Research

University of Tokyo

University of Surrey

The Technion – Israel Institute of Technology

University of Illinois at Urbana-Champaign

National Laboratory of Pattern Recognition

Shenzhen University

University of the Philippines

Beijing Jiaotong University

Wuhan University

Helsing AI

Purdue University

2. Datasets

SCUT-CTW1500 2018

Task: text location(with different style) and recognition

download

Total Text Dataset 2017

1,555 images with more than 3 different text orientations: Horizontal, Multi-Oriented, and Curved, one of a kind

Task: text location(with different style) and recognition

download

PowerPoint Text Detection and Recognition Dataset 2017

21,384 images, 21,384+ text instances

Task: text location and recognition

download

COCO-Text (Computer Vision Group, Cornell) 2016

63,686 images, 173,589 text instances, 3 fine-grained text attributes.

Task: text location and recognition

download

Synthetic Word Dataset (Oxford, VGG) 2014

9 million images covering 90k English words

Task: text recognition, segmantation

download

The Street View House Number Dataset (SVHN) 2012

Real-world street view number image with its position and classification tags.

Task: number location detection, text recognition

download

IIIT 5K-Words 2012

5000 images from Scene Texts and born-digital (2k training and 3k testing images)

Each image is a cropped word image of scene text with case-insensitive labels

Task: text recognition

download

StanfordSynth(Stanford, AI Group) 2012

Small single-character images of 62 characters (0-9, a-z, A-Z)

Task: text recognition

download

MSRA Text Detection 500 Database (MSRA-TD500) 2012

500 natural images(resolutions of the images vary from 1296x864 to 1920x1280)

Chinese, English or mixture of both

Task: text detection

Street View Text (SVT) 2010

350 high resolution images (average size 1260 × 860) (100 images for training and 250 images for testing)

Only word level bounding boxes are provided with case-insensitive labels

Task: text location

KAIST Scene_Text Database 2010

3000 images of indoor and outdoor scenes containing text

Korean, English (Number), and Mixed (Korean + English + Number)

Task: text location, segmantation and recognition

Chars74k 2009

Over 74K images from natural images, as well as a set of synthetically generated characters

Small single-character images of 62 characters (0-9, a-z, A-Z)

Task: text recognition

ICDAR Benchmark Datasets

DatasetDescriptionCompetition Paper
ICDAR 2017over 173,589 labeled text regions in over 63,686 imagespaper link
ICDAR 20151000 training images and 500 testing imagespaper link
ICDAR 2013229 training images and 233 testing imagespaper link
ICDAR 2011229 training images and 255 testing imagespaper link
ICDAR 20051001 training images and 489 testing imagespaper link
ICDAR 2003181 training images and 251 testing images(word level and character level)paper link

3. Competitions

4. Online OCR Service

NameDescription
Tesseract OCRAPI,free
Online OCRAPI,free
Free OCRAPI,free
New OCRAPI,free
ABBYY FineReader OnlineNo API,Not free
Super Online Transfer Tools (Chinese)API,free
Online Chinese RecognitionAPI,free

5. Blogs