Home

Awesome

Conway Stroke Data

A data set compiled manually by Conway (@yawnoc), used in the Android keyboard app Stroke Input Method (筆畫輸入法).

Stroke input method (generic, not the app)

The (generic) stroke input method is found on all dumbphones in HK and surrounds.

It is the simplest Chinese input method in existence. All strokes are classified into 5 types, entered via keypad:

#StrokeTypeComment
1橫 HorizontalIncludes rises (提) etc.
2豎 Vertical
3撇 Throw
4點 DotIncludes presses (捺)
5折 BreakBasically everything else

Picture of a dumbphone with stroke input method on keys 1 to 5.

Contents of this repository

A. Manually compiled data

The following files contain data manually compiled by Conway (@yawnoc):

codepoint-character-sequence.txt

phrases-traditional.txt, phrases-simplified.txt

ranking-traditional.txt, ranking-simplified.txt

B. Automatically generated data

The following files contain data automatically generated by running generate.py, which parses codepoint-character-sequence.txt:

characters-traditional.txt, characters-simplified.txt

sequence-characters.txt

C. Scripts

.bash_aliases

generate.py

sort.py

D. Tests

test_generate.py

test_sort.py

Miscellanea for convenient reference (in comments)

Unicode strokes

CJK Strokes (Unicode block) (U+31C0 to U+31E3):

㇀㇁㇂㇃㇄㇅㇆㇇㇈㇉㇊㇋㇌㇍㇎㇏
㇐㇑㇒㇓㇔㇕㇖㇗㇘㇙㇚㇛㇜㇝㇞㇟
㇠㇡㇢㇣

Unicode composition

Ideographic Description Characters (Unicode block) (U+2FF0 to U+2FFB):

⿰⿱⿲⿳⿴⿵⿶⿷⿸⿹⿺⿻