Skip to content

Commit 4c77f90

Browse files
authored
Improve old Unicode support (#86)
1 parent 7a4607b commit 4c77f90

20 files changed

+19366
-6
lines changed

data/2.0.14-blocks.txt

Lines changed: 71 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,71 @@
1+
# Start Code; End Code; Block Name
2+
0000; 007F; Basic Latin
3+
0080; 00FF; Latin-1 Supplement
4+
0100; 017F; Latin Extended-A
5+
0180; 024F; Latin Extended-B
6+
0250; 02AF; IPA Extensions
7+
02B0; 02FF; Spacing Modifier Letters
8+
0300; 036F; Combining Diacritical Marks
9+
0370; 03FF; Greek
10+
0400; 04FF; Cyrillic
11+
0530; 058F; Armenian
12+
0590; 05FF; Hebrew
13+
0600; 06FF; Arabic
14+
0900; 097F; Devanagari
15+
0980; 09FF; Bengali
16+
0A00; 0A7F; Gurmukhi
17+
0A80; 0AFF; Gujarati
18+
0B00; 0B7F; Oriya
19+
0B80; 0BFF; Tamil
20+
0C00; 0C7F; Telugu
21+
0C80; 0CFF; Kannada
22+
0D00; 0D7F; Malayalam
23+
0E00; 0E7F; Thai
24+
0E80; 0EFF; Lao
25+
0F00; 0FBF; Tibetan
26+
10A0; 10FF; Georgian
27+
1100; 11FF; Hangul Jamo
28+
1E00; 1EFF; Latin Extended Additional
29+
1F00; 1FFF; Greek Extended
30+
2000; 206F; General Punctuation
31+
2070; 209F; Superscripts and Subscripts
32+
20A0; 20CF; Currency Symbols
33+
20D0; 20FF; Combining Marks for Symbols
34+
2100; 214F; Letterlike Symbols
35+
2150; 218F; Number Forms
36+
2190; 21FF; Arrows
37+
2200; 22FF; Mathematical Operators
38+
2300; 23FF; Miscellaneous Technical
39+
2400; 243F; Control Pictures
40+
2440; 245F; Optical Character Recognition
41+
2460; 24FF; Enclosed Alphanumerics
42+
2500; 257F; Box Drawing
43+
2580; 259F; Block Elements
44+
25A0; 25FF; Geometric Shapes
45+
2600; 26FF; Miscellaneous Symbols
46+
2700; 27BF; Dingbats
47+
3000; 303F; CJK Symbols and Punctuation
48+
3040; 309F; Hiragana
49+
30A0; 30FF; Katakana
50+
3100; 312F; Bopomofo
51+
3130; 318F; Hangul Compatibility Jamo
52+
3190; 319F; Kanbun
53+
3200; 32FF; Enclosed CJK Letters and Months
54+
3300; 33FF; CJK Compatibility
55+
4E00; 9FFF; CJK Unified Ideographs
56+
AC00; D7A3; Hangul Syllables
57+
D800; DB7F; High Surrogates
58+
DB80; DBFF; High Private Use Surrogates
59+
DC00; DFFF; Low Surrogates
60+
E000; F8FF; Private Use
61+
F900; FAFF; CJK Compatibility Ideographs
62+
FB00; FB4F; Alphabetic Presentation Forms
63+
FB50; FDFF; Arabic Presentation Forms-A
64+
FE20; FE2F; Combining Half Marks
65+
FE30; FE4F; CJK Compatibility Forms
66+
FE50; FE6F; Small Form Variants
67+
FE70; FEFF; Arabic Presentation Forms-B
68+
FF00; FFEF; Halfwidth and Fullwidth Forms
69+
FEFF; FEFF; Specials
70+
FFF0; FFFF; Specials
71+

0 commit comments

Comments
 (0)