xref: /aosp_15_r20/external/AFLplusplus/dictionaries/utf8.dict (revision 08b48e0b10e97b33e7b60c5b6e2243bd915777f2)
1*08b48e0bSAndroid Build Coastguard Worker# https://www.cl.cam.ac.uk/~mgk25/ucs/examples/UTF-8-test.txt
2*08b48e0bSAndroid Build Coastguard Worker
3*08b48e0bSAndroid Build Coastguard Worker
4*08b48e0bSAndroid Build Coastguard Worker# Defines byteorder and endianess
5*08b48e0bSAndroid Build Coastguard Workerbyte_order="\xFE\xFF"
6*08b48e0bSAndroid Build Coastguard Worker
7*08b48e0bSAndroid Build Coastguard Worker# Reorder the display of text for RTL reading
8*08b48e0bSAndroid Build Coastguard Workerright_to_left="\x20\x2E"
9*08b48e0bSAndroid Build Coastguard Worker
10*08b48e0bSAndroid Build Coastguard Worker# Mongolian Vowel Separator: invisible and has the whitespace property
11*08b48e0bSAndroid Build Coastguard Workerinvisible_separator="\x18\x03"
12*08b48e0bSAndroid Build Coastguard Worker
13*08b48e0bSAndroid Build Coastguard Worker# Invisible zero-width character.
14*08b48e0bSAndroid Build Coastguard Workerword_join="\x20\x60"
15*08b48e0bSAndroid Build Coastguard Worker
16*08b48e0bSAndroid Build Coastguard Worker# Reserved code point
17*08b48e0bSAndroid Build Coastguard Workerreserved="\xfe\xfe"
18*08b48e0bSAndroid Build Coastguard Worker
19*08b48e0bSAndroid Build Coastguard Worker# Invalid code point
20*08b48e0bSAndroid Build Coastguard Workerinvalid1="\xff\xff"
21*08b48e0bSAndroid Build Coastguard Workerinvalid2="\x01\xff\xff"
22*08b48e0bSAndroid Build Coastguard Workerinvalid3="\xfdd0"
23*08b48e0bSAndroid Build Coastguard Worker
24*08b48e0bSAndroid Build Coastguard Worker# unassigned code point
25*08b48e0bSAndroid Build Coastguard Workerunassigned="\x0f\xed"
26*08b48e0bSAndroid Build Coastguard Worker
27*08b48e0bSAndroid Build Coastguard Worker# illegal low half-surrogate
28*08b48e0bSAndroid Build Coastguard Workerillegal_low="\xde\xad"
29*08b48e0bSAndroid Build Coastguard Worker
30*08b48e0bSAndroid Build Coastguard Worker# illegal high half-surrogate
31*08b48e0bSAndroid Build Coastguard Workerillegal_high="\xda\xad"
32*08b48e0bSAndroid Build Coastguard Worker
33*08b48e0bSAndroid Build Coastguard Worker# private use area code usbed by apple for its logo
34*08b48e0bSAndroid Build Coastguard Workerapple="\xf8\xff"
35*08b48e0bSAndroid Build Coastguard Worker
36*08b48e0bSAndroid Build Coastguard Worker# hostname normalization
37*08b48e0bSAndroid Build Coastguard Workerfullwidth_solidus="\xff\x0f"
38*08b48e0bSAndroid Build Coastguard Worker
39*08b48e0bSAndroid Build Coastguard Worker# numerical mapping and a value
40*08b48e0bSAndroid Build Coastguard Workerbold_eight="\x01\xd7\xd6"
41*08b48e0bSAndroid Build Coastguard Worker
42*08b48e0bSAndroid Build Coastguard Worker# # U+00DF normalizes to "ss" during IDNA2003's mapping phase,
43*08b48e0bSAndroid Build Coastguard Worker# different from its IDNA2008 mapping. See http://www.unicode.org/reports/tr46/
44*08b48e0bSAndroid Build Coastguard Workerweird="\x00\xdf"
45*08b48e0bSAndroid Build Coastguard Worker
46*08b48e0bSAndroid Build Coastguard Worker# U+FDFD expands by 11x (UTF-8) and 18x (UTF-16) under NFKC/NFKC
47*08b48e0bSAndroid Build Coastguard Workerexpansion="\xfd\xfd"
48*08b48e0bSAndroid Build Coastguard Worker
49*08b48e0bSAndroid Build Coastguard Worker# U+0390 expands by 3x (UTF-8) under NFD
50*08b48e0bSAndroid Build Coastguard Workerexpansion2="\x03\x90"
51*08b48e0bSAndroid Build Coastguard Worker
52*08b48e0bSAndroid Build Coastguard Worker# U+1F82 expands by 4x (UTF-16) under NFD
53*08b48e0bSAndroid Build Coastguard Workerexpansion3= "\x1F\x82"
54*08b48e0bSAndroid Build Coastguard Worker
55*08b48e0bSAndroid Build Coastguard Worker# U+FB2C expands by 3x (UTF-16) under NFC
56*08b48e0bSAndroid Build Coastguard Workerexpansion4="\xFB\x2C"
57*08b48e0bSAndroid Build Coastguard Worker
58*08b48e0bSAndroid Build Coastguard Worker# Lowecaser expansion: https://twitter.com/jifa/status/625776454479970304
59*08b48e0bSAndroid Build Coastguard Workerlow_exp1="\x02\x3a"
60*08b48e0bSAndroid Build Coastguard Workerlow_exp2="\x02\x3e"
61*08b48e0bSAndroid Build Coastguard Workerlow_exp3="\x00\xdf"
62*08b48e0bSAndroid Build Coastguard Workerlow_exp4="\x1e\x9e"
63*08b48e0bSAndroid Build Coastguard Worker
64*08b48e0bSAndroid Build Coastguard Worker# Null byte
65*08b48e0bSAndroid Build Coastguard Workernull="\x00\x00"
66*08b48e0bSAndroid Build Coastguard Worker"\xfc\x80\x80\x80\x80\x80"
67*08b48e0bSAndroid Build Coastguard Worker"fc\x80\x80\x80\x80\xaf"
68*08b48e0bSAndroid Build Coastguard Worker
69*08b48e0bSAndroid Build Coastguard Worker# Confusing new lines
70*08b48e0bSAndroid Build Coastguard Worker"\x00\x1b"
71*08b48e0bSAndroid Build Coastguard Worker"\x00\x85"
72*08b48e0bSAndroid Build Coastguard Worker"\x20\x28"
73*08b48e0bSAndroid Build Coastguard Worker"\x20\x29"
74