While 16-bit Wide-character Unicode is supported, 32-bit Unicode isn't.
In my C environment, we are using wchar_t by default for all characters;
encoding UTF-8 --> stemming --> decoding UTF-8 would be quite inefficient for a big number of terms.
At the time, in libstemmer_c/runtime/api.h i'm defining:
typedef wchar_t symbol;
and can so use the ISO-8859 stem variants without conversions.
For some of the provided languages only the UTF-8 variants exist, what leads us to this issue.