Arctos is UTF encoded.
Many products are encoded in different character sets by default. It is possible that “special” characters which cannot be translated to Unicode by the tool will be converted to a placeholder character, such as � or . (The actual character depends on the tool being used. Many source characters may be converted to the same placeholder.)
Some data are encoded with HTML escape characters rather than Unicode. These will appear visually identical in Arctos (which in general renders HTML), but are not searchable. For example,
á will appear identical in many forms. A search for “á” will find only the latter.
Arctos disallows characters not in POSIX character class PRINT for most fields, but things like doubled or trailing spaces can still appear in data and will make searching difficult.
Many characters, or combinations of characters, are visually similar. For example, see the Wikipedia entry on the degree symbol.
Some programs or editors silently truncate text. Arctos never will; some text length limitations exist, and violating them will always result in an error, never string manipulation.