Arc Forum | Arc actually does support UTF-8. arc> ("uber" 0) #\u I imagine it only o...

Arc Forum

2 points by nex3 6389 days ago | link | parent

Arc actually does support UTF-8.

  arc> ("uber" 0)
  #\u

I imagine it only officially supports ASCII because it will be migrated away from MzScheme eventually.

Note: Those "u"s are supposed to have umlauts, but that's apparently normalized away somewhere. The point is, u with an umlaut is treated as a single character by the current implementation.

1 point by mascarenhas 6389 days ago | link

Well, indexing will most certainly break, but making an encoding agnostic reader/writer is easy, I hope PG does that when/if Arc goes standalone.

-----

1 point by nex3 6389 days ago | link

I'm sure it'll be agnostic, if by "agnostic" you mean that it just reads in strings as a sequence of bytes. It would be easier to do that than to check for non-ASCII characters and handle them specially.

-----