commit -- better utf8/unicode support

Jeff Hostetler (
Sun, 17 Oct 1999 12:57:05 -0500

over the past few days i've checked in a bunch-o-code
that improves upon our unicode support/commitment. in

[] we now have a plain-text UTF8 importer and exporter
(these use the suffix .utf8) (the .txt i/e still
expects latin-1 text.)

[] i removed bob's temporary patch for #512 where we
had to fold fancy punctation (dashes, quotes, etc
(in the range 0x2000-0x2030) into us-ascii. this
mainly affects the RTF importer.

[] removed (on unix) and rewrote (on win32) all logic
dealing with the CharWidths array -- this was preventing
us from laying out any non-latin-1 characters.

[] opened up all non-latin-1 keybindings. let me know
if any of these are broken.

[] removed all latin-1 restrictions on the various

[] improved font cacheing strategy on win32.

with all of this, we are now closer to actually being
able to deal with unicode -- but we aren't there yet.
there are several problems remaining that we need help

[1] on win95/98, these pseudo-OS's do not support
unicode. so, we need to deal with various CodePage

[2] on unix, i don't think we have any unicode fonts,
so some weird (wrong) things get drawn when a non-Latin-1
character is entered -- it's correct in the document, but
just drawn wrong. this may be a font issue or it may be
in the code which accesses the .pfa/.pfb files.

[3] on WinNT, we probably need to switch to use ToUnicode()
rather than ToAscii() in the keyboard handler and experiment


This archive was generated by hypermail 1.03b2.