| { filsa.net: Polyglot Frontier - Introduction | Resource Index | Get Involved } |
|
|
Originally published on 9/25/97; 1:25:46 PMString Manipulation ProblemsWhy double-byte text has to be handled differently, and solutions so far. |
|
|
|
What's the problem? Simply, double-text contains characters that are mis-interpreted by Frontier. The problem characters are [in brackets]: backslash [\], the chevron [«], the left-brace [{], the atmark [@]. (and I think the quote ["], for glossary entries-5/11/98 PS) When Frontier sees these characters in the HTML suite, it does special processing. For example, the left brace and the quote trigger macro processing and glossary substitution, the chevron is used to find and delete comments in outline rendering, and the backslash is always used in Frontier to escape the next character. However, when these problem characters appear in the second byte of double-byte text, Frontier can't distinguish them, and proceeds to process them as it normally would. This destroys the Japanese text. The Scope of the Problem This problem affects the html.suite, specifically, html.refGlossary and html.processMacros. It also affects most of the string-handling verbs, or scripts that rely on string-handling verbs. So it's a big problem. Attempts have been made to patch the html suite and the string verbs. Here's a chart of the progress.
A note about words In the chart above, verbs that deal with counting words point to a note. This is it, from an email from "Nobumi Iyanaga".
The concept of "word" is different in "space delimited" languages like English and other "roman" languages, and the Japanese or other languages, which have no delimiter like that... Current Solutions, Future Solutions Currently, double-byte text in Frontier runs on patches. (namely, suites.MWU). It works, but a more pleasing and solid solution involves making Frontier recognize and handle double-byte text on it's own. The best place to take care of this is at the kernel level. Although it will require the time and attention of the Userland development team, once completed it will not have to be dealt with again, on any platform. And it will open up a wider market for Frontier in overseas markets.
|
|
Previous | Next |
|
| Other filsa.net services: Frontier Scripting | Script Archive |
|
This is part of Phil's Frontier Scripting Site. |