OBD:TSGA: Difference between revisions

OBD:TSGA (view source)

Revision as of 16:39, 21 November 2022

1,258 bytes added , 21 November 2022

m

there there

Geyser

Administrators

5,391

edits

@@ Line 4: / Line 4: @@
 Every TSGA file contains 256 elements of 20 bytes each, thus taking up 5,120 bytes without the header. Together with the 8-byte header and 32-byte padding, each TSGA takes up 5,152 bytes in a .dat (another 20 bytes are taken up in the instance descriptor array).
-For Western languages, a TSFT has only one TSGA which corresponds to an ASCII-style encoding table:
+For Western languages, a TSFT has only one TSGA which corresponds to an ASCII-based [[wp:code page|code page]]:
-*the first 32 symbols (0 to 31) are non-printable control characters, so these elements are always all-zero
+*the first 32 symbols (0 to 31) are non-printable control characters, so these elements are always all zero;
-*the next 96 symbols (32 to 127) correspond to standard ASCII
+*the next 96 symbols (32 to 127) correspond to standard US-ASCII (except that 127 is usually non-printable);
-*the upper half of the table (128 to 255) is filled with non-standard punctuation and non-Latin characters
+*the upper half of the table (128 to 255) contains additional punctuations and ligatures, Latin characters with diacritics and, in the case of the Russian version, Cyrillic script.
+The code page is not the same for all Western Oni versions.
+Asian localizations of Oni use their own extended encoding system and glyph data either in place of Oni's (Chinese), or in combination with it (Japanese).
+See [[OBD:Text encoding]] for details on the encoding systems, a list of the available characters in each language version, and a review of known issues.
-"Non-Latin" refers to European characters which use diacritics or to Cyrillic characters; the exact character set varies by language version of the game. Asian localizations of Oni use their own extended encoding system and glyph data in place of Oni's. See [[OBD:Text encoding]] for details on the encoding systems and the available characters in each language version.
 ----
@@ Line 35: / Line 39: @@
 The pixels stored in TSFT (packed 4-by-4 as little-Endian unsigned int32s) are treated as a scanline, row major, top to bottom and left to right. The width of a glyph is not always a multiple of 4 pixels, so the scanline can wrap around, i.e. a new row of pixels can start in the middle of a 4-byte element. The start of a glyph, however, is always aligned on a 4-byte element of the TSFT array. The end of a glyph is padded with 0xDEAD.
+----
+;Difference in the PS2 implementation
+{{Table}}
+|- ALIGN=CENTER VALIGN=TOP
+{{OBDtr| 0x0C | int32    |FFC800| 00 00 00 00 |unused| position of the pixel glyph data in the .raw part of the [[OBD:TSFT|TSFT]] file, in quarter-bytes }}
+{{OBDtr| 0x10 | int32    |C800C8| 00 00 00 00 |unused| runtime only? }}
+|}
+In the PS2 implementation, glyphs are also rectangular grayscale bitmaps, but the storage is much more compact, with only 2 bits per pixel (i.e., 4 pixels per byte), and the scanline is stored continuously for the whole font, without padding/aligning the beginning of a glyph to whole bytes or 4-byte blocks. The scanline is stored in the .raw part of the TSFT file (as opposed to the PC and Mac implementation, which uses a variable array in the .dat). Within a byte, the pixel order is low-to-high (see the TSFT page for an example).
+It is not 100% clear why the template checksum is different for PS2, even though the TSGA structure is essentially the same; the field at 0x0C seems to be an (unsigned) int32 in both cases. The runtime field at 0x10 is also an unsigned int32, rather than a bunch of flags.