BYML: Difference between revisions
imported>NiceneNerd |
imported>Leoetlino (add ACNH and v4 info) |
||
Line 1: | Line 1: | ||
<onlyinclude>'''BYML''' is a simplified, binary form of [[wikipedia:YAML|YAML]].</onlyinclude> | <onlyinclude>'''BYML''' is a simplified, binary form of [[wikipedia:YAML|YAML]].</onlyinclude> | ||
Only the version 2 | Only the version 2+ format is documented in this article. | ||
== Structure == | == Structure == | ||
Line 14: | Line 14: | ||
| align="center" | 0x00 | | align="center" | 0x00 | ||
| align="center" | 2 | | align="center" | 2 | ||
| | | “BY” (big endian) or “YB” (little endian). | ||
|- | |- | ||
| align="center" | 0x02 | | align="center" | 0x02 | ||
| align="center" | 2 | | align="center" | 2 | ||
| Version 0x0002 in Breath of the Wild. | | Version 0x0002 in ''Breath of the Wild''. 1, 3 and 4 are also valid version numbers. | ||
|- | |- | ||
| align="center" | 0x04 | | align="center" | 0x04 | ||
Line 40: | Line 40: | ||
! align="center" width="8%" | Identifier | ! align="center" width="8%" | Identifier | ||
! align="center" width="17%" | Type | ! align="center" width="17%" | Type | ||
! width=" | ! width="63%" | Description | ||
! width="10%" | Version | |||
|- | |- | ||
| align="center" | 0xA0 | | align="center" | 0xA0 | ||
| align="center" | Value (Index) | | align="center" | Value (Index) | ||
| String. Value is an index into the string table. | | String. Value is an index into the string table. | ||
| 2+ | |||
|- | |||
| align="center" | 0xA1 | |||
| align="center" | Container | |||
| Binary data. | |||
| 4+ | |||
|- | |- | ||
| align="center" | 0xC0 | | align="center" | 0xC0 | ||
| align="center" | Container | | align="center" | Container | ||
| Array. Node is an array of nodes, typically, though not necessarily, all of the same format. | | Array. Node is an array of nodes, typically, though not necessarily, all of the same format. | ||
| 2+ | |||
|- | |- | ||
| align="center" | 0xC1 | | align="center" | 0xC1 | ||
| align="center" | Container | | align="center" | Container | ||
| Hash. Node is a mapping from strings in the hash key table to other nodes. | | Hash. Node is a mapping from strings in the hash key table to other nodes. | ||
| 2+ | |||
|- | |- | ||
| align="center" | 0xC2 | | align="center" | 0xC2 | ||
| align="center" | Container (Special) | | align="center" | Container (Special) | ||
| String table. Special purpose node type only seen in the hash key table and the string table. | | String table. Special purpose node type only seen in the hash key table and the string table. | ||
| 2+ | |||
|- | |- | ||
| align="center" | 0xD0 | | align="center" | 0xD0 | ||
| align="center" | Value | | align="center" | Value | ||
| Bool. Node is 1 or 0 representing true or false respectively. | | Bool. Node is 1 or 0 representing true or false respectively. | ||
| 2+ | |||
|- | |- | ||
| align="center" | 0xD1 | | align="center" | 0xD1 | ||
| align="center" | Value | | align="center" | Value | ||
| Int. Node is a signed 32 bit integer value. | | Int. Node is a signed 32 bit integer value. | ||
| 2+ | |||
|- | |- | ||
| align="center" | 0xD2 | | align="center" | 0xD2 | ||
| align="center" | Value | | align="center" | Value | ||
| Float. Node is a binary32 floating-point number. | | Float. Node is a binary32 floating-point number. | ||
| 2+ | |||
|- | |- | ||
| align="center" | 0xD3 | | align="center" | 0xD3 | ||
| align="center" | Value | | align="center" | Value | ||
| UInt. Node is an unsigned 32 bit integer value. The game uses this for some CRC32 hashes and for masks. | | UInt. Node is an unsigned 32 bit integer value. The game uses this for some CRC32 hashes and for masks. | ||
| 2+ | |||
|- | |- | ||
| align="center" | 0xD4 | | align="center" | 0xD4 | ||
| align="center" | Value (Special) | | align="center" | Value (Special) | ||
| Int64. Node is a 64 bit integer value. | | Int64. Node is a 64 bit integer value. | ||
| 3+ | |||
|- | |- | ||
| align="center" | 0xD5 | | align="center" | 0xD5 | ||
| align="center" | Value (Special) | | align="center" | Value (Special) | ||
| UInt64. Node is an unsigned 64 bit integer value. | | UInt64. Node is an unsigned 64 bit integer value. | ||
| 3+ | |||
|- | |- | ||
| align="center" | 0xD6 | | align="center" | 0xD6 | ||
| align="center" | Value (Special) | | align="center" | Value (Special) | ||
| Double. Node is a binary64 floating-point number. | | Double. Node is a binary64 floating-point number. | ||
| 3+ | |||
|- | |- | ||
| align="center" | 0xFF | | align="center" | 0xFF | ||
| align="center" | Value | | align="center" | Value | ||
| Null. Value is always 0. | | Null. Value is always 0. | ||
| 3+ | |||
|} | |} | ||
Line 98: | Line 116: | ||
For special value nodes, the value is an offset to a 64 bit integer / floating-point number relative to the start of the file. | For special value nodes, the value is an offset to a 64 bit integer / floating-point number relative to the start of the file. | ||
==== 0xA1 - Binary Data Node ==== | |||
{| class="wikitable" | |||
! align="center" | Offset | |||
! align="center" | Size | |||
! Description | |||
|- | |||
| align="center" | 0x00 | |||
| align="center" | 4 | |||
| Data length | |||
|- | |||
| align="center" | 0x04 | |||
| align="center" | variable | |||
| Data | |||
|} | |||
==== 0xC0 - Array Node ==== | ==== 0xC0 - Array Node ==== | ||
Line 197: | Line 230: | ||
| ''Breath of the Wild'' (Switch) || Little endian || 2 || Uses SMO's library (in al::), but with unused code and support for big endian eliminated | | ''Breath of the Wild'' (Switch) || Little endian || 2 || Uses SMO's library (in al::), but with unused code and support for big endian eliminated | ||
|- | |- | ||
| ''Super Mario Odyssey'' (Switch) || Little endian (supports big endian as well) || 2 | | ''Super Mario Odyssey'' (Switch) || Little endian (supports big endian as well) || 3 || | ||
|- | |||
| ''Splatoon 2'' (Switch) || Little endian (supports big endian as well) || 3? || Uses SMO's library, but in the Lp::Utl:: namespace | |||
|- | |- | ||
| '' | | ''Animal Crossing: New Horizons'' (Switch) || Little endian (supports big endian as well) || 4 || | ||
|} | |} | ||
Revision as of 13:13, 21 March 2020
BYML is a simplified, binary form of YAML.
Only the version 2+ format is documented in this article.
Structure
The file is broken up into a node structure, with possible interlinking between the nodes. Each node has a one byte format identifier. The File begins with a header which points to three special nodes; the hash key table; the string table node and the root node.
Header
Offset | Size | Description |
---|---|---|
0x00 | 2 | “BY” (big endian) or “YB” (little endian). |
0x02 | 2 | Version 0x0002 in Breath of the Wild. 1, 3 and 4 are also valid version numbers. |
0x04 | 4 | Offset to the hash key table, relative to start (usually 0x010). May be 0 if no hash nodes are used. Must be a string value node (0xc2). |
0x08 | 4 | Offset to the string table, relative to start. May be 0 if no strings are used. Must be a string value node (0xc2). |
0x0c | 4 | Offset to the root node, relative to start. May be 0 if the document is totally empty. Must be either an array node (0xc0) or a hash node (0xc1). |
Nodes
Every node format has a unique one byte identifier as follows. Some nodes are considered value nodes as indicated below. The container nodes have a longer encoding in the file, which must be four byte aligned. The order of encoding full nodes within the file does not seem to matter.
Identifier | Type | Description | Version |
---|---|---|---|
0xA0 | Value (Index) | String. Value is an index into the string table. | 2+ |
0xA1 | Container | Binary data. | 4+ |
0xC0 | Container | Array. Node is an array of nodes, typically, though not necessarily, all of the same format. | 2+ |
0xC1 | Container | Hash. Node is a mapping from strings in the hash key table to other nodes. | 2+ |
0xC2 | Container (Special) | String table. Special purpose node type only seen in the hash key table and the string table. | 2+ |
0xD0 | Value | Bool. Node is 1 or 0 representing true or false respectively. | 2+ |
0xD1 | Value | Int. Node is a signed 32 bit integer value. | 2+ |
0xD2 | Value | Float. Node is a binary32 floating-point number. | 2+ |
0xD3 | Value | UInt. Node is an unsigned 32 bit integer value. The game uses this for some CRC32 hashes and for masks. | 2+ |
0xD4 | Value (Special) | Int64. Node is a 64 bit integer value. | 3+ |
0xD5 | Value (Special) | UInt64. Node is an unsigned 64 bit integer value. | 3+ |
0xD6 | Value (Special) | Double. Node is a binary64 floating-point number. | 3+ |
0xFF | Value | Null. Value is always 0. | 3+ |
Value Nodes
Value nodes can only be encoded as children of container nodes. Each value node has a direct four byte encoding.
For string nodes, this encoding is simply a four byte index into the string table respectively.
For special value nodes, the value is an offset to a 64 bit integer / floating-point number relative to the start of the file.
0xA1 - Binary Data Node
Offset | Size | Description |
---|---|---|
0x00 | 4 | Data length |
0x04 | variable | Data |
0xC0 - Array Node
Offset | Size | Description |
---|---|---|
0x00 | 1 | 0xC0 node type. |
0x01 | 3 | Number of entries in array. |
0x04 | N | Array of N node types; the type of each element in the array. |
0x04 + N’ | 4*N | Array of N node values. For regular value nodes, this is a 4 byte node value. For other nodes, this is a 4 byte offset to the node relative to the start of the file. |
N’ is N rounded up to the nearest multiple of 4.
0xC1 - Hash Node
Hash / dictionary nodes are used to encode name value collections. Entries must be lexicographically sorted[1].
Offset | Size | Description |
---|---|---|
0x00 | 1 | 0xC1 node type. |
0x01 | 3 | Number of entries in dictionary. |
What then follows is a variable length array of dictionary entries. Each entry has the following structure.
Offset | Size | Description |
---|---|---|
0x00 | 3 | Name. Value is an index in the hash key table. |
0x03 | 1 | The node type. |
0x04 | 4 | Value. For regular value nodes, this is the 4 byte node value. For other nodes, this is a 4 byte offset to the node relative to the start of the file. |
0xC2 - String Table Node
Offset | Size | Description |
---|---|---|
0x00 | 1 | 0xC2 node type. |
0x01 | 3 | Number of entries in the string table. |
0x04 | Y=4*(N+1) | Array of N+1 offsets to each string relative to the start of the node. The last entry is an offset to the end of the last string. |
0x04+Y’ | variable | Array of N null-terminated strings stored in alphabetical order. |
Y’ is Y rounded up to the nearest multiple of 4.
Usage in Nintendo games
Game | Endianness | Version | Notes |
---|---|---|---|
Breath of the Wild (Wii U) | Big endian | 2 | Uses SMO's library (in al::) |
Breath of the Wild (Switch) | Little endian | 2 | Uses SMO's library (in al::), but with unused code and support for big endian eliminated |
Super Mario Odyssey (Switch) | Little endian (supports big endian as well) | 3 | |
Splatoon 2 (Switch) | Little endian (supports big endian as well) | 3? | Uses SMO's library, but in the Lp::Utl:: namespace |
Animal Crossing: New Horizons (Switch) | Little endian (supports big endian as well) | 4 |
The v3 format seems to be identical to v2 but with the 64-bit node types added.
Extensions
Nintendo uses several file extensions for BYML files in BotW. The basic format is the same even though each kind has a different purpose and contains different parameters. Known extensions:
Tools
Tool | Cross-platform | Setup | Known issues |
---|---|---|---|
byml-v2 | Yes |
Install the |
None |
ninten-file-tool | Yes |
|
|
Byaml Editor | Yes | ||
Wild Bits | Yes |
|
On some system configurations, launching with |
Switch Toolbox | No |
|
This article was adapted from documentation in the handsomematt/botw-modding repo.
- ↑ Nintendo's BYML library performs a binary search when looking up items by key in a hash node.