bring back bom-as-ws (#34)

* bring back bom-as-ws

* Update SPEC.md

Co-authored-by: Lily Ballard <lily@sb.org>

Co-authored-by: Lily Ballard <lily@sb.org>
This commit is contained in:
Kat Marchán 2020-12-15 23:15:50 -08:00 committed by GitHub
parent 32f38622ee
commit 5cf6a5a276
No known key found for this signature in database
GPG Key ID: 4AEE18F83AFDEB23
1 changed files with 5 additions and 7 deletions

12
SPEC.md
View File

@ -17,12 +17,8 @@ The toplevel concept of KDL is a Document. A Document is composed of zero or
more [Nodes](#node), separated by newlines and whitespace, and eventually
terminated by an EOF.
All KDL documents should:
* Be UTF-8 encoded
* Ignore UTF-8 byte order marks ("BOM") anywhere in the file, even when it's
not the first set of bytes in a stream.
* Conform to the specifications in this document.
All KDL documents should be UTF-8 encoded and conform to the specifications in
this document.
#### Example
@ -198,7 +194,9 @@ linespace := newline | ws | single-line-comment
newline := `000D` | `000A` | `000D` `000A` | `0085` | `000C` | `2028` | `2029`
ws := unicode-space | multi-line-comment
ws := bom | unicode-space | multi-line-comment
bom := `FFEF`
unicode-space := See Table (All White_Space unicode characters which are not `newline`)