nicer formatting for unicode in grammar

This commit is contained in:
Kat Marchán 2020-12-16 12:07:56 -08:00
parent 5cf6a5a276
commit 1bfbe78022
No known key found for this signature in database
GPG Key ID: AEB529C08A3C7E9E
1 changed files with 27 additions and 27 deletions

54
SPEC.md
View File

@ -113,24 +113,24 @@ space](https://www.unicode.org/Public/UCD/latest/ucd/PropList.txt):
| Name | Code Pt | | Name | Code Pt |
|----------------------|---------| |----------------------|---------|
| Character Tabulation | `0009` | | Character Tabulation | `U+0009` |
| Space | `0020` | | Space | `U+0020` |
| No-Break Space | `00A0` | | No-Break Space | `U+00A0` |
| Ogham Space Mark | `1680` | | Ogham Space Mark | `U+1680` |
| En Quad | `2000` | | En Quad | `U+2000` |
| Em Quad | `2001` | | Em Quad | `U+2001` |
| En Space | `2002` | | En Space | `U+2002` |
| Em Space | `2003` | | Em Space | `U+2003` |
| Three-Per-Em Space | `2004` | | Three-Per-Em Space | `U+2004` |
| Four-Per-Em Space | `2005` | | Four-Per-Em Space | `U+2005` |
| Six-Per-Em Space | `2006` | | Six-Per-Em Space | `U+2006` |
| Figure Space | `2007` | | Figure Space | `U+2007` |
| Punctuation Space | `2008` | | Punctuation Space | `U+2008` |
| Thin Space | `2009` | | Thin Space | `U+2009` |
| Hair Space | `200A` | | Hair Space | `U+200A` |
| Narrow No-Break Space| `202F` | | Narrow No-Break Space| `U+202F` |
| Medium Mathematical Space | `205F` | | Medium Mathematical Space | `U+205F` |
| Ideographic Space | `3000` | | Ideographic Space | `U+3000` |
### Newline ### Newline
@ -139,13 +139,13 @@ lines](https://www.unicode.org/versions/Unicode13.0.0/ch05.pdf):
| Acronym | Name | Code Pt | | Acronym | Name | Code Pt |
|---------|-----------------|---------| |---------|-----------------|---------|
| CR | Carriage Return | `000D` | | CR | Carriage Return | `U+000D` |
| LF | Line Feed | `000A` | | LF | Line Feed | `U+000A` |
| CRLF | Carriage Return and Line Feed | `000D` + `000A` | | CRLF | Carriage Return and Line Feed | `U+000D` + `U+000A` |
| NEL | Next Line | `0085` | | NEL | Next Line | `U+0085` |
| FF | Form Feed | `000C` | | FF | Form Feed | `U+000C` |
| LS | Line Separator | `2028` | | LS | Line Separator | `U+2028` |
| PS | Paragraph Separator | `2029` | | PS | Paragraph Separator | `U+2029` |
Note that for the purpose of new lines, CRLF is considered _a single newline_. Note that for the purpose of new lines, CRLF is considered _a single newline_.
@ -192,11 +192,11 @@ escline := '\\' ws* (single-line-comment | newline)
linespace := newline | ws | single-line-comment linespace := newline | ws | single-line-comment
newline := `000D` | `000A` | `000D` `000A` | `0085` | `000C` | `2028` | `2029` newline := See Table (All line-break white_space)
ws := bom | unicode-space | multi-line-comment ws := bom | unicode-space | multi-line-comment
bom := `FFEF` bom := '\u{FFEF}'
unicode-space := See Table (All White_Space unicode characters which are not `newline`) unicode-space := See Table (All White_Space unicode characters which are not `newline`)