Commit message (Collapse) | Author | Age | Files | Lines | |
---|---|---|---|---|---|
* | Update build scripts and data | Mathias Bynens | 2016-04-04 | 10 | -72/+99 |
| | | | | The spec changed the order of some listings here and there. | ||||
* | Update dependencies | Mathias Bynens | 2015-01-26 | 3 | -3/+8 |
| | |||||
* | Update spec URLs | Mathias Bynens | 2014-09-15 | 2 | -2/+2 |
| | |||||
* | Minor cleanup | Mathias Bynens | 2014-08-31 | 1 | -1/+1 |
| | |||||
* | Clean up dependencies | Mathias Bynens | 2014-05-24 | 2 | -4/+1 |
| | |||||
* | Simplify `regexInvalidRawCodePoint` | Mathias Bynens | 2014-05-24 | 1 | -12/+12 |
| | | | | | | By using a lookahead the regular expression to match lone surrogates gets a bit more compact. This also fixes an encoding issue caused by a typo that was introduced in 74fd7cd1b3fad9f464b778acff8ffeb0d3a47163. | ||||
* | Split scripts into separate files per exported value | Mathias Bynens | 2014-05-24 | 9 | -88/+116 |
| | |||||
* | Clean up scripts | Mathias Bynens | 2014-05-24 | 2 | -34/+22 |
| | |||||
* | Fix off-by-one error in printable ASCII range | Mathias Bynens | 2014-05-24 | 1 | -6/+6 |
| | |||||
* | Don’t encode invalid code points whose character references would refer to ↵ | Mathias Bynens | 2014-05-24 | 1 | -0/+28 |
| | | | | | | | | another code point I.e. the code points listed in the first column of the overrides table at <http://www.whatwg.org/specs/web-apps/current-work/multipage/tokenization.html#table-charref-overrides>. Closes #19. | ||||
* | Make lone high surrogates trigger parse errors too | Mathias Bynens | 2014-05-24 | 1 | -0/+1 |
| | |||||
* | Add `strict` option to `he.encode` | Mathias Bynens | 2014-05-24 | 2 | -4/+26 |
| | | | | Ref. #19. | ||||
* | Scrape the spec for more data | Mathias Bynens | 2014-05-23 | 1 | -34/+59 |
| | | | | | | Let’s also scrape the spec for the list of code points for symbols that cause parse errors when they occur in raw HTML source. Note: `invalid-character-reference-code-points.json` is identical to `invalid-raw-code-points.json` except U+000D (CR) is not included in the latter, because lone CR are converted to LF before tokenization. http://whatwg.org/html/parsing.html#preprocessing-the-input-stream | ||||
* | Update dependencies | Mathias Bynens | 2014-04-24 | 1 | -2/+3 |
| | |||||
* | Update dependencies | Mathias Bynens | 2013-10-15 | 2 | -21/+13 |
| | |||||
* | encode: Add `encodeEverything` option | Mathias Bynens | 2013-08-07 | 2 | -9/+46 |
| | | | | Ref. #12. | ||||
* | Fix logic error in `scrape-spec.js` | Mathias Bynens | 2013-08-04 | 1 | -7/+12 |
| | |||||
* | data/decode-map-overrides: Filter keys mapping to themselves | Mathias Bynens | 2013-08-02 | 1 | -16/+22 |
| | | | | | | @fb55 pointed out that `decode-map-overrides.json` can be simplified: any code points that map to their respective symbols can be removed if they are included in `invalid-code-points.json`. Good catch! Closes #13. | ||||
* | Tweak script that scrapes the spec | Mathias Bynens | 2013-08-02 | 1 | -7/+6 |
| | |||||
* | Avoid compiling `regexDecode` at runtime | Mathias Bynens | 2013-07-30 | 1 | -1/+4 |
| | |||||
* | Update dependencies | Mathias Bynens | 2013-07-27 | 3 | -3/+3 |
| | |||||
* | Tweak template variable names | Mathias Bynens | 2013-07-01 | 1 | -2/+2 |
| | |||||
* | Make sure code points are stored as numbers | Mathias Bynens | 2013-07-01 | 1 | -9/+14 |
| | |||||
* | Generate an array of all code points that trigger a parse error | Mathias Bynens | 2013-07-01 | 3 | -55/+100 |
| | | | | Ref. #9. | ||||
* | Make file names of data files more consistent | Mathias Bynens | 2013-06-30 | 2 | -3/+4 |
| | |||||
* | Remove stray `require('regenerate')` | Mathias Bynens | 2013-06-29 | 1 | -1/+0 |
| | |||||
* | Clean up data processing scripts | Mathias Bynens | 2013-06-28 | 3 | -55/+103 |
| | |||||
* | Future-proof the script that scrapes the character reference overrides | Mathias Bynens | 2013-06-28 | 1 | -36/+32 |
| | |||||
* | Move `src/data.js` to the `scripts` folder | Mathias Bynens | 2013-06-27 | 1 | -0/+92 |
| | |||||
* | Add script that parses the table in section 12.2.4.69 | Mathias Bynens | 2013-06-27 | 1 | -0/+41 |
See http://whatwg.org/html/tokenization.html#tokenizing-character-references. Ref. #4. |