summaryrefslogtreecommitdiffstats
path: root/scripts
Commit message (Collapse)AuthorAgeFilesLines
* Update build scripts and dataMathias Bynens2016-04-0410-72/+99
| | | | The spec changed the order of some listings here and there.
* Update dependenciesMathias Bynens2015-01-263-3/+8
|
* Update spec URLsMathias Bynens2014-09-152-2/+2
|
* Minor cleanupMathias Bynens2014-08-311-1/+1
|
* Clean up dependenciesMathias Bynens2014-05-242-4/+1
|
* Simplify `regexInvalidRawCodePoint`Mathias Bynens2014-05-241-12/+12
| | | | | | By using a lookahead the regular expression to match lone surrogates gets a bit more compact. This also fixes an encoding issue caused by a typo that was introduced in 74fd7cd1b3fad9f464b778acff8ffeb0d3a47163.
* Split scripts into separate files per exported valueMathias Bynens2014-05-249-88/+116
|
* Clean up scriptsMathias Bynens2014-05-242-34/+22
|
* Fix off-by-one error in printable ASCII rangeMathias Bynens2014-05-241-6/+6
|
* Don’t encode invalid code points whose character references would refer to ↵Mathias Bynens2014-05-241-0/+28
| | | | | | | | another code point I.e. the code points listed in the first column of the overrides table at <http://www.whatwg.org/specs/web-apps/current-work/multipage/tokenization.html#table-charref-overrides>. Closes #19.
* Make lone high surrogates trigger parse errors tooMathias Bynens2014-05-241-0/+1
|
* Add `strict` option to `he.encode`Mathias Bynens2014-05-242-4/+26
| | | | Ref. #19.
* Scrape the spec for more dataMathias Bynens2014-05-231-34/+59
| | | | | | Let’s also scrape the spec for the list of code points for symbols that cause parse errors when they occur in raw HTML source. Note: `invalid-character-reference-code-points.json` is identical to `invalid-raw-code-points.json` except U+000D (CR) is not included in the latter, because lone CR are converted to LF before tokenization. http://whatwg.org/html/parsing.html#preprocessing-the-input-stream
* Update dependenciesMathias Bynens2014-04-241-2/+3
|
* Update dependenciesMathias Bynens2013-10-152-21/+13
|
* encode: Add `encodeEverything` optionMathias Bynens2013-08-072-9/+46
| | | | Ref. #12.
* Fix logic error in `scrape-spec.js`Mathias Bynens2013-08-041-7/+12
|
* data/decode-map-overrides: Filter keys mapping to themselvesMathias Bynens2013-08-021-16/+22
| | | | | | @fb55 pointed out that `decode-map-overrides.json` can be simplified: any code points that map to their respective symbols can be removed if they are included in `invalid-code-points.json`. Good catch! Closes #13.
* Tweak script that scrapes the specMathias Bynens2013-08-021-7/+6
|
* Avoid compiling `regexDecode` at runtimeMathias Bynens2013-07-301-1/+4
|
* Update dependenciesMathias Bynens2013-07-273-3/+3
|
* Tweak template variable namesMathias Bynens2013-07-011-2/+2
|
* Make sure code points are stored as numbersMathias Bynens2013-07-011-9/+14
|
* Generate an array of all code points that trigger a parse errorMathias Bynens2013-07-013-55/+100
| | | | Ref. #9.
* Make file names of data files more consistentMathias Bynens2013-06-302-3/+4
|
* Remove stray `require('regenerate')`Mathias Bynens2013-06-291-1/+0
|
* Clean up data processing scriptsMathias Bynens2013-06-283-55/+103
|
* Future-proof the script that scrapes the character reference overridesMathias Bynens2013-06-281-36/+32
|
* Move `src/data.js` to the `scripts` folderMathias Bynens2013-06-271-0/+92
|
* Add script that parses the table in section 12.2.4.69Mathias Bynens2013-06-271-0/+41
See http://whatwg.org/html/tokenization.html#tokenizing-character-references. Ref. #4.