Expose the encoding of a document #121

SimonSapin · 2013-11-30T22:08:36Z

Based on various information provided by the API’s user (see #120) and in the input stream (BOM, <meta charset>), the parser decides which character encoding to use.

The used encoding for a parsed document should be exposed somehow. It is needed eg. as a fallback encoding for scripts and stylesheets referred to by the document.

The text was updated successfully, but these errors were encountered:

SimonSapin · 2013-11-30T22:21:10Z

This is actually accessible as HTML5Parser.tokenizer.stream.charEncoding.

SimonSapin added a commit to SimonSapin/html5lib-python that referenced this issue Nov 30, 2013

Add a usedEncoding method to HTML5Parser, fix html5lib#121

9aab922

gsnedders closed this as completed in 808d102 Jan 5, 2014

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Expose the encoding of a document #121

Expose the encoding of a document #121

SimonSapin commented Nov 30, 2013

SimonSapin commented Nov 30, 2013

Expose the encoding of a document #121

Expose the encoding of a document #121

Comments

SimonSapin commented Nov 30, 2013

SimonSapin commented Nov 30, 2013