HTML Parsing: Introduce a new HTML rewriting stream that allows for more flexible manipulation of HTML content, enhancing your ability to customize and transform HTML as needed. This feature also includes text and attribute value escaping to ensure your HTML remains valid during transformations.
Error Handling: Add comprehensive error reporting for various scenarios, including duplicate attributes and specific insertion modes, improving the robustness of HTML parsing and serialization.
Improvements
Performance: Enhance performance by fixing a regression in attribute duplication checks, ensuring faster and more efficient parsing of HTML documents.
Location Information: Implement location information for EOF tokens and improve location calculations for elements, providing better context and accuracy in error reporting and debugging.
Documentation: Update and expand documentation for various components, including the SAX parser and HTML rewriting stream, making it easier for users to understand and utilize the toolset effectively.
Fixes
Fix issues related to empty DOCTYPE field values, ensuring they are converted to empty strings for better compliance with HTML standards.
Resolve various naming inconsistencies in error messages, improving clarity and user understanding when parsing errors occur.
Others
Streamline the codebase by removing unnecessary components and adopting ES6 syntax, leading to cleaner and more maintainable code.
Transition to a multi-package repository structure, enhancing organization and modularity of the project.