- Improve parsing to correctly handle spaces in css style #222
- Rewriting of parsing to use
Span<char>instead of Regex for +25% performance gain 🚀 - Set Timeout on remaining Regex to prevent any DoS attack
- Remove extra border space in table #156
- Added .NET 10 as an explicit target
- Fix page break. Add support for
break-beforeandbreak-aftercss style #220 - Support registering custom bookmark with
data-bookmark#219
- Fix a fatal crash when trying to convert multiple images #215
- New feature to allow to reference external image instead of embedding them #216
- Fix a potential issue on image streams that are disposed too early.
- Support table col with percentage width #206
- Fix handling Uri with an anchor #209
- New option DefaultStyles.NumberedHeadingStyle to support an alternate heading style #210
(wrong packaging, same code as 3.2.5)
- Fix a crash with the new whitespace handling introduced in 3.2.3 #191
- Fix crash when the html contains 2 images with identical source path #193
- Support margin auto for table alignment #194
- Fix handling whitespace between runs #195
- Whitelist more mime-types as specified by the IANA standard #196
- Support EMF file #196
- Correct handling of
figcaption(allow nested phrasings) #197 - Numbering list now supports type attribute
<ol type="1|a|A|i|I">#198 - Always restart nested numbering list #198
- Fix table borders being removed even when the specified word table style has borders #199
- Defensive code when download image stream is truncated #201
- Table inside list is constrained to not exceed page margin #202
- Table now supports width:auto for auto-fit content #202
- Fix a crash with the new whitespace handling introduced in 3.2.3 #191
- Table inside list must be aligned with the list item #192
- Improve support of table alignment #187
- Fix a crash if a span is empty
- Heading with only digits should not be considered as a numbering #189
- Fix whitespaces inserted between spans #179 and #185
- Support percentage size (typically width:100%) for img node #188
- Supports a feature to disable heading numbering #175
- Support center image with margin auto #171
- Support deprecrated align attribute for block #171
- Fix parsing of style attribute with a key with no value
- Improve parsing of style attribute to avoid an extra call to HtmlDecode
- Extend support of nested list for non-W3C compliant html #173
- Change way to apply table 100% width
- Allow to apply percentage widths cells
- Fix indentation of numbering list #166
- Bordered container must render its content with one bordered frame #168
- Fix serialisation of the "Harvard" style for lower-roman list
- Fix ParseHeader/Footer where input with multiple paragraphs output only the latest
- Ensure to apply default style for paragraphs, to avoid a paragraph between 2 list is mis-guessed
- Add new public API to allow parsing into Header and Footer #162. Some API methods as been flagged as obsolete with a clear message of what to use instead. This is not a breaking changes as it keep existing behaviour.
- Add support for
SVGformat (either from img src or the SVG node tag) - Automatically create the
_topbookmark if needed - Fix a crash when a hyperlink contains both
imgandfigcation - Fix a crash when
liis empty #161
- Fix respecting layout with
div/pending with line break #158 - Prevent crash when header/footer is incomplete and parsing image #159
- Fix combining 2 runs separated by a break, 2nd line should not be prefixed by a space
- Fix table Cell borders are wrongly applied on the run #156
- Correctly handle RTL layout for text, list, table and document scope #86 #66
- Support property line-height #52
- Fallback to
backgroundstyle attribute as many users use this simplified attribute version - In
HtmlDomExpression.CreateFromHtmlNode, use the correct casting toIElementrather thanIHtmlElement, to prevent crash ifsvgnode is encountered
- Ensure to count existing images from header and footer too #113
- Preserve line break pre for OSX/Windows
- Prevent a crash when the provided style is missing its type
- Defensive code to avoid 2 rowSpan+colSpan with a cell in between to crash #59
- AngleSharp is now the backend parser for Html
- Refactoring to use the Interpreter/Composite design pattern, which ease the code maintenance
- Lots of new unit test cases (190+)
- Rewriting of
list(correct handling of nested style, restarting numbers and consecutive) - Rewriting of
table(row span, col span, col tags driving styles) - Parallel download of images at early stage of the parsing.
- Fix signing the assembly
- Enable Nullable reference types
- support latest version of OpenXML SDK (3.1.0) which introduces breaking changes, but also support embedding SVG and JPEG2000 files.
- fix caching the provisioned images
- drop support for .Net Standard 1.3
do not use as the signing assembly was in failure #138
- better table border style
- keep processing html even if downloading image generates an error
- support for styling OL, UL and LI elements
- support latest version of OpenXML SDK (2.12.0) which introduces an API to add an OpenXmlElement to the correct XSD order
- restore support for .NET 4.6+, Net Standard 1.3+
- use cleaner name for base-64 images description
- support latest version of OpenXML SDK (2.11.0+) which fix fatal issue
- drop support for .NET 4.0, .Net Standard 1.4
- optimize number of nested list numbering (thanks to BenGraf)
- fix an issue where some styles weren't being applied
- fix reading JPEG images with SOF2 progressive DCT encoding
- fix nested list numbering
- fix manual provisioning of images
- img respect both border attribute and border style attribute
This brings .Net Core support:
- better inline styling
- numbering list with nested list is more stable
- allow parsing unit with decimals
- color can be either rgb(a), hsl(a), hex or named color.
- parser is more stable
- imported from codeplex.com