I want TagSoup settings to use HTML5 standars.
I am using tagsoup Parser which is adhearing to HTML4 which doesn't allow a <div> inside an <a> tag. hence, parsing a wrong HTML. However, HTML5 allows the use of the same. How do I makethe tagsoup (org.ccil.cowan.tagsoup) to use HTML5 standards.
eg,
<a>
<div></div>
</a>
becomes,
<a></a>
<div></div>
I had the same problem with the following structure:
became,
I resolved it by using a custom HTMLSchema:
...