HTML tags in content will make the parsed result a mess
Le — Jan 24, 2018 05:09AM CET
Thanks for your great tool! When I use Push to Kindle to push a page in Safari online books, it does not handle the HTML tags in the article content well. It seems recognize them as real HTML tags.
With a tool like egrep, it doesn’t seem particularly common or useful to simply match lines with HTML tags. But, exploring a regular expression that matches HTML tags exactly can be quite fruitful, especially when we delve into more advanced tools in the next chapter.
Looking at simple cases like ‘</code>’ and ‘<code><HR></code>’, we might think to try <span class="ent">⌈</span><code><.*></code><span class="ent">⌋</span>. This simplistic approach is a frequent first thought, but it’s certainly incorrect. Converting <span class="ent">⌈</span><code><.*></code><span class="ent">⌋</span> into English reads “match a ‘<code><</code>’
1 Community Answers
Keyvan Minoukadeh - Jan 25, 2018 at 01:55AM CET
Hi there, could you please give us the URL of the page you're trying to send so we can take a look?