index
:
html-parser
master
A small html parser written in ANSI C that doesn't perform any dynamic allocation and neither copies any memory.
about
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
src
Age
Commit message (
Collapse
)
Author
2021-07-19
Javascript string doesn't escape script end tag, so dont escape that. Fixes ↵
dec05eba
some websites
2021-07-19
Fix script tag not getting closed until the end
dec05eba
2021-07-03
Fix attribute value without quote (first char was skipped)
dec05eba
2021-07-03
mm
dec05eba
2021-07-03
Call tag end for void tags (such as br)
dec05eba
2021-07-02
Do not callback for text that is empty after stripping
dec05eba
2021-07-02
Return error if too many unclosed tags
dec05eba
2021-07-02
Return non-0 value from callback to cancel parsing (and return the value in ↵
dec05eba
html_parser_parse)
2021-04-28
Remove warning stderr output
dec05eba
2021-04-28
Simplify api, use const char* for input string
dec05eba
2021-04-28
Make tags case insensitive
dec05eba
2021-04-12
Parse attribute value without quotes correctly with whitespace
dec05eba
2020-10-19
Take into consideration < /html> and <br/ >
dec05eba
2020-10-19
Revert tag name to previous after void tag end (such as <br>)
dec05eba
2020-06-06
Convert uppercase to lowercase for tag and attribute name, fix missing ↵
dec05eba
is_identifier_char
2020-05-14
Use text_stripped for javascript as well
dec05eba
2020-02-03
Make c89 compliant
dec05eba
2020-01-06
Support rss with colon in tag name
DEC05EBA
2020-01-04
Call parse callback even for empty text/javascript. Makes tools like xpath ↵
DEC05EBA
easier
2020-01-01
Do not skip whitespaces in text in pre tags
DEC05EBA
2020-01-01
Ignore comments, ignore end tags without a start tag.
DEC05EBA
Fixes tags closing too soon
2019-12-31
Fix bug where end tag can be reported for a broken void tag
DEC05EBA
2019-12-31
Change name of StringView to prevent name conflicts with other projects
DEC05EBA
2019-12-31
Initial commit, copied from wwwhtml-parser. Works. Need to add unescape of ↵
DEC05EBA
html sequences