Ecosyste.ms: OpenCollective
An open API service for software projects hosted on Open Collective.
github.com/inikulin/parse5
HTML parsing/serialization toolset for Node.js. WHATWG HTML Living Standard (aka HTML5)-compliant.
https://github.com/inikulin/parse5
ParsingFrame -> Preprocessor
88da225418ff2834c5ee081bf505e87978a701f3 authored almost 12 years ago
88da225418ff2834c5ee081bf505e87978a701f3 authored almost 12 years ago
TreeConstructor->Parser
87c97e4e8919b025950ccfa80c33e898d3a51703 authored almost 12 years ago
87c97e4e8919b025950ccfa80c33e898d3a51703 authored almost 12 years ago
In head insertion mode implementation started
de2976d33c088fc0bd394831bcac28a25fcba28a authored almost 12 years ago
de2976d33c088fc0bd394831bcac28a25fcba28a authored almost 12 years ago
New insertion modes and refactorings
45dae59479196646bffbf0358354c4c50eb6e017 authored almost 12 years ago
45dae59479196646bffbf0358354c4c50eb6e017 authored almost 12 years ago
More insertion modes
76c49ab8f9864c4c52e402757933cae2cbe30d82 authored almost 12 years ago
76c49ab8f9864c4c52e402757933cae2cbe30d82 authored almost 12 years ago
more refactorings
4f1de9948aee7641f1ddda2f9ac427f043f064aa authored almost 12 years ago
4f1de9948aee7641f1ddda2f9ac427f043f064aa authored almost 12 years ago
Several refactorings
a7eb2304e99fc87435b607659fce00a7de70d06b authored almost 12 years ago
a7eb2304e99fc87435b607659fce00a7de70d06b authored almost 12 years ago
Before html insertion mode implemented. Error identifiers removed for now
3e19a39e9641a3d53c2b8080c7bca5edec9c8d08 authored almost 12 years ago
3e19a39e9641a3d53c2b8080c7bca5edec9c8d08 authored almost 12 years ago
Tree constructor implementation started
9451bd2ac9dfd59f3a9b035d57c542de7c46535a authored almost 12 years ago
9451bd2ac9dfd59f3a9b035d57c542de7c46535a authored almost 12 years ago
Refactorings: isDigit method extracted, tokenizer fields initialized with appropriate values. tree_constructor.js added
60f8f15795a532e9bc3c4c4c290a5e771fa2bf57 authored almost 12 years ago
60f8f15795a532e9bc3c4c4c290a5e771fa2bf57 authored almost 12 years ago
States now are named functions for easier debugging
5421e9b7f7d967df6cbcd2b2d611c17e11605f79 authored almost 12 years ago
5421e9b7f7d967df6cbcd2b2d611c17e11605f79 authored almost 12 years ago
Trie identifiers changed to not interference code points
ca8c9772e689059409e6eeeca05290cdad54bf0d authored almost 12 years ago
ca8c9772e689059409e6eeeca05290cdad54bf0d authored almost 12 years ago
Bunch of renamings
e534224717619e120a241a45f8055c48142e3c68 authored almost 12 years ago
e534224717619e120a241a45f8055c48142e3c68 authored almost 12 years ago
Latest optimization-related refactorings
69d021a17533dc8b3d96aa8bff60a05f920e2683 authored almost 12 years ago
69d021a17533dc8b3d96aa8bff60a05f920e2683 authored almost 12 years ago
Huge optimization: work with code points instead of chars.
c2804ce77ee448be3677e9aea4a1afe44366b8e9 authored almost 12 years ago
c2804ce77ee448be3677e9aea4a1afe44366b8e9 authored almost 12 years ago
Make it right..
c46384f289f1611222bf6893513678ba286d440b authored almost 12 years ago
c46384f289f1611222bf6893513678ba286d440b authored almost 12 years ago
Cache mechanism previously used in ParsingFrame was ineffective. Gaps mechanism introduced. 65% performance boost!!!
f077a998a4c15cf65147062f6d14ea603d0291b0 authored almost 12 years ago
f077a998a4c15cf65147062f6d14ea603d0291b0 authored almost 12 years ago
text_utils.js -> unicode.js
e2585a25f56e2bf3bbbe4c0e95a836d3eec5df14 authored almost 12 years ago
e2585a25f56e2bf3bbbe4c0e95a836d3eec5df14 authored almost 12 years ago
Moved text utils back to tokenizer to support inlining
dd317fd30c6243bab0c52a1bd7649eea42061194 authored almost 12 years ago
dd317fd30c6243bab0c52a1bd7649eea42061194 authored almost 12 years ago
Several optimizations. Char code check moved to ParsingFrame to support inlining
fbe8696026f0462401bfc4ef7f29823cae589a0f authored almost 12 years ago
fbe8696026f0462401bfc4ef7f29823cae589a0f authored almost 12 years ago
Fixed optimization
9d598ca3171d22a2223c672a941f084af8850149 authored almost 12 years ago
9d598ca3171d22a2223c672a941f084af8850149 authored almost 12 years ago
Optimizations added
f063288cfdcfcd58d9ba49554f4ec5a371659948 authored almost 12 years ago
f063288cfdcfcd58d9ba49554f4ec5a371659948 authored almost 12 years ago
States refactorings
08e24454da579273a9189111238467479492971a authored almost 12 years ago
08e24454da579273a9189111238467479492971a authored almost 12 years ago
No broken tests left. There was an error in test, according to spec '<!----!\\u0000-->' input should emit 2 errors: one for '--!' sequence in comment and second for NULL character in comment. Now some additional work required to support UTF-16 chars in some case, then refactorings and optimizations come to place
f7a342bd2a45233c306f61eed16751fe23f1aac0 authored almost 12 years ago
f7a342bd2a45233c306f61eed16751fe23f1aac0 authored almost 12 years ago
Bug fixes. 1 failing test left
6a35d31b301bbc5863279f1423622ea8c74bdd74 authored almost 12 years ago
6a35d31b301bbc5863279f1423622ea8c74bdd74 authored almost 12 years ago
Several refactorings and bug fixes. Only 4 failing tests left.
5a59b2da45813786593ca09ee56ce673e83d4284 authored almost 12 years ago
5a59b2da45813786593ca09ee56ce673e83d4284 authored almost 12 years ago
End tag token errors handling
a36713fd3378c65f08d47126aebf8519e833d833 authored almost 12 years ago
a36713fd3378c65f08d47126aebf8519e833d833 authored almost 12 years ago
Several simple refactorings
67e5acd70fc019dcf549cacc69a2becd1e98da30 authored almost 12 years ago
67e5acd70fc019dcf549cacc69a2becd1e98da30 authored almost 12 years ago
Parsing frame done right. Text-related stuff move to text_utils.js
0ec7bf6531765e16df7a34cd5d06f597883f6cef authored almost 12 years ago
0ec7bf6531765e16df7a34cd5d06f597883f6cef authored almost 12 years ago
wtf
b2e3edeecd54ab2d310bf9ce9ced8e251a572615 authored almost 12 years ago
b2e3edeecd54ab2d310bf9ce9ced8e251a572615 authored almost 12 years ago
Parsing frame implemented. Requires serious refactoring and revision. consumeIfMatch functionality is currently broken
706227e09e8697ee8430fd8ea9970987d40ed917 authored almost 12 years ago
706227e09e8697ee8430fd8ea9970987d40ed917 authored almost 12 years ago
Numeric entities consumption implemented
3390223f2c3ffbfe5c01d8fe6c542ed32d101162 authored almost 12 years ago
3390223f2c3ffbfe5c01d8fe6c542ed32d101162 authored almost 12 years ago
Numeric entity consumption implementation started. Tests restructured.
fa0ed5d281bb54396fb0a848d6801fb7ddd9da7d authored almost 12 years ago
fa0ed5d281bb54396fb0a848d6801fb7ddd9da7d authored almost 12 years ago
Named entity consumption refactored. Several additional refactorings performed
bf8a8a3eb21a57c924ce1ae913a53f0a6d0d6714 authored almost 12 years ago
bf8a8a3eb21a57c924ce1ae913a53f0a6d0d6714 authored almost 12 years ago
Buggy and shitty named entity consumption implemented. This requires a serious revision. However at the moment only 1818 of 6677 tests in html5lib tokenizer test suite fails
109e2faed76ee465c4e8e0cce6b50508a7d830b4 authored almost 12 years ago
109e2faed76ee465c4e8e0cce6b50508a7d830b4 authored almost 12 years ago
oops once again
4dbffac77778f262cae200abee2238b690eac207 authored almost 12 years ago
4dbffac77778f262cae200abee2238b690eac207 authored almost 12 years ago
Named entities trie added
b0281f5530c3af94980b83f861571da1563827a7 authored almost 12 years ago
b0281f5530c3af94980b83f861571da1563827a7 authored almost 12 years ago
Oops
42ae9c0e2c7a73d2b074c0e0df99dbe3144a3cfe authored almost 12 years ago
42ae9c0e2c7a73d2b074c0e0df99dbe3144a3cfe authored almost 12 years ago
Lexer renamed to Tokenizer for consistency
2f7138f207f0ed16429fb6f4366d557338abb701 authored almost 12 years ago
2f7138f207f0ed16429fb6f4366d557338abb701 authored almost 12 years ago
Character consumption done right. Character reference consumption implementation started
604a87718dc9d2a07848fc9658cdf1e53ca2a0f8 authored almost 12 years ago
604a87718dc9d2a07848fc9658cdf1e53ca2a0f8 authored almost 12 years ago
Tests added
c00ce8a5898d5bf3d234527fdb49de1703044a69 authored almost 12 years ago
c00ce8a5898d5bf3d234527fdb49de1703044a69 authored almost 12 years ago
All simple states implemented. Todo next: 1)implement character reference parsing and all related states 2)Implement all states with conditional consumption 3)Test Lexer using html5lib test suite
d1ba8df867c4d3d91852cb1b3bffec57aeaa75b7 authored almost 12 years ago
d1ba8df867c4d3d91852cb1b3bffec57aeaa75b7 authored almost 12 years ago
More states and lookahead implemented
24b52a2141e7865522f9bd3f7c762017cde359de authored almost 12 years ago
24b52a2141e7865522f9bd3f7c762017cde359de authored almost 12 years ago
More lexer states =/. Unnecessary lexer entities dropped.
14bee04667a0599f5fb1e96c6c376545a3550820 authored almost 12 years ago
14bee04667a0599f5fb1e96c6c376545a3550820 authored almost 12 years ago
Lexer line length storing fixed. LICENSE reformatted
40f203c93df619e86821dbdf2e0f268d97c4a264 authored almost 12 years ago
40f203c93df619e86821dbdf2e0f268d97c4a264 authored almost 12 years ago
Merge commit '4fe91f94b05733e5a2c04c6816057fe6a5c5b1d6'
53718fed8355cea222b16c3cb7d19837c173c02f authored almost 12 years ago
53718fed8355cea222b16c3cb7d19837c173c02f authored almost 12 years ago
New bunch of states. Now WHATWG version of specification used (nothing is changed actually except state numeration =))
8c58aa7ca06d663d43d0a9a00d3a8c243e845be2 authored almost 12 years ago
8c58aa7ca06d663d43d0a9a00d3a8c243e845be2 authored almost 12 years ago
Create LICENSE.md
4fe91f94b05733e5a2c04c6816057fe6a5c5b1d6 authored almost 12 years ago
4fe91f94b05733e5a2c04c6816057fe6a5c5b1d6 authored almost 12 years ago
Numerous bugs fixed. Constants for 'magic literals' added. Mooooore lexer states implemented.
d1804aed598e8b3896aa0ba1895804f67e01f11d authored almost 12 years ago
d1804aed598e8b3896aa0ba1895804f67e01f11d authored almost 12 years ago
More lexer states implemented. Code reformatted to improve readability
3892a38fef73f95e9914a00e21b7b3ff80864dd6 authored almost 12 years ago
3892a38fef73f95e9914a00e21b7b3ff80864dd6 authored almost 12 years ago
A bunch of new states added. God, this is a boring job, I tell ya =)
56c224c094e9ed93ea5d882050f3ffeeae2be932 authored almost 12 years ago
56c224c094e9ed93ea5d882050f3ffeeae2be932 authored almost 12 years ago
More lexer states implemented
8f0e2d725209db868acc06ffe2ff897483f4deba authored almost 12 years ago
8f0e2d725209db868acc06ffe2ff897483f4deba authored almost 12 years ago
Lexer implementation started.
8ef28cd445b1401369f8415b93bc4a35385a0d55 authored almost 12 years ago
8ef28cd445b1401369f8415b93bc4a35385a0d55 authored almost 12 years ago