github.com/inikulin/parse5 commits | Ecosyste.ms: OpenCollective

ParsingFrame -> Preprocessor

88da225418ff2834c5ee081bf505e87978a701f3 authored almost 12 years ago

TreeConstructor->Parser

87c97e4e8919b025950ccfa80c33e898d3a51703 authored almost 12 years ago

In head insertion mode implementation started

de2976d33c088fc0bd394831bcac28a25fcba28a authored almost 12 years ago

New insertion modes and refactorings

45dae59479196646bffbf0358354c4c50eb6e017 authored almost 12 years ago

More insertion modes

76c49ab8f9864c4c52e402757933cae2cbe30d82 authored almost 12 years ago

more refactorings

4f1de9948aee7641f1ddda2f9ac427f043f064aa authored almost 12 years ago

Several refactorings

a7eb2304e99fc87435b607659fce00a7de70d06b authored almost 12 years ago

Before html insertion mode implemented. Error identifiers removed for now

3e19a39e9641a3d53c2b8080c7bca5edec9c8d08 authored almost 12 years ago

Tree constructor implementation started

9451bd2ac9dfd59f3a9b035d57c542de7c46535a authored almost 12 years ago

Refactorings: isDigit method extracted, tokenizer fields initialized with appropriate values. tree_constructor.js added

60f8f15795a532e9bc3c4c4c290a5e771fa2bf57 authored almost 12 years ago

States now are named functions for easier debugging

5421e9b7f7d967df6cbcd2b2d611c17e11605f79 authored almost 12 years ago

Trie identifiers changed to not interference code points

ca8c9772e689059409e6eeeca05290cdad54bf0d authored almost 12 years ago

Bunch of renamings

e534224717619e120a241a45f8055c48142e3c68 authored almost 12 years ago

Latest optimization-related refactorings

69d021a17533dc8b3d96aa8bff60a05f920e2683 authored almost 12 years ago

Huge optimization: work with code points instead of chars.

c2804ce77ee448be3677e9aea4a1afe44366b8e9 authored almost 12 years ago

Make it right..

c46384f289f1611222bf6893513678ba286d440b authored almost 12 years ago

Cache mechanism previously used in ParsingFrame was ineffective. Gaps mechanism introduced. 65% performance boost!!!

f077a998a4c15cf65147062f6d14ea603d0291b0 authored almost 12 years ago

text_utils.js -> unicode.js

e2585a25f56e2bf3bbbe4c0e95a836d3eec5df14 authored almost 12 years ago

Moved text utils back to tokenizer to support inlining

dd317fd30c6243bab0c52a1bd7649eea42061194 authored almost 12 years ago

Several optimizations. Char code check moved to ParsingFrame to support inlining

fbe8696026f0462401bfc4ef7f29823cae589a0f authored almost 12 years ago

Fixed optimization

9d598ca3171d22a2223c672a941f084af8850149 authored almost 12 years ago

Optimizations added

f063288cfdcfcd58d9ba49554f4ec5a371659948 authored almost 12 years ago

States refactorings

08e24454da579273a9189111238467479492971a authored almost 12 years ago

No broken tests left. There was an error in test, according to spec '' input should emit 2 errors: one for '--!' sequence in comment and second for NULL character in comment. Now some additional work required to support UTF-16 chars in some case, then refactorings and optimizations come to place

f7a342bd2a45233c306f61eed16751fe23f1aac0 authored almost 12 years ago

Bug fixes. 1 failing test left

6a35d31b301bbc5863279f1423622ea8c74bdd74 authored almost 12 years ago

Several refactorings and bug fixes. Only 4 failing tests left.

5a59b2da45813786593ca09ee56ce673e83d4284 authored almost 12 years ago

End tag token errors handling

a36713fd3378c65f08d47126aebf8519e833d833 authored almost 12 years ago

Several simple refactorings

67e5acd70fc019dcf549cacc69a2becd1e98da30 authored almost 12 years ago

Parsing frame done right. Text-related stuff move to text_utils.js

0ec7bf6531765e16df7a34cd5d06f597883f6cef authored almost 12 years ago

wtf

b2e3edeecd54ab2d310bf9ce9ced8e251a572615 authored almost 12 years ago

Parsing frame implemented. Requires serious refactoring and revision. consumeIfMatch functionality is currently broken

706227e09e8697ee8430fd8ea9970987d40ed917 authored almost 12 years ago

Numeric entities consumption implemented

3390223f2c3ffbfe5c01d8fe6c542ed32d101162 authored almost 12 years ago

Numeric entity consumption implementation started. Tests restructured.

fa0ed5d281bb54396fb0a848d6801fb7ddd9da7d authored almost 12 years ago

Named entity consumption refactored. Several additional refactorings performed

bf8a8a3eb21a57c924ce1ae913a53f0a6d0d6714 authored almost 12 years ago

Buggy and shitty named entity consumption implemented. This requires a serious revision. However at the moment only 1818 of 6677 tests in html5lib tokenizer test suite fails

109e2faed76ee465c4e8e0cce6b50508a7d830b4 authored almost 12 years ago

oops once again

4dbffac77778f262cae200abee2238b690eac207 authored almost 12 years ago

Named entities trie added

b0281f5530c3af94980b83f861571da1563827a7 authored almost 12 years ago

Oops

42ae9c0e2c7a73d2b074c0e0df99dbe3144a3cfe authored almost 12 years ago

Lexer renamed to Tokenizer for consistency

2f7138f207f0ed16429fb6f4366d557338abb701 authored almost 12 years ago

Character consumption done right. Character reference consumption implementation started

604a87718dc9d2a07848fc9658cdf1e53ca2a0f8 authored almost 12 years ago

Tests added

c00ce8a5898d5bf3d234527fdb49de1703044a69 authored almost 12 years ago

All simple states implemented. Todo next: 1)implement character reference parsing and all related states 2)Implement all states with conditional consumption 3)Test Lexer using html5lib test suite

d1ba8df867c4d3d91852cb1b3bffec57aeaa75b7 authored almost 12 years ago

More states and lookahead implemented

24b52a2141e7865522f9bd3f7c762017cde359de authored almost 12 years ago

More lexer states =/. Unnecessary lexer entities dropped.

14bee04667a0599f5fb1e96c6c376545a3550820 authored almost 12 years ago

Lexer line length storing fixed. LICENSE reformatted

40f203c93df619e86821dbdf2e0f268d97c4a264 authored almost 12 years ago

Merge commit '4fe91f94b05733e5a2c04c6816057fe6a5c5b1d6'

53718fed8355cea222b16c3cb7d19837c173c02f authored almost 12 years ago

New bunch of states. Now WHATWG version of specification used (nothing is changed actually except state numeration =))

8c58aa7ca06d663d43d0a9a00d3a8c243e845be2 authored almost 12 years ago

Create LICENSE.md

4fe91f94b05733e5a2c04c6816057fe6a5c5b1d6 authored almost 12 years ago

Numerous bugs fixed. Constants for 'magic literals' added. Mooooore lexer states implemented.

d1804aed598e8b3896aa0ba1895804f67e01f11d authored almost 12 years ago

More lexer states implemented. Code reformatted to improve readability

3892a38fef73f95e9914a00e21b7b3ff80864dd6 authored almost 12 years ago

A bunch of new states added. God, this is a boring job, I tell ya =)

56c224c094e9ed93ea5d882050f3ffeeae2be932 authored almost 12 years ago

More lexer states implemented

8f0e2d725209db868acc06ffe2ff897483f4deba authored almost 12 years ago

Lexer implementation started.

8ef28cd445b1401369f8415b93bc4a35385a0d55 authored almost 12 years ago