Ecosyste.ms: OpenCollective
An open API service for software projects hosted on Open Collective.
github.com/ooni/pipeline
OONI data processing pipeline
https://github.com/ooni/pipeline
Add binario workflow
4cd191e35f0bbcfaa6ba0d4c730e8439201d8cdb authored over 9 years ago
4cd191e35f0bbcfaa6ba0d4c730e8439201d8cdb authored over 9 years ago
Import traceback
0de2b2c4b7d02e0b640dfe734c8a2a479275d568 authored over 9 years ago
0de2b2c4b7d02e0b640dfe734c8a2a479275d568 authored over 9 years ago
Adjust parallelism. Print the traceback.
d77c9ca9692da6d269980df49fc3513d6eae7753 authored over 9 years ago
d77c9ca9692da6d269980df49fc3513d6eae7753 authored over 9 years ago
Fix bug in sanitisation procedure
19fe34bc2748494df235835d2d9a0f8fa969012e authored over 9 years ago
19fe34bc2748494df235835d2d9a0f8fa969012e authored over 9 years ago
Print the traceback..
9196365ac78f2f7e3924fabe78528646693fe59f authored over 9 years ago
9196365ac78f2f7e3924fabe78528646693fe59f authored over 9 years ago
Carry the exception along..
85e956d344d79c35847f7f114cdbf6de7b1f0fd2 authored over 9 years ago
85e956d344d79c35847f7f114cdbf6de7b1f0fd2 authored over 9 years ago
Add function to skip over misformatted YAML reports
686a2192622a32e446eeaf43c4b3a8df38f8f506 authored over 9 years ago
686a2192622a32e446eeaf43c4b3a8df38f8f506 authored over 9 years ago
Debug various problems with the pipeline by running it on the test cluster.
* Need clj in path
* Set all the dependencies
* Wrong topology
* Fix imports and names
* Add src...
Rename to report (singular)
9ada3384338330103beef11d4aa4978f7815b469 authored over 9 years ago
9ada3384338330103beef11d4aa4978f7815b469 authored over 9 years ago
Update README
15ad5a05584c15c7ec815b3f86d430b67b794751 authored over 9 years ago
15ad5a05584c15c7ec815b3f86d430b67b794751 authored over 9 years ago
Add aggregate task chart
b6e504efb15fc0beb5bbaeaa70ef8c472e380dd3 authored over 9 years ago
b6e504efb15fc0beb5bbaeaa70ef8c472e380dd3 authored over 9 years ago
Refactor pipeline to use a KafkaSpout and 2 Bolts
18e849be6ab0ec5320d56d885f1906c481172d1b authored over 9 years ago
18e849be6ab0ec5320d56d885f1906c481172d1b authored over 9 years ago
Fix typos
6f73b236b733c23f94572cdc63b360deba66ad27 authored over 9 years ago
6f73b236b733c23f94572cdc63b360deba66ad27 authored over 9 years ago
Properly move kafka messages
1869221ffee25861c5df000e398c664cf45172e2 authored over 9 years ago
1869221ffee25861c5df000e398c664cf45172e2 authored over 9 years ago
Update project.clj to use the hortonworks repo
Thanks to @dan-blanchard for the swift reply and fix
8df70710a485365a2657285b4c398d33d69860e2 authored over 9 years ago
Also add footer to report bucket
6efdae7d9e2eba85438600ca50401c36246a5355 authored over 9 years ago
6efdae7d9e2eba85438600ca50401c36246a5355 authored over 9 years ago
It's called nimbus y0
5d27553dc0e3e59f3681dae0a66b7893cbf69e08 authored over 9 years ago
5d27553dc0e3e59f3681dae0a66b7893cbf69e08 authored over 9 years ago
Ack tuples
072ef3014ab59bdef6bf36e0204f15348268391f authored over 9 years ago
072ef3014ab59bdef6bf36e0204f15348268391f authored over 9 years ago
Extra argument was missing..
a0745d0acdb0f6d841b01e24a2971c4625e7ffea authored over 9 years ago
a0745d0acdb0f6d841b01e24a2971c4625e7ffea authored over 9 years ago
Move flush print into function
77d28cfd068ae593953c0a49f27288bc16877cc8 authored over 9 years ago
77d28cfd068ae593953c0a49f27288bc16877cc8 authored over 9 years ago
Update storm version
075ad0c7082c94cc0be3feca7908d1bb8767dfc1 authored over 9 years ago
075ad0c7082c94cc0be3feca7908d1bb8767dfc1 authored over 9 years ago
Also check for timeouts inside of the consume loop
56783f7e0c3093defec2f35ffaef268605afba5f authored over 9 years ago
56783f7e0c3093defec2f35ffaef268605afba5f authored over 9 years ago
Clean dead code
7719cd1c04a852d0758d31a0391ba130ffdb1f82 authored over 9 years ago
7719cd1c04a852d0758d31a0391ba130ffdb1f82 authored over 9 years ago
Properly call consume_messages()
8648abb4854c4df32c67f2cd65959bc368a54d2b authored over 9 years ago
8648abb4854c4df32c67f2cd65959bc368a54d2b authored over 9 years ago
Set the timeout for the kafka consumer and after the timeout is reached check for timed out buckets
26233ce2b1778a64366564b774d07dd327eb47e2 authored over 9 years ago
26233ce2b1778a64366564b774d07dd327eb47e2 authored over 9 years ago
Implement TimedOutStringIO and refactor logging
12f6489831ef60d40de75cda9646b7a2f99c0026 authored over 9 years ago
12f6489831ef60d40de75cda9646b7a2f99c0026 authored over 9 years ago
Remove too much verbosity
534ff67a453a74f6b3828a62514774d2576d3bcd authored over 9 years ago
534ff67a453a74f6b3828a62514774d2576d3bcd authored over 9 years ago
Add some debug lines to kafka-consume
01424e83a1c608b1ba7d19c51276a0ee8baf3547 authored over 9 years ago
01424e83a1c608b1ba7d19c51276a0ee8baf3547 authored over 9 years ago
Cast everything to string
06e50b7bfda265aa7fb3c3ddd2b690f0d3fa786d authored over 9 years ago
06e50b7bfda265aa7fb3c3ddd2b690f0d3fa786d authored over 9 years ago
Update config and requirements
9db851c92e66b9d82ecec48bca16c6b9b881d5ab authored over 9 years ago
9db851c92e66b9d82ecec48bca16c6b9b881d5ab authored over 9 years ago
Actually the bolt probably wants more parallelism for serialization
4f7a980a48a1660a550a32acbb203bfc263cb37b authored over 9 years ago
4f7a980a48a1660a550a32acbb203bfc263cb37b authored over 9 years ago
Specify the parallelism of them bolts and sprouts
f1ecd4efe97dbea6a6b63d6ac77ff2f8f70f0b4c authored over 9 years ago
f1ecd4efe97dbea6a6b63d6ac77ff2f8f70f0b4c authored over 9 years ago
"" is not ''
e004757188ce8a344511e28bc531c3d6f10bfeeb authored over 9 years ago
e004757188ce8a344511e28bc531c3d6f10bfeeb authored over 9 years ago
Last try
64b2184cfdd0ae33dcaf5c4ed1f9bb396e887d3b authored over 9 years ago
64b2184cfdd0ae33dcaf5c4ed1f9bb396e887d3b authored over 9 years ago
Cast to binary type
7787fd24af134e5110d212bcd8155bc62b69fa3c authored over 9 years ago
7787fd24af134e5110d212bcd8155bc62b69fa3c authored over 9 years ago
No encoding, no fun.
5a03b72256544b16a83287efe6923b4694395bb0 authored over 9 years ago
5a03b72256544b16a83287efe6923b4694395bb0 authored over 9 years ago
Fix naming of record types
4b6cfdae7a4db5131e9fceb18bad9c9e0db37d57 authored over 9 years ago
4b6cfdae7a4db5131e9fceb18bad9c9e0db37d57 authored over 9 years ago
Configure production
98780b96171d420a39779a753659cd5b27aa3da1 authored over 9 years ago
98780b96171d420a39779a753659cd5b27aa3da1 authored over 9 years ago
Add report topology requirements.txt
7482351b179e62080e402d1bd73fc624662800ec authored over 9 years ago
7482351b179e62080e402d1bd73fc624662800ec authored over 9 years ago
They call them tuples, but they actually mean lists...
5d824eff0632f24b5adfbb9c3d4a30b2aaf9dfd0 authored over 9 years ago
5d824eff0632f24b5adfbb9c3d4a30b2aaf9dfd0 authored over 9 years ago
Emit a 3 tuple
210f2a19a1060f2befabfeda720adf497b0af5e0 authored over 9 years ago
210f2a19a1060f2befabfeda720adf497b0af5e0 authored over 9 years ago
Implement a basic unittest
797838b1cf53ecd32272fd30002a73de7910d691 authored over 9 years ago
797838b1cf53ecd32272fd30002a73de7910d691 authored over 9 years ago
Need tell
8c4600b52bf73e77b62a1c374bc2c13e27f546d7 authored over 9 years ago
8c4600b52bf73e77b62a1c374bc2c13e27f546d7 authored over 9 years ago
Actually it's gzip..
cf6067db55781f063cd70afee3c31356eae902d4 authored over 9 years ago
cf6067db55781f063cd70afee3c31356eae902d4 authored over 9 years ago
boto works differently..
c2209e3cdf880e429818e5bbeb818846b0bff510 authored over 9 years ago
c2209e3cdf880e429818e5bbeb818846b0bff510 authored over 9 years ago
Move open into init
03c8e6f9476149ab39950536e4b3872dafcb2038 authored over 9 years ago
03c8e6f9476149ab39950536e4b3872dafcb2038 authored over 9 years ago
boto reports don't support the with construct
f1815b01dda59c034441b34c3f3de4f2e2420424 authored over 9 years ago
f1815b01dda59c034441b34c3f3de4f2e2420424 authored over 9 years ago
Fix calling of parse
85427656b8f7b26b26984d2a44fdb734470c7294 authored over 9 years ago
85427656b8f7b26b26984d2a44fdb734470c7294 authored over 9 years ago
Use a python file for configuration instead of conifg.json
9e0ff18915336e6cd2a345474c412dfa4a393640 authored over 9 years ago
9e0ff18915336e6cd2a345474c412dfa4a393640 authored over 9 years ago
Almost mistake singular and plural
00982a335061f98eba56987a20307e8f256c6856 authored over 9 years ago
00982a335061f98eba56987a20307e8f256c6856 authored over 9 years ago
It's actually a KafkaBolt
7621a6cf6b4b0c5d5e76bcae581ec91fc027ab7a authored over 9 years ago
7621a6cf6b4b0c5d5e76bcae581ec91fc027ab7a authored over 9 years ago
Rename report to reports
dd4d594809f3b54bb82a9e16ffdc2f78a09c6e0f authored over 9 years ago
dd4d594809f3b54bb82a9e16ffdc2f78a09c6e0f authored over 9 years ago
Refactor pipeline to use streamparse
338749b87155feef631dc0f1597a6833a720c38e authored over 9 years ago
338749b87155feef631dc0f1597a6833a720c38e authored over 9 years ago
Implement finished method.
Keep track of queue of messages that have been processed and flush them when finished
447d5ba291cb8b52fe5973c3efa759f7199ce5a1 authored over 9 years ago
Add some debugging info
e9afcd725f08952278741718d14e5c4fb4bac04c authored over 9 years ago
e9afcd725f08952278741718d14e5c4fb4bac04c authored over 9 years ago
Fix path
54a1342b0894aa11e9f9152ed1ff286ba8761af3 authored over 9 years ago
54a1342b0894aa11e9f9152ed1ff286ba8761af3 authored over 9 years ago
Len is on the date bucket StringIO
9c928ae7d40e92f99251b95fb86f0255a8bff2a2 authored over 9 years ago
9c928ae7d40e92f99251b95fb86f0255a8bff2a2 authored over 9 years ago
Prepend a character to identify the type of message
f263a069ded872cfaad7086baf718d094c2ba68e authored over 9 years ago
f263a069ded872cfaad7086baf718d094c2ba68e authored over 9 years ago
Use the bucket manager
8221c409f773fe81d5c4ff8301864f5d083b7da4 authored over 9 years ago
8221c409f773fe81d5c4ff8301864f5d083b7da4 authored over 9 years ago
Implement bucket based kafka consumer
a142cb7935b00a1b7d2906e4474da1a26e137c56 authored over 9 years ago
a142cb7935b00a1b7d2906e4474da1a26e137c56 authored over 9 years ago
Add the bridge_address key when it's missing
8dd3c8b05a714c334ff81c4ef0702acebcf081d7 authored over 9 years ago
8dd3c8b05a714c334ff81c4ef0702acebcf081d7 authored over 9 years ago
Fix missing imports
a3c15a0f5284e575879fe8f84890dadb471ca64a authored over 9 years ago
a3c15a0f5284e575879fe8f84890dadb471ca64a authored over 9 years ago
Disable output and serialise to error message
6c16e1d1269c5c55b8aec4cf2c892fb40d9a9c92 authored over 9 years ago
6c16e1d1269c5c55b8aec4cf2c892fb40d9a9c92 authored over 9 years ago
Fix calling of close
1def874b863da023d87a106a1fa26020e59fa2d4 authored over 9 years ago
1def874b863da023d87a106a1fa26020e59fa2d4 authored over 9 years ago
For the moment kafka-python seems like the most stable library
a1df40e1a8bb15e6a7aa66a47b765b5db758a1d6 authored over 9 years ago
a1df40e1a8bb15e6a7aa66a47b765b5db758a1d6 authored over 9 years ago
Fix bug in sanitisers
ab6f17b463b411b46310a7f0cf7b18c662464d49 authored over 9 years ago
ab6f17b463b411b46310a7f0cf7b18c662464d49 authored over 9 years ago
Fix references
dbc8af2cbf89f2db8070b55493d0f7f6c97fde65 authored over 9 years ago
dbc8af2cbf89f2db8070b55493d0f7f6c97fde65 authored over 9 years ago
Switch back to pykafka
e9f75a31f0520c5bbac6fa0bfbc6ebbd8f5714c6 authored over 9 years ago
e9f75a31f0520c5bbac6fa0bfbc6ebbd8f5714c6 authored over 9 years ago
Use older API for kafka-python
4770a00ec187d6ca9964b613d6435259b3f7b113 authored over 9 years ago
4770a00ec187d6ca9964b613d6435259b3f7b113 authored over 9 years ago
Implement consumer using kafka-python
29f2226f6c7636a47c2979c3eb45030f071f8731 authored over 9 years ago
29f2226f6c7636a47c2979c3eb45030f071f8731 authored over 9 years ago
Add simple Kafka consumer and producer
35a587e1715445fbef76062a6932b4e3d2d89715 authored over 9 years ago
35a587e1715445fbef76062a6932b4e3d2d89715 authored over 9 years ago
Add support for publishing topics to kafka
356d8a1cc91537f49458c218afdca1b9c9662925 authored over 9 years ago
356d8a1cc91537f49458c218afdca1b9c9662925 authored over 9 years ago
parse quickstart (http://streamparse.readthedocs.org/en/latest/quickstart.html)
a30baa5033557bd72ee26be5c1e57b9fb30582c2 authored over 9 years ago
a30baa5033557bd72ee26be5c1e57b9fb30582c2 authored over 9 years ago
Refactor util and workflow
9b3d3040ba2bd20ed39b1af775979547d9eaa824 authored over 9 years ago
9b3d3040ba2bd20ed39b1af775979547d9eaa824 authored over 9 years ago
First commit
c2b28ca272d6a265052b2df0ed9f1de38812a73a authored over 9 years ago
c2b28ca272d6a265052b2df0ed9f1de38812a73a authored over 9 years ago