Ecosyste.ms: OpenCollective

An open API service for software projects hosted on Open Collective.

github.com/pelias/openaddresses

Pelias import pipeline for OpenAddresses.
https://github.com/pelias/openaddresses

Fix comment

2eccbd9b538ca57068eae7cdc589e76c90063f6f authored almost 9 years ago by Julian Simioni <[email protected]>
Split creation and setup of import pipline

A lot of import pipeline details lived in one place. As many details as
possible are now moved i...

9cee7e2e96cf7d8e27636df62257da6b18fe4d9b authored almost 9 years ago by Julian Simioni <[email protected]>
Move creation of combined steram to import_pipelines

924a420f67ff83651a4f9989e90db3a477d6ccda authored almost 9 years ago by Julian Simioni <[email protected]>
Fix handling of files in peliasConfig

If a user's peliasConfig specified a list of files, the importer was not
properly joining the re...

b90f3abe641aff9d9778c77397662cd9a4c7320f authored almost 9 years ago by Julian Simioni <[email protected]>
Test that default path from config is used when no params specified

240da87ed6a800a9e0422a93c6a1de696f5a3881 authored almost 9 years ago by Julian Simioni <[email protected]>
Move file listing code to interpretUserArgs

af47ce66f8205ea40e969288768e7f250dbf31c1 authored almost 9 years ago by Julian Simioni <[email protected]>
Use temp module for temporary directories in tests

73843715a64081ce8e17d6206072f022040b7c63 authored almost 9 years ago by Julian Simioni <[email protected]>
Add peliasConfig as optional 2nd parameter

9db5976b999318b8a0527f0b9da44c1f54315912 authored almost 9 years ago by Julian Simioni <[email protected]>
Add test for command line dir behavior

83ba2bc9dc0b3a00fd18a5f98cf08776831e571c authored almost 9 years ago by Julian Simioni <[email protected]>
Move interpretUserArgs to its own module

a72967ff66c5d1608d4652fe6adaa19adb835999 authored almost 9 years ago by Julian Simioni <[email protected]>
Count and log invalid records

Count records that don't have a lat, lon, housenumber or streetname and
display the total count ...

2eef14b7288cba58d6c160ca2090add628dec565 authored almost 9 years ago by Julian Simioni <[email protected]>
Trim all string fields in cleanupStream

841ccdd8550fcc550938827e14e301762d6647c4 authored almost 9 years ago by Julian Simioni <[email protected]>
Extract documentStream from createRecordStream

621ae11a0e2154282235182c1e97df181b6593d9 authored almost 9 years ago by Julian Simioni <[email protected]>
Use conditional instead of try/catch

It's expected that zipcodes aren't always present, so a try/catch isn't
appropriate.

cdef1f841bca3e6ad1d91ef7469ad80f7900e6aa authored almost 9 years ago by Julian Simioni <[email protected]>
Move tests to separate files

cf865980fd48205d22097a9581eccbed0287083d authored almost 9 years ago by Julian Simioni <[email protected]>
Test quoted, all whitespace field

a2729555e7b1bac60246cee1b37e2b99fa352361 authored almost 9 years ago by Julian Simioni <[email protected]>
Move all property cleanup to its own stream

c08b3973d637b45977ad32e92755d197658fe5df authored almost 9 years ago by Julian Simioni <[email protected]>
Simplify error handling for housenumber/street name

Both of these fields are required, so if there is an error setting
either of them, the document ...

a16805d218a99239a07b034d71ebef846750d50b authored almost 9 years ago by Julian Simioni <[email protected]>
Extract invalid record filtering into its own stream

3fda9d6eb3c9f548c7a653860c460b8234d96c20 authored almost 9 years ago by Julian Simioni <[email protected]>
Simplify isValidCsvRecord implementation

bf699dcc66130831b2c499a9ebc054014bbc237f authored almost 9 years ago by Julian Simioni <[email protected]>
Test the case where the CSV row contains only whitespace

0401b3844f2d7bbde32bf4fd047250d6a9146b1e authored almost 9 years ago by Julian Simioni <[email protected]>
Merge pull request #26 from pelias/remove_leading_zeros_from_numbered_streets

Add cleanup module with numbered street handling

fbebf5c4873475820bfbf4c902515d678d2d041f authored almost 9 years ago by Julian Simioni <[email protected]>
Add precommit-hook settings to package.json

c97d304d44afc490edd9b215e419bc488d00eeff authored almost 9 years ago by Julian Simioni <[email protected]>
Add cleanup module with numbered street handling

This cleanup code handles numbered streets with leading zeros. For
example, `05th Street` will b...

a50a997f9130f2763554d98835a105e80c3ce373 authored almost 9 years ago by Julian Simioni <[email protected]>
Merge branch 'switch-to-glob-for-recursive-csv-file-find'

400ead03793ee8f678f3dda21247e3c331ac631d authored about 9 years ago by Julian Simioni <[email protected]>
switched to glob package for finding all .csv files from root instead of expecting all .csv files to be in root

250af50c2c4fcc9c8117ef7089c41f028bd38bab authored about 9 years ago by Stephen Hess <[email protected]>
2.1.0

c0f78bec241d14a5472e5192c9ebff244a160bb3 authored over 9 years ago by Peter Johnson <[email protected]>
update pelias-config dep

159755b2394617bff2bdb1506032fb9b71b3b858 authored over 9 years ago by Peter Johnson <[email protected]>
Merge branch 'fix-cli-dir-handling'

f4a4a0d47fd0f25b767eb8586767c83ef5a49895 authored over 9 years ago by Julian Simioni <[email protected]>
pelias-config dependency min version

50ae7e1d07f61103be2f18e02bcb94cc54bc7961 authored over 9 years ago by Peter Johnson <[email protected]>
Handle case where entire openaddresses config is empty

This can happen on import when the user is specifying a directory of
files and hasn't set anythi...

30f4532587588c4b9c4f94fc6ba6d5bf6dcc6163 authored over 9 years ago by Julian Simioni <[email protected]>
2.0.0

e0171fa571e316395fb1c697f19b81ff3c3d221f authored over 9 years ago by Peter Johnson <[email protected]>
Merge pull request #18 from pelias/ngram

migrate from FST to ngrams

da55f9e561786d9808e1ca5f56d47c2f2a17ef10 authored over 9 years ago by Peter Johnson a.k.a. insertcoffee <[email protected]>
migrate from FST to ngrams

99b24baeb617d961654187e2dcfde88052c2ee14 authored over 9 years ago by Peter Johnson <[email protected]>
Merge pull request #16 from pelias/issue_15

resolves #15

d0133d55a48a8f33acebd518b5a7d3823edf02dc authored over 9 years ago by Peter Johnson a.k.a. insertcoffee <[email protected]>
resolves #15

e664c3e105f9c245120df22b57898a386b4b5034 authored over 9 years ago by Peter Johnson <[email protected]>
Remove repeated spaces.

lib/import_pipelines.js
-Remove repeated spaces with a single space.

test/(openaddresses_sampl...

596967326c4a12d4ebedb0e3218c94ab1a4dfe44 authored over 9 years ago by Severyn Kozak <[email protected]>
Set Address data in Documents (closes #14).

669739d51acc883a1d05b2e1a2eaa02a70534ed0 authored over 9 years ago by Severyn Kozak <[email protected]>
package.json: add a script to run the import.

17f5fcb77c3134182ee3e49a01ff803a0b6050e5 authored over 9 years ago by Severyn Kozak <[email protected]>
Revert "Switch to latest address-deduplicator version."

This reverts commit bbf81fc9948b6998476d4d4eb8f7abc7a4f30b4b.

b6733cf8bc585a3d8bbfdb4f51a160d1acd24954 authored almost 10 years ago by Severyn Kozak <[email protected]>
Switch to latest address-deduplicator version.

package.json
-Link to a version with an attempted memory fix.

bbf81fc9948b6998476d4d4eb8f7abc7a4f30b4b authored almost 10 years ago by Severyn Kozak <[email protected]>
Periodically log the total number of bad records.

lib/import_pipelines.js
-Instead of logging a message per bad CSV record encountered
(which mi...

3690444a6396071ce2d476f9362635c96335775b authored almost 10 years ago by Severyn Kozak <[email protected]>
Extract address validation to own function.

lib/import_pipelines.js
-Extract the address-validation conditional to its own function,
`isVa...

fe487147c44ec5f88de273c4620de6dbb4d3da40 authored almost 10 years ago by Severyn Kozak <[email protected]>
lib/import_pipelines.js: remove leftover console.log().

c915794b8a27ed53626ba7a23b3039215377bb8f authored almost 10 years ago by Severyn Kozak <[email protected]>
Add tests for document validity.

test/openaddresses_sample.csv
-Make the data more unpredictable by adding extra additional
col...

5e1dff45dab7e61dc3e032825f9da4f5454d156c authored almost 10 years ago by Severyn Kozak <[email protected]>
Make record validity check more flexible.

lib/import_pipelines.js
-A change in the OA schema (the addition of a `POSTCODE`) field
broke ...

c0eaab2cb219e10e9372119fd97509c59e42bfb9 authored almost 10 years ago by Severyn Kozak <[email protected]>
Merge branch 'cherry-pick-files'

README.md, import.js
-Close #13: allow cherry-picking the files to import from
pelias-config.

8ca2a06cf71d6a5b865f16b78b8316b64688b0f0 authored almost 10 years ago by Severyn Kozak <[email protected]>
Update README files option documentation.

README.md
-Update the documentation of the `files` property in
`pelias-config` against the cha...

0c29a570b34bccf3f0367a462c05d60877f43f48 authored almost 10 years ago by Severyn Kozak <[email protected]>
Import all files for an empty files array.

import.js
-When an empty `files` array is specified in the config, import
all files from the t...

28f2a0ffb1d94e33785f6661689995ea7e2acbad authored almost 10 years ago by Severyn Kozak <[email protected]>
Update README documentation.

README.md, import.js
-Update documentation to reflect changes in the past two
commits.
-Docum...

6052174c72318943912d6801497e5a9a702334e0 authored almost 10 years ago by Severyn Kozak <[email protected]>
Filter out non-CSV files after reading a directory.

import.js
-The code that filters out non-CSV files was meant to only run
on files read from a ...

207098a42e0b8da905dda8990536ebb9627ac7a1 authored almost 10 years ago by Severyn Kozak <[email protected]>
Allow cherry-picking files in pelias-config.

import.js
-Check the `pelias-config` openaddresses configuration object
for a `files` property...

22bb61d31bcaf6d1d1b98330c37119f2fed8fd1f authored almost 10 years ago by Severyn Kozak <[email protected]>
Log the total amount of time taken.

import.js
-Log the total amount of time the import took, as a convenience
(instead of forcing ...

2fbd05372a87917785aa52d6e4cb474b07e123c4 authored almost 10 years ago by Severyn Kozak <[email protected]>
Remove usage information from README.

README.md
-Remove the copy-and-pasted `node import.js --help` from the
"usage" section to prev...

40e395d3d10873c40a97db63a3ff4b71c5170e5a authored almost 10 years ago by Severyn Kozak <[email protected]>
Integrate latest admin-lookup.

import.js, package.json
-Integrate the latest admin-lookup (v2.0.0), which ships with a
pseudo...

244ec72a4d55d15d504113ee60c605f06ed7ab29 authored almost 10 years ago by Severyn Kozak <[email protected]>
Increase deduper batch size, limit to 1 live req.

import.js
-Concurrent requests don't make a ton of sense for what seems
like a CPU-bound problem.

63acc17c51d74708ced4b6446c46a3bf2a72abd3 authored almost 10 years ago by Severyn Kozak <[email protected]>
Integrate the latest pelias-admin-lookup.

7a6dbdda9af49919a583c15ad2bb3c0b4d366e50 authored almost 10 years ago by Severyn Kozak <[email protected]>
Update references to pelias-deduplicator-stream.

import.js, package.json
-Update references to the `pelias-deduplicator-stream` package
to refl...

ec53109421ad74e62f645b5c24b82fc87754076e authored almost 10 years ago by Severyn Kozak <[email protected]>
Make all package.json dep versions fuzzy.

package.json
-Use the `^` in dependency versions to make npm install the
latest compatible ones.

38020693d767746bb5877f258e46a688e98cc594 authored almost 10 years ago by Severyn Kozak <[email protected]>
Switch out winston with pelias-logger.

import.js, lib/import_pipelines.js, package.json
-Use the new `pelias-logger` package instead o...

215c3ab39a9266e1b41eb7fb7e742b1689930f42 authored almost 10 years ago by Severyn Kozak <[email protected]>
Parse CSVs with the csv-parse package.

lib/import_pipelines.js, package.json
-OpenAddresses data got trickier to parse now that it con...

69fa7fdc1a7772986ed0666d6508c6beae513eeb authored almost 10 years ago by Severyn Kozak <[email protected]>
Integrate new pelias-admin-lookup.

import.js, package.json
-Integrate the `pelias-admin-lookup` package, which replaces
`pelias-h...

a08f2c8b6debcd1c30f0e1f4b8ff4bef46427808 authored almost 10 years ago by Severyn Kozak <[email protected]>
Use minimist for args. Read OA path from config.

import.js, package.json
-Use the `minimist` package for argument parsing.
-Make the data path ...

bf3cd07c733b47af0c8c8d53ab4befb31b922017 authored almost 10 years ago by Severyn Kozak <[email protected]>
Set pelias-hierarchy-lookup dep to latest version.

package.json
-Set the `pelias-hierarchy-lookup` package to the latest
version, which should fi...

c533419494060813d7535aaf37033759b53fcb99 authored almost 10 years ago by Severyn Kozak <[email protected]>
Add conditionals to check directory arg.

import.js
-Add two conditionals to `interpretUserArgs()` to check whether
the provided directo...

0194254e59b0624f5c126386164d9eba39343323 authored almost 10 years ago by Severyn Kozak <[email protected]>
.gitignore: add a .gitignore for node_modules/.

8e938bb14093f6796f1cafdb80bde861571cd9fe authored almost 10 years ago by Severyn Kozak <[email protected]>
Remove now obsolete admin_data/.

admin_data/(country_codes, state_codes/(ca, us)).json
-Remove all `admin_data` files, which wer...

78a7279bad3364002e2ba6615771a44cdffacc4d authored almost 10 years ago by Severyn Kozak <[email protected]>
Merge branch 'issue/9'

lib/import_pipelines.js
-Fix #9.

080c36d1616a3128b8b8ca3233eff4f135cef92a authored almost 10 years ago by Severyn Kozak <[email protected]>
v1.0.4

206b708c308cc15ddc27ec6efb609020e2e5c0e8 authored almost 10 years ago by Severyn Kozak <[email protected]>
Fix #9. Remove fast-csv.

lib/import_pipelines.js, package.json
-The replacement of `fast-csv` with a dead-simple CSV par...

a5034192b1a9abafccf917b90255e9753d6d04fe authored almost 10 years ago by Severyn Kozak <[email protected]>
Potentially fix #9.

lib/import_pipelines.js
-`fast-csv`, the package used to parse OpenAddresses CSVs,
appeared to...

6a1485d8c2efe6160517f9c8a4a5cc33311f4244 authored almost 10 years ago by Severyn Kozak <[email protected]>
package.json: update suggester-pipeline dep.

ac3125df2cbe4400a32805e84ef892467d248900 authored almost 10 years ago by Severyn Kozak <[email protected]>
.jshintrc: decrease line-length limit to 120.

f74817d1aaa7629782c83dfcb9d4e53fcaa0f9d9 authored almost 10 years ago by Severyn Kozak <[email protected]>
package.json: add jshint to devDependencies.

abd5b1e83e1ac6ee2624bde2a2767181dce49417 authored almost 10 years ago by Severyn Kozak <[email protected]>
v1.0.3

7ec1ac6a8521a1586bbb42e22e673ae741816959 authored almost 10 years ago by Severyn Kozak <[email protected]>
Fix jshint complaints.

import.js, lib/import_pipelines.js, test/test.js
-Fix the few things that `jshint` complained a...

e12a58c46cc7c6c9d0cb61d8fc1f804b0443bf7a authored almost 10 years ago by Severyn Kozak <[email protected]>
Integrate jshint. Add jshintrc, precommit-hook.

.jshintignore, .jshintrc, package.json
-Add a `.jshintrc`, and the `precommit-hook` package to ...

9203d4d813d01aa81e8b58009dc903f87006d733 authored almost 10 years ago by Severyn Kozak <[email protected]>
Add more logging statements, logger config.

import.js
-Add some configuration options to Winston.
-Add more diagnostic logging statements ...

39ca510b0f4e45c059a203fb6ef5cde9106ddf5f authored almost 10 years ago by Severyn Kozak <[email protected]>
Add the winston logger.

import.js, lib/import_pipelines.js, package.json
-Add the `winston` logger, and replace all `co...

8d055480a305d0a608db6ae1463a2f6b89e38e57 authored almost 10 years ago by Severyn Kozak <[email protected]>
v1.0.2

1428fe3c0d1a93169c28b68f0f9ad3e2daadcbdd authored almost 10 years ago by Severyn Kozak <[email protected]>
Add usage info and travis badge to README.

README.md
-Add usage information, as copied from `node import.js --help`,
and this repo's trav...

af0078b5753495683d2343df01e959e325086f67 authored almost 10 years ago by Severyn Kozak <[email protected]>
Add support for a --help argument.

import.js
-Add a `--help` argument, to print the usage message.

5fcbc2e6fceb808fcf0c9e6714e09a0aef49f617 authored almost 10 years ago by Severyn Kozak <[email protected]>
Merge branch 'test_coverage'

import.js, test/
-Improve the module's test coverage substantially, and refactor
some bits to ...

1c5f36d5ec53c0da3316bcc05b3a926e298e4ea0 authored almost 10 years ago by Severyn Kozak <[email protected]>
Add a test for interpretUserArgs().

test/test.js
-Add a unit-test for the newly revised `interpretUserArgs()` in
`import.js`.

27d75d1e0047b85272c1a252e88076b099344fbf authored almost 10 years ago by Severyn Kozak <[email protected]>
Refactor handleUserArgs() into a testable function.

import.js
-Rewrite `handleUserArgs()` such that it returns an object
containing the interprete...

e81acb979c3267584eac92e667e975bcf2e6d480 authored almost 10 years ago by Severyn Kozak <[email protected]>
Add a test for createPeliasElasticsearchPipeline().

test/test.js
-Add a test for the interface of
`createPeliasElasticsearchPipeline()`.

904e32272a11daf057cf6a457f041c1690c04803 authored almost 10 years ago by Severyn Kozak <[email protected]>
Add unit test for createRecordStream().

test/(test.js, openaddresses_sample.csv)
-Add a thorough test for `createRecordStream()`, to ch...

cf3b3b6e28634b2093b1834c31dc7f543e79ebb6 authored almost 10 years ago by Severyn Kozak <[email protected]>
Merge branch 'issue_8'

import.js, lib/import_pipelines.js
-Break up `import.js` into another module, and add some
doc...

e73f4e588fe3cf910bed21433dccae295e734e92 authored almost 10 years ago by Severyn Kozak <[email protected]>
Narrow a try/except in createRecordStream().

lib/import_pipelines.js
-The `try/except` block in `createRecordStream()` performed more
actio...

01005fa0076bd29a3accde8a1c7de9d715140b74 authored almost 10 years ago by Severyn Kozak <[email protected]>
Add in-code documentation where missing.

import.js, lib/import_pipelines.js
-Add documentation comments to all functions that were missi...

b61be91a992b3109498e581a2f18d2790a6c1ac9 authored almost 10 years ago by Severyn Kozak <[email protected]>
Break up import.js into another module.

import.js, lib/import_pipelines.js
-Move functions that create parts of the import data pipelin...

6a19a4b65a2d00bec0448a4c0c42f45cc0100c1d authored almost 10 years ago by Severyn Kozak <[email protected]>
Merge branch 'rewrite_admin_values'

lib/create_admin_values.js, test/test.js
-Remove the remnants of the old admin-value assignment...

4fbe9dbc6e219046a3a0023634067f75f260b602 authored almost 10 years ago by Severyn Kozak <[email protected]>
package.json: update pelias-hierarchy-lookup.

a36b99bbd336b1542d06ef9a010c83d1b8bddaa8 authored almost 10 years ago by Severyn Kozak <[email protected]>
Remove the create_admin_values module.

lib/create_admin_values.js, test/test.js
-Remove the `create_admin_values` module, as we're now...

781f7e0aa96deb93a3900c6e73bf4ae5da152a11 authored almost 10 years ago by Severyn Kozak <[email protected]>
Merge branch 'rewrite_admin_values'

import.js, package.json
-Use `pelias-admin-hierarchy` to perform admin value lookups
against Q...

83d8b69624a588a4e5b24dc06665620e6b4da1ba authored almost 10 years ago by Severyn Kozak <[email protected]>
import.js: remove accidentally added debugging line.

f9dad42bf583b336d9622801e637bda61af7b7cd authored almost 10 years ago by Severyn Kozak <[email protected]>
Make deduplication/hierarchy-lookup configurable.

import.js
-Make address deduplication and admin hierarchy lookup optional
steps of the import....

b55dff96a326e97bab08bc561afff401d0fe6ca6 authored almost 10 years ago by Severyn Kozak <[email protected]>
Add an improved usage message to the import script.

import.js
-Add a real CLI usage message.

28d33eaeeb54974c6f2fe00d26a53953aa05123f authored almost 10 years ago by Severyn Kozak <[email protected]>
Add a WIP warning to the README.

README.md
-Add a warning indicating that this importer is strictly a WIP,
and not entirely rea...

3de0efe3fe2a64c60adf14384ba74521049b40c9 authored almost 10 years ago by Severyn Kozak <[email protected]>
Use hierarchy-lookup to identify admin values.

import.js, package.json
-Use the `hierarchy-lookup` package to identify address admin
values, ...

756923a485c1fc55846115ace4be42a1f2dc67c5 authored almost 10 years ago by Severyn Kozak <[email protected]>