Ecosyste.ms: OpenCollective

An open API service for software projects hosted on Open Collective.

libpostal

libpostal is a C library for parsing/normalizing street addresses around the world
Collective - Host: opensource - https://opencollective.com/libpostal - Code: https://github.com/openvenues/libpostal

[language_classifier] Features for address languages classification, quadgrams for most languages, unigrams for ideographic characters, script for single-script languages like Thai, Hebrew, etc.

github.com/openvenues/libpostal - b13462f8efaedf0c3ea45506de815e5b7d38201a authored about 9 years ago
[fix] sort hash keys by value

github.com/openvenues/libpostal - 29930fa7b665ab7e9936f9ad72b8e78a4feb9ab8 authored about 9 years ago
[optimization] Using sparse updates in stochastic gradient descent. Decomposing the updates into the gradient of the loss function (zero for features not observed in the current batch) and the gradient of the regularization term. The derivative of the regularization term in L2-regularized models is equivalent to an exponential decay function. Before computing the gradient for the current batch, we bring the weights up to date only for the features observed in that batch, and update only those values

github.com/openvenues/libpostal - 62017fd33d708a227373d4091929178b0fb069fe authored about 9 years ago
[math] Matrix arithmetic

github.com/openvenues/libpostal - aa22db11b2fbc9a00d31ccea30f2900d8035172d authored about 9 years ago
[fix] NULL check

github.com/openvenues/libpostal - 197b18f3cffeb4575a188cc526bd6af70d96092c authored about 9 years ago
[math] Adding array_{op}_times_scalar methods

github.com/openvenues/libpostal - 9c4b5ccbb176734c0e095787fdfacd8773dc41b8 authored about 9 years ago
[math] Unique columns as array for CSR sparse matrix

github.com/openvenues/libpostal - 2f1e2139caff8cff7cf4f788c576c50d00f7aa8f authored about 9 years ago
[classification] Pre-allocating memory in logistic regression trainer, storing last updated timestamps for sparse stochastic gradient descent and using the new gradient API

github.com/openvenues/libpostal - 023c04d78f61d9d65adfb053312e0844f7d1dfb5 authored about 9 years ago
[classification] Sparse version of logistic regression gradient which, given an array of the features/columns used in the input batch, only updates the gradient for that batch, even for the operations which otherwise would apply to the entire matrix (scaling by -1/m, regularization)

github.com/openvenues/libpostal - 562cc06eaf59bdfe06286040b4ab1324b3f61e13 authored about 9 years ago
[fix] Writing matrix dimension as 64-bit

github.com/openvenues/libpostal - 5ca4bba1d544c533b93b06807b16ce03aa39952f authored about 9 years ago
[classification] Training structures for logistic regression and stochastic (minibatch) gradient descent update

github.com/openvenues/libpostal - 8f054eeeb1c72d76c3b3a9a966d08416b06e8610 authored about 9 years ago
[classification] Multinomial logistic regression, gradient and cost function

github.com/openvenues/libpostal - 4acf10c3a4547d6d9412b54db482bc5efececf08 authored about 9 years ago
[optimization] Stochastic gradient descent with gain schedule a la Leon Bottou

github.com/openvenues/libpostal - 8b7052971174f2aed16607b7972c972363099239 authored about 9 years ago
[math] Sparse matrix from dense

github.com/openvenues/libpostal - 6b164d263eb31813794eb05fb5757200cc0789dc authored about 9 years ago
[features] Functions for dealing with minibatches

github.com/openvenues/libpostal - ba8fc716df0a6e533cb89928d4d44862ae4c223c authored about 9 years ago
[fix] only strdup when necessary in feature counting functions

github.com/openvenues/libpostal - 06638d2885b439467524624d9d05abbc804fc570 authored about 9 years ago
[math] Matrix scalar arithmetic functions

github.com/openvenues/libpostal - 31a3a2a3faeccd4ee69e1fc047c250a5a96a5aeb authored about 9 years ago
[sparse] Only increase size of sparse matrix on finalize row if it needs to be

github.com/openvenues/libpostal - b6ce94166bcce7fff4c27b09109192e8f97f85b4 authored about 9 years ago
[fix] adding functions to string_utils header

github.com/openvenues/libpostal - 2e67afab090b87154327a81ae4848ed47d8a15da authored about 9 years ago
[fix] making *_hash_sort_keys_by_value static

github.com/openvenues/libpostal - a8b9a2c153c2a5551344a20b2874e0c0d8355d81 authored about 9 years ago
[utils] char_array_cat_printf was forcing a doubling of the size of the buffer, which is bad if calling many times. Now only initiates a realloc if the char_array is almost full. Also adding cstring_array_from_strings which takes a list of char *s

github.com/openvenues/libpostal - 0d5cf0d6d70e24ec6bb2b4c5dd8420314cd2f163 authored about 9 years ago
[phrases] trie_num_keys

github.com/openvenues/libpostal - 8c019998d7d8abc5ed944ced859d157337f9fc28 authored about 9 years ago
[mv] Moving trie_new_from_hash to a module

github.com/openvenues/libpostal - 22668945cbf75adbb74493d3b3038d7171aeecbd authored about 9 years ago
[tokenization] is_whitespace

github.com/openvenues/libpostal - 33e9a05ebf1bd93f3a4845aca644544738580c2a authored about 9 years ago
[features] No copy versions of feature counts functions

github.com/openvenues/libpostal - 6e1435ac4886483c3711b38a3495290a4b380aa2 authored about 9 years ago
[utils] Adding hash sort by values for numeric types

github.com/openvenues/libpostal - a740417cab18bf10675fe0f81ad19920895f0fed authored about 9 years ago
[fix] using string_equals, handles NULLs

github.com/openvenues/libpostal - 6ef7c902788a84f6c606abaf5e78bd41f437c6cc authored about 9 years ago
[fix] free normalized string in address parser data set

github.com/openvenues/libpostal - c0214d6023e6e289a3bdf56eaa834baf5604ecd1 authored about 9 years ago
[math] Adding vector sort and vector argsort to numeric vectors

github.com/openvenues/libpostal - 6a5ad96a178bfae1b881299879f47742d2c0b908 authored about 9 years ago
[math] Floating point equality with relative epsilon comparisons

github.com/openvenues/libpostal - 7aea79281e1f532ded8afacca253764779475652 authored about 9 years ago
[dictionaries] All professional suffixes should use the abbreviated form as the canonical

github.com/openvenues/libpostal - 81624f8b6d0775c5b93bd4d73e17dc8d8dc73aa0 authored about 9 years ago
[api] More spacing fixes and using language information in normalize string

github.com/openvenues/libpostal - 780966a59b0e11bd88b5e0bcaec8811a87b659d1 authored about 9 years ago
[normalize] Adding normalize_string_languages method which can use additional transliterators

github.com/openvenues/libpostal - ff75c5cc50376e9d0a7b52b91c080959074a51f5 authored about 9 years ago
[dictionaries] ulitsa is the proper transliteration for Russian

github.com/openvenues/libpostal - 7906f5542d5416ae7aeb3983e2b28421b3fa8efb authored about 9 years ago
[fix] Freeing languages in Python

github.com/openvenues/libpostal - 7bd1336b3b1231ee1b41203932e18e049b04617f authored about 9 years ago
[dictionaries] New Japanese abbreviations from the OSM wiki

github.com/openvenues/libpostal - cc89b768d840c104d30d07bef11d19f11f794bab authored about 9 years ago
[dictionaries] Santi/SS in Italian

github.com/openvenues/libpostal - ffe9c2a971a4f2c26467e60064f940bf5fe4f2e8 authored about 9 years ago
[dictionaries] New German toponym abbreviations from the OSM wiki

github.com/openvenues/libpostal - ecfdbc3ec2fc3d9267e98a5a5c7f51a2b4542922 authored about 9 years ago
[dictionaries] Adding service road to English

github.com/openvenues/libpostal - a6f7924f12defd7ca0d0f3a17533919c9201fdef authored about 9 years ago
[dictionaries] Adding no to English ambiguous

github.com/openvenues/libpostal - 684c238ca03782e5f4cce76451131f0b1b2425e7 authored about 9 years ago
[fix] Ubuntu build

github.com/openvenues/libpostal - 1b0567a8815d28c7a500b2e367dffaccd8be3f25 authored about 9 years ago
[build] Adding /usr/local/lib and /usr/local/include to sparkey build

github.com/openvenues/libpostal - d0b5985cb7b7bd33d46b280f5b8a1098d1dcf131 authored about 9 years ago
[build] Adding -L/usr/local/lib to LDFLAGS before searching for snappy

github.com/openvenues/libpostal - 508459a9f9afbd9a16d65bc327e834f811c37334 authored about 9 years ago
[docs] Fleshing out parser description, correcting city name in Russian address

github.com/openvenues/libpostal - d6362ba0fcee7a1d2dc04ae8a6cedbeedf8dc3f6 authored about 9 years ago
[fix] array_zero

github.com/openvenues/libpostal - 45b5e2dd6f03ccbac1aaa5e167cdab04e7830b06 authored about 9 years ago
[math] sparse_matrix_new_shape

github.com/openvenues/libpostal - fb4c984f1560b9900eff7accc1a9521ab35ebdf9 authored about 9 years ago
[features] Using a str=>double hashtable for feature counts

github.com/openvenues/libpostal - 72ad01cbc3b9f1bbab7b18bc31a151fb1b9d7661 authored about 9 years ago
[mv] Moving token type checking to header

github.com/openvenues/libpostal - e4dba2297d1b902d007a2af715ada5c329c288dc authored about 9 years ago
[fix] Leak in expanding strings that have a separable prefix and suffix, other than that ran through 78 million expansions with no discernable memory issues

github.com/openvenues/libpostal - 0fa1c2389cd5f0086777500a3f5a77903cd0c552 authored about 9 years ago
[fix] Check for result.len > 0 in false start continuation numex parsing, plus additional safety check during replacement

github.com/openvenues/libpostal - deeb8f007e11a3c381c118720cb622392b77ef4c authored about 9 years ago
[build] Adding json_encode.c to the address parser client sources

github.com/openvenues/libpostal - 507dd631f80cb34373d6ab928c44561dcaf8643c authored about 9 years ago
[unicode] Upgrading to latest utf8proc from JuliaLang (Unicode 8)

github.com/openvenues/libpostal - 5e6d24ff7e5a7f513808cc87a87e815dc3b3909d authored about 9 years ago
[fix] using a char_array instead of copying the string in normalize_string

github.com/openvenues/libpostal - 3fbb3c587a2924b1fc1fce5f2bbba36fb67bfa6e authored about 9 years ago
[fix] Fixing false start continuations in numex parsing

github.com/openvenues/libpostal - 2eea999692ee2b244757cea61c2cdf9a4da16869 authored about 9 years ago
[fix] In trie search, moving fall-off and tail checks inside the inner character loop dding tail position as a separate variable from offset in the string

github.com/openvenues/libpostal - 850d82de6e3a041a5580e54c140de477839692a1 authored about 9 years ago
[transliteration] In set match checks, use the current index, not current index - char_len

github.com/openvenues/libpostal - 19173d3a6eb432ceab36df12da3345fb8dd2c2e4 authored about 9 years ago
[transliteration] Distinguishing between variables with numbers and backreferences in transliteration rules

github.com/openvenues/libpostal - e9e05bb9294c4c76a7b9b1aa65592e3d72a83d7e authored about 9 years ago
[fix] Stepping through codepoints first then through chars in trie_search_prefixes_from_index (used in transliteration and numex)

github.com/openvenues/libpostal - aaa1fc0387ec5e02aad6ca589b127b9778c6d691 authored about 9 years ago
[fix] Compare the remaining part of the current UTF-8 character using simple string comparison, since it may be in the middle of a valid UTF-8 character

github.com/openvenues/libpostal - baa8e3cc3fd8286741424c03bccc5a707715fe54 authored about 9 years ago
[docs] README fixes

github.com/openvenues/libpostal - 57040b873340ef9eeb337ee3b957076eb567ebcd authored about 9 years ago
[fix] Encode strings as JSON in address parser cli

github.com/openvenues/libpostal - ceda863e9f29f85f8d77d6fde8993e3ad67e89cf authored about 9 years ago
[fix] Adding Korean-Latin-BGN to excluded transliterators

github.com/openvenues/libpostal - e55ff54be18338f0f4f71062fa6565dcbe6ba506 authored about 9 years ago
[transliteration] Fixing group replacement in transliteration in the case of multiple groups, not adding to phrase length when checking context

github.com/openvenues/libpostal - c7fb7f685d8ed6f15286b74a3d5d7203764a617e authored about 9 years ago
[transliteration] Removing Korean-Latin-BGN, not a great transliterator and AFAICT, ICU doesn't use it either

github.com/openvenues/libpostal - 682c316775c5942a38925e986e2b26866d19807d authored about 9 years ago
[fix] regenerating transliteration data

github.com/openvenues/libpostal - ab124465e6bfff721cff64f2c87ef328e344c922 authored about 9 years ago
[fix] update to control characters for generating the transliteration rules

github.com/openvenues/libpostal - ccf509edb1661253bf72691f2b054baf6ff191f4 authored about 9 years ago
[fix] Special tokens like emails/urls/phone numbers bypass normalization

github.com/openvenues/libpostal - 5439f4679fea98af7aef75f602be3d486156c854 authored about 9 years ago
[fix] Prefixes and suffixes that are the same length as the original token should be handled as regular expansions

github.com/openvenues/libpostal - cf2a0efa11d9f0f8a1602450509ea4d10d61a1e2 authored about 9 years ago
[fix] Options out of order

github.com/openvenues/libpostal - aaecd7961af7a7b7dce67da0b8d0a48da047fd2d authored about 9 years ago
[api] Node was complaining about non-trivial designated initializers (probably the bit fields), so converting to old-school initializer

github.com/openvenues/libpostal - 48cb2b5c7b2299cfb64331af59464772ba541d64 authored about 9 years ago
[fix] Strip punctuation in final output in cases where there are no expansions

github.com/openvenues/libpostal - 97906c86a8d44df5b2a810e3fdded7fc7adec76d authored about 9 years ago
[fix] do not add a token if prefix/suffix expansions are inseparable and canonical

github.com/openvenues/libpostal - 4497c4501eec03cf6ba7f862e13b55551f681fa4 authored about 9 years ago
[fix] Making a copy even on pure Latin-script transliteration since string_trim modifies in-place, occasionally causes issues

github.com/openvenues/libpostal - f8da44e8b0bbd1d81238bcf0fb038ba3fa30df3f authored about 9 years ago
[fix] Bug in suffix expansion affecting inseparable suffixes like burg as well as ordinal suffixes like first=>1st

github.com/openvenues/libpostal - 39e83961ef8d1844925d2eed62e84052b635aa9c authored about 9 years ago
[transliteration] Making sure the Python script to generate transliteration data works on the new CLDR format

github.com/openvenues/libpostal - b2a944830ad842af2735d503570cdcacd048c597 authored about 9 years ago
[expansion] Fixing extra space on prefix/suffix expansions

github.com/openvenues/libpostal - b4a8a69226be390056a1dfcd9223ad5f65948d05 authored about 9 years ago
[fix] Partial matches, ultimate misses in concatenated suffixes

github.com/openvenues/libpostal - df47dad817e7a8d18d804a017aaab3c9083e0559 authored about 9 years ago
[fix] Handling case of concatenated suffixes like straße when they stand alone

github.com/openvenues/libpostal - 66073c17d52cf43b080fedd5e5b6cf86d06bea25 authored about 9 years ago
[fix] Moving Python bindings up-front in the README

github.com/openvenues/libpostal - b71755bf7f5fe0f40e122db4192a32ba4020f241 authored about 9 years ago
[api] Adding a --json option to expand cli

github.com/openvenues/libpostal - 31ed88bf6a5df15d5ef010405cfdb1dfae3ba85d authored about 9 years ago
[api] Simple JSON encoding for strings, UTF-8 rather than Unicode

github.com/openvenues/libpostal - 41ea105bb41c5f996ce9208d12fe6907285ef096 authored about 9 years ago
[fix] Print usage info on -h/--help to libpostal cli

github.com/openvenues/libpostal - af78614f62effa304021ec5450a6d66be56b40b6 authored about 9 years ago
[fix] task list

github.com/openvenues/libpostal - f4ee9c264562fbfe872ae2fa5331a8016728f358 authored about 9 years ago
[fix] Python syntax highlighting for README instructions

github.com/openvenues/libpostal - 54cc1b8b2d4e3029edf6c2d680185cbf06a0d7f0 authored about 9 years ago
Merge pull request #11 from nvkelso/master

andthus > and thus in Transliteration section

github.com/openvenues/libpostal - f3b4a4e894c2fcc3bc448267d35245a0e488471a authored about 9 years ago
[docs] README updates, better explanations of normalization and parsing

github.com/openvenues/libpostal - 59cc6d34178e50b69ea34f9bb71ac22ebf12c4a0 authored about 9 years ago
Merge pull request #1 from nvkelso/nvkelso/readme-translit-typo

andthus > and thus in Transliteration section

github.com/openvenues/libpostal - 11a9c47cea98f8004d952dfccfd512ffb3591c2e authored about 9 years ago
andthus > and thus in Transliteration section

github.com/openvenues/libpostal - 7ff7027cdb885ed9bc98dc5c96d641bcd562da5c authored about 9 years ago
[fix] Note about ldconfig

github.com/openvenues/libpostal - 3e44910664c0a1147e1fe2e46589457abc8400ca authored about 9 years ago
[fix] README parses

github.com/openvenues/libpostal - ef941a663464207501c648e755a97c3a2819ce65 authored about 9 years ago
[fix] README addition

github.com/openvenues/libpostal - 6cccc3ee4649f518456252de4fdf45ec7a0af85a authored about 9 years ago
[docs] Updating README with parsing info/examples

github.com/openvenues/libpostal - d1833a8f8fd8209c177377855b2c79fdeb4fe980 authored about 9 years ago
[build] Removing setup.py fanciness. Install the C library first, then run setup.py or pip install

github.com/openvenues/libpostal - 83ba053373783598b2c2155367016c75a665653f authored about 9 years ago
[numex] Return true if numex table already loaded

github.com/openvenues/libpostal - e0c0ed2d04bb87e4fb04b61ceffabc1b9edf61aa authored about 9 years ago
[fix] default for libdir

github.com/openvenues/libpostal - 7e04017851453495d2705b3d151693a6647fe512 authored about 9 years ago
[build] Build shared lib in site-packages

github.com/openvenues/libpostal - 40641209ee4013741d2755bb5a62514caf801a1b authored about 9 years ago