Ecosyste.ms: OpenCollective

An open API service for software projects hosted on Open Collective.

github.com/Caucasus-Rosetta/Lingua-Corpus

Caucasus languages focused multilingual and monolingual corpuses for Natural Language Processing(NLP)
https://github.com/Caucasus-Rosetta/Lingua-Corpus

Extract all text from Adiga Maqa

danielinux7 opened this issue 2 months ago
Extract all text from Adiga Psatha

danielinux7 opened this issue 2 months ago
add kbd ap

maltunok opened this pull request 3 months ago
re-add ap pdfs

maltunok opened this pull request 3 months ago
Revert "add kbd ap pdfs and scrapper code"

danielinux7 opened this pull request 3 months ago
add kbd ap pdfs and scrapper code

maltunok opened this pull request 3 months ago
Scrape all archive of KBR newspaper

danielinux7 opened this issue 3 months ago
Fine-tune a NLLB-200 model for translating

Bachstelze opened this issue 9 months ago
XLM-R with Abkhaz

Bachstelze opened this issue 11 months ago
Corpus alignment

Bachstelze opened this issue over 1 year ago
Add subtitle to apsny habar video

danielinux7 opened this issue almost 3 years ago
[Common Voice] averageClipDuration is not accurate

danielinux7 opened this issue almost 3 years ago
[Common Voice] Add accents to the official abkhazian corpus

danielinux7 opened this issue almost 3 years ago
Cyrillic to latin convertor for Abkhazian e-books.

danielinux7 opened this issue about 3 years ago
[LibreLingo] Research to learn a language without an intermediate language.

danielinux7 opened this issue about 3 years ago
Prepare a document of projects

danielinux7 opened this issue about 3 years ago
[stats] Random sampling for bulk submission

danielinux7 opened this issue about 3 years ago
Interview with Prefect Strangers magazine

danielinux7 opened this issue about 3 years ago
Open Text Bank

danielinux7 opened this issue about 3 years ago
Project proposals

danielinux7 opened this issue about 3 years ago
Setup platform for open text bank

danielinux7 opened this issue about 3 years ago
[Жҭь] Аҳәоуқәа рыцкареи рыриашьареи

danielinux7 opened this issue about 3 years ago
Need to localize xdg-user-dirs

danielinux7 opened this issue over 3 years ago
CC0 аиқәшаҳаҭрақәа рҩымҭазы

danielinux7 opened this issue over 3 years ago
[Sep] Аҳәоуқәа рыцкареи рыриашьареи

danielinux7 opened this issue over 3 years ago
Prepare sentences from CC0 text

danielinux7 opened this issue over 3 years ago
Setup Dima's computer

danielinux7 opened this issue over 3 years ago
Steps for Gnome testing and localization environment

danielinux7 opened this issue over 3 years ago
Use translation memory to localize Gnome 41

danielinux7 opened this issue over 3 years ago
Subtitles (English,Turkish,Abkhaz) in apsny habar

danielinux7 opened this issue over 3 years ago
Train NMT domain specific ru-ab model

danielinux7 opened this issue over 3 years ago
Extract bi-text from Telegram, Gnome, FireFox, OO and LO

danielinux7 opened this issue over 3 years ago
StarDic

danielinux7 opened this issue over 3 years ago
Enable Gnome

danielinux7 opened this issue over 3 years ago
OpenOffice.org 2.3.0

danielinux7 opened this issue over 3 years ago
FireFox 3.0.12

danielinux7 opened this issue over 3 years ago
Аҳәоуқәа реизгара Common Voice азы - 100k

danielinux7 opened this issue over 3 years ago
Common Voice 12

danielinux7 opened this issue over 3 years ago
Parse Ru-Ab dictionary

danielinux7 opened this issue over 3 years ago
Create Mispronunciation.ipynb

Plkmoi opened this pull request over 3 years ago
New data set for BLEU scoring

danielinux7 opened this issue over 3 years ago
Training Ru-Ab model

danielinux7 opened this issue over 3 years ago
Python script to identify mismatched punctuation

danielinux7 opened this issue over 3 years ago
Sponsor

danielinux7 opened this issue over 3 years ago
Common Voice 4

danielinux7 opened this issue over 3 years ago
Common Voice 16

Radmir717 opened this issue over 3 years ago
Naulinux аԥсышәала

danielinux7 opened this issue over 3 years ago
Common Voice запись голоса

danielinux7 opened this issue over 3 years ago
Prepare a presentation at the Uni 10/06

danielinux7 opened this issue over 3 years ago
Поиск текста

Radmir717 opened this issue over 3 years ago
Clean text:

danielinux7 opened this issue over 3 years ago
Automate Project board

danielinux7 opened this issue over 3 years ago
Create Chechen-Arabic.tsv

Plkmoi opened this pull request over 3 years ago
No Number

Plkmoi opened this pull request over 3 years ago
Ab-es

Plkmoi opened this pull request over 3 years ago
Ab-Ce

Plkmoi opened this pull request over 3 years ago
Ady-ru upload

Plkmoi opened this pull request over 3 years ago
Ab - ar Corpus files

Plkmoi opened this pull request over 3 years ago
Ab-ar

Plkmoi opened this pull request over 3 years ago
Abkhazian Arabic

Plkmoi opened this issue over 3 years ago
Dictionary lists for rare words

Bachstelze opened this issue over 4 years ago
Paraphrase start and end words

Bachstelze opened this issue over 4 years ago
Back translation

danielinux7 opened this issue over 4 years ago
Testing punctuation join_corpus

danielinux7 opened this issue over 4 years ago
Testing paraphrase join_corpus

danielinux7 opened this issue over 4 years ago
--punctuation usage with join_corpus script

danielinux7 opened this issue over 4 years ago
FileNotFoundError Join_corpus script

danielinux7 opened this issue over 4 years ago
Rare word counting for paraphrases

Bachstelze opened this issue over 4 years ago
Corpus generation with back-translation

Bachstelze opened this issue over 4 years ago
Paralize the corpus processing script

Bachstelze opened this issue over 4 years ago
Document kaggle notebooks

danielinux7 opened this issue over 4 years ago
R&D

danielinux7 opened this issue over 4 years ago
Add number filters

Bachstelze opened this issue over 4 years ago
Wordnet compatibility

Bachstelze opened this issue over 4 years ago
Validation and testing corpus

Bachstelze opened this issue over 4 years ago
Punctuation filter

Bachstelze opened this issue over 4 years ago
Training methods to enhance NMT models

danielinux7 opened this issue over 4 years ago
MASS training

Bachstelze opened this issue over 4 years ago
Setup up AB - RU transformer

danielinux7 opened this issue over 4 years ago
Setting up RU-AB transformer

danielinux7 opened this issue over 4 years ago
Setup translation api server

danielinux7 opened this issue over 4 years ago
Ru-Ab transformer

danielinux7 opened this issue over 4 years ago
Adyghe OCR

danielinux7 opened this issue over 4 years ago
GCS training

danielinux7 opened this issue over 4 years ago
Add capability to join_corpus tool

danielinux7 opened this issue over 4 years ago
Replace the latin letters to it's cyrillic equivalent !important

danielinux7 opened this issue over 4 years ago
Testing the integrity of the ab-ru Transformer model

danielinux7 opened this issue over 4 years ago
Ab-Ru corpus correction

Bachstelze opened this issue over 4 years ago
Draft alignment

Bachstelze opened this issue over 4 years ago
Multilingual dictionary parsing

Bachstelze opened this issue over 4 years ago
Create LICENSE

danielinux7 opened this pull request over 4 years ago
Possible resources

Bachstelze opened this issue over 4 years ago
Optional latin script in the dictionary

Bachstelze opened this issue over 4 years ago
The readable writing of paraphrases

Bachstelze opened this issue almost 5 years ago
generate_synphrases wrong paraphrase (postfix)

danielinux7 opened this issue almost 5 years ago
generate synphrases less sentences than count

danielinux7 opened this issue almost 5 years ago
Enlarge the corpus with a Russian-Abkhazian dictionary

Bachstelze opened this issue almost 5 years ago
Monolingual corpus

Bachstelze opened this issue almost 5 years ago