Found 5,891 repositories(showing 30)
psychopy
For running psychology and neuroscience experiments
nltk
NLTK Data
acl-org
Official style files for papers submitted to venues of the Association for Computational Linguistics
😀😄😂😭 A curated list of Sentiment Analysis methods, implementations and misc. 😥😟😱😤
Tatoeba
Tatoeba is a platform whose purpose is to create a collaborative and open dataset of sentences and their translations.
LexPredict
LexNLP by LexPredict
BLKSerene
An Integrated Corpus Tool With Multilingual Support for the Study of Language, Literature, and Translation
open-dict-data
Monolingual wordlists with pronunciation information in IPA
ayanonagon
Parsimmon is a wee linguistics toolkit for iOS written in Swift.
rime
Rime Cantonese input schema | 中州韻粵語拼音輸入方案
nonamestreet
微信公众号语料库
proycon
PyNLPl, pronounced as 'pineapple', is a Python library for Natural Language Processing. It contains various modules useful for common, and less common, NLP tasks. PyNLPl can be used for basic tasks such as the extraction of n-grams and frequency lists, and to build simple language model. There are also more complex data types and algorithms. Moreover, there are parsers for file formats common in NLP (e.g. FoLiA/Giza/Moses/ARPA/Timbl/CQL). There are also clients to interface with various NLP specific servers. PyNLPl most notably features a very extensive library for working with FoLiA XML (Format for Linguistic Annotation).
theimpossibleastronaut
A curated list of anything remotely related to linguistics
jacksonllee
Cantonese Linguistics and NLP
tshatrov
Linguistic tools for texts in Japanese language
CUNY-CL
Massively multilingual pronunciation mining
quadrismegistus
Prosodic: a metrical-phonological parser, written in Python. For English and Finnish, with flexible language support.
ged
A generic, language-neutral framework for extending Ruby objects with linguistic methods.
csebuetnlp
This repository contains the code, data, and models of the paper titled "XL-Sum: Large-Scale Multilingual Abstractive Summarization for 44 Languages" published in Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021.
tj
Linguistics module for Node - inflection, transformation, i18n and more
OpenCorpora
A web-based engine for creating and annotating textual corpora
mshang
Javascript/canvas linguistics syntax tree generator.
csebuetnlp
This repository contains the official release of the model "BanglaBERT" and associated downstream finetuning code and datasets introduced in the paper titled "BanglaBERT: Language Model Pretraining and Benchmarks for Low-Resource Language Understanding Evaluation in Bangla" accpeted in Findings of the Annual Conference of the North American Chapter of the Association for Computational Linguistics: NAACL-2022.
hangulize
Hangulize transcribes non-Korean words into Hangul
acl-org
The official tool for creating proceedings for conferences of the Association for Computational Linguistics (ACL).
sublee
Korean Alphabet Transcription
Crawler for linguistic corpora
interrogator
A toolkit for corpus linguistics
what-studio
Chooses correct Korean particle morphs for arbitrary words.
glottolog
Collaborative data curation for Glottolog