Found 8,383 repositories(showing 30)
first20hours
This repo contains a list of the 10,000 most common English words in order of frequency, as determined by n-gram frequency analysis of the Google's Trillion Word Corpus.
tdebatty
Implementation of various string similarity and distance algorithms: Levenshtein, Jaro-winkler, n-Gram, Q-Gram, Jaccard index, Longest Common Subsequence edit distance, cosine similarity ...
dav
This tool provides an efficient implementation of the continuous bag-of-words and skip-gram architectures for computing vector representations of words. These representations can be subsequently used in many natural language processing applications and for further research.
Lonami
(tele)gramme.rs - use Telegram's API from Rust
sinovation
A BERT-based Chinese Text Encoder Enhanced by N-gram Representations
rockymadden
:dart: String metrics and phonetic algorithms for Scala (e.g. Dice/Sorensen, Hamming, Jaccard, Jaro, Jaro-Winkler, Levenshtein, Metaphone, N-Gram, NYSIIS, Overlap, Ratcliff/Obershelp, Refined NYSIIS, Refined Soundex, Soundex, Weighted Levenshtein).
PreviousAlone
Another gram by NextAlone
proycon
PyNLPl, pronounced as 'pineapple', is a Python library for Natural Language Processing. It contains various modules useful for common, and less common, NLP tasks. PyNLPl can be used for basic tasks such as the extraction of n-grams and frequency lists, and to build simple language model. There are also more complex data types and algorithms. Moreover, there are parsers for file formats common in NLP (e.g. FoLiA/Giza/Moses/ARPA/Timbl/CQL). There are also clients to interface with various NLP specific servers. PyNLPl most notably features a very extensive library for working with FoLiA XML (Format for Linguistic Annotation).
FGRibreau
🇫🇷 NodeJS language detection library using n-gram
klarna-incubator
Gram is Klarna's own threat model diagramming tool
deborausujono
Python implementation of CBOW and skip-gram word vector models, and hierarchical softmax and negative sampling learning algorithms
ranelpadon
Touch typing trainer using N-grams as data source, with options to customize the auto-generated lessons and specify the minimum typing performance needed. There are sound/color effects as well.
dalinvip
cw2vec: Learning Chinese Word Embeddings with Stroke n-gram Information
shuyo
Language Detection with Infinity-gram
fanglanting
A complete pytorch implementation of skip-gram
microsoft
GRAM: Generative Radiance Manifolds for 3D-Aware Image Generation (CVPR 2022 Oral)
liuhuanyong
Self complemented word embedding methods using CBOW,skip-Gram,word2doc matrix , word2word matrix ,基于CBOW、skip-gram、词-文档矩阵、词-词矩阵四种方法的词向量生成
harrypython
It's A Gram Live is a Python script that create a Instagram Live and provide you a rtmp server and stream key to streaming using sofwares like OBS-Studio.
sbos
Adaptive Skip-gram implementation in Julia
jctian98
E2E system with LF-MMI; word N-gram for Mandarin
mongoid
An n-gram-based full-text search implementation for the Mongoid ODM.
pgbigm
The pg_bigm module provides full text search capability in PostgreSQL. This module allows a user to create 2-gram (bigram) index for faster full text search.
zedom1
Code for chinese error detection module, using n-gram and bi-lstm
andreekeberg
Machine learning based text classification in JavaScript using n-grams and cosine similarity
proycon
Colibri core is an NLP tool as well as a C++ and Python library for working with basic linguistic constructions such as n-grams and skipgrams (i.e patterns with one or more gaps, either of fixed or dynamic size) in a quick and memory-efficient way. At the core is the tool ``colibri-patternmodeller`` whi ch allows you to build, view, manipulate and query pattern models.
jwieting
Code to train and use models from "Charagram: Embedding Words and Sentences via Character n-grams".
ispamm
Official PyTorch repository for GRAM
feedbackmine
ruby language detection library using n-gram
arntanguy
C++ Implementation of Savitzky-Golay filtering based on Gram polynomials
kmario23
Training an n-gram based Language Model using KenLM toolkit for Deep Speech 2