Found 2,375 repositories(showing 30)
facebookresearch
Library for fast text representation and classification.
piskvorky
Topic Modelling for Humans
brightmart
all kinds of text classification models and more with deep learning
649453932
中文文本分类,TextCNN,TextRNN,FastText,TextRCNN,BiLSTM_Attention,DPCNN,Transformer,基于pytorch,开箱即用。
bentrevett
Tutorials on getting started with PyTorch and TorchText for sentiment analysis.
duoergun0729
兜哥出品 <一本开源的NLP入门书籍>
Kyubyong
Pre-trained word vectors of 30+ languages
yongzhuo
中文长文本分类、短句子分类、多标签分类、两句子相似度(Chinese Text Classification of Keras NLP, multi-label classify, or sentence classify, long or short),字词句向量嵌入层(embeddings)和网络层(graph)构建基类,FastText,TextCNN,CharCNN,TextRNN, RCNN, DCNN, DPCNN, VDCNN, CRNN, Bert, Xlnet, Albert, Attention, DeepMoji, HAN, 胶囊网络-CapsuleNet, Transformer-encode, Seq2seq, SWEM, LEAM, TextGCN
plasticityai
A fast, efficient universal vector embedding utility package.
babylonhealth
Multilingual word vectors in 78 languages
chenyuntc
1st Place Solution for Zhihu Machine Learning Challenge . Implementation of various text-classification models.(知乎看山杯第一名解决方案)
brightmart
Pre-training of Deep Bidirectional Transformers for Language Understanding: pre-train TextCNN
ShawnyXiao
Text classification models implemented in Keras, including: FastText, TextCNN, TextRNN, TextBiRNN, TextAttBiRNN, HAN, RCNN, RCNNVariant, etc.
jimichan
一个生产级、高性能、模块化、可扩展的中文NLP工具包。(中文分词、平均感知机、fastText、拼音、新词发现、分词纠错、BM25、人名识别、命名实体、自定义词典)
zlsdu
Word2vec, Fasttext, Glove, Elmo, Bert, Flair pre-train Word Embedding
oborchers
Compute Sentence Embeddings Fast!
ncbi-nlp
BioWordVec & BioSentVec: pre-trained embeddings for biomedical words and sentences
gfthr
The best rich editor (TextView) on IOS platform ,maybe be the fastest
shibing624
pytextclassifier is a toolkit for text classification. 文本分类,LR,Xgboost,TextCNN,FastText,TextRNN,BERT等分类模型实现,开箱即用。
Lizhen0628
使用rnn,lstm,gru,fasttext,textcnn,dpcnn,rnn-att,lstm-att,兼容huggleface/transformers,以及以transforemrs作为词嵌入模型,后面接入cnn、rnn、attention等等做文本分类。以及各个模型的对比
AnubhavGupta3377
Implementation of State-of-the-art Text Classification Models in Pytorch
ArtistScript
中文文本摘要/关键词提取
ThoughtRiver
Fast word vectors with little memory usage in Python
中文文本分类任务,基于PyTorch实现(TextCNN,TextRNN,FastText,TextRCNN,BiLSTM_Attention, DPCNN, Transformer,Bert,ERNIE),开箱即用!
yongzhuo
Macadam是一个以Tensorflow(Keras)和bert4keras为基础,专注于文本分类、序列标注和关系抽取的自然语言处理工具包。支持RANDOM、WORD2VEC、FASTTEXT、BERT、ALBERT、ROBERTA、NEZHA、XLNET、ELECTRA、GPT-2等EMBEDDING嵌入; 支持FineTune、FastText、TextCNN、CharCNN、BiRNN、RCNN、DCNN、CRNN、DeepMoji、SelfAttention、HAN、Capsule等文本分类算法; 支持CRF、Bi-LSTM-CRF、CNN-LSTM、DGCNN、Bi-LSTM-LAN、Lattice-LSTM-Batch、MRC等序列标注算法。
LlmKira
⚡️ 80x faster Fasttext language detection out of the box | Split text by language
apcode
Simple embedding based text classifier inspired by fastText, implemented in tensorflow
brightmart
all kinds of baseline models for long text classificaiton( text categorization)
vngrs-ai
State-of-the-art, lightweight NLP tools for Turkish language. Developed by VNGRS.
dalinvip
cw2vec: Learning Chinese Word Embeddings with Stroke n-gram Information