Search Results

Found 2,064 repositories(showing 30)

Lemma

etodd

🧡56

Immersive first-person parkour in a surreal, physics-driven voxel world.

572

Updated 1 week ago

lemmatization-lists

michmech

💛71

Machine-readable lists of lemma-token pairs in 23 languages.

362

ODbL-1.0

Updated 4 days ago

lemmatizationnlp

A Python wrapper of the Yandex Mystem 3.1 morphological analyzer (http://api.yandex.ru/mystem). The original tool is shipped as a binary and this library makes it easy to integrate it in Python projects. Let us know in the issues if you would like to be involved into the developments or maintenance of this project. If you have any fix or suggestion, please make a pull request. We are very open to accepting any contributions.

292

NOASSERTION

Python

Updated 1 month ago

languagelemmalemmatization+11

udify

Hyperparticle

❤️46

A single model that parses Universal Dependencies across 75 languages. Given a sentence, jointly predicts part-of-speech tags, morphology tags, lemmas, and dependency trees.

225

MIT

Python

Updated 2 months ago

allennlpdeep-learningdependency-parser+4

Lemmachine

larrytheliquid

🧡60

REST'ful web framework in Agda

135

MIT

Haskell

Updated 3 weeks ago

lemmatizer

yohasebe

❤️40

Lemmatizer for text in English. Inspired by Python's nltk.corpus.reader.wordnet.morphy

112

MIT

Ruby

Updated 6 months ago

lemmatizernlpruby+2

elasticsearch-analysis-lemmagen

vhyza

❤️40

Elasticsearch lemmatizer for 15 languages

110

Apache-2.0

Java

Updated 2 months ago

analyzerelasticsearchelasticsearch-plugin+3

lemma

sleepyeinstein

❤️47

Remote CLI tools at your fingertips

102

144

NOASSERTION

Updated 1 month ago

Turkish-Lemmatizer

akoksal

🧡55

Lemmatization for Turkish Language

Python

Updated 3 weeks ago

lemmatizationlemmatizernatural-language-processing+2

COCA-English-Anki-Deck

Ecattea

💛70

This Anki deck contains top 5,000 high-frequency English lemmas (as ranked by COCA) in an English-only environment. Each atomic card presents a single sense, with expert-level definitions from Merriam-Webster’s Learner’s Dictionary and dual-track audio (native recordings + TTS) to boost both memorization and listening practice.

NOASSERTION

HTML

Updated 2 days ago

ankicocadeck+5

PyBot-A-ChatBot-For-Answering-Python-Queries-Using-NLP

abhishek305

🧡65

Pybot can change the way learners try to learn python programming language in a more interactive way. This chatbot will try to solve or provide answer to almost every python related issues or queries that the user is asking for. We are implementing NLP for improving the efficiency of the chatbot. We will include voice feature for more interactivity to the user. By utilizing NLP, developers can organize and structure knowledge to perform tasks such as automatic summarization, translation, named entity recognition, relationship extraction, sentiment analysis, speech recognition, and topic segmentation. NLTK has been called “a wonderful tool for teaching and working in, computational linguistics using Python,” and “an amazing library to play with natural language.The main issue with text data is that it is all in text format (strings). However, the Machine learning algorithms need some sort of numerical feature vector in order to perform the task. So before we start with any NLP project we need to pre-process it to make it ideal for working. Converting the entire text into uppercase or lowercase, so that the algorithm does not treat the same words in different cases as different Tokenization is just the term used to describe the process of converting the normal text strings into a list of tokens i.e words that we actually want. Sentence tokenizer can be used to find the list of sentences and Word tokenizer can be used to find the list of words in strings.Removing Noise i.e everything that isn’t in a standard number or letter.Removing Stop words. Sometimes, some extremely common words which would appear to be of little value in helping select documents matching a user need are excluded from the vocabulary entirely. These words are called stop words.Stemming is the process of reducing inflected (or sometimes derived) words to their stem, base or root form — generally a written word form. Example if we were to stem the following words: “Stems”, “Stemming”, “Stemmed”, “and Stemtization”, the result would be a single word “stem”. A slight variant of stemming is lemmatization. The major difference between these is, that, stemming can often create non-existent words, whereas lemmas are actual words. So, your root stem, meaning the word you end up with, is not something you can just look up in a dictionary, but you can look up a lemma. Examples of Lemmatization are that “run” is a base form for words like “running” or “ran” or that the word “better” and “good” are in the same lemma so they are considered the same.

Python

Updated 8 hours ago

nlpnltk-librarynumpy+4

kan-extensions

ekmett

❤️35

Kan extensions, Kan lifts, the Yoneda lemma, and (co)monads generated by a functor

NOASSERTION

Haskell

Updated 2 months ago

lemmy

sorenlind

🧡55

🤘Lemmy is a lemmatizer for Danish 🇩🇰 and Swedish 🇸🇪

MIT

Python

Updated 1 week ago

danishlemmalemmatizer+3

yoneda

emilyriehl

❤️40

comparative formalizations of the Yoneda lemma for 1-categories and infinity-categories

Lean

Updated 1 month ago

javascript-lemmatizer

takafumir

❤️45

JavaScript Lemmatizer is a lemmatization library to retrieve a base form from an English inflected word.

MIT

JavaScript

Updated 2 months ago

wink-lemmatizer

winkjs

❤️40

English lemmatizer

MIT

JavaScript

Updated 2 months ago

lemmalemmatizationlemmatizer+3

Words-CEFR-Dataset

Maximax67

🧡55

A dataset mapping English words to CEFR levels based on the CEFR-J dataset, word lemmas, stems, parts of speech (POS), and frequency data from the N-Gram Google dataset. Ideal for NLP tasks, language proficiency assessment, and linguistic research.

MIT

Jupyter Notebook

Updated 2 weeks ago

LEMMA

csguoh

🧡65

[IJCAI2023] Your text images can be clearer!

Apache-2.0

Python

Updated 1 day ago

MO-Problem-Journal

AnglyPascal

🧡50

A journal of theorems, lemmas and problems for Mathematical Olympiads.

MIT

TeX

Updated 1 month ago

journallemmasmath+3

NLPSwift

VamshiIITBHU14

❤️35

NSLinguisticTagger provides a uniform interface to a variety of natural language processing functionality with support for many different languages and scripts. One can use this class to segment natural language text into paragraphs , sentences, or words and tag information about those segments such as parts of speech, lexical class, lemma!

Swift

Updated 1 year ago

coremliosnlp+2

lucene-stanford-lemmatizer

larsmans

❤️40

A library that adds some NLP capabilities to the Lucene search engine

GPL-3.0

Java

Updated 4 years ago

lemma

xiamx

🧡50

A Morphological Parser (Analyser) / Lemmatizer written in Elixir.

Apache-2.0

Elixir

Updated 2 months ago

elixirerlanglemmatization+4

spanish_data

doozan

💛70

Spanish to English dictionary, frequency list, and lemma data

CC-BY-4.0

Makefile

Updated 3 days ago

dictionaryspanishwiktionary

Lemma

cvfosammmm

🧡65

Note-taking app, written in Python with Gtk

NOASSERTION

Python

Updated 13 hours ago

elasticsearch-ukrainian-lemmatizer

mrgambal

🧡50

Ukrainian lemmatizer plugin for ElasticSearch

MIT

Java

Updated 1 month ago

elasticsearchlemmaplugin+1

FrenchLefffLemmatizer

ClaudeCoulombe

🧡50

A French Lemmatizer in Python based on the LEFFF

NOASSERTION

Python

Updated 1 month ago

korean_lemmatizer

lovit

❤️25

한국어 용언 분석기 (원형 복원, 용언 형태소 분석)

Python

Updated 6 months ago

lemma

mailgun

❤️40

Mailgun Cryptographic Tools

Apache-2.0

Updated 1 year ago

spacy-spanish-lemmatizer

pablodms

❤️35

Spanish rule-based lemmatization for spaCy

MIT

Python

Updated 1 year ago

LemmatizedAncientGreekXML

gcelano

❤️15

No description available

Updated 5 months ago

GitHub Explorer

Search Results

Lemma

lemmatization-lists

pymystem3

udify

Lemmachine

lemmatizer

elasticsearch-analysis-lemmagen

lemma

Turkish-Lemmatizer

COCA-English-Anki-Deck

PyBot-A-ChatBot-For-Answering-Python-Queries-Using-NLP

kan-extensions

lemmy

yoneda

javascript-lemmatizer

wink-lemmatizer

Words-CEFR-Dataset

LEMMA

MO-Problem-Journal

NLPSwift

lucene-stanford-lemmatizer

lemma

spanish_data

Lemma

elasticsearch-ukrainian-lemmatizer

FrenchLefffLemmatizer

korean_lemmatizer

lemma

spacy-spanish-lemmatizer

LemmatizedAncientGreekXML

Lemma

lemmatization-lists

pymystem3

udify

Lemmachine

lemmatizer

elasticsearch-analysis-lemmagen

lemma

Turkish-Lemmatizer

COCA-English-Anki-Deck

PyBot-A-ChatBot-For-Answering-Python-Queries-Using-NLP

kan-extensions

lemmy

yoneda

javascript-lemmatizer

wink-lemmatizer

Words-CEFR-Dataset

LEMMA

MO-Problem-Journal

NLPSwift

lucene-stanford-lemmatizer

lemma

spanish_data

Lemma

elasticsearch-ukrainian-lemmatizer

FrenchLefffLemmatizer

korean_lemmatizer

lemma

spacy-spanish-lemmatizer

LemmatizedAncientGreekXML