GitHub Explorer

by Alexey Ratnikov

GitHub Explorer

GitHub Explorer|TRENDING COMPARE|FEEDBACK

Back to search

LukeLikesDirt/dyna-clust-predict - GitHub Explorer | GitHub Explorer | Trending | Compare

Back to search

dyna-clust-predict

LukeLikesDirt•PUBLIC

View on GitHub

Prediction of optimal sequence similarity cut-offs for classification and clustering of metabarcoding data using vsearch global alignment and F-measure optimisation as a confidence measure

Created on Feb 28, 2026

Updated on Mar 19, 2026

Stars

Forks

Watchers

Open Issues

Repository Health Score

🧡

55/100

Fair

Overall repository health assessment

Score Breakdown

Activity

Regular updates - updated this month

20/30

67%

Recent Commits

fix(dereplicate_lca): replace NA/empty taxonomy ranks with unclassified

Luke Florence•3 weeks ago

e3b316fView on GitHub

fix(compute_sim) Changed incorrect args minsim -> min_sim & ncpu -> n_cpu

Luke Florence•3 weeks ago

e19d0e6View on GitHub

fix(reformat): strip parentheses from fully-wrapped genus names

Luke Florence•3 weeks ago

2f31936View on GitHub

fix(compute_sim): replace base R I/O with data.table to fix OOM

LukeLikeDirt•3 weeks ago

4c80afeView on GitHub

fix(04_prepare_subsets): remove stale fasta_in arg and uncomment ITS1/ITS2 runs

LukeLikeDirt•4 weeks ago

c238c49View on GitHub

refactor(predict): update max_proportion default (0.5→1) and max_seq_no default (25000→20000) to match dnabarcoder defaults

Luke Florence•4 weeks ago

b6ac7b2View on GitHub

refactor(predict): update max_proportion default (0.5→1) and max_seq_no default (25000→20000) to match dnabarcoder defaults

Luke Florence•4 weeks ago

95c2b4bView on GitHub

refactor(subset): remove unique-sequence step and switch to pure random sampling

Luke Florence•4 weeks ago

7b1f4a5View on GitHub

feat: add length_filter and dereplicate_lca to pipeline

Luke Florence•4 weeks ago

1ace343View on GitHub

feat: add length_filter and dereplicate_lca to pipeline

Luke Florence•4 weeks ago

c55b557View on GitHub

feat: add length_filter and dereplicate_lca to pipeline

Luke Florence•4 weeks ago

f45eb96View on GitHub

refactor(subset): species-weighted sampling and species-identified pre-filter

Luke Florence•1 month ago

209d71aView on GitHub

fix: remove parenthesis-to-double-underscore replacement from reformat.R

Luke Florence•1 month ago

4696fe8View on GitHub

build: add cutadapt to conda environment

Luke Florence•1 month ago

e73985cView on GitHub

feat: extend is_identified() to exclude spikes, prokaryotes, and organelles from prediction

Luke Florence•1 month ago

04703ddView on GitHub

View all commits