Found 6 repositories(showing 6)
NVlabs
OmniVinci is an omni-modal LLM for joint understanding of vision, audio, and language.
kyegomez
An open-source PyTorch implementation of OmniAlignNet from the OmniVinci paper, designed to align vision and audio embeddings in a shared omni-modal space.
alamutsinbad
No description available
vonerikbarren
No description available
alamutsinbad
No description available
Ark-Ntech
AI powered Tiger trafficking investigation app built with fastapi & react. SOTA Omnivinci & ML models. Datasets included.
All 6 repositories loaded