Found 67,869 repositories(showing 30)
chatwoot
Open-source live-chat, email support, omni-channel desk. An alternative to Intercom, Zendesk, Salesforce Service Cloud etc. π₯π¬
microsoft
A simple screen parsing tool towards pure vision based GUI agent
omnivore-app
Omnivore is a complete, open source read-it-later solution for people who like reading.
modelscope
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.5, DeepSeek-R1, GLM-5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, Phi4, ...) (AAAI 2025).
iib0011
Self-hosted collection of powerful web-based tools for everyday tasks. No ads, no tracking, just fast, accessible utilities right from your browser!
omniauth
OmniAuth is a flexible authentication system utilizing Rack middleware.
alyssaxuu
The all-in-one tool to supercharge your productivity β¨οΈ
adithya-s-k
Ingest, parse, and optimize any data format β‘οΈ from documents to multimedia β‘οΈ for enhanced compatibility with GenAI frameworks
thephpleague
A framework agnostic, multi-gateway payment processing library for PHP 5.6+
commonsguy
Source code to omnibus edition of _The Busy Coder's Guide to Android Development_
VectorSpaceLab
OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340
vllm-project
A framework for efficient model inference with omni-modality models
VectorSpaceLab
OmniGen2: Exploration to Advanced Multimodal Generation. https://arxiv.org/abs/2506.18871
QwenLM
Qwen2.5-Omni is an end-to-end multimodal model by Qwen team at Alibaba Cloud, capable of understanding text, audio, vision, video, and performing real-time speech generation.
QwenLM
Qwen3-omni is a natively end-to-end, omni-modal LLM developed by the Qwen team at Alibaba Cloud, capable of understanding text, audio, images, and video, as well as generating speech in real time.
gpt-omni
open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.
OmniDB
Web tool for database management
ictnlp
LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.
k2-fsa
High-Quality Voice Cloning TTS for 600+ Languages
federicoiosue
Open source note-taking application for Android
facebookresearch
Omnilingual ASR Open-Source Multilingual SpeechRecognition for 1600+ Languages
diegosouzapw
OmniRoute is an AI gateway for multi-provider LLMs: an OpenAI-compatible endpoint with smart routing, load balancing, retries, and fallbacks. Add policies, rate limits, caching, and observability for reliable, cost-aware inference.
OmniSVG
[NeurIPS 2025] OmniSVG is the first family of end-to-end multimodal SVG generators that leverage pre-trained Vision-Language Models (VLMs), capable of generating complex and detailed SVGs, from simple icons to intricate anime characters.
omnigroup
Source for many of The Omni Group's frameworks
qianqianwang68
No description available
JIA-Lab-research
This project is the official implementation of 'DreamOmni2: Multimodal Instruction-based Editing and Generation (CVPR2026 Highlight)''
OmniSharp
OmniSharp server (HTTP, STDIO) based on Roslyn workspaces
scambier
A search engine that "just works" for Obsidian. Supports OCR and PDF indexing.
gpt-omni
Towards Open-source GPT-4o with Vision, Speech and Duplex Capabilitiesγ
Music-and-Culture-Technology-Lab
Omniscient Mozart, being able to transcribe everything in the music, including vocal, drum, chord, beat, instruments, and more.