Found 15,903 repositories(showing 30)
PaddlePaddle
Easy-to-use image segmentation library with awesome pre-trained model zoo, supporting wide-range of practical tasks in Semantic Segmentation, Interactive Segmentation, Panoptic Segmentation, Image Matting, 3D Segmentation, etc.
chaiNNer-org
A node-based image processing GUI aimed at making chaining image processing tasks easy and customizable. Born as an AI upscaling application, chaiNNer has grown into an extremely flexible and powerful programmatic image processing application.
zuruoke
a machine learning image inpainting task that instinctively removes watermarks from image indistinguishable from the ground truth image
EvolvingLMMs-Lab
One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks
tinyvision
A Semantic Controllable Self-Supervised Learning Framework to learn general human representations from massive unlabeled human images, which can benefit downstream human-centric tasks to the maximum extent
twolfson
Grunt task for converting a set of images into a spritesheet and corresponding CSS variables
AHEKOT
Visual Novel Character Creation Suite is a comprehensive tool for creating character sprites for visual novels. It allows you to create unique characters with a consistent appearance across all images, which was previously a challenging task when using neural networks.
MrGiovanni
[MICCAI 2019 Young Scientist Award] [MEDIA 2020 Best Paper Award] Models Genesis, one of the first "foundation" models in medical image analysis for multiple downstream tasks
jayleicn
[CVPR 2021 Best Student Paper Honorable Mention, Oral] Official PyTorch code for ClipBERT, an efficient framework for end-to-end learning on image-text and video-text tasks.
mikeroyal
Image Signal Processing (ISP) Guide. Learn all about the process of converting an image/video into digital form by performing tasks like noise reduction, filtering, auto exposure, autofocus, HDR correction, and image sharpening with a Specialized type of media processor.
ZZZHANG-jx
[CVPR 2024] DocRes: A Generalist Model Toward Unifying Document Image Restoration Tasks
wl-zhao
[ICCV 2023] VPD is a framework that leverages the high-level and low-level knowledge of a pre-trained text-to-image diffusion model to downstream visual perception tasks.
cferdinandi
A simple boilerplate for using NPM tasks to build and compile JavaScript, CSS, and image files.
26hzhang
Several image/video enhancement methods, implemented by Java, to tackle common tasks, like dehazing, denoising, backscatter removal, low illuminance enhancement, featuring, smoothing and etc.
Start building and deploying Python packages and Docker images for MLOps tasks.
SHI-Labs
[IJCV] Pyramid Attention Networks for Image Restoration: new SOTA results on multiple image restoration tasks: denoising, demosaicing, compression artifact reduction, super-resolution
keplerlab
Tool for automating common video key-frame extraction, video compression and Image Auto-crop/Image-resize tasks
hanna-xu
Code of U2Fusion: a unified unsupervised image fusion network for multiple image fusion tasks, including multi-modal, multi-exposure and multi-focus image fusion.
karolzak
Helper package with multiple U-Net implementations in Keras as well as useful utility tools helpful when working with image semantic segmentation tasks. This library and underlying tools come from multiple projects I performed working on semantic segmentation tasks
aws-actions
Inserts a container image URI into an Amazon ECS task definition JSON file.
HiLab-git
Semi-supervised Medical Image Segmentation through Dual-task Consistency
google-deepmind
A C++/Python implementation of the StreetLearn environment based on images from Street View, as well as a TensorFlow implementation of goal-driven navigation agents solving the task published in “Learning to Navigate in Cities Without a Map”, NeurIPS 2018
Linfeng-Tang
The code of " Image fusion in the loop of high-level vision tasks: A semantic-aware real-time infrared and visible image fusion network"
corentin-dfg
This page presents a list of satellite imagery datasets with a temporal dimension, mainly satellite image time series (SITS) and satellite videos, for various computer vision and deep learning tasks. It covers multi-temporal datasets with more than two acquisitions but not bi-temporal datasets.
UniFlowMatch
UFM: A Unified Dense Image Correspondence Estimator for both Optical Flow & Wide Baseline Matching Tasks. Matches any pair of images. (NeurIPS 2025)
A curated list of foundation models for vision and language tasks in medical imaging
RollingPlain
Infrared and Visible Image Fusion: From Data Compatibility to Task Adaption. A fire-new survey for infrared and visible image fusion.
chaozhong2010
HRSID: high resolution sar images dataset for ship detection, semantic segmentation, and instance segmentation tasks.
leeesangwon
A PyTorch framework for an image retrieval task including implementation of N-pair Loss (NIPS 2016) and Angular Loss (ICCV 2017).
lzyhha
[ICCV 2025] VisualCloze: A universal image generation framework that can support a wide range of in-domain tasks and generalize to unseen ones. (🔥 🔥 🔥 Merged into offical pipelines of diffusers.)