Search Results

Found 1,519 repositories(showing 30)

semantic-segmentation-pytorch

CSAILVision

💛86

Pytorch implementation for Semantic Segmentation/Scene Parsing on MIT ADE20K dataset

5.1k

1.1k

BSD-3-Clause

Python

Updated 2 days ago

ade20kpytorchscene-recognition+1

PySceneDetect

Breakthrough

💛79

:movie_camera: Python and OpenCV-based scene cut/transition detection program & library.

4.7k

480

BSD-3-Clause

Python

Updated 1 day ago

analysisimage-processingopencv+6

Awesome-Scene-Text-Recognition

chongyangtao

💛71

A curated list of resources dedicated to scene text localization and recognition

1.7k

408

Updated 6 days ago

natural-imagesscene-textstext-detection+1

CRNN_Tensorflow

MaybeShewill-CV

💛75

Convolutional Recurrent Neural Networks(CRNN) for Scene Text Recognition

1.0k

385

MIT

Python

Updated 3 days ago

chinese-ocrcrnn-tensorflowctc-loss+3

sceneReco

bear63

❤️40

ctpn+crnn Scene character recognition

944

371

Python

Updated 5 months ago

AttentionOCR

zhang0jhon

❤️43

Scene text recognition

839

251

Python

Updated 1 month ago

DeepStack

johnolafenwa

🧡67

The World's Leading Cross Platform AI Engine for Edge Devices

810

122

Apache-2.0

Python

Updated 3 days ago

ai-enginecomputer-visiondeepstack+4

SceneTextPapers

Jyouhou

🧡57

Tracking the latest progress in Scene Text Detection and Recognition: Must-read papers well organized

789

157

Updated 3 weeks ago

parseq

baudm

🧡67

Scene Text Recognition with Permuted Autoregressive Sequence Models (ECCV 2022)

699

155

Apache-2.0

Python

Updated 2 days ago

computer-visioneccveccv2022+5

MORAN_v2

Canjie-Luo

🧡62

MORAN: A Multi-Object Rectified Attention Network for Scene Text Recognition

649

151

MIT

Python

Updated 1 week ago

attention-mechanismimage-deformationimage-rectification+2

Scene-Text-Recognition

HCIILAB

🧡57

No description available

619

117

Updated 9 hours ago

see

Bartzi

❤️37

Code for the AAAI 2018 publication "SEE: Towards Semi-Supervised End-to-End Scene Text Recognition"

576

150

GPL-3.0

Python

Updated 4 months ago

chainercnncomputer-vision+3

vedastr

Media-Smart

🧡52

A scene text recognition toolbox based on PyTorch

535

101

Apache-2.0

Python

Updated 1 month ago

ocrocr-recognitionpytorch+3

ABINet

FangShancheng

🧡61

Read Like Humans: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text Recognition

461

NOASSERTION

Jupyter Notebook

Updated 6 days ago

FudanOCR

FudanVI

❤️46

A toolbox of scene text super-resolution and recognition

431

Python

Updated 1 week ago

Scene-Text-Understanding

tangzhenyu

🧡57

OCR, Scene-Text-Understanding, Text Recognition

377

115

C++

Updated 1 week ago

Scene-Text-Recognition-Recommendations

HCIILAB

🧡51

Papers, Datasets, Algorithms, SOTA for STR. Long-time Maintaining

353

MIT

Python

Updated 2 months ago

aster-pytorchcrnn-pytorchdatasets+4

deep-text-recognition-benchmark

roatienza

❤️36

PyTorch code of my ICDAR 2021 paper Vision Transformer for Fast and Efficient Scene Text Recognition (ViTSTR)

313

Apache-2.0

Jupyter Notebook

Updated 3 months ago

ocrstrvision-transformer+1

Android-Object-Detection

tzutalin

❤️47

:coffee: Fast-RCNN and Scene Recognition using Caffe

307

141

Java

Updated 2 months ago

androidcaffedetection+1

The aim of this work is to recognize the six emotions (happiness, sadness, disgust, surprise, fear and anger) based on human facial expressions extracted from videos. To achieve this, we are considering people of different ethnicity, age and gender where each one of them reacts very different when they express their emotions. We collected a data set of 149 videos that included short videos from both, females and males, expressing each of the the emotions described before. The data set was built by students and each of them recorded a video expressing all the emotions with no directions or instructions at all. Some videos included more body parts than others. In other cases, videos have objects in the background an even different light setups. We wanted this to be as general as possible with no restrictions at all, so it could be a very good indicator of our main goal. The code detect_faces.py just detects faces from the video and we saved this video in the dimension 240x320. Using this algorithm creates shaky videos. Thus we then stabilized all videos. This can be done via a code or online free stabilizers are also available. After which we used the stabilized videos and ran it through code emotion_classification_videos_faces.py. in the code we developed a method to extract features based on histogram of dense optical flows (HOF) and we used a support vector machine (SVM) classifier to tackle the recognition problem. For each video at each frame we extracted optical flows. Optical flows measure the motion relative to an observer between two frames at each point of them. Therefore, at each point in the image you will have two values that describes the vector representing the motion between the two frames: the magnitude and the angle. In our case, since videos have a resolution of 240x320, each frame will have a feature descriptor of dimensions 240x320x2. So, the final video descriptor will have a dimension of #framesx240x320x2. In order to make a video comparable to other inputs (because inputs of different length will not be comparable with each other), we need to somehow find a way to summarize the video into a single descriptor. We achieve this by calculating a histogram of the optical flows. This is, separate the extracted flows into categories and count the number of flows for each category. In more details, we split the scene into a grid of s by s bins (10 in this case) in order to record the location of each feature, and then categorized the direction of the flow as one of the 8 different motion directions considered in this problem. After this, we count for each direction the number of flows occurring in each direction bin. Finally, we end up with an s by s by 8 bins descriptor per each frame. Now, the summarizing step for each video could be the average of the histograms in each grid (average pooling method) or we could just pick the maximum value of the histograms by grid throughout all the frames on a video (max pooling For the classification process, we used support vector machine (SVM) with a non linear kernel classifier, discussed in class, to recognize the new facial expressions. We also considered a Naïve Bayes classifier, but it is widely known that svm outperforms the last method in the computer vision field. A confusion matrix can be made to plot results better.

299

112

Python

Updated 1 week ago

SwinTextSpotter

mxin262

❤️36

Pytorch re-implementation of Paper: SwinTextSpotter: Scene Text Spotting via Better Synergy between Text Detection and Text Recognition (CVPR 2022)

288

Python

Updated 1 month ago

MASTER-pytorch

wenwenyu

❤️36

Code for the paper "MASTER: Multi-Aspect Non-local Network for Scene Text Recognition" (Pattern Recognition 2021)

280

MIT

Python

Updated 5 months ago

non-local-networkocrscene-text-recognition+2

Recommendations-Diffusion-Text-Image

yeungchenwa

🧡65

A paper collection of recent diffusion models for text-image generation tasks, e,g., visual text generation, font generation, text removal, text image super resolution, text editing, handwritten generation, scene text recognition and scene text detection.

273

Updated 2 days ago

awesomediffusion-modelsfont-generation+9

straug

roatienza

🧡51

Image transformations designed for Scene Text Recognition (STR) data augmentation. Published at ICCV 2021 Workshop on Interactive Labeling and Data Augmentation for Vision.

266

Apache-2.0

Python

Updated 2 months ago

data-augmentationscene-text-recognitionstr

Transformer_STR

opconty

❤️46

PyTorch implementation of my new method for Scene Text Recognition (STR) based on Transformer,Equipped with Transformer, this method outperforms the best model of the aforementioned deep-text-recognition-benchmark by 7.6% on CUTE80.

226

Python

Updated 3 weeks ago

ocrtext-recognitiontransformer-str

Union14M

Mountchicken

💛70

[ICCV 2023] Code base for Revisiting Scene Text Recognition: A Data Perspective

204

MIT

Python

Updated 5 days ago

datasetmaemmocr+3

Keras-VGG16-places365

GKalliatakis

🧡61

Keras code and weights files for the VGG16-places365 and VGG16-hybrid1365 CNNs for scene classification

202

MIT

Python

Updated 1 week ago

baseline-cnnscnndeep-learning+7

SRN.pytorch

chenjun2hao

❤️31

Unofficial PyTorch implementation of Towards Accurate Scene Text Recognition with Semantic Reasoning Networks

192

Python

Updated 1 year ago

cvpr2020ocr-recognitionsrn

Automatic_Number_Plate_Recognition_YOLO_OCR

mftnakrsu

🧡61

Automatic number plate recognition using tech: Yolo, OCR, Scene text detection, scene text recognation, flask, torch

189

MIT

Python

Updated 2 weeks ago

aiartificial-intelligencecomputer-vision+16

STR-Fewer-Labels

ku21fan

❤️40

Scene Text Recognition (STR) methods trained with fewer real labels (CVPR 2021)

184

MIT

Jupyter Notebook

Updated 4 months ago

cvpr2021deep-learningocr+6

GitHub Explorer

Search Results

semantic-segmentation-pytorch

PySceneDetect

Awesome-Scene-Text-Recognition

CRNN_Tensorflow

sceneReco

AttentionOCR

DeepStack

SceneTextPapers

parseq

MORAN_v2

Scene-Text-Recognition

see

vedastr

ABINet

FudanOCR

Scene-Text-Understanding

Scene-Text-Recognition-Recommendations

deep-text-recognition-benchmark

Android-Object-Detection

Emotion-Detection-in-Videos

SwinTextSpotter

MASTER-pytorch

Recommendations-Diffusion-Text-Image

straug

Transformer_STR

Union14M

Keras-VGG16-places365

SRN.pytorch

Automatic_Number_Plate_Recognition_YOLO_OCR

STR-Fewer-Labels

semantic-segmentation-pytorch

PySceneDetect

Awesome-Scene-Text-Recognition

CRNN_Tensorflow

sceneReco

AttentionOCR

DeepStack

SceneTextPapers

parseq

MORAN_v2

Scene-Text-Recognition

see

vedastr

ABINet

FudanOCR

Scene-Text-Understanding

Scene-Text-Recognition-Recommendations

deep-text-recognition-benchmark

Android-Object-Detection

Emotion-Detection-in-Videos

SwinTextSpotter

MASTER-pytorch

Recommendations-Diffusion-Text-Image

straug

Transformer_STR

Union14M

Keras-VGG16-places365

SRN.pytorch

Automatic_Number_Plate_Recognition_YOLO_OCR

STR-Fewer-Labels