Found 272 repositories(showing 30)
omarelhedaby
Implementation of CNN on ZYNQ FPGA to classify handwritten numbers using MNIST database
hunterlew
CNN acceleration on virtex-7 FPGA with verilog HDL
lulinchen
A hardware implementation of CNN, written by Verilog and synthesized on FPGA
GuoningHuang
2023集创赛国二。基于脉动阵列写的一个简单的卷积层加速器,支持yolov3-tiny的第一层卷积层计算,可根据FPGA端DSP资源灵活调整脉动阵列的结构以实现不同的计算效率。
Project is about designing a Trained Neural Network on FPGA to classify an Image Input using CNN.
MasLiang
FPGA
Squeezenet V1.1 on Cyclone V SoC-FPGA at 450ms/image, 20x faster than ARM A9 processor alone. A project for 2017 Innovate FPGA design contest.
tirumalnaidu
OpenCL HLS based CNN Accelerator on Intel DE10 Nano FPGA.
atrifex
Implementing CNN code in CUDA and OpenCL to evaluate its performance on NVIDIA GPUs, AMD GPUs, and an FPGA platform.
cxdzyq1110
To help machines learn what we human beings are doing via a camera is important. Once it comes true, machines can make different responses to all kinds of human's postures. But the process is very difficult as well, because usually it is very slow and power-consuming, and requires a very large memory space. Here we focus on real-time posture recognition, and try to make the machine "know" what posture we make. The posture recognition system is consisted of DE10-Nano SoC FPGA Kit, a camera, and an HDMI monitor. SoC FPGA captures video streams from the camera, recognizes human postures with a CNN model, and finally shows the original video and classification result (standing, walking, waving, etc.) via HDMI interface.
ralbertazzi
Training and Implementation of a CNN for image classification with binary weights and activations on FPGA with HLS tools
eda-lab
CNN-Accelerator based on FPGA developed by verilog HDL.
mertz1999
implement convolution neural network on FPGA based on VHDL design
amiq-consulting
How to Accelerate an Image Upscaling CNN on FPGA Using HLS
No description available
LukiBa
YOLO example implementation using Intuitus CNN accelerator on ZYBO ZYNQ-7000 FPGA board
patryk-oleniuk
FPGA accelerator and port of the emotion recognition CNN running in C on Xilinx ZYNQ
A CNN-based hardware digit/image recognition module designed on PyTorch and then implemented with Verilog on FPGA
anupam-io
ES-203 Computer Organization & Architecture CNN on FPGA board
CAS-CLab
[TCAD 2021] Block Convolution: Towards Memory-Efficient Inference of Large-Scale CNNs on FPGA
Who doesn’t dream of a new FPGA family that can provide embedded hard neurons in its silicon architecture fabric instead of the conventional DSP and multiplier blocks? The optimized hard neuron design will allow all the software and hardware designers to create or test different deep learning network architectures, especially the convolutional neural networks (CNN), more easily and faster in comparing to any previous FPGA family in the market nowadays. The revolutionary idea about this project is to open the gate of creativity for a precise-tailored new generation of FPGA families that can solve the problems of wasting logic resources and/or unneeded buses width as in the conventional DSP blocks nowadays. The project focusing on the anchor point of the any deep learning architecture, which is to design an optimized high-speed neuron block which should replace the conventional DSP blocks to avoid the drawbacks that designers face while trying to fit the CNN architecture design to it. The design of the proposed neuron also takes the parallelism operation concept as it’s primary keystone, beside the minimization of logic elements usage to construct the proposed neuron cell. The targeted neuron design resource usage is not to exceeds 500 ALM and the expected maximum operating frequency of 834.03 MHz for each neuron. In this project, ultra-fast, adaptive, and parallel modules are designed as soft blocks using VHDL code such as parallel Multipliers-Accumulators (MACs), RELU activation function that will contribute to open a new horizon for all the FPGA designers to build their own Convolutional Neural Networks (CNN). We couldn’t stop imagining INTEL ALTERA to lead the market by converting the proposed designed CNN block and to be a part of their new FPGA architecture fabrics in a separated new Logic Family so soon. The users of such proposed CNN blocks will be amazed from the high-speed operation per seconds that it can provide to them while they are trying to design their own CNN architectures. For instance, and according to the first coding trial, the initial speed of just one MAC unit can reach 3.5 Giga Operations per Second (GOPS) and has the ability to multiply up to 4 different inputs beside a common weight value, which will lead to a revolution in the FPGA capabilities for adopting the era of deep learning algorithms especially if we take in our consideration that also the blocks can work in parallel mode which can lead to increasing the data throughput of the proposed project to about 16 Tera Operations per Second (TOPS). Finally, we believe that this proposed CNN block for FPGA is just the first step that will leave no areas for competitions with the conventional CPUs and GPUs due to the massive speed that it can provide and its flexible scalability that it can be achieved from the parallelism concept of operation of such FPGA-based CNN blocks.
trung-pham-dinh
No description available
redlightASl
FPGA-CNN Application for fruit detection based on Logos-PGL22G Board
Hand Writing Digital Recognization Based on FPGA, we desiged a SoC embeded a Cortex M3 core and other peripherals,this SoC run a CNN. The SoC worked not bad in the end the success rate up to 90%。.
one-ware
A resource-efficient VHDL-based framework for deploying CNNs and Feedforward Networks on FPGAs, featuring core modules, MNIST examples, and hardware-specific optimizations.
CNN on Artix-7 FPGA to perform pattern detection from a pool of objects
pratikpv
Benchmarking execution time of AlexNet CNN on FPGA and GPU. Developed AlexNet in opencl.
eda-lab
CNN-Accelerator based on FPGA developed by verilog HDL.
FPGA Implementation of Image Processing for MNIST Dataset Based on Convolutional Neural Network Algorithm (CNN)
KaestnerFlorian
Landmark Detection with CNN on FPGA including DPR