Search Results

Found 6,708 repositories(showing 30)

taskflow

💚91

A General-purpose Task-parallel Programming System in C++

11.9k

1.4k

NOASSERTION

C++

Updated 1 hour ago

concurrent-programmingcuda-programminggpu-programming+12

metaflow

Netflix

💛88

Build, Manage and Deploy AI/ML Systems

10.0k

1.3k

Apache-2.0

Python

Updated 2 hours ago

agentsaiaws+17

MoltenVK is a Vulkan Portability implementation. It layers a subset of the high-performance, industry-standard Vulkan graphics and compute API over Apple's Metal graphics framework, enabling Vulkan applications to run on macOS, iOS and tvOS.

5.6k

486

Apache-2.0

Objective-C++

Updated 6 hours ago

tf-quant-finance

google

💛82

High-performance TensorFlow library for quantitative finance.

5.3k

679

Apache-2.0

Python

Updated 1 hour ago

financegpugpu-computing+9

FluidX3D

ProjectPhysX

💛75

The fastest and most memory efficient lattice Boltzmann CFD software, running on all GPUs and CPUs via OpenCL. Free for non-commercial use.

5.0k

453

NOASSERTION

C++

Updated 5 hours ago

benchmarkcfdcomputational-fluid-dynamics+17

course

parallel101

💛80

高性能并行编程与优化 - 课件

4.2k

561

NOASSERTION

C++

Updated 1 day ago

coursecppcpp17+3

alpa

alpa-projects

💛72

Training and serving large-scale neural networks with auto parallelization.

3.2k

361

Apache-2.0

Python

Updated 14 hours ago

alpaauto-parallelizationcompiler+7

thread-pool

bshoshany

💛71

BS::thread_pool: a fast, lightweight, modern, and easy-to-use C++17 / C++20 / C++23 thread pool library

2.9k

318

MIT

C++

Updated 7 hours ago

concurrencycpluspluscplusplus-17+16

awesome-tensor-compilers

merrymercy

💛71

A list of awesome compiler projects and papers for tensor computation and deep learning.

2.7k

324

Updated 15 hours ago

code-generationcompilerdeep-learning+4

blis

flame

💛72

BLAS-like Library Instantiation Software Framework

2.6k

417

NOASSERTION

Updated 1 hour ago

blasblas-librariesblis+11

kokkos

🧡62

Kokkos C++ Performance Portability Programming Ecosystem: The Programming Model - Parallel Execution and Memory Abstraction

2.5k

490

NOASSERTION

C++

Updated 19 hours ago

abstractionc-plus-plushigh-performance-computing+4

turbit

jofpin

💛71

Build applications, scripts, and automations powered by high-performance multicore computing using Node.js

2.5k

368

MIT

JavaScript

Updated 16 hours ago

cpu-boosterhigh-performancejavascript+6

boinc

BOINC

🧡62

Open-source software for volunteer computing and grid computing.

2.4k

512

LGPL-3.0

PHP

Updated 1 hour ago

androidboincc-plus-plus+12

mfem

🧡63

Lightweight, general, scalable C++ library for finite element methods

2.1k

612

BSD-3-Clause

C++

Updated 8 hours ago

amrcomputational-sciencefem+8

cubecl

tracel-ai

🧡64

Multi-platform high-performance compute language extension for Rust.

2.1k

165

Apache-2.0

Rust

Updated 18 hours ago

cudagpgpugpu+4

awesome-programming-books

zero-equals-false

🧡69

📚 A curated list of awesome programming books (Algorithms and data structures, Artificial intelligence, Software Architecture, Human–computer interaction, Operating Systems, Database Systems, IT Security, Concurrency, Interpreters and Compilers, High-Performance Computing, Distributed Systems, Game Development, Mathematical optimization)

2.1k

171

Updated 1 day ago

awesomeawesome-listbooks+2

chapel

chapel-lang

🧡56

a Productive Parallel Programming Language

2.0k

445

NOASSERTION

Chapel

Updated 9 hours ago

chapelcompilerconcurrency+12

hermit-rs

hermit-os

💛73

Hermit for Rust.

1.9k

105

Apache-2.0

Rust

Updated 25 minutes ago

cloud-computinghigh-performance-computingoperating-system+6

AdaptiveCpp

🧡64

Compiler for multiple programming models (SYCL, C++ standard parallelism, HIP/CUDA) for CPUs and GPUs from all vendors: The independent, community-driven compiler for C++-based heterogeneous programming models. Lets applications adapt themselves to all the hardware in the system - even at runtime!

1.8k

213

BSD-2-Clause

C++

Updated 16 hours ago

adaptivecppcompilergpgpu+8

NNPACK

Maratyszcza

🧡60

Acceleration package for neural networks on multi-core CPUs

1.7k

319

BSD-2-Clause

Updated 1 week ago

convolutional-layerscpufast-fourier-transform+9

sse2neon

DLTcollab

🧡69

A translator from Intel SSE intrinsics to Arm/Aarch64 NEON implementation

1.5k

233

MIT

C++

Updated 1 day ago

aarch64apple-siliconarm+14

kernel

hermit-os

🧡58

A Rust-based, lightweight unikernel.

1.4k

113

Apache-2.0

Rust

Updated 4 hours ago

cloud-computinghigh-performance-computingkernel+7

Arraymancer

mratsim

🧡52

A fast, ergonomic and portable tensor library in Nim with a deep learning focus for CPU, GPU and embedded devices via OpenMP, Cuda and OpenCL backends

1.4k

100

Apache-2.0

Nim

Updated 2 weeks ago

autogradautomatic-differentiationcuda+17

Trilinos

trilinos

🧡63

Primary repository for the Trilinos Project

1.4k

614

NOASSERTION

C++

Updated 1 day ago

c-plus-plushigh-performance-computinghpc+5

drake

ropensci

🧡53

An R-focused pipeline toolkit for reproducibility and high-performance computing

1.3k

130

GPL-3.0

Updated 2 months ago

data-sciencedrakehigh-performance-computing+10

envpool

sail-sg

💛73

C++-based high-performance parallel environment execution engine (vectorized env) for general RL environments.

1.3k

129

Apache-2.0

C++

Updated 10 hours ago

atari-gamesbox2dcpp17+13

PixelFlow

diwi

💛73

A Processing/Java library for high performance GPU-Computing (GLSL). Fluid Simulation + SoftBody Dynamics + Optical Flow + Rendering + Image Processing + Particle Systems + Physics +...

1.3k

134

MIT

Java

Updated 4 hours ago

antialiasingcloth-simulationfilters+7

How_to_optimize_in_GPU

Liu-xiandong

💛73

This is a series of GPU optimization topics. Here we will introduce how to optimize the CUDA kernel in detail. I will introduce several basic kernel optimizations, including: elementwise, reduce, sgemv, sgemm, etc. The performance of these kernels is basically at or near the theoretical limit.

1.3k

180

Apache-2.0

Cuda

Updated 1 hour ago

elementwisegpu-accelerationhigh-performance-computing+4

awesome-high-performance-computing

trevor-vincent

🧡67

A curated list of awesome high performance computing resources

1.2k

125

Updated 1 day ago

awesomeawesome-listhpc

neanderthal

uncomplicate

💛72

Fast Clojure Matrix Library

1.1k

EPL-1.0

Clojure

Updated 4 days ago

apiclojureclojure-library+13

GitHub Explorer

Search Results

taskflow

metaflow

MoltenVK

tf-quant-finance

FluidX3D

course

alpa

thread-pool

awesome-tensor-compilers

blis

kokkos

turbit

boinc

mfem

cubecl

awesome-programming-books

chapel

hermit-rs

AdaptiveCpp

NNPACK

sse2neon

kernel

Arraymancer

Trilinos

drake

envpool

PixelFlow

How_to_optimize_in_GPU

awesome-high-performance-computing

neanderthal

taskflow

metaflow

MoltenVK

tf-quant-finance

FluidX3D

course

alpa

thread-pool

awesome-tensor-compilers

blis

kokkos

turbit

boinc

mfem

cubecl

awesome-programming-books

chapel

hermit-rs

AdaptiveCpp

NNPACK

sse2neon

kernel

Arraymancer

Trilinos

drake

envpool

PixelFlow

How_to_optimize_in_GPU

awesome-high-performance-computing

neanderthal