Search Results

Found 63 repositories(showing 30)

LLM-Engineering-Foundations-to-SLMs-Open-Source

AI-Maker-Space

🧡50

Large Language Model Engineering (LLM Engineering) refers to the emerging best-practices and tools for pretraining, post-training, and optimizing LLMs prior to production deployment. Pre- and post-training techniques include unsupervised pretraining, supervised fine-tuning, alignment, model merging, distillation, quantization. and others.

Jupyter Notebook

Updated 1 week ago

Ai-Engineering-Toolkit

evelyyyyynnnnn

🧡65

Engineering toolkit for building scalable AI systems, including model training pipelines, LLM optimization utilities, and data engineering tools.

Python

Updated 1 hour ago

Mini-LLM

Ashx098

🧡65

A ground-up LLM engineering project: tokenizer → architecture → training → scaling laws → inference. Starts at 80M, engineered to scale into 1B+ models with minimal changes. Clean, research-ready code for anyone serious about understanding and building LLMs from first principles.

Python

Updated 5 days ago

aisoc-season-2

aisummerofcode

❤️30

The world's no 1 classroom for incubating AI talent. 4 months of hands-on, no-BS technical training in ML, LLMs and Production AI engineering.

Jupyter Notebook

Updated 2 months ago

GeneticPromptLab

AmanPriyanshu

🧡60

GeneticPromptLab uses genetic algorithms for automated prompt engineering (for LLMs), enhancing quality and diversity through iterative selection, crossover, and mutation, while efficiently exploring minimal yet diverse samples from the training set.

MIT

Python

Updated 4 weeks ago

aiautomated-prompt-engineeringevolutionary-algorithm+16

Web2-LLM.txt

buildwithfiroz

🧡60

Web2LLM.txt – A fast, open-source website-to-LLM context file generator. Paste any https:// URL and instantly get a clean llm.txt file with token & cost estimation—ideal for RAG, prompt engineering, and AI training workflows.

MIT

Python

Updated 2 weeks ago

aiai-trainingcss+12

AI_Tutor

ybenkirane

❤️45

An LLM-powered automated tutoring program that will converse with you on any given branch of topics (technical or soft). Has practical uses for Quantitative training in Finance, Economics training in Investment Banking, Software Engineering, or Data Science. Can teach basic STEM topics as well as the arts and humanities.

Python

Updated 2 months ago

assistantautomationgpt-4+5

ai-ml-domain-llm

Bansnetsajak007

❤️45

An engineering-focused project covering data acquisition, dataset curation, tokenization, training, and evaluation of an AI/ML-focused LLM.

Python

Updated 1 month ago

daadataengineeringllm+1

prompt-engineering-course

maxmoundas

❤️40

Course on LLMs and Prompt Engineering. Covers LLM fundamentals, training, evaluation, prompting techniques, RAG, multimodal capabilities, agents, MCP, and LLM-powered software engineering tools.

MIT

Python

Updated 6 months ago

Awesome-AI-Engineering

Eric-LLMs

❤️45

The Full-Stack LLM Engineering Playbook. Architectural patterns for Agents (MCP) & RAG, coupled with advanced Post-Training recipes (SFT, DPO, QLoRA) for domain adaptation. Covers Data Pipelines, Evaluation Frameworks, and System Design.

Updated 1 month ago

agentsdeep-learningevaluation+13

Text-Generation-LLM-S

Warishayat

❤️40

This project focuses on text-generation LLMs, offering a deep dive into building and fine-tuning large language models for generating human-like text. It covers key techniques in training, prompt engineering, and model optimization, enabling the creation of powerful, context-aware text generation applications for diverse use cases.

MIT

Jupyter Notebook

Updated 1 year ago

Enterprise-Enhanced-LLaMA-Factory-Pro-Advanced-FineTuning-Local-Deployment-Pipeline

SuleynanAuir

❤️40

LLaMA Factory is an end-to-end LLM fine-tuning and deployment pipeline, integrating data augmentation & engineering, cloud-based training, and cloud data management. With one-click fine-tuning and local deployment, it enables rapid iteration of models while keeping your infrastructure flexible and secure. Perfect for enterprise-grade applications.

MIT

Python

Updated 4 weeks ago

autodleasy-datafine-tuning+2

META-PROMPT-for-Creating-Prompt-Engineering-Prompts-50-prompts-for-LLM-training

epaunova

❤️35

This is designed for generating prompts that train LLMs to reason, analyze, and generate prompts—for internal tooling, agents, or fine-tuning.

Updated 9 months ago

distributed-llm-guide

tuanthi

❤️35

🚀 Production ML Engineering: The Complete Guide to Distributed LLM Training & Serving Master the art of building, optimizing, and deploying large-scale ML systems in production environments 🎯 This repository is your complete handbook for becoming a production LLM machine engineer.

Python

Updated 4 months ago

ai-engineering

Predictive-Systems-Inc

❤️35

Training materials for LLM engineering

Jupyter Notebook

Updated 8 months ago

DeepLearningAI

mleanca

❤️35

Training LLMs, Jupyter Notebook, ChatGPT Prompt Engineering for Devs

Updated 8 months ago

AI-Engineering-ERA3

peeyushsinghal

❤️35

All things AI Engineering : Models, Transformers, LLMs, FrontEnd, BackEnd, Distributed Training, On Cloud

Jupyter Notebook

Updated 7 months ago

Easy-CoT

ice188

❤️35

NLP research project: automated prompt engineering method for training LLM on logic and reasoning

Jupyter Notebook

Updated 3 months ago

self-hosted-llms-tutorial

hanasobi

🧡60

Production-grade LLM fine-tuning tutorial: Dataset engineering, LoRA training, vLLM serving - completely self-hosted

MIT

Python

Updated 2 weeks ago

fine-tuningkubernetesllm+5

rt-llm-eng-cert-week4

readytensor

❤️40

Week 4 of LLM Engineering Certification: Learn memory limits, distributed training, and production-ready workflows.

NOASSERTION

Python

Updated 4 months ago

nanoGPT-optimizer-benchmark

hongcanauro-auro

🧡60

Benchmarking AdamW, SophiaG and AdamSNSM optimizers for training NanoGPT under resource‑constrained conditions (single GPU with 8GB VRAM). Includes reproducible training pipeline, experimental results and engineering practices for memory‑efficient LLM training.

MIT

Python

Updated 1 week ago

Math-Misconception-Analytics-Pipeline

mostafa-kermaninia

🧡50

An end-to-end Data Science pipeline for analyzing mathematical misconceptions, featuring automated ETL, MySQL integration, and feature engineering for LLM training. Dockerized & CI/CD enabled.

MIT

Python

Updated 1 month ago

data-science-projectsdockereducational-data-mining+5

literate-funicular

anupaminnit

🧡50

Interactive Gen AI training site built for covering LLM fundamentals, RAG architecture, Copilot 365, prompt engineering, and a role-specific prompt library. Pure HTML/CSS/JS.

MIT

HTML

Updated 1 month ago

Large-Scale-AI-Engineering-Project

RamonKaspar

🧡55

Final project of the course "Large Scale AI Engineering" at ETH Zürich, FS2025. Implementation and benchmarking of pretokenization and Distributed Data Parallel (DDP) for efficient LLM training on the CSCS Alps supercomputer.

Python

Updated 2 weeks ago

cscsdistributed-data-parallelhpc-cluster+5

DocuShield

SaadBrohi

🧡50

DocuShield is a hybrid AI document risk analysis system that processes legal contracts using AWS Textract for OCR and Groq-powered LLMs for reasoning. It runs on Kubernetes, stores results in DynamoDB, supports clause-level RAG, and focuses on real-world AI system engineering, not model training.

MIT

TypeScript

Updated 2 months ago

llm_engineering_training

Kgresmer

❤️30

No description available

MIT

Jupyter Notebook

Updated 1 year ago

LLM-Engineering-Training

ksdiwe

❤️25

No description available

Jupyter Notebook

Updated 1 year ago

llm_engineering_training

hoomanete

❤️45

Learning LLM engineering through an online course on Udemy taught by Ed Donner.

Jupyter Notebook

Updated 2 months ago

llm_engineering_training

davinashk

❤️40

No description available

MIT

Jupyter Notebook

Updated 1 month ago

llm_engineering_training

FabG

❤️25

No description available

Jupyter Notebook

Updated 11 months ago

GitHub Explorer

Search Results

LLM-Engineering-Foundations-to-SLMs-Open-Source

Ai-Engineering-Toolkit

Mini-LLM

aisoc-season-2

GeneticPromptLab

Web2-LLM.txt

AI_Tutor

ai-ml-domain-llm

prompt-engineering-course

Awesome-AI-Engineering

Text-Generation-LLM-S

Enterprise-Enhanced-LLaMA-Factory-Pro-Advanced-FineTuning-Local-Deployment-Pipeline

META-PROMPT-for-Creating-Prompt-Engineering-Prompts-50-prompts-for-LLM-training

distributed-llm-guide

ai-engineering

DeepLearningAI

AI-Engineering-ERA3

Easy-CoT

self-hosted-llms-tutorial

rt-llm-eng-cert-week4

nanoGPT-optimizer-benchmark

Math-Misconception-Analytics-Pipeline

literate-funicular

Large-Scale-AI-Engineering-Project

DocuShield

llm_engineering_training

LLM-Engineering-Training

llm_engineering_training

llm_engineering_training

llm_engineering_training

LLM-Engineering-Foundations-to-SLMs-Open-Source

Ai-Engineering-Toolkit

Mini-LLM

aisoc-season-2

GeneticPromptLab

Web2-LLM.txt

AI_Tutor

ai-ml-domain-llm

prompt-engineering-course

Awesome-AI-Engineering

Text-Generation-LLM-S

Enterprise-Enhanced-LLaMA-Factory-Pro-Advanced-FineTuning-Local-Deployment-Pipeline

META-PROMPT-for-Creating-Prompt-Engineering-Prompts-50-prompts-for-LLM-training

distributed-llm-guide

ai-engineering

DeepLearningAI

AI-Engineering-ERA3

Easy-CoT

self-hosted-llms-tutorial

rt-llm-eng-cert-week4

nanoGPT-optimizer-benchmark

Math-Misconception-Analytics-Pipeline

literate-funicular

Large-Scale-AI-Engineering-Project

DocuShield

llm_engineering_training

LLM-Engineering-Training

llm_engineering_training

llm_engineering_training

llm_engineering_training