A LLaMA2-7b chatbot with memory running on CPU, and optimized using smooth quantization, 4-bit quantization or Intel® Extension For PyTorch with bfloat16.
Stars
15
Forks
0
Watchers
15
Open Issues
0
Overall repository health assessment
No package.json found
This might not be a Node.js project
69
commits
6
commits
Merge pull request #6 from aahouzi/dependabot/pip/streamlit-1.30.0
01182b4View on GitHubMerge pull request #5 from aahouzi/dependabot/pip/langchain-0.1.0
eb220a2View on GitHubMerge pull request #4 from aahouzi/dependabot/pip/langchain-0.0.329
2754278View on GitHubMerge pull request #3 from aahouzi/dependabot/pip/transformers-4.36.0
fdbfb0fView on GitHubMerge pull request #2 from aahouzi/dependabot/pip/smoothquant/transformers-4.36.0
c54f22dView on GitHubBump transformers from 4.31.0 to 4.36.0 in /smoothquant
8ff4693View on GitHubMerge pull request #1 from aahouzi/dependabot/pip/langchain-0.0.325
87489ebView on GitHub