Search Results

Found 24 repositories(showing 24)

LLM-Sec-Evaluation

Coldwave96

❤️35

Scripts for evaluating LLM security abilities.

Python

Updated 8 months ago

baichuan-13bchatglm2-6bllm-test

LLM-as-Judge-Security-Evaluation-Framework

SamSon1402

❤️45

No description available

HTML

Updated 3 weeks ago

SecureDocChat-Secure-Multi-PDF-Chatbot-with-LLM-Evaluation

RathanVeer

❤️35

A secure multi-PDF Retrieval-Augmented Generation (RAG) chatbot that enables question answering over documents with built-in safety guards. The system integrates LLM-based input/output validation and automatic evaluation of responses on faithfulness, coherence, and completeness.

Python

Updated 3 months ago

A-Secure-Sandbox-for-Pre-Clinical-Evaluation-of-LLM-in-Patient-Portal-Message-Management

thom22

❤️45

Secure AI sandbox for patient portal messaging, developed in collaboration with Johns Hopkins Bayview Medical Center and a faculty member from the Johns Hopkins School of Nursing, using IRB-approved data. Due to data governance and institutional policies, the full source code and datasets are not publicly shared.

Updated 1 month ago

epichealthcare-aillm+2

LLM-Security-Evaluations

manjunathnp

❤️25

No description available

Jupyter Notebook

Updated 8 months ago

LLM_Security_Evaluation

Harshitha0531

❤️45

No description available

Jupyter Notebook

Updated 4 weeks ago

LLM-Security-Evaluation-Platform

Nikhil-UCEOU

🧡55

No description available

Python

Updated 6 days ago

llm-judge-security-evaluation

apesseu

🧡50

LLM-as-Judge framework for automated chatbot security evaluation (MLOps + Docker)

MIT

Python

Updated 1 month ago

llm-security-evaluation-protocol

IDSDataset

❤️35

Supplementary appendix describing the full evaluation setup, threat model, defense layers, metrics, and release-gate procedures for secure agentic LLM systems. Includes system configuration, attacker capabilities, telemetry, reproducibility details, and parameter glossary.

Updated 4 months ago

LLM-Security-OWASP-Evaluation

BRupani

❤️35

No description available

Python

Updated 2 months ago

LLM-evaluation-and-security

SiddharthWayne

❤️25

No description available

Updated 11 months ago

ai-financial-document-analysis

reetu95

❤️45

Evaluating LLMs on SEC 10K filings using RAG

Python

Updated 1 month ago

LLM-Evaluation-Security-Test-Suite-for-Multilingual-Content

SamiraSamrose

❤️25

No description available

Jupyter Notebook

Updated 6 months ago

An-Empirical-Security-Evaluation-of-LLM-Generated-Cryptographic-Rust-Code

anonymous-project-2026

❤️35

No description available

Jupyter Notebook

Updated 1 month ago

An-Empirical-Security-Evaluation-of-LLM-Generated-Cryptographic-Rust-Code

MohamedSobhy11

❤️35

No description available

Updated 1 month ago

Presidio-Based-LLM-Security-Mini-Gateway-Design-Implementation-and-Quantitative-Evaluation

AtazazMalik

❤️45

No description available

Python

Updated 3 weeks ago

promptfoo-llm-quality-gate

achamorrofdz14

❤️40

Testing LLM evaluation framework for financial RAG systems using Promptfoo. Evaluates models, retrievers, prompts, and security with SEC 10-Q filings.

MIT

Python

Updated 3 months ago

llmllmopsmlops+1

rag-financial-filing-analyst

faheemahmad02042019

🧡50

RAG-powered financial filing analyst using LangChain, LlamaIndex, and LLMs for SEC 10-K analysis with numerical reasoning, guardrails, and evaluation pipelines

MIT

Python

Updated 1 month ago

Enterprise-RAG-System

kumaraadya

❤️45

Production RAG system with multi-stage retrieval (FAISS dense + BM25 sparse + cross-encoder reranking) over SEC 10-K filings. Fine-tuned transformer models, LLM integration (GPT-4), evaluation framework, and FastAPI deployment. Python | PyTorch | FAISS | FastAPI

Python

Updated 1 month ago

Financial-Analyst-Agent

nilesh-auradkar05

💛70

AI-powered financial analyst agent that autonomously researches companies, analyzes SEC filings, evaluates market sentiment, and generates investment memos with citations. Built with LangGraph, RAG, and local LLMs.

MIT

Python

Updated 5 days ago

Local-AI-Assistant-Ollama-FastAPI

shiva17

🧡55

An end-to-end local AI assistant running open-source LLMs via Ollama with a FastAPI interface. Benchmarks multiple models (Llama3, Mistral, Phi) using metrics like latency, tokens/sec, and time-to-first-token. Includes Pydantic-validated structured outputs, retry logic, and a model evaluation framework.

Python

Updated 2 weeks ago

agentic-financial-auditor

xnaleb-ml

🧡65

A dual-architecture agentic RAG pipeline for auditing SEC filings. Version 1 features a FastAPI backend powered by cloud LLMs. Version 2 is fully localized, serving open-weight models via vLLM. Built with LangGraph and evaluated on FinanceBench.

Updated 5 days ago

QA_system_SEC_filling_dataset

iliaadam

❤️40

Build a Financial QA System using SEC Filings data. Employ two pre-trained Language Models (LLMs) like BERT and ELECTRA to compare performance using BLEU scores and latency. Find the code to implement, preprocess, and evaluate the QA system with ease.

Apache-2.0

Jupyter Notebook

Updated 2 years ago

Financial_LLM_Audit

RhondaMeloMsc

❤️45

Comprehensive adversarial red-teaming audit of a simulated institutional financial advisory LLM. This repository documents high-fidelity attack vectors—including persona-based context injection and narrative stress induction—to evaluate regulatory compliance (SEC/MiFID II) and logic integrity in high-stakes AI governance scenarios.

Updated 1 month ago

All 24 repositories loaded

GitHub Explorer

Search Results

LLM-Sec-Evaluation

LLM-as-Judge-Security-Evaluation-Framework

SecureDocChat-Secure-Multi-PDF-Chatbot-with-LLM-Evaluation

A-Secure-Sandbox-for-Pre-Clinical-Evaluation-of-LLM-in-Patient-Portal-Message-Management

LLM-Security-Evaluations

LLM_Security_Evaluation

LLM-Security-Evaluation-Platform

llm-judge-security-evaluation

llm-security-evaluation-protocol

LLM-Security-OWASP-Evaluation

LLM-evaluation-and-security

ai-financial-document-analysis

LLM-Evaluation-Security-Test-Suite-for-Multilingual-Content

An-Empirical-Security-Evaluation-of-LLM-Generated-Cryptographic-Rust-Code

An-Empirical-Security-Evaluation-of-LLM-Generated-Cryptographic-Rust-Code

Presidio-Based-LLM-Security-Mini-Gateway-Design-Implementation-and-Quantitative-Evaluation

promptfoo-llm-quality-gate

rag-financial-filing-analyst

Enterprise-RAG-System

Financial-Analyst-Agent

Local-AI-Assistant-Ollama-FastAPI

agentic-financial-auditor

QA_system_SEC_filling_dataset

Financial_LLM_Audit

LLM-Sec-Evaluation

LLM-as-Judge-Security-Evaluation-Framework

SecureDocChat-Secure-Multi-PDF-Chatbot-with-LLM-Evaluation

A-Secure-Sandbox-for-Pre-Clinical-Evaluation-of-LLM-in-Patient-Portal-Message-Management

LLM-Security-Evaluations

LLM_Security_Evaluation

LLM-Security-Evaluation-Platform

llm-judge-security-evaluation

llm-security-evaluation-protocol

LLM-Security-OWASP-Evaluation

LLM-evaluation-and-security

ai-financial-document-analysis

LLM-Evaluation-Security-Test-Suite-for-Multilingual-Content

An-Empirical-Security-Evaluation-of-LLM-Generated-Cryptographic-Rust-Code

An-Empirical-Security-Evaluation-of-LLM-Generated-Cryptographic-Rust-Code

Presidio-Based-LLM-Security-Mini-Gateway-Design-Implementation-and-Quantitative-Evaluation

promptfoo-llm-quality-gate

rag-financial-filing-analyst

Enterprise-RAG-System

Financial-Analyst-Agent

Local-AI-Assistant-Ollama-FastAPI

agentic-financial-auditor

QA_system_SEC_filling_dataset

Financial_LLM_Audit