Search Results

Found 53 repositories(showing 30)

BoxPwnr

0ca

💛71

A modular framework for benchmarking LLMs and agentic strategies on security challenges across HackTheBox, TryHackMe, PortSwigger Labs, Cybench, picoCTF and more.

302

AGPL-3.0

Python

Updated 1 day ago

WMDP is a LLM proxy benchmark for hazardous knowledge in bio, cyber, and chemical security. We also release code for RMU, an unlearning method which reduces LLM performance on WMDP while retaining general capabilities.

165

MIT

Jupyter Notebook

Updated 4 days ago

sec-code-bench

alibaba

🧡65

SecCodeBench is a benchmark suite focusing on evaluating the security of code generated by large language models (LLMs).

110

Apache-2.0

Python

Updated 1 day ago

benchmarkdatasetsllm+1

BoxPwnr-Traces

0ca

💛70

LLM agent solving traces, leaderboards, and benchmark results across security CTF and hacking platforms

AGPL-3.0

JavaScript

Updated 1 day ago

SEC-bench

🧡55

Automated Benchmarking of LLM Agents on Real-World Software Security Tasks [NeurIPS 2025]

MIT

Python

Updated 1 week ago

Mithra-Scanner

KadirArslan

🧡60

Mithra Scanner is an interactive API testing tool for prompt injection, refusal detection, and LLM security benchmarking. It supports YAML-based rule definitions, custom refusal lists, REST API integration, and provides detailed CLI output for security testing of language model endpoints.

Python

Updated 6 days ago

ai-securitycybersecurityllm+3

redteam-ai-benchmark

toxy4ny

💛70

Red Team AI Benchmark: Evaluating Uncensored LLMs for Offensive Security

MIT

Python

Updated 4 days ago

aiai-agentscybersecurity+10

HackerLLMBench

Hackerbone

🧡60

A robust framework to benchmark LLMs specifically for penetration testing use-cases, offering a significant leap forward in the security assessment process.

MIT

Python

Updated 1 week ago

phare

Giskard-AI

❤️30

Phare is a LLM benchmark that evaluates models across key AI security & safety dimensions

Python

Updated 3 months ago

kidnapp-ai-benchmark

toxy4ny

❤️45

Kidnapp-AI-Benchmark is a modular, extensible framework designed to systematically test and evaluate privacy leakage, data extraction, and adversarial vulnerabilities in large language models (LLMs) and other generative AI systems. Built for red teamers, penetration testers, and AI security researchers.

Python

Updated 1 month ago

benchmark-llms-security

FuzzingLabs

❤️30

Benchmarking 12 LLMs for vulnerability research

Python

Updated 8 months ago

llm-security-benchmark

rapticore

🧡50

A multi-LLM benchmark suite for evaluating security analysis and vulnerability detection capabilities across OpenAI, Anthropic, Google's models.

MIT

Python

Updated 6 days ago

deceptive-vuln

ColeMurray

🧡50

A comprehensive benchmark system for evaluating whether Large Language Models (LLMs) can be tricked into ignoring security vulnerabilities through deceptive code patterns and misleading comments.

MIT

Python

Updated 1 month ago

llm_security_guidance_benchmarks

davcoservices

💛70

A repository dedicated to benchmarking lightweight, open-source large language models (LLMs) for their effectiveness in providing security guidance. This project uses the SECURE dataset as a foundation to replicate research and evaluate selected models on predefined cybersecurity tasks.

Apache-2.0

Python

Updated 6 days ago

aibenchmarkcybersecurity+2

CyberQA

priamai

❤️40

A benchmark for cyber security knowledge evaluation on LLM

Apache-2.0

Python

Updated 10 months ago

thiqah-ops

ImBIOS

❤️40

AI SysAdmin Trust Benchmark - Comprehensive testing suite for evaluating LLM competence in system administration. Real-world scenarios covering setup, security, networking, monitoring, and troubleshooting.

MIT

TypeScript

Updated 3 months ago

aiautomationbenchmark+10

TrustMH_Bench

Qiyuan0130

🧡60

TrustMH_Bench is a trustworthiness benchmark for general-purpose and mental-health LLMs in mental health settings. It evaluates models across fairness, privacy, reliability, security, crisis identification and escalation, ethics, robustness, and sycophancy. Supports standardized, reproducible evaluation for researchers and developers.

MIT

Updated 1 week ago

AEGIS-PALADIN-Multi-LLM-Security-Guardrail-Comparative-Study

kwangilkimkenny

🧡55

This repository presents the results of a comprehensive multi-LLM security benchmark study evaluating the effectiveness of the AEGIS PALADIN 6-Layer Defense System as a deterministic guardrail across six major Large Language Models.

Updated 1 week ago

LLm-Sec-Scanner

alby-shinoj

❤️40

benchmarks the security and performance of open-source large language models (LLMs) from Hugging Face

Apache-2.0

Python

Updated 8 months ago

AART-AI-Adversarial-Research-Toolkit

caspiankeyes

🧡55

AART provides security researchers, AI labs, and red teams with a structured framework for conducting thorough adversarial evaluations of LLM systems. The framework implements a multi-dimensional assessment methodology that systematically probes model boundaries, quantifies security vulnerabilities and benchmarks defensive robustness in frontier AI

MIT

HTML

Updated 6 days ago

awesome-llm-agent-privacy

yagobski

🧡60

A curated list of papers on privacy, security, and compliance in LLM-based agent systems — attacks, defenses, benchmarks, and regulatory frameworks.

CC0-1.0

Updated 3 weeks ago

ai-safetyawesome-listbenchmark+9

LLM_security_garak

tmpoulionis

❤️35

Security benchmarking of low param (< 3B) LLMS using Nvidia's garak tool.

HTML

Updated 8 months ago

TOSSS-LLM-Benchmark

MarcT0K

🧡55

TOSSS, an extensible LLM security benchmark based on the CVE database

GPL-3.0

Python

Updated 1 week ago

benchmarkcvelarge-language-model+3

Prompt-Injection-Attacks-on-AI-Systems

Shubham-Kumar-Sinhaa

❤️40

This repository explores the security vulnerabilities of large language models (LLMs) to prompt injection attacks. It includes a research paper, benchmarks, attack/defense taxonomies, and illustrations of both direct and indirect prompt injections. Ideal for researchers, developers, and security practitioners working on LLM safety.

Apache-2.0

Updated 11 months ago

AlphaSecBench

maferrag

❤️35

α³-SecBench is a large-scale benchmark for evaluating security, resilience, and trust of LLM-based UAV agents under realistic adversarial conditions in 6G-enabled networks, featuring layered attack taxonomies and CWE-aligned evaluation.

Apache-2.0

Updated 1 month ago

Benchmarking-Open-Weight-LLMs-for-Security-Backporting

Salisg03

❤️35

No description available

Python

Updated 1 month ago

BenchAISecTools

No-N4me

❤️35

Benchmarking LLM Security tools

Jupyter Notebook

Updated 6 months ago

llm-security-benchmarking

Smart-Labs-AI

🧡50

Benchmarking the security of various LLMs

Apache-2.0

Python

Updated 1 month ago

FSKU-benchmark

FSI-AI

🧡50

Financial Security Knowledge Understanding Benchmark for LLMs

NOASSERTION

Python

Updated 2 months ago

security

shinjadong

❤️45

LLM Security Benchmark Hub - 6개 벤치마크 모니터링 + 화이트해커 리소스 90개

Python

Updated 1 month ago

GitHub Explorer

Search Results

BoxPwnr

wmdp

sec-code-bench

BoxPwnr-Traces

SEC-bench

Mithra-Scanner

redteam-ai-benchmark

HackerLLMBench

phare

kidnapp-ai-benchmark

benchmark-llms-security

llm-security-benchmark

deceptive-vuln

llm_security_guidance_benchmarks

CyberQA

thiqah-ops

TrustMH_Bench

AEGIS-PALADIN-Multi-LLM-Security-Guardrail-Comparative-Study

LLm-Sec-Scanner

AART-AI-Adversarial-Research-Toolkit

awesome-llm-agent-privacy

LLM_security_garak

TOSSS-LLM-Benchmark

Prompt-Injection-Attacks-on-AI-Systems

AlphaSecBench

Benchmarking-Open-Weight-LLMs-for-Security-Backporting

BenchAISecTools

llm-security-benchmarking

FSKU-benchmark

security

BoxPwnr

wmdp

sec-code-bench

BoxPwnr-Traces

SEC-bench

Mithra-Scanner

redteam-ai-benchmark

HackerLLMBench

phare

kidnapp-ai-benchmark

benchmark-llms-security

llm-security-benchmark

deceptive-vuln

llm_security_guidance_benchmarks

CyberQA

thiqah-ops

TrustMH_Bench

AEGIS-PALADIN-Multi-LLM-Security-Guardrail-Comparative-Study

LLm-Sec-Scanner

AART-AI-Adversarial-Research-Toolkit

awesome-llm-agent-privacy

LLM_security_garak

TOSSS-LLM-Benchmark

Prompt-Injection-Attacks-on-AI-Systems

AlphaSecBench

Benchmarking-Open-Weight-LLMs-for-Security-Backporting

BenchAISecTools

llm-security-benchmarking

FSKU-benchmark

security