Found 47 repositories(showing 30)
zzhang-cn
A flipped classroom series on understanding LLMs for non-CS/AI students
XuandongZhao
[ICLR 2025] Permute-and-Flip: An optimally robust and watermarkable decoder for LLMs
adarshpalaskar1
Interactive LLM for logs and security analysis with vectorstores
eager-flipstudio
⚡ Flip LLM API — Access 35+ AI models (Mistral, Llama 4, DeepSeek, GLM, Qwen 3, Kimi K2, Phi 3, Minimax) through a single unified REST API. Features session memory via Firebase RTDB, custom system prompts, multi-turn conversations, and support for cURL, Python, JavaScript & Java. Fast • Free • No Auth Required • Built by Aquib
Yuzhe-W
CausalFlip is an LLM causal-judgment benchmark featuring semantically similar, label-flipped question pairs over event triples with causal structures of confounder, chain, and collider, designed to evaluate causal judgments grounded in causal structure rather than spurious semantic correlations.
Ch3rry-Pi3-AI
No description available
Mansi168
No description available
heyiamantara
Flipkart product recommendation system using RAG and LLMOps, deployed with Docker, Kubernetes, and cloud infrastructure.
sashsinha
NQMP is a tiny, deterministic llm benchmark focused on logical sensitivity to small prompt flips.
dorkish
Behavioral phase transitions in LLM belief revision: inverted-U flip rate, two-channel dissociation, and framing-only interventions. Paper + data + code.
callmesadbaby
PASB-bench (Lite) is a minimal benchmark for testing stable regimes (UR) in LLMs. Includes Persona Flip, Non-commutativity, and Antilexical tests with stability metrics. Supports OpenAI API and HuggingFace models. First step toward an open standard of LLM stability.
DeependraVerma
No description available
Ravi-Teja4
No description available
gfmasca
Personal project for my brother
bair-climate-initiative
No description available
Ravi-Teja-konda
Tiny experiments that evaluate how different social-cue prompts influence large language model (LLM) decision-making on HLE dataset multiple-choice questions.
Manikhandan
No description available
ElDincexYT
This repository explores how social cues impact decision-making in large language models using a structured pipeline. Join us in analyzing various prompts and their effects on responses to multiple-choice questions! 🐙✨
sanmj-learning
No description available
samlangborgh
A slightly overkill way to flip a coin
rcorona
No description available
sushant097
No description available
vibecodervicky
No description available
Aakash326
No description available
lokesh-kummari
No description available
amaithi-sam
Groq, HF, Langchain, Flask, Docker, AWS, grafana, kubectl, promtheus, astra DB
aignishant
No description available
numan8
No description available
No description available
jwjeong92
Framework for testing bit flip resilient LLMs.