AI/LLM Security Articles

back to AI/LLM Security

All AI/LLM Security articles

AI/LLM Security → All AI/LLM Security articles

AI/LLM Security Premium

3 weeks ago

AI Red Teaming Methodology

Red-teaming an AI system is not a classic pentest — the failures are behavioural and probabilistic, so you hunt for harmful output…

18m

AI/LLM Security Premium

3 weeks ago

LLM Agent Security

Give a model tools and a loop and it stops being a chatbot and becomes an actor — it can send the email, run the code, move the mo…

17m

AI/LLM Security Premium

3 weeks ago

Training Data Extraction from LLMs

Large language models do not only generalise — they memorise, and memorised text can be pulled back out word for word. Feed the ri…

15m

AI/LLM Security Premium

3 weeks ago

Model Inversion & Membership Inference

These attacks do not make a model misbehave — they make it confess. By reading ordinary outputs (a label, a confidence score), an …

17m

AI/LLM Security Premium

3 weeks ago

Adversarial ML — Evasion Attacks

A perturbation too small for a human to see can flip a model from "panda, 58%" to "gibbon, 99%". Evasion attacks nudge an input ac…

12m

AI/LLM Security Premium

3 weeks ago

Vector Database Security

A vector database is still a database — it just holds embeddings and metadata behind an API. In the rush to ship RAG, teams skippe…

12m

AI/LLM Security Premium

3 weeks ago

RAG Pipeline Attacks

RAG bolts a search step onto the model: before answering, the app retrieves chunks from a knowledge base and pastes them into the …

16m

AI/LLM Security Premium

3 weeks ago

Insecure Output Handling

A model's output is untrusted input to whatever consumes it next. The app trusts it because "we generated it" — but the model is s…

14m

AI/LLM Security Premium

3 weeks ago

LLM Data Exfiltration

A model holds secrets in its context — the system prompt, retrieved documents, earlier turns, tool outputs. Exfiltration is the pr…

14m

AI/LLM Security Premium

3 weeks ago

Jailbreaking Techniques

A refusal is a learned behaviour, not an enforced rule — which is exactly why it can be steered around. Personas, fictional framin…

16m