AI/LLM Security Articles

back to AI/LLM Security

AI/ML Security & Red Teaming

AI/LLM Security → AI/ML Security & Red Teaming

AI/LLM Security Members

2 weeks ago

The AI Feature Nobody Audited

Companies bolt an LLM chatbot onto their product and nobody pen-tests it — "it is just the AI feature." But the AI is a new untrus…

20m

Deepfakes & Synthetic Media for Social Engineering

AI/LLM Security Members

3 weeks ago

Deepfakes & Synthetic Media for Social Engineering

Three seconds of audio clones a voice. One photo drives a live face-swap on a video call. The $25M Arup wire fraud ran on a fully …

22m

AI/LLM Security Members

3 weeks ago

AI Code Assistant Risks

The AI pair-programmer reads your code, suggests new code, and increasingly edits files and runs commands — and each is a security…

12m

Defenses — Guardrails & Constitutional AI

AI/LLM Security Members

3 weeks ago

Defenses — Guardrails & Constitutional AI

Every attack in this track meets the same answer: defense-in-depth, because no single control holds. Safety gets baked into the we…

16m

AI/LLM Security Members

3 weeks ago

LLM-assisted Pentesting

Turn the model around and it becomes the attacker's tool — a copilot that parses recon and drafts payloads, or an autonomous agent…

14m

AI/LLM Security Premium

3 weeks ago

AI Red Teaming Methodology

Red-teaming an AI system is not a classic pentest — the failures are behavioural and probabilistic, so you hunt for harmful output…

18m

AI/LLM Security Members

3 weeks ago

Backdoored Models

A backdoored model is correct on everything you test — and wrong exactly when the attacker wants. A secret trigger, a pixel patch …

14m

AI/LLM Security Premium

3 weeks ago

Model Inversion & Membership Inference

These attacks do not make a model misbehave — they make it confess. By reading ordinary outputs (a label, a confidence score), an …

17m

AI/LLM Security Premium

3 weeks ago

Adversarial ML — Evasion Attacks

A perturbation too small for a human to see can flip a model from "panda, 58%" to "gibbon, 99%". Evasion attacks nudge an input ac…

12m

AI/LLM Security Members

3 weeks ago

Model Supply Chain

Downloading a model is running a stranger's code and data. You pull weights from a hub by name, often automatically at runtime, an…

13m