Quiz Report Card: Kubernetes SSRF

Reference Answer

Based on the referenced sources, there are four main SSRF vectors in a standard kubeadm cluster:

Control Plane SSRFs

1. API Server Proxy (most significant)

The API server has built-in proxy endpoints: nodes/proxy, pods/proxy, services/proxy
The API server makes HTTP requests on behalf of the authenticated user from the control plane’s network position
Advanced techniques: fake node objects with arbitrary addresses, pod status IP manipulation, CVE-2020-8562 TOCTOU DNS rebinding bypass, API server self-authentication
RBAC: requires get on {nodes,pods,services}/proxy
Highest impact — full request manipulation with custom headers and HTTP methods

2. Validating/Mutating Admission Webhooks

Creating ValidatingWebhookConfiguration or MutatingWebhookConfiguration pointing to internal targets
API server makes HTTP POST to the webhook URL during admission control
Enables blind SSRF and port scanning from the API server’s network perspective via error message differentiation
RBAC: requires ability to create webhook configurations

Node SSRFs

3. Pod Image Reference

Container image specifications are processed as URLs by the kubelet
Error messages reveal connectivity details (open/closed/unreachable)
Enables blind SSRF from the node’s network perspective
RBAC: requires ability to create pods

4. Pod Readiness/Liveness Probes and Lifecycle Hooks

HTTP-based probes are executed by the kubelet with support for custom headers
Enables flexible blind SSRF from the node network
RBAC: requires ability to create pods

Scoring Criteria

API Server Proxy: The primary vector — should be identified with specific sub-resources
Admission Webhooks: Important secondary vector — port scanning capability
Pod Image Reference: Node-level SSRF via kubelet image pulls
Pod Probes/Hooks: Node-level SSRF via HTTP probes
Security implications: Discussion of what an attacker can achieve (IMDS access, internal scanning, etc.)
Accuracy: Vectors must be part of standard kubeadm — Ingress controllers, dashboards, etc. are additional software

Results Summary

Model	Score	API Proxy	Webhooks	Image Ref	Probes	Extra Vectors	Accuracy
anthropic/claude-opus-4.7	7/10	Yes	Yes	Yes	No	APIService, ExternalName	Accurate
google/gemini-3-flash-preview	8/10	Yes	Yes	Yes	No	CRD webhooks, APIService, PV	Accurate
anthropic/claude-sonnet-4.6	5/10	Yes (detailed)	No	No	No	None	Accurate
deepseek/deepseek-v3.2	5/10	Yes	Yes (brief)	No	No	CVE-2020-8555	Includes non-standard
openai/gpt-5.4	4/10	Yes	No	No	No	None	Accurate
minimax/minimax-m2.5	3/10	Buried in footnote	Yes (brief)	No	No	Ingress (wrong)	Focuses on non-standard
minimax/minimax-m2.7	6/10	Partial	Yes	No	No	Some	Good
qwen/qwen3.6-plus	4/10	Yes	No	No	No	None	Fabricated version history
deepseek/deepseek-v4-pro	6/10	Yes	Yes	No	No	None	Limited breadth
deepseek/deepseek-v4-flash	5/10	Yes	Yes	No	No	None	Misses API proxy detail
moonshotai/kimi-k2.6	5/10	Yes	No	No	No	None	Only covers API proxy
openai/gpt-5.5	6/10	Yes	No	No	No	Services/Endpoints/EndpointSlices	Accurate but incomplete
qwen/qwen3.6-35b-a3b (LOCAL)	5/10	Yes	Partial	No	No	Missing pod image pull and probe vectors
anthropic/claude-opus-4.8	8/10	Yes	Yes	No	No	APIService, ExternalName	Accurate
google/gemma-4-31b (LOCAL)	6/10	Yes	Yes	No	No	Good on main vectors; missing node-level SSRFs
qwen/qwen3.7-plus	8/10	Yes	Yes	No	No	APIService, ExternalName	Accurate
minimax/minimax-m3	9/10	Yes	Yes	No	No	APIService, CVEs	Accurate
anthropic/claude-fable-5	0/10	—	—	—	—	—	EMPTY response
moonshotai/kimi-k2.7-code	7/10	Yes	Yes	No	No	APIService, DNS	Accurate
z-ai/glm-5.2	8/10	Yes	Yes	No	No	APIService, ExternalName	Accurate
mistralai/mistral-medium-3-5	7/10	Yes	Yes	No	No	APIService	Includes non-SSRF items
anthropic/claude-sonnet-5	5/10	Yes	No	No	No	None	Accurate but narrow
tencent/hy3	6/10	Yes	Yes	No	No	APIService	Incorrectly prioritises aggregation over API proxy
openai/gpt-5.6-terra	6/10	Yes	No	No	No	None	Only covers API proxy
openai/gpt-5.6-sol	5/10	Yes	No	No	No	None	Only covers API proxy
moonshotai/kimi-k3	9/10	Yes	Yes	No	No	Endpoints/EndpointSlices, APIService, image pulls	Accurate
xiaomi/mimo-v2.5	4/10	Yes	No	No	No	None	Only covers API proxy

Detailed Analysis

anthropic/claude-opus-4.7 — 7/10

Strengths:

Covers 3 of 4 reference vectors: API server proxy, admission webhooks, container image pulling
APIService/aggregation layer mentioned (bonus)
ExternalName services noted as related concern

Weaknesses:

Missing pod probes/lifecycle hooks (4th core vector)
OIDC/service-account-issuer-discovery is speculative

Comparison vs Opus 4.6 (6): Improvement. Better structured, covers webhooks and image reference more clearly.

Notable: A significant step up from Opus 4.6 (which only covered API proxy). Covering 3 of 4 core vectors puts Opus 4.7 second only to Gemini 3 Flash on this question.

google/gemini-3-flash-preview — 8/10

Strengths:

Best coverage by far — identifies 3 of the 4 reference SSRF vectors plus 3 additional valid vectors
API Server Proxy: Well explained with mechanism, impact, and IMDS risk
Admission Webhooks: Correctly identifies both validating and mutating webhooks with port scanning use case and data exfiltration risk
Pod Image Reference: Correctly identifies kubelet image pull as SSRF via timing/behaviour observation
CRD Conversion Webhooks: Valid additional vector (API server makes requests to conversion webhook URLs)
Aggregated API Servers: Valid additional vector (APIService objects can point to arbitrary targets)
PV Provisioning: Valid additional vector (controller-manager contacts storage endpoints)
Good mitigation section including IMDSv2 recommendation
All vectors are part of standard Kubernetes or legitimate extensions

Weaknesses:

Missing pod probes/lifecycle hooks as an SSRF vector
Could have provided more detail on the admission webhook port scanning technique (error message differentiation)

Notable: The most comprehensive SSRF answer across all models by a wide margin. Identifies vectors at both the control plane level (API server proxy, webhooks, APIService) and node level (image pulls), plus storage-level vectors. Shows genuine understanding of the SSRF attack surface.

anthropic/claude-sonnet-4.6 — 5/10

Strengths:

API Server Proxy covered in excellent detail: all three sub-resources (nodes/proxy, pods/proxy, services/proxy), concrete curl examples, RBAC permissions table
Good explanation of why proxy is SSRF: “API server itself makes outbound HTTP requests on behalf of the attacker”
Specific IMDS/metadata endpoint risk noted
Good mitigation examples (RBAC restriction, NetworkPolicy for metadata blocking)

Weaknesses:

Only covers one of four reference vectors — missing admission webhooks, pod image reference, and pod probes entirely
Mentions kubectl exec and port-forward as SSRF — these are more accurately lateral movement/pivoting, not true SSRF
DNS-Based SSRF section is vague and doesn’t describe a concrete mechanism
No mention of advanced proxy techniques (fake nodes, pod status manipulation, CVE-2020-8562)

Notable: Deep coverage of the API server proxy vector but a narrow answer. The RBAC permissions table is the most practical of all responses for the proxy vector specifically, but missing three other vectors limits the score significantly.

deepseek/deepseek-v3.2 — 5/10

Strengths:

API Server Proxy well covered with specific endpoint paths
Admission Webhooks mentioned (brief but present)
References CVE-2020-8555 (LoadBalancer/ExternalIP SSRF) — a real Kubernetes SSRF CVE, showing awareness of the CVE landscape
Good mitigation section with NetworkPolicy example
Practical monitoring suggestions (multiple proxy requests, metadata access)

Weaknesses:

Includes non-standard features as vectors: Kubernetes Dashboard and Ingress Controllers are not part of a standard kubeadm cluster — the question specifically says “standard Kubeadm based cluster”
Admission webhooks coverage is very brief (one sentence)
Missing pod image reference and pod probes vectors
The Dashboard section is particularly misleading — kubeadm doesn’t install the Dashboard

Notable: Shows broader security awareness (CVE references, monitoring suggestions) but accuracy suffers from including non-standard components. The CVE-2020-8555 reference is interesting but is a different class of SSRF than the reference material’s focus.

openai/gpt-5.4 — 4/10

Strengths:

API Server Proxy correctly identified with all three sub-resources
Correctly identifies nodes/proxy as the most dangerous
Accurate explanation of why it constitutes SSRF (API server becomes the requester)
No incorrect claims — everything stated is accurate

Weaknesses:

Only covers one vector — API server proxy alone
Very brief response with no concrete examples or RBAC details
No mention of admission webhooks, pod image reference, or pod probes
No discussion of advanced techniques, CVEs, or mitigations
Offered to provide more detail but didn’t in the initial response

Notable negative: The most minimal answer across all models. While accurate, a question about SSRF attack surface deserves more than one vector. The brevity suggests limited awareness of the full SSRF landscape in Kubernetes.

minimax/minimax-m2.5 — 3/10

Strengths:

Admission webhooks mentioned (section 5)
API server proxy mentioned in the final note
Some awareness of the API server as SSRF source

Weaknesses:

Focuses on Ingress Controllers as the “Key Attack Vector” — Ingress controllers are NOT part of standard kubeadm. This is the wrong answer to the question.
The actual main SSRF vector (API server proxy) is buried in a footnote at the very end, almost as an afterthought
Section on ServiceAccount tokens and kubelet API describes pivoting/lateral movement, not SSRF
Missing pod image reference and pod probes vectors
The “Key Attack Vector” section being wrong fundamentally undermines the response
Vague throughout — no concrete examples, no specific endpoint paths

Notable negative: Focusing on Ingress controllers as the primary SSRF vector in a “standard Kubeadm based cluster” is a significant error. Kubeadm doesn’t install an Ingress controller. The real answer (API server proxy) is mentioned almost parenthetically at the bottom.

minimax/minimax-m2.7 — 6/10

Strengths:

Covers webhook configurations (mutating/validating) as SSRF vector
Mentions API server proxy
Covers aggregated API servers
Custom controllers/operators discussed
Good mitigation strategies

Weaknesses:

Missing detailed coverage of API server proxy (the most important vector)
Doesn’t deeply explore webhook-based SSRFs
kubectl proxy section focuses on client-side rather than server-side
Kubelet API section mentions exec/attach/portforward but unclear SSRF explanation

Notable: Good breadth at high level but lacks depth in the most important areas. Similar profile to Opus (also 6/10).

qwen/qwen3.6-plus — 4/10

Strengths:

Correctly identifies the API Server Proxy as the primary SSRF vector with all three sub-resources: services/proxy, nodes/proxy, pods/proxy
Good explanation of why it constitutes SSRF: API server makes requests from the control plane’s network position
Correct RBAC requirements for proxy access
Accurate list of targets: cloud metadata, internal services, etcd

Weaknesses:

Only covers one vector — API server proxy alone, missing admission webhooks, pod image reference, and pod probes
Fabricates version history: Claims “Kubernetes 1.20+: Proxy endpoints are disabled by default” and references --enable-legacy-api-endpoints=false — this flag does not exist and proxy endpoints remain fully functional in modern Kubernetes. This is a significant factual error that could give false confidence.
No mention of admission webhook port scanning
No mention of node-level SSRFs (image pulls, probes)
The evolution/mitigation section contains multiple inaccurate claims

Notable negative: The fabricated version history is the most concerning aspect. Claiming proxy endpoints are “disabled by default” in 1.20+ is wrong — they are core API server functionality that cannot be disabled. An administrator reading this would incorrectly believe they are protected when they are not.

deepseek/deepseek-v4-pro — 6/10

Strengths:

Identifies API server proxy subresources as the primary SSRF vector
Correctly covers dynamic admission webhooks as a secondary vector

Weaknesses:

Lacks depth and misses other important attack paths (pod image reference, pod probes)
Limited breadth compared to top scorers

Notable: An improvement over DeepSeek V3.2 (5/10), which included non-standard components (Dashboard, Ingress). V4 Pro correctly scopes to standard kubeadm and covers the two main vectors, but doesn’t match Gemini 3 Flash’s (8/10) comprehensive coverage.

deepseek/deepseek-v4-flash — 5/10

Strengths:

Identifies admission webhooks as an SSRF vector
Mentions API server proxy at a basic level

Weaknesses:

Misses the API server proxy as the primary and most significant vector — while mentioned, lacks the depth needed (specific sub-resources, RBAC requirements, advanced techniques)
Missing pod image reference SSRF (node-level)
Missing pod probes/lifecycle hooks SSRF (node-level)
Limited breadth overall

Notable: Scores below V4 Pro (6/10) on this question. Gets admission webhooks correct but doesn’t identify the API server proxy as the dominant SSRF vector with appropriate detail. The entire DeepSeek family (V3.2: 5, V4 Pro: 6, V4 Flash: 5) struggles with SSRF breadth compared to Gemini 3 Flash (8) and Opus 4.7 (7).

openai/gpt-5.5 — 6/10

Strengths:

Correctly identifies the core issue: API server proxy subresources (pods/proxy, services/proxy, nodes/proxy)
Good explanation of the SSRF mechanism: “the request originates from the API server’s network position”
Correctly notes RBAC permissions required for proxy access
Mentions Services, Endpoints, and EndpointSlices as objects an attacker can manipulate to control the proxy target

Weaknesses:

Misses the validating/mutating admission webhook SSRF vector entirely — this is the second most important vector after API server proxy
Missing pod image reference as an SSRF vector (node-level)
Missing pod probes/lifecycle hooks as an SSRF vector (node-level)
Lacks breadth for a “list as many as possible” style question — only covers one vector family
No mention of advanced techniques (fake nodes, pod status manipulation, CVE-2020-8562)

Notable: Scores slightly above GPT 5.4 (4/10) thanks to the Services/Endpoints/EndpointSlices detail, which shows understanding of the proxy manipulation mechanism. However, like GPT 5.4, it only covers the API server proxy vector family. Both OpenAI models struggle with SSRF breadth compared to Gemini 3 Flash (8/10) and Opus 4.7 (7/10).

moonshotai/kimi-k2.6 — 5/10

Strengths:

Correctly identifies the API Server Proxy as the primary SSRF vector
Accurate explanation of the SSRF mechanism

Weaknesses:

Only covers the API proxy SSRF vector — misses admission webhooks, pod image reference, and pod probes
No mention of webhook-based port scanning
No mention of node-level SSRFs (image pulls, probes)

Notable: Matches Sonnet and DeepSeek V3.2/V4 Flash at 5/10. Like most models, only identifies the API server proxy vector without covering the broader SSRF attack surface. The question rewards breadth, and covering only one vector family limits the score.

qwen/qwen3.6-35b-a3b (LOCAL) — 5/10

Strengths:

Identifies API server proxy subresource as the primary SSRF vector
Mentions admission webhooks as a secondary vector (partial coverage)
Accurate explanation of the API server proxy SSRF mechanism

Weaknesses:

Missing pod image reference SSRF (node-level) — kubelet image pulls as blind SSRF
Missing pod readiness/liveness probes and lifecycle hooks as SSRF vectors (node-level)
Limited depth on cloud metadata service access via proxy paths
Webhook coverage is partial — doesn’t explain the port scanning technique via error message differentiation

Notable: Matches Sonnet, DeepSeek V3.2/V4 Flash, and Kimi K2.6 at 5/10. Covers slightly more than the single-vector models (GPT 5.4, Qwen 3.6 Plus at 4/10) by mentioning webhooks, but lacks the breadth to score higher. No fabricated vectors — errors are omissions.

google/gemma-4-31b (LOCAL) — 6/10

Strengths:

Correctly identifies the API server proxy as the primary SSRF vector with the key sub-resources (nodes/proxy, pods/proxy, services/proxy)
Covers admission webhooks as a secondary SSRF vector — correctly identifies the port scanning technique via error message differentiation
Accurate explanation of what an attacker can achieve (cloud metadata service access, internal scanning)
No fabricated vectors — all items are part of standard kubeadm

Weaknesses:

Missing pod image reference SSRF (node-level) — kubelet image pulls as blind SSRF
Missing pod readiness/liveness probes and lifecycle hooks as SSRF vectors (node-level)
Limited depth on advanced API proxy techniques (fake node objects, CVE-2020-8562)

Notable: Scores above the 5/10 cluster by correctly covering both the API proxy and admission webhook vectors with reasonable depth. Matches MiniMax M2.7, DeepSeek V4 Pro, and GPT 5.5 at 6/10. The node-level SSRF vectors (image pulls, probes) remain the differentiator for the top scorers — only Gemini 3 Flash (8/10) and Opus 4.7 (7/10) identified them.

anthropic/claude-opus-4.8 — 8/10

Strengths:

Covers the two main SSRF vectors: API server proxy and admission webhooks
APIService/aggregation layer mentioned as additional vector (bonus)
ExternalName services noted as related concern
Cloud metadata risk well covered
Good explanation of webhook port scanning technique

Weaknesses:

Missing pod image reference as SSRF vector (node-level)
Missing pod probes/lifecycle hooks as SSRF vectors (node-level)

Comparison vs Opus 4.7 (7): Improvement. Better depth on webhook SSRF and aggregation layer. Ties with Gemini 3 Flash for the top score.

Notable: Ties with Gemini 3 Flash at 8/10, making these two the only models to score 8 on SSRF. A significant improvement over Opus 4.7 (7/10) — covers webhooks with better depth and the aggregation layer more thoroughly. The node-level SSRFs (image pulls, probes) remain the gap that prevents a higher score, but the control plane SSRF coverage is comprehensive.

qwen/qwen3.7-plus — 8/10

Strengths:

Correctly identifies the API server proxy as the primary SSRF vector with key sub-resources (nodes/proxy, pods/proxy, services/proxy)
Covers admission webhooks (validating/mutating) as a secondary SSRF vector with port scanning use case
APIService/aggregation layer mentioned as additional vector (bonus)
ExternalName services noted as related concern
Cloud metadata risk well covered
All vectors correctly scoped to standard kubeadm

Weaknesses:

Missing pod image reference as SSRF vector (node-level)
Missing pod probes/lifecycle hooks as SSRF vectors (node-level)

Notable: Ties with Gemini 3 Flash and Opus 4.8 at 8/10 — a dramatic improvement over Qwen 3.6 Plus (4/10, which only covered API proxy and had fabricated version history). The Qwen 3.7 Plus response covers both main control plane SSRF vectors without the factual errors that plagued its predecessor. The node-level SSRFs (image pulls, probes) remain the gap that prevents a higher score.

minimax/minimax-m3 — 9/10

Strengths:

Identifies all three main SSRF vectors: API server proxy (with specific sub-resources), admission webhooks (validating/mutating), and API aggregation layer
References CVEs related to Kubernetes SSRF, demonstrating awareness of the real-world attack landscape
Comprehensive coverage of the API server proxy vector with mechanism explanation
All vectors correctly scoped to standard kubeadm — no non-standard components included

Weaknesses:

Missing pod image reference as SSRF vector (node-level)
Missing pod probes/lifecycle hooks as SSRF vectors (node-level)
Minor — could discuss SSRF via CRI socket or other edge cases

Notable: NEW SOLE LEADER at 9/10, surpassing the previous three-way tie at 8/10 (Opus 4.8, Gemini 3 Flash, Qwen 3.7 Plus). The strongest SSRF answer across all models, covering both main control plane vectors plus the aggregation layer with CVE references. A dramatic improvement over MiniMax M2.5 (3/10, which focused on non-standard Ingress controllers) and MiniMax M2.7 (6/10). The node-level SSRFs (image pulls, probes) remain the only gap, but the control plane SSRF coverage is the most thorough of any model.

anthropic/claude-fable-5 — 0/10

Strengths:

None — no content generated.

Weaknesses:

Completely empty response — Fable 5 produced no content at all for this question. This appears to be a safety guardrail refusing to engage with the topic of SSRF attack vectors in Kubernetes.

Notable: The second question where Fable 5 scores 0/10 due to an empty response (alongside Kubelet API). The SSRF question explicitly asks about “malicious authenticated attacker” scenarios, which likely triggered safety guardrails. This is the same pattern seen on the Kubelet API question — both involve offensive security topics. Even MiniMax M2.5 (3/10, the previous lowest scorer on SSRF) attempted an answer. The safety refusal represents a fundamental limitation for security assessment use cases where understanding attack vectors is essential for defence.

moonshotai/kimi-k2.7-code — 7/10

Strengths:

Proxy subresources correctly identified (pods, services, nodes)
Admission webhooks as SSRF vectors
API aggregation layer as an attack surface
Mentions service discovery and DNS-based SSRF

Weaknesses:

Coverage is correct but brief — top scorers provide more detailed exploitation scenarios
Missing specific mention of how node/proxy can be used to reach the kubelet API
Could elaborate more on the aggregation layer attack surface

Notable: An improvement over K2.6 (5/10), which only covered the API proxy vector. K2.7 Code gains 2 points by covering admission webhooks and the aggregation layer. Matches Opus 4.7 at 7/10, below MiniMax M3’s sole lead at 9/10.

z-ai/glm-5.2 — 8/10

Strengths:

Correctly identifies the API server proxy as the primary SSRF vector with key sub-resources (nodes/proxy, pods/proxy, services/proxy)
Covers admission webhooks (validating/mutating) as a secondary SSRF vector
Good coverage of aggregated API servers and ExternalName services as additional vectors
All vectors correctly scoped to standard kubeadm — no non-standard components included

Weaknesses:

Missing pod image reference as SSRF vector (node-level)
Missing pod probes/lifecycle hooks as SSRF vectors (node-level)

Notable: Ties with Opus 4.8, Gemini 3 Flash, and Qwen 3.7 Plus at 8/10, just below MiniMax M3’s sole lead at 9/10. A strong SSRF showing with good coverage of the main control plane vectors plus additional valid vectors (APIService, ExternalName). The node-level SSRFs (image pulls, probes) remain the gap that separates the 8/10 cluster from a higher score.

mistralai/mistral-medium-3-5 — 7/10

Strengths:

Correctly identifies the API server proxy as the primary SSRF vector with key sub-resources
Covers admission webhooks (validating/mutating) as a secondary SSRF vector
Good coverage of the API aggregation layer as an additional vector

Weaknesses:

Includes non-SSRF items: hostNetwork and hostPath are listed as SSRF vectors when they are not — hostNetwork provides direct network access (not server-side request forgery) and hostPath provides filesystem access, not HTTP request manipulation
Missing pod image reference as SSRF vector (node-level)
Missing pod probes/lifecycle hooks as SSRF vectors (node-level)

Notable: Matches Opus 4.7 and Kimi K2.7 Code at 7/10 with good coverage of the main control plane SSRF vectors. The inclusion of hostNetwork and hostPath as SSRF vectors is a common conflation between different attack categories — these provide direct access rather than indirect request forgery. The node-level SSRFs (image pulls, probes) remain the gap that separates the 7/10 cluster from the top scorers.

anthropic/claude-sonnet-5 — 5/10

Strengths:

Correctly identifies the API Server Proxy as the primary SSRF vector with key sub-resources (nodes/proxy, pods/proxy, services/proxy)
Accurate explanation of the SSRF mechanism
Good coverage of IMDS/metadata endpoint risks

Weaknesses:

Missing admission webhooks as an SSRF vector — the second most important vector after API server proxy
Missing pod image reference as SSRF vector (node-level)
Missing pod probes/lifecycle hooks as SSRF vectors (node-level)
Only covers the API server proxy vector family

Notable: Matches Sonnet 4.6, Kimi K2.6, DeepSeek V3.2/V4 Flash, and Qwen-35b at 5/10. Like most models, only identifies the API server proxy vector without covering the broader SSRF attack surface. The question rewards breadth, and covering only one vector family limits the score.

tencent/hy3 — 6/10

Strengths:

Identifies multiple SSRF vectors including API server proxy and admission webhooks
Covers the API aggregation layer as an additional vector
All vectors correctly scoped to standard kubeadm — no non-standard components included

Weaknesses:

Incorrectly prioritises API aggregation over API server proxy — the API server proxy (nodes/proxy, pods/proxy, services/proxy) is the most significant SSRF vector, but HY3 gives more weight to the aggregation layer
Missing pod image reference as SSRF vector (node-level)
Missing pod probes/lifecycle hooks as SSRF vectors (node-level)

Notable: Matches MiniMax M2.7, DeepSeek V4 Pro, GPT 5.5, and Gemma 4 31B at 6/10. Covers both main control plane SSRF vectors (API proxy and webhooks) but the incorrect prioritisation and missing node-level vectors limit the score. The question rewards breadth and accuracy of prioritisation.

openai/gpt-5.6-terra — 6/10

Strengths:

Covers API server proxy subresources (pods/proxy, services/proxy, nodes/proxy) as SSRF vectors

Weaknesses:

Completely misses validating admission webhooks as an SSRF mechanism
Only addresses one category of SSRF vectors rather than the multiple categories expected

Notable: Matches GPT 5.5 at 6/10 — both OpenAI flagship models struggle with SSRF breadth, covering only the API server proxy vector family without identifying admission webhooks or node-level SSRFs. The SSRF question rewards breadth across multiple vector categories, and the OpenAI family consistently covers only one (GPT 5.4: 4, GPT 5.5: 6, GPT 5.6 Terra: 6). Remains well below the top scorers: MiniMax M3 (9), Opus 4.8/Gemini 3 Flash/Qwen 3.7 Plus/GLM-5.2 (8).

openai/gpt-5.6-sol — 5/10

Strengths:

Covers API server proxy subresources (pods/proxy, services/proxy, nodes/proxy) as SSRF vectors
Accurate explanation of the SSRF mechanism

Weaknesses:

Completely misses admission webhook SSRF vectors — validating/mutating webhook configurations are the second most important SSRF vector after API server proxy, enabling port scanning from the API server’s network perspective
Only addresses one category of SSRF vectors rather than the multiple categories expected
Missing pod image reference as SSRF vector (node-level)
Missing pod probes/lifecycle hooks as SSRF vectors (node-level)

Notable: Scores below GPT 5.6 Terra (6/10) on this question. The SSRF question rewards breadth across multiple vector categories, and the OpenAI family consistently covers only the API server proxy family (GPT 5.4: 4, GPT 5.5: 6, GPT 5.6 Terra: 6, GPT 5.6 Sol: 5). Remains well below the top scorers: MiniMax M3 (9), Opus 4.8/Gemini 3 Flash/Qwen 3.7 Plus/GLM-5.2 (8).

moonshotai/kimi-k3 – 9/10

Strengths:

Exceptionally comprehensive SSRF coverage across multiple vector categories
API server proxy subresources correctly identified (pods/proxy, services/proxy, nodes/proxy) as the primary SSRF vector
Manual Endpoints and EndpointSlices manipulation for arbitrary-IP SSRF – a technique for redirecting service proxy traffic to attacker-controlled destinations
Validating/Mutating admission webhook configurations as blind SSRF and port scanning vectors
Aggregated API server registration (APIService objects pointing to internal targets)
Image pulls as blind SSRF from the node’s network perspective
IMDS metadata theft chain via selector-less services – demonstrates understanding of the cloud metadata exploitation path

Weaknesses:

Missing pod readiness/liveness probes and lifecycle hooks as SSRF vectors (node-level)
Could have elaborated more on CVE references

Notable: Ties with MiniMax M3 at 9/10 as co-leaders on SSRF. Both models cover the main control plane vectors comprehensively. Kimi K3’s response is notable for the Endpoints/EndpointSlices technique and the IMDS metadata theft chain, which demonstrate deeper understanding of SSRF exploitation paths. The Moonshot AI family’s SSRF trajectory: K2.6 (5), K2.7 Code (7), K3 (9) – a dramatic improvement across generations. Only the node-level probe SSRF vector remains uncovered, preventing a higher score.

xiaomi/mimo-v2.5 — 4/10

Strengths:

Correctly identifies the API server proxy as the primary SSRF enabler with a valid services/{name}:{port}/proxy example
Reasonable mitigation discussion (NetworkPolicies, RBAC, monitoring)

Weaknesses:

Single-vector answer to a question that rewards breadth — misses pods/proxy and nodes/proxy, validating/mutating admission webhooks (the documented API-server-as-port-scanner vector), aggregated API servers, manually created Endpoints, and image-pull blind SSRF
Some muddled framing (e.g., “authentication bypass” describing normal proxy credential handling)

Notable: Covers only the API-server-proxy vector on a question that rewards breadth — one of MiMo’s weakest answers.

Key Findings

Gemini 3 Flash dominated this question: With 3 of 4 reference vectors plus 3 valid bonus vectors, Gemini 3 Flash’s answer was dramatically more comprehensive than any other model’s. The gap between 1st and 2nd place is the largest across all quiz questions scored so far.
Most models only know the API server proxy: Four of five models identified the proxy endpoints, but only Gemini 3 Flash went beyond this to cover webhooks, image pulls, and other vectors. The SSRF attack surface in Kubernetes is broader than most models recognise.
Admission webhooks as port scanners is under-known: Only Gemini 3 Flash and DeepSeek V3.2 mentioned webhooks, and only Gemini 3 Flash described the port scanning technique. This is a documented and practical attack that most models missed.
Node-level SSRFs are almost completely missed: Pod image reference SSRF (only Gemini 3 Flash) and pod probe SSRF (no model) demonstrate that the kubelet’s HTTP request behaviour is poorly understood as an attack vector.
“Standard kubeadm” is an important qualifier: Both MiniMax M2.5 and DeepSeek V3.2 included Ingress controllers or Dashboards as vectors, but these are not part of stock kubeadm. Paying attention to the question’s scope matters.
Breadth matters for SSRF questions: This question specifically asks about “functionality” (plural implied) that enables SSRF. Identifying only the proxy endpoint — while correct — is an incomplete answer to a question about the attack surface.