How to scan an AI agent skill for malware before you install it
Most users install ClawHub skills without reviewing the code. Here is a repeatable, five-minute process that catches the threats that visual review and star ratings miss.
Why this matters
- Snyk's ToxicSkills research found that 13.4% of scanned skills have critical-level security issues — most are invisible to casual review.
- A structured scan takes under five minutes and catches C2 callbacks, data exfiltration, prompt injection, and obfuscated payloads.
- Running the scan before every install, not just for unfamiliar authors, is the correct security posture.
- Scan results are only useful if you understand what each finding means. TrustSkills explains every flag in plain English.
Why visual review is not enough
A common instinct is to open the skill's CLAUDE.md or AGENTS.md file and read through it. That catches obvious problems, but it misses the threats that ClawHavoc actually used: C2 callback URLs embedded in benign-looking tool definitions, environment variable exfiltration disguised as normal HTTP requests, and obfuscated payloads using base64 or eval that only decode at runtime.
Snyk's ToxicSkills study found 76 skills across ClawHub and skills.sh that contained malicious payloads in their markdown instructions alone — content that looks like documentation but is actually attacker-controlled instructions passed to the model. None of those would be caught by checking star ratings or publisher reputation.
Step 1: Download the skill zip from ClawHub
On any ClawHub skill page, find the 'Download zip' button. Click it to save the .zip file to your local machine. Do not install or run the skill yet.
If you are evaluating a skill you found outside ClawHub — on GitHub, a blog post, or a third-party marketplace — download the package in whatever format it is distributed and confirm it contains a CLAUDE.md, AGENTS.md, or equivalent skill descriptor file.
Step 2: Upload to TrustSkills
Go to trustskills.app and drop the .zip file into the scanner. You can also paste a ClawHub skill URL directly if you prefer not to download the file first.
The scan runs server-side. No data is stored. Results are typically returned in under ten seconds.
Step 3: Read the report
TrustSkills runs 10+ static checks and returns a risk level (safe, low, medium, high, or critical) along with a plain-English explanation of each finding. The report groups findings by category: C2 callbacks, data exfiltration, obfuscated payloads, reverse shells, prompt injection, permission mismatches, and suspicious network patterns.
For each finding, the report explains what the code is doing, why it is suspicious, and what the practical risk is in non-technical language. The goal is to give you enough context to make a real decision, not just a list of flags to investigate.
Step 4: Act on the findings
If the report is clean, you can proceed with installation using your standard process. If the report returns any medium, high, or critical findings, decide whether the skill is necessary and whether the risk is acceptable given your environment.
- C2 callback findings: The skill is communicating with an external server that is not declared in its documentation. This is a strong indicator of malicious intent.
- Data exfiltration findings: The skill is attempting to read and transmit sensitive files such as .env, SSH keys, or AWS credentials. Do not install.
- Prompt injection findings: The skill contains instructions embedded in its descriptor that could alter model behavior in unintended ways. Treat as high risk.
- Obfuscated payload findings: The skill contains encoded or eval-wrapped code that obscures what it actually executes. Review carefully before installing.
- Excessive permission findings: The skill declares shell execution or write access that is not consistent with its stated purpose. Consider whether you need it.
Step 5: Make scanning part of your install process
A one-time scan is better than nothing, but the correct posture is to scan every skill before every install, even from publishers you have used before. ClawHavoc showed that trusted-looking publisher accounts can be compromised. A publisher who shipped a clean skill last month might ship a malicious update today.
For teams managing multiple OpenClaw deployments, use the TrustSkills waitlist to get notified when monitoring features launch. The monitoring tier will track hash changes and alert you when a skill you have already installed changes its behavior.
Trusted sources
Snyk
ToxicSkills: Malicious Payloads in AI Agent Skills Supply Chain
Source for the 13.4% critical findings rate and the 76 markdown-embedded payload finding.
Palo Alto Networks Unit 42
OpenClaw's Skill Marketplace and the Emerging AI Supply Chain Threat
Source for the behavioral integrity mismatch data that motivates static analysis beyond visual review.
OWASP
LLM03:2025 Supply Chain
Framework for supply chain risk evaluation and provenance verification.