Parse academic papers and research documents from PDF using MinerU. Extracts structured content including title, abstract, sections, figures, tables, formula...
Security Analysis
high confidenceThe skill's requirements, runtime instructions, and install options are consistent with a PDF-to-structured-paper parser built on MinerU and do not request unrelated access.
The name/description (paper parsing via MinerU) matches the declared binary (mineru-open-api), the declared primary env var (MINERU_TOKEN), and the documented CLI commands.
SKILL.md only instructs use of the mineru-open-api CLI (extract/auth/crawl) on PDFs and URLs; it does not direct the agent to read unrelated files, secrets, or system config.
Install options are standard: npm package and a Go install from a GitHub repo. No downloads from untrusted URLs, no archives/extract steps are present in the spec.
Only one credential (MINERU_TOKEN) is required and is appropriate for an authenticated API/CLI. No unrelated secrets, config paths, or broad credential requests are present.
The skill is not forced-always, does not modify other skills, and its runtime behavior is limited to invoking the mineru-open-api CLI; autonomous invocation is allowed but is the platform default.
Guidance
This skill appears coherent for parsing academic PDFs with MinerU. Before installing, verify the npm package name and the GitHub repo (https://github.com/opendatalab/MinerU-Ecosystem) match the vendor/site (https://mineru.net), inspect the repo or package code if possible, and limit the MINERU_TOKEN scope (create a dedicated token). Install the CLI in a controlled environment, and avoid supplying unrelated or high-privilege credentials. If you rely on sensitive documents, consider testing on non-sensitive PDFs first to confirm outputs and behavior.
Latest Release
v0.4.0
SEO: expand description for better ClawHub vector search discovery
More by @mzlzyca
Published by @mzlzyca on ClawHub